# **Thinker-IM: An Energy-Efficient Mixed Signal RNN Engine with Computing-in-Memory Techniques and Predictive Execution**

**Ruiqi Guo<sup>1</sup>**, Yonggang Liu<sup>1</sup>, Shixuan Zheng<sup>1</sup>, Ssu-Yen Wu<sup>2</sup>, Peng Ouyang<sup>3</sup>, Win-San Khwa<sup>2</sup>, Xi Chen<sup>1</sup>, Jia-Jing Chen<sup>2</sup>, Xiudong Li<sup>3</sup>, Leibo Liu<sup>1</sup>, Meng-Fan Chang<sup>2</sup>, Shaojun Wei<sup>1</sup>, <u>Shouyi Yin<sup>1</sup></u>\*

<sup>1</sup>Tsinghua University, Beijing; <sup>2</sup>National Tsing Hua University, Hsinchu; <sup>3</sup>TsingMicro Tech, Beijing; \*yinsy@tinghua.edu.cn





### Low Power Techniques

#### **<u>1. SRAM-CIM Macro Design and CIM-aware Weights Adaptation</u> Key features of SRAM-CIM macro:**

A. Dual-split-control 6T memory cell to achieve XNOR; B. Serial-phase triple sensing controller to support 3-b output



## Verification

#### **Demonstration System**

#### **Test chip communicates with PC**

- neural network weights, BN
- parameters, *etc*,
- testing data;
- configurations;
- recognition result.
- **Oscilloscope measures the working** current









### 2.6 mm \_\_ **RNN Engine** using 16 CIM SRAM Macros VAD & & Mel Filter Compressed Quantization Data Memory Unit D Main Controller

#### **Chip Summary:**

- Process: 65 nm CMOS
- Supply Voltage: 0.9 1.1 V
- Frequency: 5 75 MHZ
- Core Size: 3.1×2 mm<sup>2</sup>
- **Die Size: 3.7 × 2.6 mm<sup>2</sup>**
- Neural Energy Efficiency: 5.1 pJ/Neuron @0.9 V, 75 MHZ
- **Arithmetic Energy Efficiency:** 11.7 TOPS/W @0.9 V, 75 MHZ

#### **Key Features:**

- A. Multiple SRAM-CIM architecture
- **B.** Muti-bit output SRAM-CIM
- **C.** Low-current training flow for SRAM-CIM architecture
- **D.** Predictive early BN and binarization method