ALCHEM: Research

Over the last decade, large-scale deep neural networks (DNNs) have made breakthroughs in many fields, such as image recognition, speech recognition, game playing, complicated control systems, driver-less cars and unmanned aerial systems (UAS). High computational complexity and large model size are two key challenges of deep neural networks (DNNs) that motivated the research efforts on model compression techniques and hardware acceleration.

Our research group made research contributions to accelerate machine learning with architecture and algorithm innovations. PipeLayer [HPCA'17] is a ReRAM-based PIM accelerator for CNNs that support both training and testing. SC-DCNN [ASPLOS'17] is the first comprehensive design and optimization framework of stochastic computing based DNNs, using a bottom-up approach. CirCNN [MICRO'17] is a principled approach to represent weights and process neural networks using block-circulant matrices. CirCNN utilizes the Fast Fourier Transform (FFT)-based fast multiplication, simultaneously reducing the computational complexity (both in inference and training) from O(n2) to O(nlogn) and the storage complexity from O(n2) to O(n), with negligible accuracy loss. VIBNN [ASPLOS'18] is an FPGA-based hardware accelerator design for variational inference on BNNs.

SparseCore: Stream ISA and Processor Specialization for Sparse Computation
Gengyu Rao, Jingji Chen, Jason Yik, Xuehai Qian
ASPLOS'22 The 22nd Conference on Architectural Support for Programming Languages and Operating Systems, 2022
Non-Structured DNN Weight Pruning--Is It Beneficial in Any Platform?
Xiaolong Ma, Sheng Lin, Shaokai Ye, Zhezhi He, Linfeng Zhang, Geng Yuan, Sia Huat Tan, Zhengang Li, Deliang Fan, Xuehai Qian, Xue Lin, Kaisheng Ma, Yanzhi Wang
TNNLS'21 IEEE Transactions on Neural Networks and Learning Systems, 2021
ESCALATE: Boosting the Efficiency of Sparse CNN Accelerator with Kernel Decomposition
Shiyu Li, Edward Hanson, Xuehai Qian, Hai Li, Yiran Chen
MICRO'21 The 54th International Symposium on Microarchitecture, 2021
HASCO: Towards Agile HArdware and Software CO-design for Tensor Computation
Qincheng Xiao, Size Zheng, Bingzhe Wu, Pengcheng Xu, Xuehai Qian, Yun Liang
ISCA'21 The 48th International Symposium on Computer Architecture, 2021
FORMS: Fine-grained Polarized ReRAM-based In-situ Computation for Mixed-signal DNN Accelerator
Geng Yuan, Payman Behnam, Zhengang Li, Ali Shafiee, Sheng Lin, Xiaolong Ma, Hang Liu, Xuehai Qian, Mahdi Nazm Bojnordi, Yanzhi Wang, Caiwen Ding
ISCA'21 The 48th International Symposium on Computer Architecture, 2021
GoSPA: An Energy-efficient High-performance Globally Optimized SParse Convolutional Neural Network Accelerator
Chunhua Deng, Siyu Liao, Yang Sui, Xuehai Qian, Bo Yuan
ISCA'21 The 48th International Symposium on Computer Architecture, 2021
Low-Cost Floating-Point Processing in ReRAM for Scientific Computing
Linghao Song, Fan Chen, Xuehai Qian, Hai Li, Yiran Chen
arXiv'20 arXiv:2011.03190, 2020
Mix and Match: A Novel FPGA-Centric Deep Neural Network Quantization Framework
Sung-En Chang, Yanyu Li, Mengshu Sun, Runbin Shi, Hayden K.-H. So, Yanzhi Wang, Xuehai Qian, Xue Lin,
HPCA'21 The 27th International Symposium on High Performance Computer Architecture, 2021
Non-Structured DNN Weight Pruning -- Is It Beneficial in Any Platform?
Xiaolong Ma, Sheng Lin, Shaokai Ye, Zhezhi He, Linfeng Zhang, Geng Yuan, Sia Huat Tan, Zhengang Li, Deliang Fan, Xuehai Qian, Xue Lin, Kaisheng Ma, Yanzhi Wang
arXiv 1907.02124, 2020
CELIA: A Full-Stack Framework for STT-MRAM-Based Deep Learning Acceleration
Hao Yan, Hebin R. Cherian, Ethan C. Ahn, Xuehai Qian, Lide Duan
TPDS'20 IEEE Transactions on Parallel and Distributed Systems, 2020
Prague: High-Performance Heterogeneity-Aware Asynchronouos Decentralized Training
Qinyi Luo, Jiaao He, Youwei Zhuo, Xuehai Qian
ASPLOS'20 The 25th International Conference on Architectural Support for Programming Languages and Operating Systems, 2020
Capuchin: Tensor-based GPU Memory Management for Deep Learning
Xuan Peng, Xuanhua Shi, Hulin Dai, Hai Jin, Weiliang Ma, Fan Yang, Xuehai Qian
ASPLOS'20 The 25th International Conference on Architectural Support for Programming Languages and Operating Systems, 2020
DNN-Guard: An Elastic Heterogeneous Architecture for DNN Accelerator against Adversarial Attacks
Xingbin Wang, Rui Hou, Boyan Zhao, Fengkai Yuan, Jun Zhang, Dan Meng, Xuehai Qian
ASPLOS'20 The 25th International Conference on Architectural Support for Programming Languages and Operating Systems, 2020
PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning
Wei Niu, Xiaolong Ma, Sheng Lin, Shihao Wang, Xuehai Qian, Xue Lin, Yanzhi Wang, Bin Ren
ASPLOS'20 The 25th International Conference on Architectural Support for Programming Languages and Operating Systems, 2020
AccPar: Tensor Partitioning for Heterogeneous Deep Learning Accelerator Arrays
Linghao Song, Fan Chen, Youwei Zhuo, Xuehai Qian, Hai Li, Yiran Chen
HPCA'20 The 26th IEEE International Symposium on High Performance Computer Architecture
TIE: Energy-Eﬃcient Tensor Train-Based Inference Engine for Deep Neural Network
Chunhua Deng, Fangxuan Sun, Xuehai Qian, Jun Lin, Zhongfeng Wang, Bo Yuan
ISCA'19The 46th International Symposium on Computer Architecture, 2019
A Stochastic-Computing based Deep Learning Framework using Adiabatic Quantum-Flux-Parametron Superconducting Technology
Ruizhe Cai, Ao Ren, Olivia Chen, Ning Liu, Caiwen Ding, Xuehai Qian, Jie Han, Wenhui Luo, Yoshikawa Nobuyuki, Yanzhi Wang
ISCA'19The 46th International Symposium on Computer Architecture, 2019
HOP: Heterogeneity-Aware Decentralized Training
Qinyi Luo, Jinkun Lin, Youwei Zhuo, Xuehai Qian
ASPLOS'19 The 24th International Conference on Architectural Support for Programming Languages and Operating Systems, 2019
ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Methods of Multipliers
Ao Ren, Jiayu Li, Tianyun Zhang, Shaokai Ye, Wenyao Xu, Xuehai Qian, Xue Lin, Yanzhi Wang
ASPLOS'19 The 24th International Conference on Architectural Support for Programming Languages and Operating Systems, 2019
HyPar: Towards Hybrid Parallelism for Deep Learning Accelerator Array
Linghao Song, Jiachen Mao, Youwei Zhuo, Xuehai Qian, Hai Li, Yiran Chen
HPCA'19 The 25th IEEE International Symposium on High Performance Computer Architecture, 2019
E-RNN: Design Optimization for Eﬃcient Recurrent Neural Networks in FPGAs
Zhe Li, Caiwen Ding, Siyue Wang, Wujie Wen, Youwei Zhuo, Chang Liu, Qinru Qiu, Wenyao Xu, Xue Lin, Xuehai Qian, Yanzhi Wang
HPCA'19 The 25th IEEE International Symposium on High Performance Computer Architecture, 2019
HEIF: Highly Efficient Stochastic Computing based Inference Framework for Deep Neural Networks
Zhe Li, Ji Li, Ao Ren, Ruizhe Cai, Caiwen Ding, Xuehai Qian, Jeffrey Draper, Bo Yuan, Jian Tang, Qinru Qiu, Yanzhi Wang
TCAD 2019 IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
PermDNN: Efficient Compressed Deep Neural Network Architecture with Permuted Diagonal Matrices
Chunhua Deng, Siyu Liao, Yi Xie, Keshab K. Parhi, Xuehai Qian, Bo Yuan
MICRO'18 The 51st IEEE/ACM International Symposium on Microarchitectureng Systems
VIBNN: Hardware Acceleration of Bayesian Neural Networks
Ruizhe Cai, Ao Ren, Ning Liu, Caiwen Ding, Luhao Wang, Xuehai Qian, Massoud Pedram, Yanzhi Wang
ASPLOS'18 The 23rd ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2018
Neu-NoC: A high-efficient interconnection network for accelerated neuromorphic systems
Xiaoxiao Liu, Wei Wen, Xuehai Qian, Hai Li, Yiran Chen
ASP-DAC 2018 Asia and South Pacific Design Automation Conference
ReRAM-based accelerator for deep learning
Bing Li, Linghao Song, Fan Chen, Xuehai Qian, Yiran Chen, Hai Helen Li
DATE 2018 Design Automation and Test in Europe
CIRCNN: Accelerating and Compressing Deep Neural Networks Using Block-Circulant Weight Matrices
Caiwen Ding, Yanzhi Wang, Siyu Liao, Zhe Li, Yu Bai, Youwei Zhuo, Chao Wang, Xuehai Qian, Ning Liu, Geng Yuan, Xiaolong Ma, Yipeng Zhang, Xue Lin, Jian Tang, Qinru Qiu, Bo Yuan
MICRO'17 The 50th Annual IEEE/ACM International Symposium on Microarchitecture, 2017
SC-DCNN: Highly-Scalable Deep Convolutional Neural Network using Stochastic Computing
Ao Ren, Ji Li, Zhe Li, Caiwen Ding, Xuehai Qian, Qinru Qiu, Bo Yuan and Yanzhi Wang
ASPLOS '17 The 22nd International Conference on Architectural Support for Programming Languages and Operating Systems, 2017
PipeLayer: A Pipelined ReRAM-Based Accelerator for Deep Learning
Linghao Song Xuehai Qian, Hai Li and Yiran Chen
HPCA'17 The 23rd IEEE International Symposium on High Performance Computer Architecture, 2017

Machine Learning Acceleration