Over the last decade, large-scale deep neural networks (DNNs) have made breakthroughs in many fields, such as image recognition, speech recognition, game playing, complicated control systems, driver-less cars and unmanned aerial systems (UAS). High computational complexity and large model size are two key challenges of deep neural networks (DNNs) that motivated the research efforts on model compression techniques and hardware acceleration.
Our research group made research contributions to accelerate machine learning with architecture and algorithm innovations. PipeLayer [HPCA'17] is a ReRAM-based PIM accelerator for CNNs that support both training and testing. SC-DCNN [ASPLOS'17] is the first comprehensive design and optimization framework of stochastic computing based DNNs, using a bottom-up approach. CirCNN [MICRO'17] is a principled approach to represent weights and process neural networks using block-circulant matrices. CirCNN utilizes the Fast Fourier Transform (FFT)-based fast multiplication, simultaneously reducing the computational complexity (both in inference and training) from O(n2) to O(nlogn) and the storage complexity from O(n2) to O(n), with negligible accuracy loss. VIBNN [ASPLOS'18] is an FPGA-based hardware accelerator design for variational inference on BNNs.
-
SparseCore: Stream ISA and Processor Specialization for Sparse Computation
Gengyu Rao, Jingji Chen, Jason Yik, Xuehai Qian
ASPLOS'22 The 22nd Conference on Architectural Support for Programming Languages and Operating Systems, 2022
-
Non-Structured DNN Weight Pruning--Is It Beneficial in Any Platform?
Xiaolong Ma, Sheng Lin, Shaokai Ye, Zhezhi He, Linfeng Zhang, Geng Yuan, Sia Huat Tan, Zhengang Li, Deliang Fan, Xuehai Qian, Xue Lin, Kaisheng Ma, Yanzhi Wang
TNNLS'21 IEEE Transactions on Neural Networks and Learning Systems, 2021
-
ESCALATE: Boosting the Efficiency of Sparse CNN Accelerator with Kernel Decomposition
Shiyu Li, Edward Hanson, Xuehai Qian, Hai Li, Yiran Chen
MICRO'21 The 54th International Symposium on Microarchitecture, 2021
-
HASCO: Towards Agile HArdware and Software CO-design for Tensor Computation
Qincheng Xiao, Size Zheng, Bingzhe Wu, Pengcheng Xu, Xuehai Qian, Yun Liang
ISCA'21 The 48th International Symposium on Computer Architecture, 2021
-
FORMS: Fine-grained Polarized ReRAM-based In-situ Computation for Mixed-signal DNN Accelerator
Geng Yuan, Payman Behnam, Zhengang Li, Ali Shafiee, Sheng Lin, Xiaolong Ma, Hang Liu, Xuehai Qian, Mahdi Nazm Bojnordi, Yanzhi Wang, Caiwen Ding
ISCA'21 The 48th International Symposium on Computer Architecture, 2021
-
GoSPA: An Energy-efficient High-performance Globally Optimized SParse Convolutional Neural Network Accelerator
Chunhua Deng, Siyu Liao, Yang Sui, Xuehai Qian, Bo Yuan
ISCA'21 The 48th International Symposium on Computer Architecture, 2021
-
Low-Cost Floating-Point Processing in ReRAM for Scientific Computing
Linghao Song, Fan Chen, Xuehai Qian, Hai Li, Yiran Chen
arXiv'20 arXiv:2011.03190, 2020
-
Mix and Match: A Novel FPGA-Centric Deep Neural Network Quantization Framework
Sung-En Chang, Yanyu Li, Mengshu Sun, Runbin Shi, Hayden K.-H. So, Yanzhi Wang, Xuehai Qian, Xue Lin,
HPCA'21 The 27th International Symposium on High Performance Computer Architecture, 2021
-
Non-Structured DNN Weight Pruning -- Is It Beneficial in Any Platform?
Xiaolong Ma, Sheng Lin, Shaokai Ye, Zhezhi He, Linfeng Zhang, Geng Yuan, Sia Huat Tan, Zhengang Li, Deliang Fan, Xuehai Qian, Xue Lin, Kaisheng Ma, Yanzhi Wang
arXiv 1907.02124, 2020
-
CELIA: A Full-Stack Framework for STT-MRAM-Based Deep Learning Acceleration
Hao Yan, Hebin R. Cherian, Ethan C. Ahn, Xuehai Qian, Lide Duan
TPDS'20 IEEE Transactions on Parallel and Distributed Systems, 2020
-
Prague: High-Performance Heterogeneity-Aware Asynchronouos Decentralized Training
Qinyi Luo, Jiaao He, Youwei Zhuo, Xuehai Qian
ASPLOS'20 The 25th International Conference on Architectural Support for Programming Languages and Operating Systems, 2020
-
Capuchin: Tensor-based GPU Memory Management for Deep Learning
Xuan Peng, Xuanhua Shi, Hulin Dai, Hai Jin, Weiliang Ma, Fan Yang, Xuehai Qian
ASPLOS'20 The 25th International Conference on Architectural Support for Programming Languages and Operating Systems, 2020
-
DNN-Guard: An Elastic Heterogeneous Architecture for DNN Accelerator against Adversarial Attacks
Xingbin Wang, Rui Hou, Boyan Zhao, Fengkai Yuan, Jun Zhang, Dan Meng, Xuehai Qian
ASPLOS'20 The 25th International Conference on Architectural Support for Programming Languages and Operating Systems, 2020
-
PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning
Wei Niu, Xiaolong Ma, Sheng Lin, Shihao Wang, Xuehai Qian, Xue Lin, Yanzhi Wang, Bin Ren
ASPLOS'20 The 25th International Conference on Architectural Support for Programming Languages and Operating Systems, 2020
-
AccPar: Tensor Partitioning for Heterogeneous Deep Learning Accelerator Arrays
Linghao Song, Fan Chen, Youwei Zhuo, Xuehai Qian, Hai Li, Yiran Chen
HPCA'20 The 26th IEEE International Symposium on High Performance Computer Architecture
-
TIE: Energy-Efficient Tensor Train-Based Inference Engine for Deep Neural Network
Chunhua Deng, Fangxuan Sun, Xuehai Qian, Jun Lin, Zhongfeng Wang, Bo Yuan
ISCA'19The 46th International Symposium on Computer Architecture, 2019
-
A Stochastic-Computing based Deep Learning Framework using Adiabatic Quantum-Flux-Parametron Superconducting Technology
Ruizhe Cai, Ao Ren, Olivia Chen, Ning Liu, Caiwen Ding, Xuehai Qian, Jie Han, Wenhui Luo,
Yoshikawa Nobuyuki, Yanzhi Wang
ISCA'19The 46th International Symposium on Computer Architecture, 2019
-
HOP: Heterogeneity-Aware Decentralized Training
Qinyi Luo, Jinkun Lin, Youwei Zhuo, Xuehai Qian
ASPLOS'19 The 24th International Conference
on Architectural Support for Programming Languages and Operating
Systems, 2019
-
ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction
Methods of Multipliers
Ao Ren, Jiayu Li, Tianyun Zhang, Shaokai Ye, Wenyao Xu, Xuehai Qian, Xue Lin, Yanzhi Wang
ASPLOS'19 The 24th International Conference
on Architectural Support for Programming Languages and Operating
Systems, 2019
-
HyPar: Towards Hybrid Parallelism for Deep Learning Accelerator Array
Linghao Song, Jiachen Mao, Youwei Zhuo, Xuehai Qian, Hai Li, Yiran Chen
HPCA'19 The 25th IEEE International Symposium on High Performance Computer Architecture, 2019
-
E-RNN: Design Optimization for Efficient Recurrent Neural Networks in FPGAs
Zhe Li, Caiwen Ding, Siyue Wang, Wujie Wen, Youwei Zhuo, Chang Liu, Qinru Qiu, Wenyao Xu, Xue Lin, Xuehai Qian, Yanzhi Wang
HPCA'19 The 25th IEEE International Symposium on High Performance Computer Architecture, 2019
-
HEIF: Highly Efficient Stochastic Computing based Inference Framework for Deep Neural Networks
Zhe Li, Ji Li, Ao Ren, Ruizhe Cai, Caiwen Ding, Xuehai Qian, Jeffrey Draper, Bo Yuan, Jian Tang, Qinru Qiu, Yanzhi Wang
TCAD 2019 IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
-
PermDNN: Efficient Compressed Deep Neural Network Architecture with Permuted Diagonal Matrices
Chunhua Deng, Siyu Liao, Yi Xie, Keshab K. Parhi, Xuehai Qian, Bo Yuan
MICRO'18 The 51st IEEE/ACM International Symposium on Microarchitectureng Systems
-
VIBNN: Hardware Acceleration of Bayesian Neural Networks
Ruizhe Cai, Ao Ren, Ning Liu, Caiwen Ding, Luhao Wang, Xuehai Qian, Massoud Pedram, Yanzhi Wang
ASPLOS'18 The 23rd ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2018
-
Neu-NoC: A high-efficient interconnection network for accelerated neuromorphic systems
Xiaoxiao Liu, Wei Wen, Xuehai Qian, Hai Li, Yiran Chen
ASP-DAC 2018 Asia and South Pacific Design Automation Conference
-
ReRAM-based accelerator for deep learning
Bing Li, Linghao Song, Fan Chen, Xuehai Qian, Yiran Chen, Hai Helen Li
DATE 2018 Design Automation and Test in Europe
-
CIRCNN: Accelerating and Compressing Deep Neural Networks Using Block-Circulant Weight Matrices
Caiwen Ding, Yanzhi Wang, Siyu Liao, Zhe Li, Yu Bai, Youwei Zhuo, Chao Wang, Xuehai Qian, Ning Liu, Geng Yuan, Xiaolong Ma, Yipeng Zhang, Xue Lin, Jian Tang, Qinru Qiu, Bo Yuan
MICRO'17 The 50th Annual IEEE/ACM
International Symposium on Microarchitecture, 2017
-
SC-DCNN: Highly-Scalable Deep Convolutional Neural Network using Stochastic Computing
Ao Ren, Ji Li, Zhe Li, Caiwen Ding, Xuehai Qian, Qinru Qiu, Bo Yuan and Yanzhi Wang
ASPLOS '17 The 22nd International Conference
on Architectural Support for Programming Languages and Operating
Systems, 2017
-
PipeLayer: A Pipelined ReRAM-Based Accelerator for Deep Learning
Linghao Song
Xuehai Qian, Hai Li and Yiran Chen
HPCA'17 The 23rd IEEE International
Symposium on High Performance Computer Architecture, 2017