2020
- [PLDI'20]
SympleGraph: Distributed Graph Processing with Precise Loop-carried Dependency Guarantee
Youwei Zhuo, Jingji Chen, Qinyi Luo, Yanzhi Wang, Hailong Yang, Depei Qian, Xuehai Qian
The 41st ACM SIGPLAN Conference on Programming Language Design and Implementation - [ASPLOS'20]
Prague: High-Performance Heterogeneity-Aware Asynchronouos Decentralized Training
Qinyi Luo, Jiaao He, Youwei Zhuo, Xuehai Qian
The 25th International Conference on Architectural Support for Programming Languages and Operating Systems - [HPCA'20]
AccPar: Tensor Partitioning for Heterogeneous Deep Learning Accelerator Arrays
Linghao Song, Fan Chen, Youwei Zhuo, Xuehai Qian, Hai Li, Yiran Chen
The 26th International Symposium on High Performance Computer Architecture
2019
- [MICRO'19]
GraphQ: Scalable PIM-Based Graph Processing
Youwei Zhuo, Chao Wang, Mingxing Zhang, Rui Wang, Dimin Niu, Yanzhi Wang, Xuehai Qian
The 52nd International Symposium on Microarchitecture - [ASPLOS'19]
HOP: Heterogeneity-Aware Decentralized Training
Qinyi Luo, Jinkun Lin, Youwei Zhuo, Xuehai Qian
The 24th International Conference on Architectural Support for Programming Languages and Operating Systems - [HPCA'19]
E-RNN: Design Optimization for Effient Recurrent Neural Networks in FPGAs
Zhe Li, Caiwen Ding, Siyue Wang, Wujie Wen, Youwei Zhuo, Chang Liu, Qinru Qiu, Wenyao Xu, Xue Lin, Xuehai Qian, Yanzhi Wang
The 25th International Symposium on High Performance Computer Architecture - [HPCA'19]
HyPar: Towards Hybrid Parallelism for Deep Learning Accelerator Array
Linghao Song, Jiachen Mao, Youwei Zhuo, Xuehai Qian, Hai Li, Yiran Chen
The 25th International Symposium on High Performance Computer Architecture
2018
- [MICRO'18]
CSE: Parallel Finite State Machines with Convergence Set Enumeration
Youwei Zhuo, Jinglei Cheng, Qinyi Luo, Jidong Zhai, Yanzhi Wang, Zhongzhi Luan, Xuehai Qian
The 51th International Symposium on Microarchitecture - [HPCA'18]
GraphR: Accelerating Graph Processing Using ReRAM
Linghao Song, Youwei Zhuo, Xuehai Qian, Miao Hu, Hai Li, Yiran Chen
The 24th International Symposium on High Performance Computer Architecture - [HPCA'18]
GraphP: Reducing Communication of PIM-based Graph Processing with Efficient Data Partition
Mingxing Zhang, Youwei Zhuo (equal contribution), Chao Wang, Mingyu Gao, Yongwei Wu, Kang Chen, Christos Kozyrakis, Xuehai Qian
The 24th International Symposium on High Performance Computer Architecture - [ASPLOS'18]
Wonderland: A Novel Abstraction-Based Out-Of-Core Graph Processing System
Mingxing Zhang, Yongwei Wu, Youwei Zhuo, Xuehai Qian, Chenyin Huan, Kang Chen
The 23rd International Conference on Architectural Support for Programming Languages and Operating Systems - [TVLSI'17]
Performance Evaluation and Optimization of HBM-Enabled GPU for Data-intensive Applications
Maohua Zhu, Youwei Zhuo, Chao Wang, Wenguang Chen, Yuan Xie
The IEEE Transactions on VLSI Systems
2017
- [MICRO'17]
CIRCNN: Accelerating and Compressing Deep Neural Networks Using Block-Circulant Weight Matrices
Caiwen Ding, Yanzhi Wang, Siyu Liao, Zhe Li, Yu Bai, Youwei Zhuo, Chao Wang, Xuehai Qian, Ning Liu, Geng Yuan, Xiaolong Ma, Yipeng Zhang, Xue Lin, Jian Tang, Qinru Qiu, Bo Yuan
The 50th International Symposium on Microarchitecture - [DATE'17]
Performance Evaluation and Optimization of HBM-Enabled GPU for Data-intensive Applications
Maohua Zhu, Youwei Zhuo, Chao Wang, Wenguang Chen, Yuan Xie
The 2017 Conference on Design Automation and Test in Europe - [IPDPS'17]
Scalable Graph Traversal on Sunway TaihuLight with Ten Million Cores
Heng Lin, Xiongchao Tang, Bowen Yu, Youwei Zhuo, Wenguang Chen, Jidong Zhai, Wanwang Yin and Weimin Zhen
The 19th IEEE International Parallel and Distributed Processing Symposium