I am a fourth-year Ph.D. student in the Department of Computer Science and Technology, Tsinghua University. I am advised by Jidong Zhai in the PACMAN group. Currently, I am visiting the Catalyst group of CMU and also advised by Zhihao Jia.

Research

My passion lies in efficient machine learning systems, covering compilation optimization, efficient service, and long-term operation. I am always eager to engage in conversations and collaborate with others who share my passion for this field.

Projects

  • InfiniTensor is a high-performance inference engine tailored for GPUs and AI accelerators. Its design focuses on effective deployment and swift academic validation.
  • Vapro is a performance profiler to detect and diagnose performance variance (i.e., performance degradation and jitters) for parallel applications.

Publications

  • [OSDI’23] EinNet: Optimizing Tensor Programs with Derivation-Based Transformations
    Liyan Zheng, Haojie Wang, Jidong Zhai, Muyan Hu, Zixuan Ma, Tuowei Wang, Shuhong Huang, Xupeng Miao, Shizhi Tang, Kezhao Huang, Zhihao Jia
    [Paper] [Slides] [Poster] [Code]

  • [PPoPP’22] Vapro: performance variance detection and diagnosis for production-run parallel applications
    Liyan Zheng, Jidong Zhai, Xiongchao Tang, Haojie Wang, Teng Yu, Yuyang Jin, Shuaiwen Leon Song, Wenguang Chen
    [Paper] [Slides] [Code]

  • [TPDS’22] Detecting Performance Variance for Parallel Applications Without Source Code (Best Paper Award Runner-Up)
    Jidong Zhai, Liyan Zheng, Jinghan Sun, Feng Zhang, Xiongchao Tang, Xuehai Qian, Bingsheng He, Wei Xue, Wenguang Chen, Weimin Zheng
    [Paper]

  • [TPDS’22] Leveraging Code Snippets to Detect Variations in the Performance of HPC Systems
    Jidong Zhai, Liyan Zheng, Jinghan Sun, Feng Zhang, Xiongchao Tang, Xuehai Qian, Bingsheng He, Wei Xue, Wenguang Chen, Weimin Zheng
    [Paper]

  • [PPoPP’22] BaGuaLu: targeting brain scale pretrained models with over 37 million cores
    Zixuan Ma, Jiaao He, Jiezhong Qiu, Huanqi Cao, Yuanwei Wang, Zhenbo Sun, Liyan Zheng, Haojie Wang, Shizhi Tang, Tianyu Zheng, Junyang Lin, Guanyu Feng, Zeqiang Huang, Jie Gao, Aohan Zeng, Jianwei Zhang, Runxin Zhong, Tianhui Shi, Sha Liu, Weimin Zheng, Jie Tang, Hongxia Yang, Xin Liu, Jidong Zhai, Wenguang Chen:

  • [PLDI’22] FreeTensor: a free-form DSL with holistic optimizations for irregular tensor programs
    Shizhi Tang, Jidong Zhai, Haojie Wang, Lin Jiang, Liyan Zheng, Zhenhao Yuan, Chen Zhang

  • [OSDI’21] PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections
    Haojie Wang, Jidong Zhai, Mingyu Gao, Zixuan Ma, Shizhi Tang, Liyan Zheng, Yuanzhi Li, Kaiyuan Rong, Yuanyong Chen, Zhihao Jia

Talks

EinNet: Optimizing Tensor Programs with Derivation-Based Transformations

  • ChinaSys, TURC, Wuhan, July 2023
  • OSDI, Boston, July 2023

Vapro: performance variance detection and diagnosis for production-run parallel applications

  • Meta, Online, August 2023
  • PPoPP, Online, April 2022