Publications

2025

  1. ISCA
    Magellan: A High-Performance Loop-Guided Prefetcher for Indirect Memory Access
    Gelin Fu, Tian Xia, Mingzhuo Yin, Prashant J. Nair, Mieszko Lis, and Pengju Ren
    In Proceedings of the 52nd Annual International Symposium on Computer Architecture (ISCA), 2025
  2. TCAS-I
    FP2: A 2-bit Floating-Point Format for Edge-AI Inference and Fine-Tuning
    Qiwei Dang, Chengyu Ma, Haiduo Huang, Gelin Fu, Zhiwang Huo, Guoming Yang, Pengchen Zong, Tian Xia, Wenzhe Zhao, and Pengju Ren
    IEEE Transactions on Circuits and Systems I: Regular Papers, 2025
  3. TCAD
    Hierarchical-ISA Supporting Row-wise Operands for Efficient DNN Computation
    Zhiwang Huo, Wenzhe Zhao, Qiwei Dang, Chengyu Ma, Guoming Yang, Gelin Fu, Tian Xia, and Pengju Ren
    IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 2025

2024

  1. HPCA
    Differential-Matching Prefetcher for Indirect Memory Access
    Gelin Fu, Tian Xia, Zhongpei Luo, Ruiyang Chen, Wenzhe Zhao, and Pengju Ren
    In 2024 IEEE International Symposium on High-Performance Computer Architecture (HPCA), 2024

2023

  1. ICCD
    PrSpMV: An Efficient Predictable Kernel for SpMV
    Gelin Fu, Tian Xia, Shaoru Qu, Zhongpei Luo, Shuyu Li, Pengyu Cheng, Runfan Guo, Yitong Ding, and Pengju Ren
    In 2023 IEEE 41st International Conference on Computer Design (ICCD), 2023

2022

  1. TCAS-I
    An Energy-and-Area-Efficient CNN Accelerator for Universal Powers-of-Two Quantization
    Tian Xia, Boran Zhao, Jian Ma, Gelin Fu, Wenzhe Zhao, Nanning Zheng, and Pengju Ren
    IEEE Transactions on Circuits and Systems I: Regular Papers, 2022
  2. TPDS
    A Comprehensive Performance Model of Sparse Matrix-Vector Multiplication to Guide Kernel Optimization
    Tian Xia, Gelin Fu, Chenyang Li, Zhongpei Luo, Lucheng Zhang, Ruiyang Chen, Wenzhe Zhao, Nanning Zheng, and Pengju Ren
    IEEE Transactions on Parallel and Distributed Systems, 2022