Yufeng Gu

I am a final-year Ph.D. candidate at University of Michigan, advised by Prof. Reetuparna Das. I am working closely with Prof. David Blaauw and Prof. Satish Narayanasamy. My research focuses on computer architecture and system. I am developing novel hardware and system on accelerating large scale AI workloads. Before UMich, I obtained a bachelor's degree from Zhejiang University in 2020.

Email  /  CV  /  Linkedin  /  Google Scholar  /  Blogs

profile photo

Research

My research focuses on computer architecture, hardware/software co-design, near-memory processing and quality of service optimization. I am developing novel hardware and system solutions on accelerating large scale emerging applications, such as generative artificial intelligence (GenAI) and precision health.

News

Selected Publications

[Full list]

GenDP: A Framework of Dynamic Programming Acceleration for Genome Sequencing Analysis (Invited Paper)
Yufeng Gu, Arun Subramaniyan, Tim Dunn, Alireza Khadem, Kuan-yu Chen, Somnath Paul, Md Vasimuddin, Sanchit Misra, David Blaauw, Satish Narayanasamy, Reetuparna Das
CACM 2025  /  Paper  /  Technical Perspective  /  University News

DX100: A Programmable Data Access Accelerator for Indirection
Alireza Khadem, Kamalavasan Kamalakkannan, Zhenyan Zhu, Akash Poptani, Yufeng Gu, Jered Benjamin Dominguez-Trujillo, Nishil Talati, Daichi Fujiki, Scott Mahlke, Galen Shipman, Reetuparna Das
ISCA 2025  /  Paper  /  Code    Artifact Available Artifact Functional Artifact Reproduced   GitHub stars

PIM Is All You Need: A CXL-Enabled GPU-Free System for Large Language Model Inference
Yufeng Gu*, Alireza Khadem*, Sumanth Umesh, Ning Liang, Xavier Servot, Onur Mutlu, Ravi Iyer, and Reetuparna Das
ASPLOS 2025  /  Paper  /  Slides  /  Code    Artifact Available Artifact Functional Artifact Reproduced   GitHub stars

Multi-Dimensional Vector ISA Extension for Mobile In-Cache Computing
Alireza Khadem, Daichi Fujiki, Hilbert Chen, Yufeng Gu, Nishil Talati, Scott Mahlke, Reetuparna Das
HPCA 2025  /  Paper  /  Code    Artifact Available Artifact Functional Artifact Reproduced   GitHub stars
Distinguished Artifact Honorable Mention

GenDP: A Framework of Dynamic Programming Acceleration for Genome Sequencing Analysis
Yufeng Gu, Arun Subramaniyan, Tim Dunn, Alireza Khadem, Kuan-yu Chen, Somnath Paul, Md Vasimuddin, Sanchit Misra, David Blaauw, Satish Narayanasamy, Reetuparna Das
ISCA 2023  /  Paper  /  Slides  /  Code  /  Lightning Talk  /  Artifact Available Artifact Functional Artifact Reproduced   GitHub stars
Communications of ACM Research Highlights  /  Technical Perspective

GenomicsBench: A Benchmark Suite for Genomics
Arun Subramaniyan, Yufeng Gu, Tim Dunn, Somnath Paul, Md Vasimuddin, Sanchit Misra, Satish Narayanasamy, David Blaauw, Reetuparna Das
ISPASS 2021  /  Paper  /  Slides  /  Code  /  Lightning Talk   GitHub stars

Multi-site fMRI Analysis Using Privacy-preserving Federated Learning and Domain Adaptation: ABIDE Results
Xiaoxiao Li, Yufeng Gu, Nicha Dvornek, Lawrence Staib, Pamela Ventola, James S. Duncan
MedIA 2020  /  Paper  /  Code /  GitHub stars

Awards and Honors

  • Distinguished Artifact Honorable Mention in HPCA 2025, selected from 3/29 artifacts.
    (Multi-Dimensional Vector ISA Extension for Mobile In-Cache Computing)
  • Communications of ACM Research Highlights, selected among 24 papers from all ACM conferences in 2023.
    (GenDP: A Framework of Dynamic Programming Acceleration for Genome Sequencing Analysis)
  • Rackham Graduate Student Research Grant at University of Michigan.
    (Pangenome Sequence Alignment Benchmark Suite, Role: PI, $3,000)
  • Rackham Conference Travel Grant at University of Michigan, 2023, 2025.
  • Student Travel Grant for ISCA 2023, HPCA 2025.
  • Summer@EPFL Fellowship (2% applicants awarded), 2019.
  • Tang Lixin Fellowship (60/60,000+ students awarded), 2017, 2018, 2019.

Talks

Services

Industry Experiences

  • Intel Labs, Graduate Technical Intern, June 2022 - Aug. 2022.
  • Tenstorrent Inc., Performance Architect Intern, May 2023 - Aug. 2023.

Design and source code from Jon Barron and Jiacheng Ma