I am Minhui Xie, currently a Lecturer (i.e., Assistant Professor in the US academic system) in the Department of Computer Science and Technology at Renmin University of China . Before that, I obtained my Ph.D. degree in the Department of Computer Science and Technology, Tsinghua University, supervised by Prof. Youyou Lu and Jiwu Shu in the Storage Group.

My research focus includes:

  • Systems Designed for At-Scale Machine Learning (System4AI)
  • Machine Learning Aided Storage System Design (AI4Storage)

I am so excited about the interact field between ML and System.

🎓 Work & Education Experience

  • [2024.09 - now] Lecturer, Department of Computer Science, Renmin University of China, Beijing, China
  • [2019.09 - 2024.06] Ph.D., Department of Computer Science, Tsinghua University, Beijing, China
  • [2015.09 - 2019.06] B.S., Department of Computer Science, Nanjing University, Jiangsu, China

📝 Publication

  • Frugal: Efficient and Economic Embedding Model Training with Commodity GPUs.
    Minhui Xie, Shaoxun Zeng, Hao Guo, Shiwei Gao, Youyou Lu,
    The 30th Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'25) (CCF-A) , 2025
    Paper
  • Medusa: Accelerating Serverless LLM Inference with Materialization.
    Shaoxun Zeng, Minhui Xie, Shiwei Gao, Youmin Chen, Youyou Lu,
    The 30th Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'25) (CCF-A) , 2025
    Paper
  • MaxEmbed: Maximizing SSD Bandwidth Utilization for Huge Embedding Models Serving.
    Ruwen Fan, Minhui Xie, Haodi Jiang, Youyou Lu,
    The 29th Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'24) (CCF-A) , 2024
    Paper
  • Challenges and Technical Development of Large Model Training Storage Systems.
    冯杨洋, 汪庆, 谢旻晖, 舒继武,
    计算机研究与发展 , 2024
    Paper
  • PetPS: Supporting Huge Embedding Models with Persistent Memory.
    Minhui Xie, Youyou Lu, Qing Wang, Yangyang Feng, Jiaqiang Liu, Kai Ren, Jiwu Shu,
    The 49th International Conference on Very Large Data Bases (VLDB'23) (CCF-A) , 2023
    Paper Slides Star
  • Citron: Distributed Range Lock Management with One-sided RDMA.
    Jian Gao, Youyou Lu, Minhui Xie, Qing Wang, Jiwu Shu,
    The 21st USENIX Conference on File and Storage Technologies (FAST'23) (CCF-A) , 2023
    Paper
  • Patronus: High-Performance and Protective Remote Memory.
    Bin Yan, Youyou Lu, Qing Wang, Minhui Xie, Jiwu Shu,
    The 21st USENIX Conference on File and Storage Technologies (FAST'23) (CCF-A) , 2023
    Paper Slides Star
  • Mobius: Fine Tuning Large-scale Models on Commodity GPU Servers.
    Yangyang Feng, Minhui Xie, Zijie Tian, Shuo Wang, Youyou Lu, Jiwu Shu,
    The 28th Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'23) (CCF-A) , 2023
    Paper Slides
  • A Recommendation Model Inference System with GPU Direct Storage Access.
    谢旻晖, 陆游游, 冯杨洋, 舒继武,
    计算机研究与发展 , 2024
    Paper
  • Pacman: An Efficient Compaction Approach for Log-Structured Key-Value Store on Persistent Memory.
    Jing Wang, Youyou Lu, Qing Wang, Minhui Xie, Keji Huang, Jiwu Shu,
    USENIX Annual Technical Conference (USENIX ATC'22) (CCF-A) , 2022
    Paper Slides Star
  • Fleche: An Efficient GPU Embedding Cache for Personalized Recommendations.
    Minhui Xie, Youyou Lu, Jiazhen Lin, Qing Wang, Jian Gao, Kai Ren, Jiwu Shu,
    The 17th European Conference on Computer Systems (EuroSys'22) (CCF-A) , 2022
    Paper Slides
  • Nap: Persistent Memory Indexes for NUMA Architectures.
    Qing Wang, Youyou Lu, Junru Li, Minhui Xie, Jiwu Shu,
    ACM Transactions on Storage (TOS) (CCF-A) , 2022
    Paper
  • Kraken: Memory Efficient Continual Learning for Large-Scale Real-Time Recommendations.
    Minhui Xie, Kai Ren, Youyou Lu, Guangxu Yang, Qingxing Xu, Bihai Wu, Jiazhen Lin, Hongbo Ao, Wanhong Xu, Jiwu Shu,
    Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC'20) (CCF-A) , 2020
    Paper Slides Star


🏛️ Invited Talks

  • [2024.12] CCF 存储大会,推荐大模型存储系统研究
  • [2024.11] 华为 ICT 存储产品线,推荐大模型存储系统研究
  • [2023.09] VLDB’23, Persistent memory supported parameter Server
  • [2023.03] ByteDance, LLM training
  • [2022.05] NVIDIA, GPU resident cache
  • [2022.04] EuroSys’22, Fleche - efficient GPU resident embedding cache
  • [2021.12] Huawei Singapore, memory efficient learning system
  • [2020.11] SC’20, Kraken - memory efficient continual learning

👨🏾‍💻 Services

  • IEEE Transactions on Computer (TC), 2024, Reviewer
  • FAST, 2024, AEC, Reviewer
  • EuroSys, 2024,2023,2022, AEC, Reviewer
  • SOSP, 2023, AEC, Reviewer
  • OSDI, 2023, 2022, AEC, Reviewer
  • USENIX ATC, 2023, 2022, AEC, Reviewer
  • SIGCOMM, 2022, AEC, Reviewer
  • IEEE Transactions on Parallel and Distributed Systems (TPDS), 2022, Reviewer
  • 中国系统学术协会(ChinaSys)长期志愿者

👨🏼‍🏫 Teaching

  • TA, Computer Organization and Architecture, Tsinghua University, Spring 2023
  • TA, Computer Organization and Architecture, Tsinghua University, Spring 2022
  • TA, Computer Organization and Architecture, Tsinghua University, Spring 2021
  • TA, Computer Organization and Architecture, Tsinghua University, Spring 2020
  • TA, Introduction to Computer System, Nanjing University, Fall 2017