MADSys (original) (raw)
The MADSys (Machine Learning, AI, Big Data Systems) group is dedicated to the design, implementation, evaluation, and application of parallel and distributed systems. Our research spans various methods aimed at accelerating data processing. Although the foundational principles like caching, batching, and overlapping are consistent, the strategic and innovative application of these techniques allows system researchers to optimally utilize diverse hardware resources across different scenarios.
Recent News

Jan 12th, 2026 - Our project "Trading storage for computation: High-Performance Large Model Inference System" has been selected as one of the "2025 Top 10 Most Noteworthy Achievements" at Tsinghua University.

Oct 13th, 2025 - Mingxing Zhang participated in the workshop discussion on "The Future is Now: System Research in the Era of AI" at SOSP25.

Oct 23rd, 2025 - MADSys attended the 2025 CNCC held in Harbin.

Sep 12th, 2025 - Our "Trading storage for computation" technology solution received Huawei's 2024 Olympus Award! Congratulations!

Jul 9th, 2025 - Xun Sun presented the paper titled "Scalio: Scaling up DPU-based JBOF Key-value Store with NVMe-oF Target Offload" at the OSDI 2025 conference!

Jul 9th, 2025 - Ruili Liu presented the paper titled "DSA-2LM: A CPU-Free Tiered Memory Architecture with Intel DSA" at the ATC 2025 conference!

May 25th, 2025 - Ruoyu Qin presented the paper titled "MOONCAKE: Trading More Storage for Less Computation — A KVCache-centric Architecture for Serving LLM Chatbot" at the 28th Chinasys Workshop!

May 24th, 2025 - Boxin Zhang delivered a presentation on KTransformers titled "A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations" at the 28th Chinasys Workshop!

May 16th, 2025 - Hao Wu attended the doctoral dissertation defense in the Department of Computer Science at Tsinghua University.

May 16th, 2025 - Zhicheng Ji attended the doctoral dissertation defense in the Department of Computer Science at Tsinghua University.

May 16th, 2025 - Yan Chen attended the doctoral dissertation defense in the Department of Computer Science at Tsinghua University.

Apr 2nd, 2025 - Weiyu Xie presented the paper titled "VertexSurge: Variable Length Graph Pattern Match on Billion-edge Graphs" at the ASPLOS 2025 conference!
Award
Sep 12, 2025 - Our “Trading storage for computation” technology solution received Huawei’s 2024 Olympus Award! Congratulations!
Jul 28, 2025 - Our joint project with Haizhi, “Joint Application Platform for Graph Models Based on Large-scale Graph Data Analysis Technology” has won the 2nd “Zu Chongzhi Award” - Frontiers of Artificial Intelligence Innovation Award. Congratulations!
Apr 22, 2025 - Congratulations to Xun Sun and Yutong Lin for winning the second prize in the 35th Student Laboratory Construction Contribution Award at Tsinghua University for their cloud platform.
Feb 25, 2025 - Our paper "MOONCAKE: Trading More Storage for Less Computation —— A KVCache-centric Architecture for Serving LLM Chatbot" gets 🏆FAST 2025 Best Paper!
Feb 19, 2025 - Our opensource project “KTransformers” has earned 10k Stars⭐ at Github! Congratulations!
Sep 15, 2023 - Wang Leping and his team participated in the “Changchengbei” cyber security competition and won the first place. Congratulations!
Publications
Nov 25, 2025 - Our paper “From Prefix Cache to Fusion RAG Cache: Accelerating LLM Inference in Retrieval-Augmented” is accepted by SIGMOD 26 (The 2026 ACM Special Interest Group on Management of Data)! Congratulations!
Oct 9, 2025 - Our paper “A Declarative Slice Spraying Engine for Performant and Resilient Data Movement in Disaggregated LLM Serving” is accepted by FAISys 2025 (The 1st Frontier AI Systems Workshop)! Congratulations!
Aug 24, 2025 - Our paper “Accelerating Stream Processing Engines via Hardware Offloading” is accepted by SIGMOD 2026!
Jul 15, 2025 - Our paper “KTransformers: Unleashing the Full Potential of CPU/GPU Hybrid Inference for MoE Models” is accepted by SOSP 2025. Congratulations!
Mar 26, 2025 - Our paper “Scalio: Scaling up DPU-based JBOF Key-value Store with NVMe-oF Target Offload” is accepted by OSDI 2025!
Feb 25, 2025 - Our paper “Scaling Asynchronous Graph Query Processing via Partitioned Stateful Traversal Machines” is accepted by ICDE 2025!
Feb 25, 2025 - Our paper “OOCC: One-round Optimistic Concurrency Control for Read-Only Disaggregated Transactions” is accepted by ICDE 2025!
People
Jan 11, 2026 - The MADSys Laboratory hosted a New Year celebration dinner.
Sep 1, 2025 - Jingqi Tang, Chengyu Qiu, Yuening Zhu, Ruili Liu and Qingliang Ou are welcome to join the MadSys Group.
Jun 15, 2025 - The MADSys Laboratory held the 2024-2025 annual meeting, inviting alumni, graduates and all the faculty and students of the lab!
Sep 1, 2024 - Research Assistant Professor Shan Yingdi joins MADSys Group, Welcome!
Sep 1, 2024 - Sixing Lin, Ruoyu Qin, Boxin Zhang, Jianfeng Li, Jingbo Shan, Jianwei Dong, Chen Lin and Yuanyong Chen are welcome to join the MadSys Group.
Dec 20, 2023 - Professor Wu Yongwei has been elevated to CCF Fellow. Congratulations!
Sep 1, 2023 - Jinqi Hua, Xun Sun, Shaofeng Ding and Ziyu Zeng are welcome to join the MadSys Group.
Faculty
Jing Zheng
Lizhi Deng
Projects
Essentially, a decoder-only Transformer model transforms data from any modality into KVCache, positioning it as a central element in LLM serving optimizations. These optimizations include, but are not limited to, caching, scheduling, compression, and offloading. KVCache.AI is a collaborative endeavor with leading industry partners such as Approaching.AI and Moonshot AI. The project focuses on developing effective and practical techniques that enrich both academic research and open-source development.
- Mooncake is the serving platform for Kimi, a leading LLM service provided by MadSys, Moonshot AI et.al.
- A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations.
Prompted by advancements in modern interconnect technologies like RDMA and CXL, this project aims to revisit the implementation and application of distributed shared memory (DSM). The objective is to facilitate the development of resilient distributed applications that can tolerate partial failures, making this process as straightforward and efficient as programming concurrent applications on a single machine. RDSM is a collaborative endeavor with leading industry partners such as Alibaba and Intel, dedicated to establishing fundamental frameworks that enhance both academic research and open-source development.
Students
Ph.D. Students
Zeqing Li
Jianglang Zhu
Xin Zhou
Yilei Lu
Shaoyuan Chen
Ao Zhang
Yutong Lin
Jialiang Huang
Jingjia Luo
Zhengyan Guo
Hongtao Chen
Xun Sun
Jinqi Hua
Sixing Lin
Ruoyu Qin
Boxin Zhang
Jianfeng Li
Jingbo Shan
Jingqi Tang
Chengyu Qiu
Yuening Zhu
Ruili Liu
Master Students
Ziyu Zeng
Jianwei Dong
Chen Lin
Yuanyong Chen
Qingliang Ou
Alumni
| Year | Name | Degree | First Job | Pos |
|---|---|---|---|---|
| 2025 | Hao Wu | Ph.D. | Qiyuan Laboratory | Beijing, China |
| 2025 | Yan Chen | Ph.D. | Sangfor Technologies | Shenzhen, China |
| 2025 | Zhicheng Ji | Ph.D. | China Mobile Communications Corporation | Beijing, China |
| 2024 | Leping Wang | Master | Huawei Inc. | Beijing, China |
| 2024 | Juguang Yan | Master | Tencent Inc. | Chengdu, China |
| 2023 | Shuke Wang | Ph.D. | Ministry of Finance of the People's Republic of China | Beijing, China |
| 2023 | Hanyang Mao | Master | Entrepreneurship, Beijing Sigaoruichi Technology Co., Ltd. | Beijing, China |
| 2023 | Yifeng Ding | Ph.D. | Entrepreneurship | - |
| 2023 | Feng Ren | Ph.D. | Qiyuan Laboratory | Beijing, China |
| 2023 | Qi Chen | Ph.D. | Wuxi Jiangnan Institute of Computing Technology | Wuxi, China |
| 2023 | Shaonan Ma | Ph.D. | Qiyuan Laboratory | Beijing, China |
| 2022 | Yingdi Shan | Ph.D. | Zhongguancun Laboratory | Beijing, China |
| 2022 | Chengying Huan | Ph.D. | Institute of Software Chinese Academy of Sciences | Beijing, China |
| 2022 | Ke Yang | Ph.D. | Beijing HaiZhi XingTu Technology Co., Ltd. | Beijing, China |
| 2022 | Jinfan Zheng | Master | Beijing HaiZhi XingTu Technology Co., Ltd. | Beijing, China |
| 2022 | Runji Wang | Master | Singularity | Beijing, China |
| 2021 | Teng Ma | Ph.D. | Alibaba Inc. | Beijing, China |
| 2021 | Xuyang Liu | Master | Beijing HaiZhi XingTu Technology Co., Ltd. | Beijing, China |
| 2021 | Shiyu Cai | Master | Beijing HaiZhi XingTu Technology Co., Ltd. | Beijing, China |
| 2021 | Junqi Huang | Master | Microsoft Inc. | Suzhou, Jiangsu |
| 2021 | Yiyuan Liu | Master | Qiyuan Laboratory | Beijing, China |
| 2020 | Xue Li | Ph.D. | Alibaba Inc. | Hangzhou, China |
| 2020 | Mengxing Liu | Ph.D. | Huawei Inc. | Beijing, China |
| 2020 | Yihua Liu | Master | Arxan FinTech | Hangzhou, China |
| 2020 | Tuoyu Gong | Master | Tencent Inc. | Shenzhen, China |
| 2020 | Haoyu Dong | Master | - | - |
| 2020 | Xianglin Chen | Master | - | Shenzhen, China |
| 2019 | Xin Xia | Master | - | - |
| 2019 | Kun Gong | Master | - | - |
| 2019 | Zhitao Li | Ph.D. | Tencent Inc. | Shenzhen, China |
| 2018 | Pin Gao | Ph.D. | Tencent Inc. | Shenzhen, China |
| 2018 | Jiang Wang | Master | Huawei Inc. | Hangzhou, China |
| 2018 | Huan Feng | Ph.D. | Huawei Inc. | Beijing, China |
| 2017 | Jian Huang | Master | the Organization Department of Jiangxi Province Party Committee | Nanchang, China |
| 2017 | Zheyi Chen | Master | University Of Exeter | Exeter, United Kingdom |
| 2016 | Maomeng Su | Ph.D. | Huawei Inc. | Beijing, China |
| 2016 | Weichao Guo | Ph.D. | Huawei Inc. | Beijing, China |
| 2016 | Jinglei Ren | Ph.D. | MicroSoft Research Asia | Beijing, China |
| 2016 | Bin Wang | Master | Netease Inc. | Guangzhou, China |
| 2015 | Xun Zhao | Ph.D. | China Merchants Bank | Shenzhen, China |
| 2015 | Shuai Mu | Ph.D. | New York University | New York, United States |
| 2015 | Daokuan Ma | Master | Netease Inc. | Hangzhou, China |
| 2015 | Wenlei Zhu | Master | Startups | Beijing, China |
| 2015 | Lei Zhang | Bachelor | Georgia Institute of Technology | Atlanta, United States |
| 2010/2014 | Pengzhi Xu | Ph.D./Postdocs | Startups | Beijing, China |
| 2014 | Jiazao Lin | Postdoc | Startups | - |
| 2014 | Waseem Ahmed | Ph.D. | back to Pakistan | - |
| 2014 | Zhenqiang Li | Master | Huawei Inc. | Beijing, China |
| 2014 | Zhenzhao Wang | Master | Alibaba Inc. | Beijing, China |
| 2014 | Zhihao Wu | Master | Netease Inc. | Guangzhou, China |
| 2013 | Feng Ye | Master | Startups | Beijing, China |
| 2012 | Gang Huang | Master | Google Inc. | California, United States |
| 2012 | Qiuping Wang | Master | Hulu Inc. | Beijing, China |