Feng Ren,
Ruoyu Qin,
Teng Ma,
Shangming Cai,
Zheng Liu,
Chao Lei,
Dejiang Zhu,
Ke Yang,
Jinyang Su,
Weixiao Huang,
Yikai Zhao,
Yongwei Wu,
Weimin Zheng,
Mingxing Zhang
(2025).
A Declarative Slice Spraying Engine for Performant and Resilient Data Movement in Disaggregated LLM Serving.
FAISys 2025.
Hongtao Chen,
Weiyu Xie,
Boxin Zhang,
Jingqi Tang,
Jiahao Wang,
Jianwei Dong,
Shaoyuan Chen,
Ziwei Yuan,
Chen Lin,
Chengyu Qiu,
Yuening Zhu,
Qingliang Ou,
Jiaqi Liao,
Xianglin Chen,
Zhiyuan Ai,
Yongwei Wu,
Mingxing Zhang
(2025).
KTransformers: Unleashing the Full Potential of CPU/GPU Hybrid Inference for MoE Models.
SOSP 2025.
Jiahao Li,
Biao Cao,
Jielong Jian,
Cheng Li,
Sen Han,
Yiduo Wang,
Yufei Wu,
Kang Chen,
Liguo Duan,
Jie Zhao,
Zhihui Yin,
Qiushi Chen,
Jiwei Xiong,
Fengyuan Liu,
Yan Xing,
Ran Zheng,
Miao Yu,
Feng Wu,
Xianjun Meng
(2025).
Mantle: Efficient Hierarchical Metadata Management for Cloud Object Storage Services.
SOSP 2025.
Yinchao Zhang,
Su Yao,
Yong Feng,
Kang Chen,
Tong Li,
Zhuotao Liu,
Yi Zhao,
Lexuan Zhang,
Xiangyu Gao,
Feng Xiong,
Qi Li,
Ke Xu
(2025).
Pegasus: A Universal Framework for Scalable Deep Learning Inference on the Dataplane.
SIGCOMM 2025.
Xiaohu Chai,
Tianyu Zhou,
Keyang Hu,
Jianfeng Tan,
Tiwei Bie,
Anqi Shen,
Dawei Shen,
Qi Xing,
Shun Song,
Tongkai Yang,
Le Gao,
Feng Yu,
Zhengyu He,
Dong Du,
Yubin Xia,
Kang Chen,
Yu Chen
(2025).
Fork in the Road: Reflections and Optimizations for Cold Start Latency in Production Serverless Systems.
OSDI 2025.
Ruili Liu,
Teng Ma,
Mingxing Zhang,
Jialiang Huang,
Yingdi Shan,
Zheng Liu,
Lingfeng Xiang,
Zhen Lin,
Hui Lu,
Jia Rao,
Kang Chen,
Yongwei Wu
(2025).
DSA-2LM: A CPU-Free Tiered Memory Architecture with Intel DSA.
ATC 2025.
Shaoyuan Chen,
Hongtao Chen,
Shaonan Ma,
Yajie Qin,
Zheng Wang,
Weiyu Xie,
Mingxing Zhang,
Kang Chen,
Xia Liao,
Yingdi Shan,
Jinlei Jiang,
Yongwei Wu
(2025).
Scaling Asynchronous Graph Query Processing via Partitioned Stateful Traversal Machines.
ICDE 2025.
Enzhe Lu,
Zhejun Jiang,
Jingyuan Liu,
Yulun Du,
Tao Jiang,
Chao Hong,
Shaowei Liu,
Weiran He,
Enming Yuan,
Yuzhi Wang,
Zhiqi Huang,
Huan Yuan,
Suting Xu,
Xinran Xu,
Guokun Lai,
Yanru Chen,
Huabin Zheng,
Junjie Yan,
Jianlin Su,
Yuxin Wu,
Neo Y. Zhang,
Zhilin Yang,
Xinyu Zhou,
Mingxing Zhang,
Jiezhong Qiu
(2025).
MoBA: Mixture of Block Attention for Long-Context LLMs.
NeurIPS 2025.
Jialiang Huang,
Mingxing Zhang,
Teng Ma,
Zheng Liu,
Sixing Lin,
Kang Chen,
Jinlei Jiang,
Xia Liao,
Yingdi Shan,
Ning Zhang,
Mengting Lu,
Tao Ma,
Haifeng Gong,
Yongwei Wu
(2024).
TrEnv: Transparently Share Serverless Execution Environments Across Different Functions and Nodes.
SOSP 2024.
Qian Xu,
Juan Yang,
Feng Zhang,
Zheng Chen,
Jiawei Guan,
Kang Chen,
Ju Fan,
Youren Shen,
Ke Yang,
Yu Zhang,
Xiaoyong Du
(2024).
Improving Graph Compression for Efficient Resource-Constrained Graph Analytics.
VLDB 2024.
Teng Ma,
Zheng Liu,
Chengkun Wei,
Jialiang Huang,
Youwei Zhuo,
Haoyu Li,
Ning Zhang,
Yijin Guan,
Dimin Niu,
Mingxing Zhang,
Tao Ma
(2024).
HydraRPC: RPC in the CXL Era.
ATC 2024.
Zuoning Chen,
Kang Chen,
Jinlei Jiang,
Lufei Zhang,
Song Wu,
Zhengwei Qi,
Chunming Hu,
Yongwei Wu,
Yuzhong Sun,
Hong Tang,
Aobing Sun,
Zilu Kang
(2017).
Evolution of Cloud Operating System: From Technology to Ecosystem.
JCST 2017.