Publications

2026

  1. ASPLOS ’26
    Towards High-Goodput LLM Serving with Prefill-decode Multiplexing
    Yukang Chen*, Weihao Cui*, Han Zhao*, Ziyi Xu, Xiaoze Fan, Xusheng Chen, Yangjie Zhou, Shixuan Sun, Bingsheng He, and Quan Chen
    In Proceedings of the 31st ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2026