-
PD-Multiplexing: Unlocking High-Goodput LLM Serving with GreenContext | LMSYS Org
This post highlights our initial efforts to support a new serving paradigm, PD-Multiplexing, in SGLang. It is designed t...
This post highlights our initial efforts to support a new serving paradigm, PD-Multiplexing, in SGLang. It is designed t...