Hao AI Lab @ UCSD
    • Home
    • Blogs
    • Projects
    • Talks
    • People
    • Publications
    • Contact

    Blogs

    MuxServe

    MuxServe: Flexible Spatial-Temporal Multiplexing for Multiple LLM Serving

    May 20, 2024

    Jiangfei Duan, Runyu Lu, Haojie Duanmu, Xiuhong Li, Xingcheng Zhang, Dahua Lin, Ion Stoica, Hao Zhang

    jacobi trajectory

    Consistency Large Language Models: A Family of Efficient Parallel Decoders

    May 6, 2024

    Siqi Kou*, Lanxiang Hu*, Zhezhi He, Zhijie Deng, Hao Zhang

    DistServe

    Throughput is Not All You Need: Maximizing Goodput in LLM Serving using Prefill-Decode Disaggregation

    March 17, 2024

    Junda Chen, Yinmin Zhong, Shengyu Liu, Yibo Zhu, Xin Jin, Hao Zhang

    • ««
    • «
    • 1
    • 2
    • 3
    • »
    • »»
    © 2026 Hao AI Lab @ UCSD Powered by Hugo & PaperMod , Adapted by Lanxiang Hu, Junda Chen & Hao Zhang