Hao AI Lab @ UCSD
    • Home
    • Blogs
    • Projects
    • Talks
    • People
    • Publications
    • Contact

    Blogs

    Attn-QAT: Making 4-Bit Attention Actually Work

    April 8, 2026

    Peiyuan Zhang*, Matthew Noto*, Wenxuan Tan*, Chengquan Jiang, Will Lin, Wei Zhou, Hao Zhang

    Into the Dreamverse: Vibe Directing in FastVideo

    March 15, 2026

    FastVideo Team

    Create a 5s 1080p Video in 4.5s with FastVideo on a Single GPU

    March 11, 2026

    FastVideo Team

    From Physical Commonsense to Scientific Reasoning: Why World Modeling in Video Matters

    February 12, 2026

    Lanxiang Hu, Abhilash Shankarampeta, Yixin Huang, Zilin Dai, Haoyang Yu, Yujie Zhao, Haoqiang Kang, Daniel Zhao, Tajana Rosing, Hao Zhang

    CAD: Disaggregating Core Attention for Efficient Long-context Language Model Training

    December 17, 2025

    Yonghao Zhuang*, Junda Chen*, Bo Pang, Yi Gu, Yibo Zhu, Yimin Jiang, Ion Stoica, Eric Xing, Hao Zhang

    Fast and Accurate Causal Parallel Decoding using Jacobi Forcing

    December 16, 2025

    Lanxiang Hu*, Siqi Kou*, Yichao Fu, Samyam Rajbhandari, Tajana Rosing, Yuxiong He, Zhijie Deng, Hao Zhang

    AUP: when Accuracy Meets Parallelism in Diffusion Language Models

    December 10, 2025

    Yu-Yang Qian, Junda Su, Lanxiang Hu, Peiyuan Zhang, Zhijie Deng, Peng Zhao, Hao Zhang

    CausalWan-MoE Preview: Applying Self-Forcing Distillation To Wan2.2

    November 18, 2025

    FastVideo Team

    Disaggregated Inference: 18 Months Later

    November 3, 2025

    Junda Chen, Yonghao Zhuang, Hao Zhang

    Scaling Speculative Decoding with Lookahead Reasoning

    September 22, 2025

    Yichao Fu, Yiming Zhao, Rui Ge, Hao Zhang

    • ««
    • «
    • 1
    • 2
    • 3
    • »
    • »»
    © 2026 Hao AI Lab @ UCSD Powered by Hugo & PaperMod , Adapted by Lanxiang Hu, Junda Chen & Hao Zhang