Can RL-based LLM post-training on games generalize to other tasks? (GRL)August 27, 2025Game Arena Team
FastWan: Generating a 5-Second Video in 5 Seconds via Sparse DistillationAugust 4, 2025FastVideo Team
ReFoRCE: A Text-to-SQL Agent with Self-Refinement, Format Restriction, and Column ExplorationApril 10, 2025Minghang Deng, Ashwin Ramachandran, Canwen Xu, Lanxiang Hu, Zhewei Yao, Anupam Datta, Hao Zhang
Fast Video Generation with Sliding Tile AttentionFebruary 18, 2025Peiyuan Zhang, Yongqi Chen*, Runlong Su*, Hangliang Ding, Ion Stoica, Zhengzhong Liu, Hao Zhang
Dynasor: More Efficient Chain-of-Thought Through Certainty ProbingFebruary 16, 2025Yichao Fu*, Junda Chen*, Yonghao Zhuang, Zheyu Fu, Ion Stoica, Hao Zhang
Efficient LLM Scheduling by Learning to RankJanuary 13, 2025Yichao Fu, Siqi Zhu, Runlong Su, Aurick Qiao, Ion Stoica, Hao Zhang