Gaokai Zhang
An LLM researcher who gazes at the starlit skies of Artificial General Intelligence
M.S. in Intelligent Information Systems
Carnegie Mellon University
Language Technologies Institute
Hi there! I’m Gaokai Zhang, an M.S. student in Intelligent Information Systems at CMU LTI since Fall 2025. I hold dual B.S. degrees from ZJUI (CompE @ UIUC, ECE @ ZJU).
Currently, I’m working on SWE-Bench related code-generation agent training with Kexun Zhang in Prof. Lei Li’s lab, focusing on supervised fine-tuning and reinforcement learning for coding agents. I am also working with Yiqing Xie and Prof. Daniel Fried on code-generation data synthesis and agent training.
Open to LLM-related MLE/RS opportunities as I’m graduating in December 2026!
Experience
Microsoft Research Asia (Jul 2024 - Jul 2025) Research Intern, Systems & Networking Group Mentored by Dr. Li Lyna Zhang
- Led LoongRL: Novel data synthesis + reinforcement learning enabling 7B models to surpass 32B LRMs in long-context reasoning (100k-200k tokens) (ICLR 2026 Oral)
- Contributed to LongRoPE2: Extended LLM context windows to 128K tokens while retaining 98.5% short-context accuracy (ICML 2025 poster)
- Built parallel pipeline for large-scale user-query processing; delivered production-ready long-context recommendation models to Microsoft Asia-Pacific R&D
Carnegie Mellon University (Oct 2025 – Present) Research Assistant, Language Technologies Institute
- Synthetic task generation for training coding agents to generalize across repository-level environments (Hybrid-Gym)
- Synthetic issue generation and test-time training for improving LLM-based coding agents under sparse rewards
University of Illinois Urbana-Champaign Research Assistant with Prof. Fan Lai and Prof. Minjia Zhang
- Monte-Carlo-Tree-Search planning for cost-efficient LLM training on heterogeneous GPUs/TPUs
- Robustness benchmarking of LLMs (Stochastic Monkeys)
Research Interests
- Long-context reasoning & scaling
- Reinforcement learning for LLMs
- Efficient training architectures
- Code generation agents
Beyond Research
Outside work, I enjoy gaming (lifetime Faker fan), vibe to rap, and hunt for the perfect omakase bite.
Feel free to reach out - always happy to connect with like-minded friends and collaborators!
news
| Jan 26, 2026 | LoongRL accepted as Oral at ICLR 2026 - my first co-first-authored paper! |
|---|---|
| Aug 01, 2025 | Started M.S. in Intelligent Information Systems at CMU LTI! |
| May 01, 2025 | LongRoPE2 accepted at ICML 2025 as a poster presentation! |
| Jul 01, 2024 | Joined Microsoft Research Asia as a research intern in the Systems & Networking Group. |