I am a Ph.D. student at Princeton University, where my research focuses on building scalable and capable large language models (LLMs). My work explores how to improve LLM reasoning, align their behavior with human preferences through general preference models, and develop new attention mechanisms and model architectures.
Previously, I was fortunate to study and conduct research at the Institute for Interdisciplinary Information Sciences (IIIS) at Tsinghua University and at the UCLA AGI Lab.