Publications

(2025). A Careful Examination of Large Behavior Models for Multitask Dexterous Manipulation. arXiv preprint.
(2025). OpenThoughts: Data Recipes for Reasoning Models. arXiv preprint.
(2025). Should VLMs be Pre-trained with Image Data?. In ICLR 2025.
(2024). DataComp-LM: In Search of the Next Generation of Training Sets for Language Models.
(2024). Linearizing Large Language Models. In COLM.
(2024). DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset. In RSS 2024.
(2024). Language Models Scale Reliably with Over-Training and on Downstream Tasks. In NeurIPS 2024.
(2023). Residual Q-Learning: Offline and Online Policy Customization without Value. In NeurIPS 2023.
(2022). RAP: Risk-Aware Prediction for Robust Planning.. In CoRL.
(2022). CAPO: Control-Aware Prediction Objectives for Autonomous Driving.. In ICRA.