Jean Mercat
Jean Mercat John

Research Scientist

About Me

I’m a machine learning research scientist at the Toyota Research Institute, where I specialize in transformers, large language models, vision language models, and large behavior models. My passion for ML has led me to explore a variety of fields, including self-driving cars, robotics, and language processing. Whether it’s making a language model smarter or teaching robots new tricks, I’m always on the lookout for the next big breakthrough!

Download CV
Interests
  • Artificial Intelligence
  • Natural Language Processing
  • Multi-modal Language Models
Education
  • PhD Machine Learning

    Paris Saclay University, L2S and Renault

  • MEng in Scientific Computing

    ENSEIRB-MatMéca, Bordeaux, France

📚 My Research

I’m a research scientist at Toyota Research Institute. I pre-train, uptrain, fine-tune, experiment, and do research with Large Language Models, Vision Language Models, and Large Behavior Models.

I attempt to understand and improve transformers. I apply large models to robotic manipulation to push the boundary of open-ended embodied intelligence.

Featured Publications
Recent Publications
(2024). DataComp-LM: In Search of the Next Generation of Training Sets for Language Models.
(2024). Linearizing Large Language Models. In COLM.
(2024). DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset. In RSS 2024.
(2024). Language Models Scale Reliably with Over-Training and on Downstream Tasks. In NeurIPS 2024.
(2023). Residual Q-Learning: Offline and Online Policy Customization without Value. In NeurIPS 2023.
Recent & Upcoming Talks