Exploring intelligence through language, vision, and learning.
I am a Staff Research Scientist at Google DeepMind, contributing to the development of the next generation of AI models on the Gemini and Gemma teams. Most recently, I have had the pleasure of leading the vision workstream on Gemma 4 and Gemma 3, state-of-the art vision and language models that are open source! I have also been working on the Gemini team since December 2023.
Previously, I earned my PhD on Fine-Grained Vision and Language Understanding at New York University's Center for Data Science advised by Prof. Yann LeCun and Prof. Kyunghyun Cho, and a Masters at University of Massachusetts Amherst advised by Prof. Andrew McCallum.
Email: firstnamelastname20@gmail.com