Hi! I am a Ph.D. student at the Univeristy of British Columbia (UBC) CS advised by Vered Shwartz and Raymond NG in the Natural Language Processing group. I frequently collaborate with Leonid Sigal in the Computer Vision group.
The goal of my research is to develop robust and safe AI models that move beyond pattern recognition to achieve a deeper, context-sensitive understanding of the world. Specifically, I evaluate and improve critical reasoning capabilities across multiple modalities (text, images, videos) - which are fundamental for safe and effective deployment in real-world applications.
My focus is on the following interconnected areas:
➊ Robustness to Expectation-Violating Video Events
I design benchmarks and models that adapt to surprising or expectation-violating events, which is crucial for real-world safety. My recent work Black Swan evaluates models on abductive and defeasible reasoning in unpredictable video settings, testing whether models can revise their beliefs when presented with new evidence. I am currently working on adapting models to be resilient to such scenarios.
➋ Feedback Driven Learning
To scale complex reasoning capabilities such as creativity, which do not adhere to a step-by-step reasoning paradigm, I have worked on feedback-driven iterative DPO and showed that it improves performance. I am currently exploring the role of multi-aspect feedback in improving such generation.
➌ Grounded Reasoning
I have developed models that infer real-world events and object dynamics using world knowledge. This includes reasoning over interconnected events in text, event coreference, and grounded vision-language understanding models.
➍ Inclusive Models
I have worked on evaluating VLMs on retrieval and grounding in multicultural settings and analyzing the understanding of cultural norms in LLMs (CulturalBench).
If you are interested in any of the above areas, feel free to reach out.
Experience

Research Intern, FAIR Communication and Language
Generating fine-grained facial expressions using semantically meaningful pose tokens, improving predictability and precise control crucial for controllable video generation.

Research Intern, Microsoft Research, HCAIX Group
Evaluating 3D spatial reasoning abilities of Vision Language Models in ego-centric videos.

AI Research Scientist Intern, Meta Reality Labs
Iterative-DPO and ranking methods for teaching creative abilities to smaller LMs.

AI Research Engineer, OpenML & TU/e
I worked as an AI research engineer in the Machine Learning group at Eindhoven University of Technology. Together with an amazing team supervised by Joaquin Vanschoren, I worked on research and development of open-source software for open and automated machine learning. I also worked with Mykola Pechenizkiy on active learning and interpretability of NLP models.