CV

Research Interests

My work focuses on building and optimizing large language models for real-world applications, with an emphasis on efficient and scalable inference, agentic search, and real-time query understanding. I have experience in training small to mid-sized language models for shopping applications and deploying medium-sized models for search systems, with a strong emphasis on latency optimization and system efficiency.

Keywords: Large Language Models (LLMs), LLM Pre-training, Inference Optimization, Agentic Search, Retrieval-Augmented Generation (RAG), Query Understanding

Education

Work Experience

Awards

Publications

Talks

Academic Service

Scholarship

References