Hi! I am a 5th year Ph.D. candidate in the School of Electrical and Computer Engineering at Purdue University, where I work with Prof. Qiang Qiu. My research focuses on advancing generative and foundation models for visual reasoning and reconstruction, with applications in inverse problems, controllable image generation, and video understanding.
Through internships at Apple, Samsung Research America (MPI Lab) and AMD, I have gained hands-on industry experience in large-scale training, personalization, fine-tuning, and alignment of diffusion, vision-language, and multimodal models on GPU clusters, along with hardware-aware model deployment.
Prior to my doctoral studies, I worked as a research intern at the Visual Computing and Analytics Lab (VCA), IIT (BHU), where I contributed to research on meta-learning and federated learning under Prof. Sanjay Kumar Singh. I also spent two years as an engineer in the Medical Electronics division at Tata Elxsi, developing embedded software and hardware solutions for medical devices.
- Multimodal foundation models and generative AI
- World models and video understanding
- Inverse problems and image restoration
- Controllable image and video generation