Hi, I'm Sourav Panda.

I like to apply reinforcement learning to strategy games, generative models, and beyond.

About Me,

I am a Ph.D. student at the College of IST at Penn State, where I am fortunate to be advised by Dr. Jonathan Dodge.

My research focuses on:

Reinforcement learning for strategy planning and decision-making.
Integrating RL into generative models.
Building AI systems that are not only high-performing but also easy to understand.

Before my Ph.D., I earned my M.S. in Informatics from Penn State, where I was advised by Dr. Jonathan Dodge and had the opportunity to work with Dr. Abhinav Verma.

Outside the lab, I enjoy maintaining an active lifestyle through gym workouts, football, biking, and swimming. I am also an avid dog lover, always eager to meet new furry friends! Tell your dog I said Hi👋

Email / LinkedIn / GitHub / Scholar / X / CV (Last Updated - 10/25)

I am actively looking for Summer 2026 Internships. Feel free to reach out.

News

Nov, 2025 Three workshop papers accepted at AAAI 2026: two posters (MURE) and one oral (FAST).
Aug, 2025 Started my Ph.D. in Informatics at Penn State.
May, 2025 Graduated from Penn State with an M.S. in Informatics, GPA: 4.0/4.0.
Mar, 2025 Successfully defended my Master's thesis.
Oct, 2024 Two workshop papers accepted at NeurIPS 2024: IMOL and Red Teaming GenAI.
Aug, 2023 Started my M.S. in Informatics (Thesis Track) at Penn State

Selected Publications

Unlocking New Strategies: Intrinsic Exploration for Evolving Macro and Micro Actions [Paper]

Sourav Panda, Aviral Srivastava, and Jonathan Dodge

Intrinsically Motivated Open-Ended Learning Workshop @ NeurIPS 2024 [Link]

MIXTAPE: Middleware for Interactive XAI with Tree-Based AI Performance Evaluation

Tanmay Ambadkar, Hayden Moore, Sourav Panda, Shreyash Kale, Connor Greenwell, Brianna Major, Aashish Chaudhary, Jonathan Dodge, Abhinav Verma, and Brian Hu.

Simulation Interoperability Standards Organization 2025 [Abstract]

MIXTAPE: Middleware for Interactive XAI with Tree-Based AI Performance Evaluation

Brian Hu, Jonathan Dodge, Abhinav Verma, Tanmay Ambadkar, Sourav Panda, Sujay Koujalgi, Aashish Chaudhary, Brianna Major, and Bryon Lewis.

Simulation Interoperability Standards Organization 2024 [Abstract]

How to Measure Human-AI Prediction Accuracy in Explainable AI Systems

Sujay Koujalgi, Andrew Anderson, Iyadunni Adenuga, Shikha Soneji, Rupika Dikkala, Teresita Guzman Nader, Leo Soccio, Sourav Panda, Rupak Kumar Das, Margaret Burnett, Jonathan Dodge

arxiv [Paper]