Joykirat Singh

Research Fellow, Microsoft Research

prof_pic.jpg

New Delhi, India

I’m Joykirat Singh, an incoming PhD student in the Computer Science Department at the University of North Carolina at Chapel Hill and will be advised by Prof. Mohit Bansal. Currently I am working at Microsoft Research India as a Research Fellow, working under the mentorship of Dr. Akshay Nambi.

My research interest lies in building Interpretable AI models that have reasoning capabilities. Study how does LLM’s behavior emerge as a function o fits training data and how doe internal mechanisms evolve/emerge during training. I also want to focus on the reasoning capabilities of LLMs and explore if models can truly reason and do long horizon planning without relying on biased priors and pattern recognition. Furthermore, research on building AI systems that improve their reasoning capabilities while minimizing the reliance on superficial patterns learned during pre-training.

I graduated in 2023 with a B.Tech in Computer Science and Design with a silver medal for academic excellance from the Indraprastha Institute of Information Technology (IIIT), Delhi, where I had the opportunity to collaborate with Dr. Md Shad Akhtar.

Previously, I served as a Research Assistant at the Indian Institute of Technology (IIT) Delhi, working alongside Dr. Tanmoy Chakraborty and Dr. Soumen Chakrabarti on various research projects.

news

Jun 01, 2025 Joined University of North Carolina at Chapel Hill as a Computer Science PhD Student and will be advised by Prof. Mohit Bansal. I will be working on language models, focusing on reasoning, tool use and interpretability. Excited to be part of the vibrant research community at UNC.
May 20, 2025 ARTIST enables models to autonomously decide when, how, and which tools to invoke within multi-turn reasoning chains. Check out the preprint to learn how it enhances reasoning capabilities in language models by leveraging tool invocation strategies.
May 16, 2025 MWP-MISTAKE and PROMPTWIZARD accepted at ACL 2025 MAINS and ACL 2025 FINDINGS, respectively!
Mar 04, 2025 Preprint SPHERE builds a self-evolving data generation pipeline that enhances reasoning in small language models (SLMs) by iteratively generating, correcting, and diversifying reasoning chains.
Oct 03, 2024 Preprint PROMPTWIZARD is out! PromptWizard introduces a novel, fully automated framework for discrete prompt optimization, utilizing a self-evolving, self-adapting mechanism.
Jun 16, 2024 Preprint MWP-MISTAKE on exploring language models mistake detection and their correction capabilities is out.
Jun 01, 2024 Joined Microsoft Research as a Research Fellow under the mentorship of Dr. Akshay Nambi. Excited to explore the fascinating world of Large Language Models and their reasoning capabilities! 🚀

Publications

  1. artist.png
    Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning
    Joykirat Singh, Raghav Magazine, Yash Pandya, and 1 more author
    Preprint, 2025
  2. self.png
    Self-Evolved Preference Optimization for Enhancing Mathematical Reasoning in Small Language Models
    Joykirat Singh, Tanmoy Chakraborty, and Akshay Nambi
    Preprint, 2025
  3. reasoning.png
    Exposing the Achilles’ Heel: Evaluating LLMs Ability to Handle Mistakes in Mathematical Reasoning
    Joykirat Singh, Akshay Nambi, and Vibhav Vineet
    2024
  4. PW.png
    PromptWizard: Task-Aware Agent-driven Prompt Optimization Framework
    Eshaan Agarwal, Joykirat Singh, Vivek Dani, and 3 more authors
    2024
  5. behaviorEditing.png
    Mechanistic Behavior Editing of Language Models
    Joykirat Singh*, Subhabrata Dutta*, and Tanmoy Chakraborty
    Preprint, 2024
  6. tmlr.png
    How to think step-by-step: A mechanistic understanding of chain-of-thought reasoning
    Subhabrata Dutta*, Joykirat Singh*, Soumen Chakrabarti, and 1 more author
    2024
  7. frugal.png
    Frugal LMs Trained to Invoke Symbolic Solvers Achieve Parameter-Efficient Arithmetic Reasoning
    Subhabrata Dutta*, Joykirat Singh*, Ishan Pandey*, and 3 more authors
    In Proceedings of the AAAI Conference on Artificial Intelligence, 2024
  8. eros.png
    EROS: Entity-Driven Controlled Policy Document Summarization
    Joykirat Singh*, Sehban Fazili*, Rohan Jain, and 1 more author
    2024