AI & Advanced Computing
Karthik Narasimhan

-
Program
AI Safety Science
-
Institution
Princeton University
-
Location
USA
Dr. Karthik Narasimhan’s research focuses on problems at the intersection of language and decision making. He builds autonomous agents that learn to operate in the world using both their own experience and existing human knowledge. Karthik received his PhD from MIT in 2017, and spent a year as a visiting research scientist at OpenAI, contributing to the GPT language model, prior to joining Princeton in 2018. He is the recipient of a Google Research Scholar Award (2022), an Amazon research award (2019) and best paper awards/nominations at EMNLP (2015, 2016).
This project will create benchmarks, tests, and metrics to assess the reliability and consistency of AI software engineering (SWE) agents. It will specifically focus on identifying potential catastrophic failures, similar to the approach used by ToolEmu for evaluating LLM tool use.