Miles Wang

I'm Miles Wang. I’m a researcher at OpenAI trying to build beneficial and safe AGI.

I’m on the RL team, but my interests span alignment, evaluations, reasoning, and science. I’ve worked on a number of research directions, including:

Scalable oversight of increasingly capable models, such as monitoring chains-of-thought for reward hacking.
Frontier evaluations for high-compute RL runs.
AI for science (especially biology) with agents that learn online.
Frontier risk evaluations for models, including maximal capability elicitation.
Alignment of behavior and understanding when misalignment generalizes.
Adversarial robustness to jailbreaks.
Machines that learn over long horizons: Currently top of mind.

I studied Computer Science at Harvard before leaving to join OpenAI in March 2024. Feel free to contact me at milesw [at] openai [dot] com.

Selected Papers

Monitoring Monitorability
Melody Y. Guan*, Miles Wang*, Micah Carroll*, Zehao Dou, Annie Y. Wei, Marcus Williams, Benjamin Arnav, Joost Huizinga, Ian Kivlichan, Mia Glaese, Jakub Pachocki, Bowen Baker* · December 2025
FrontierScience: Evaluating AI’s ability to perform scientific research tasks
Miles Wang, Joy Jiao, Neil Chowdhury, Ethan Chang, Tejal Patwardhan · December 2025
Measuring AI’s capability to accelerate biological research in the wet lab
Nikolai Eroshenko, Miles Wang, Rachel Smith, Liliana Abramson, Tejal Patwardhan, Kemo Jammeh, Chase Olle, Azadeh Samadian, Nitin Mahadeo · December 2025
Estimating worst case frontier risks of open weight LLMs
Eric Wallace*, Olivia Watkins*, Miles Wang, Kai Chen, Chris Koch · August 2025
Persona Features Control Emergent Misalignment
Miles Wang*, Tom Dupré la Tour*, Olivia Watkins*, Alex Makelov*, Ryan A. Chi*, Samuel Miserendino, Jeffrey Wang, Achyuta Rajaram, Johannes Heidecke, Tejal Patwardhan, Dan Mossing* · June 2025
Forbidden Facts: An Investigation of Competing Objectives in Llama-2
Tony T. Wang*, Miles Wang*, Kaivalya Hariharan, Nir Shavit · December 2023