Jason Wei

Jason Wei is an American AI researcher currently at Meta Superintelligence Labs, known for pioneering chain-of-thought prompting, instruction tuning (FLAN), and the concept of emergent abilities in large language models. He earned a BS in Computer Science and Mathematics from Harvey Mudd College (2016) and a PhD in Computer Science from Dartmouth College (2020, advised by Lorenzo Torresani). He joined Google as an AI Resident in October 2020, rising to Research Scientist at Google Brain, where he authored three of the most influential papers in modern LLM research: chain-of-thought prompting (5,000+ citations), FLAN instruction tuning, and emergent abilities of LLMs. In February 2023 he joined OpenAI, where he co-created the o1 reasoning model and contributed to o3 and Deep Research. In July 2025 he and colleague Hyung Won Chung left OpenAI to join Meta's newly formed Superintelligence Labs, where he focuses on reasoning and reinforcement learning for frontier AI systems.

chain-of-thought promptinginstruction tuningemergent abilities of large language modelsscaling lawsreasoning in LLMsreinforcement learning for AI reasoningtest-time computedata augmentation for NLPAI safety and alignmentsuperintelligence research

Timeline

13 Research13 total

2025

2025-07Research

Left OpenAI alongside Hyung Won Chung to join Meta's newly formed Superintelligence Labs, bringing expertise in reasoning and reinforcement learning.

2025-11Research

Spoke at Stanford AI Club on '3 Key Ideas in AI' — introduced the Verifier's Law, the Commoditization of Intelligence, and the Jagged Edge of Intelligence as a framework for understanding AI progress.

2024

2024-09Research

Co-created and launched OpenAI o1 — a reasoning model that uses reinforcement learning to perform chain-of-thought at inference time rather than via prompting alone.

2023

2023-02Research

Left Google Brain to join OpenAI, working on the ChatGPT team focused on reasoning and agents.

2023-05Research

Returned to Dartmouth College as a guest lecturer on recent advances in AI.

2022

2022-01Research

Published 'Chain-of-Thought Prompting Elicits Reasoning in Large Language Models' — the seminal paper showing LLMs can perform complex reasoning by generating intermediate steps. Now cited over 5,000 times.

2022-06Research

Published 'Emergent Abilities of Large Language Models' — defined emergence in LLMs and cataloged 137 emergent abilities that appear unpredictably at scale.

2022-10Research

Published 'Scaling Instruction-Finetuned Language Models' (Flan-PaLM) — scaled instruction tuning to 1,800 tasks on PaLM 540B, achieving +9.4% average improvement and SOTA on MMLU at the time.

2021

2021-09Research

Published 'Finetuned Language Models Are Zero-Shot Learners' (FLAN) — demonstrated that instruction tuning a 137B-parameter model on 60+ NLP tasks substantially improves zero-shot performance on unseen tasks.

2020

2020-01Research

Earned PhD in Computer Science from Dartmouth College, advised by Professor Lorenzo Torresani.

2020-10Research

Joined Google as an AI Resident in the Brain team (Google AI Residency program).

2019

2019-01Research

Published 'EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks' — his first paper, completed at age 20, accepted at EMNLP 2019 with three positive reviews.

2016

2016-01Research

Graduated from Harvey Mudd College with a BS in Computer Science and Mathematics.

Key Contributions

Chain-of-Thought Prompting

Introduced chain-of-thought (CoT) prompting in January 2022, demonstrating that large language models can solve complex reasoning tasks by generating intermediate reasoning steps. The paper has been cited over 5,000 times and fundamentally changed how practitioners interact with LLMs, directly inspiring OpenAI's o1 reasoning model.

FLAN (Finetuned Language Net) and Instruction Tuning

Led the FLAN series of research showing that finetuning language models on tasks described via natural language instructions dramatically improves zero-shot and few-shot performance. The initial FLAN (2021) used 60+ tasks on a 137B model; Flan-PaLM (2022) scaled to 1,800 tasks on PaLM 540B, achieving SOTA on MMLU. This work laid the foundation for instruction-tuned models like ChatGPT.

Emergent Abilities of Large Language Models

Defined and cataloged 137 emergent abilities — capabilities that appear unpredictably as models scale — providing a conceptual framework for understanding why larger models exhibit qualitatively new behaviors. Published in TMLR 2022.

OpenAI o1 Reasoning Model

Co-created OpenAI's o1 model (launched September 2024), which uses reinforcement learning to train models to perform chain-of-thought reasoning at inference time. This shifted the field from prompt-based CoT to learned reasoning, enabling adaptive test-time compute scaling.

EDA: Easy Data Augmentation

First paper (2019), presenting four simple text augmentation techniques (synonym replacement, random insertion, random swap, random deletion) that boosted text classification performance, especially on small datasets. Accepted at EMNLP 2019.

Notable Quotes

“

Don't do chain of thought purely via prompting. Train models to do better chain of thought using RL.

X post on o1 launch, Sep 2024·Source

“

In the history of deep learning we have always tried to scale training compute, but chain of thought is a form of adaptive compute that can also be scaled at inference time.

X post on o1 launch, Sep 2024·Source

“

o1-mini is the most surprising research result I've seen in the past year.

OpenAI o1 announcement, Sep 2024·Source

“

Beating the teacher requires walking your own path and taking risks and rewards from the environment.

Personal reflection on RL philosophy·Source

“

There will be no fast takeoff, because there is a jagged edge of intelligence capability and rate of improvement.

Public talk, 2025·Source

14 sources(click to expand)

Jason Wei — Personal Website Jason Wei — Google Scholar Chain-of-Thought Prompting Elicits Reasoning in Large Language Models (arXiv)Finetuned Language Models Are Zero-Shot Learners (arXiv)Emergent Abilities of Large Language Models (arXiv)Scaling Instruction-Finetuned Language Models (arXiv)EDA: Easy Data Augmentation Techniques (arXiv)Jason Wei on o1 Launch — X post Meta reportedly scores two more high-profile OpenAI researchers — TechCrunch Jason Wei — Grokipedia Recent Alumnus Lectures on Advances in AI — Dartmouth Jason Wei's 3 Laws of AI — LLM Practical Experience Hub A quote from Jason Wei (OpenAI) — Simon Willison Jason Wei, Ex-OpenAI Core Researcher, Defines the Boundaries of RL — 36Kr

Research generated March 19, 2026

Researchers & Thinkers/Jason Wei

All Profiles