Jason Liu

Creator & Founder |Instructor / 567 Studios

Created Instructor (structured LLM outputs via Pydantic), cited by OpenAI as inspiration for their structured output feature. AI consultant and educator.

GitHubjxnl

Biography

Jason Liu is a staff-level machine learning engineer, angel investor, a16z scout, and the creator of Instructor, a Python library for structured outputs from LLMs with 12,500+ GitHub stars and 6M+ monthly downloads. The OpenAI team cited Instructor as direct inspiration for their native structured output feature. Liu studied Computational Mathematics & Statistics at the University of Waterloo (2012-2017). He worked as a Data Scientist at Meta (Facebook) in 2017 on content detection algorithms at 2B+ user scale, then spent five years as a Staff ML Engineer at Stitch Fix (2018-2023), where he led a team of 6-7 engineers building multimodal embedding systems and created the Flight framework processing 350M+ daily requests. In 2023 he founded 567 Studios, a solo AI consulting practice advising seed-to-Series B startups including Zapier, HubSpot, Limitless AI, Weights & Biases, Modal Labs, Timescale, and Pydantic. He ran cohort-based training programs on Maven with students from OpenAI, Anthropic, Google, Microsoft, Amazon, and McKinsey. In February 2026 he sunset 567 Labs and open-sourced all course content. He is based in New York, where he works as a freelance ML consultant and angel investor.

Structured LLM OutputsPydantic IntegrationRAG SystemsContext EngineeringAI ConsultingMultimodal EmbeddingsEvaluation FrameworksFunction CallingOpen SourceAI Engineering Education

github twitter linkedin website blog

Timeline

12 Research12 total

2026

2026-02Research

Sunset 567 Labs; open-sourced all course content (RAG Playbook + Consulting Archive)

2025

2025-01Research

Instructor surpassed 6M+ monthly PyPI downloads and 12,000+ GitHub stars

2024

2024-01Research

Appeared on Weaviate Podcast #88 discussing Instructor and AI consulting

2024-01Research

Launched Maven course 'Systematically Improving RAG Applications' (6-week cohort-based)

2024-04Research

Appeared on Latent Space podcast: 'High Agency Pydantic > VC Backed Frameworks'

2024-06Research

Appeared on devtools.fm podcast: 'Instructor, Shipping LLMs to Production'

2024-08Research

OpenAI launched native structured outputs, citing Instructor as inspiration

2023

2023-01Research

Left Stitch Fix; founded 567 Studios as an independent AI consultant

2023-06Research

Open-sourced Instructor (originally 'OpenAI Function Call and Pydantic Integration Module') on GitHub

2018

2018-01Research

Joined Stitch Fix as Staff ML Engineer; led multimodal AI and search team (6-7 engineers)

2017

2017-01Research

Data Scientist at Meta (Facebook), building content detection algorithms at 2B+ user scale

2012

2012-01Research

Enrolled at University of Waterloo, Computational Mathematics & Statistics

Key Contributions

Instructor

Python library for structured outputs from LLMs, patching provider SDKs to return Pydantic models. 12,500+ GitHub stars, 6M+ monthly downloads, cited by OpenAI as inspiration for their structured output feature.

Instructor JS/TS

TypeScript port of Instructor for structured extraction from LLMs in the JavaScript ecosystem.

Flight Framework (Stitch Fix)

Internal ML pipeline framework at Stitch Fix processing 350M+ daily requests with 80% internal adoption, serving as a semantic bridge integrating multiple systems.

Systematically Improving RAG (Maven Course)

6-week hands-on course covering synthetic evaluation, embedding fine-tuning for 20-40% gains, query segmentation, and multimodal indices. Students from OpenAI, Anthropic, Google, Microsoft, Amazon.

567 Labs Open-Source Course Archive

Open-sourced written content from RAG Playbook and Consulting courses as freely available ebooks after sunsetting 567 Labs.

Context Engineering for RAG

Prolific writing on building better agentic RAG systems, evaluation frameworks, and practical AI engineering patterns.

Notable Quotes

“

It's just Python, right? Like, if you're going to use the LLM SDKs, you're obviously going to install instructor.