Sylvain Gugger

Biography

Sylvain Gugger is a Machine Learning Engineer at Jane Street on the ML-infra team, where he helps traders and researchers accelerate their model training and inference. Before Jane Street, he spent three years at Hugging Face (2020-2023) as a core maintainer of the Transformers library (1,250+ commits) and creator of the Accelerate library (290+ commits, 9.5k+ GitHub stars), which simplifies distributed PyTorch training across GPUs, TPUs, and mixed-precision setups with minimal code changes. Prior to Hugging Face, he worked at fast.ai (2017-2019) alongside Jeremy Howard, co-authoring "Deep Learning for Coders with fastai and PyTorch" (O'Reilly, 2020) and helping build the fastai library and courses. Gugger began his career as a mathematics and computer science teacher in France, spending seven years at CPGE (Classes Preparatoires aux Grandes Ecoles), where he authored 10 math textbooks published by Dunod. He discovered machine learning through Jeremy Howard's fast.ai MOOC after relocating to New York City around 2015. A key moment in his early ML career was the Stanford DAWNBench competition, where his implementation of Leslie Smith's super-convergence method helped the fast.ai team achieve first place on CIFAR-10 (fastest and cheapest training). His widely read blog posts on the 1-cycle learning rate policy and learning rate finders have become foundational references in the deep learning community. He holds a Masters in Mathematics and Computer Science from Ecole normale superieure (2003-2007).

Distributed TrainingPyTorch Performance OptimizationLearning Rate SchedulesSuper-ConvergenceMixed Precision TrainingGPU AccelerationML InfrastructureOpen Source ML LibrariesTransfer LearningTransformer Models

Timeline

12 Research12 total

2024

2024-10Research

Appeared on Signals and Threads podcast: "The Uncertain Art of Accelerating ML Models"

2023

2023-01Research

Joined Jane Street as Machine Learning Engineer on the ML-infra team

2023-08Research

Left Hugging Face after three years; announced continued open-source contributions to Transformers and Accelerate

2021

2021-04Research

Launched Hugging Face Accelerate library for simplified distributed PyTorch training

2020

2020-01Research

Joined Hugging Face; became core maintainer of the Transformers library and developed the Trainer API

2020-06Research

Co-authored "Deep Learning for Coders with fastai and PyTorch" (O'Reilly) with Jeremy Howard

2018

2018-01Research

Joined fast.ai as a research scientist, co-developing the fastai library and deep learning courses with Jeremy Howard

2018-04Research

Published influential blog post on the 1-cycle learning rate policy, popularizing super-convergence techniques

2018-04Research

Fast.ai team (with Gugger's super-convergence implementation) won DAWNBench CIFAR-10: fastest and cheapest training on public infrastructure

2015

2015-01Research

Relocated to New York City; discovered deep learning through fast.ai MOOC

2007

2007-01Research

Started teaching mathematics and computer science at CPGE in France; authored 10 textbooks for Dunod editions

2003

2003-01Research

Began Masters in Mathematics and Computer Science at Ecole normale superieure

Key Contributions

Hugging Face Accelerate

Created the Accelerate library (9.5k+ GitHub stars), a thin PyTorch wrapper that simplifies distributed training across GPUs, TPUs, and mixed-precision setups with only 5 lines of code changes. Supports FSDP, DeepSpeed, and data/pipeline/tensor parallelism.

Hugging Face Transformers (Core Maintainer)

Contributed 1,250+ commits to the Transformers library as a core maintainer, working on the Trainer API, model implementations, and training infrastructure that powers millions of ML workflows.

Deep Learning for Coders with fastai and PyTorch

Co-authored with Jeremy Howard the O'Reilly book (2020) and accompanying Jupyter notebooks (fastbook), making deep learning accessible to programmers without a PhD. Cited by 16,500+ researchers.

1-Cycle Learning Rate Policy & Super-Convergence

Popularized and implemented Leslie Smith's 1-cycle policy and super-convergence technique, enabling dramatically faster training. His blog posts became foundational references adopted across PyTorch, TensorFlow, and Keras ecosystems.

DAWNBench Competition (fast.ai team)

Key contributor to the fast.ai team that won Stanford's DAWNBench CIFAR-10 competition for fastest and cheapest training on publicly available infrastructure, beating entries from Google and Intel clusters.

fastai Library

Co-developed the fastai deep learning library with Jeremy Howard, introducing progressive resizing, learning rate finders, and one-cycle training as default practices that influenced the broader PyTorch ecosystem.

Notable Quotes

“

I created a new open source library to make it much more lightweight to help people with our trainings.

Signals and Threads podcast, Oct 2024 (on creating Accelerate)·Source

“

No one knows anything about machine learning. Like, it's really just a cooking sense.

Signals and Threads podcast, Oct 2024·Source

“

Making sure that your code is there... you can change a small line of code in your model and think 'Oh, this is totally harmless,' but then it actually destroys the performance.

Signals and Threads podcast, Oct 2024 (on ML reproducibility)·Source

“

Yesterday was my last day at Hugging Face. The past three years have been exhilarating and I am very proud of what the team has accomplished during that time!

X (Twitter), Aug 2023·Source

“

The way we tune all the other hyper-parameters of the model will impact the best learning rate.

Blog: The 1cycle policy, Apr 2018·Source

12 sources(click to expand)

sgugger (Sylvain Gugger) -- GitHub Profile Sylvain Gugger -- About Me (personal blog)Introducing Accelerate -- Hugging Face Blog The Uncertain Art of Accelerating ML Models -- Signals and Threads (Jane Street podcast)Sylvain Gugger's departure from Hugging Face -- X (Twitter)Deep Learning for Coders with fastai and PyTorch -- O'Reilly / Amazon Training Imagenet in 3 hours for $25 -- fast.ai DAWNBench announcement The 1cycle policy -- Sylvain Gugger's blog huggingface/accelerate -- GitHub repository (9.5k stars)sgugger on Hugging Face Hub Sylvain Gugger -- Google Scholar Chai Time Data Science -- Interview with Sylvain Gugger

Research generated March 19, 2026

Builders & Technical Leaders/Sylvain Gugger

All Profiles