Co-Founder & CEO |Chroma
Previously co-founded Standard Cyborg. Chroma is the leading open-source AI-native embeddings database.
GitHub
12 repositories · 40 total stars
Languages
Episodes
Videos










Biography
**Jeff Huber** is a multi-time founder who first launched Standard Cyborg (YC W15), a startup building affordable 3D-printed prosthetics using computer vision and iPhone scanning technology. He co-founded Chroma in 2022 with Anton Troynikov, creating the leading open-source vector database that reached 1M+ users within 9 months and secured $18M in seed funding led by Quiet Capital. As Chroma's CEO, Huber has championed "context engineering" over traditional RAG, publishing influential technical reports on context rot and generative benchmarking while positioning Chroma as modern search infrastructure for AI applications. His unique perspective combines deep technical focus on retrieval systems with a contrarian product philosophy that prioritizes developer experience over rapid feature releases, evident in Chroma's deliberate transition to cloud services and integration with major platforms like Microsoft's Semantic Kernel.
Open-source AI-native vector database designed specifically for LLM applications, providing efficient storage and retrieval of embeddings with a developer-friendly Python/JavaScript API. It bridges the gap between AI models and production by making knowledge, facts, and skills pluggable for LLMs through semantic search capabilities.
Groundbreaking technical report demonstrating that LLM performance degrades non-uniformly as input context length increases, coining the term 'context rot' to describe this phenomenon. The research systematically evaluated 18 state-of-the-art LLMs including GPT-4.1, Claude 4, and Gemini 2.5, revealing that models don't use their context uniformly and performance becomes increasingly unreliable with longer inputs.
Novel approach to evaluating retrieval systems using synthetic benchmarks generated from user-specific data, addressing limitations of traditional benchmarks like MTEB that fail to represent real-world query patterns. This framework enables custom evaluation sets that reflect actual production queries and document distributions.
Technical report demonstrating that simple query-only linear adapters can improve retrieval accuracy by up to 70% without requiring full model retraining. This work showed how lightweight adaptation layers can significantly enhance embedding model performance for specific domains or use cases.
Comprehensive analysis of text chunking methods for retrieval-augmented generation systems, providing empirical evaluation of different strategies including fixed-size, semantic, and hierarchical approaches. The research introduced token-level evaluation metrics and revealed performance gaps of up to 9% in recall between different chunking methods.
Hallucinations happen when the AI model does not have enough factual information to ground its response.
Chroma is designed first and foremost for developer productivity and happiness.
The reason that Chroma got started is because we had worked for many years in applied machine learning and we'd seen how demos were easy to build, but building a production reliable system was incredibly challenging and that the gap between demo and production didn't really feel like engineering. It felt a lot more like alchemy.
RAG, like, never meant anything to anybody, and as a result meant everything to everybody as well. And like it's just overall a poor phrase to help shape the actual job to be done versus context engineering is a great phrase and implies the existence of a context engineer, which is literally the job of most AI teams today.
I think toxic cultures really only evolve when people have the strong opinion that they are right. That's different than having the strong opinion that we should find the right answer.