What is TeaRAGs

TeaRAGs (Trajectory Enrichment-Aware Retrieval-Augmented Generation system) is a high-performance code RAG system exposed as an MCP server. Built for large monorepos and actively growing codebases, it combines semantic retrieval with development history signals — authorship, churn patterns, change volatility, bug-fix rates — to rerank results beyond pure similarity.

What It's For

Semantic code search — find code and documentation by intent, not by identifier names. Ask "how does authentication work?" and get the actual implementation, even if it's called Pipeline::StageClient
Agentic data-driven engineering — AI agents making code decisions backed by empirical evidence from your repository's history: stable templates, domain owner's style, proven patterns — not pattern matching intuition
Deep codebase analysis — hotspot detection, ownership mapping, tech debt scoring, blast radius estimation, churn volatility tracking — all at function-level granularity, not just per file, and all queryable through semantic search

What It Does

TeaRAGs indexes your codebase into searchable vector embeddings, then enriches each code chunk with git-derived quality signals. When an AI agent searches your code, results are re-scored using these signals — so the agent finds not just code that looks right, but code that is stable, well-owned, and battle-tested.

Learn more in Core Concepts:

Code Vectorization — AST-aware chunking pipeline
Trajectory Enrichment Awareness — 19 git-derived signals at function-level
Reranking — weighted scoring presets for quality-aware retrieval

What It Can Do

Find code by intent, not by name. Ask "How does user authentication work?" — get the actual implementation, even if it's called Pipeline::StageClient or InfoRequest. No need to guess identifiers. Works in seconds, consuming 2–3x fewer tokens than grep-based exploration.

Detect the most dangerous code in your system. Query with rerank: "hotspots" — instantly surface high-churn, frequently-fixed code that is statistically likely to break next. A single query replaces hours of manual git log archaeology.

Find stable, battle-tested templates for code generation. Query with rerank: "stable" — the agent copies from code that has survived production for months with minimal changes and near-zero bug fixes, instead of copying from the first search hit.

Map code ownership and knowledge silos. Query with rerank: "ownership" — identify who owns which part of the codebase, where knowledge is concentrated in a single author (bus factor risk), and whose coding style to match when contributing to a domain.

Migrate patterns across a 3.5M LOC codebase. Use semantic search to analyze how the first migration was done — the AI completes 95% of the next one. Rewriting batch operations, moving to new frameworks, standardizing error handling — all become systematic instead of manual.

Investigate production bugs in minutes. Describe the problem in natural language: "Where does the system handle failed payment retries?" — get relevant code fragments with their stability and churn history. High churn on a code fragment? That's probably where the bug lives.

Prepare for audits and compliance. Search for "personal data handling", "access control checks", "logging of sensitive operations" — get a structured map of where security-critical logic lives, who owns it, and when it was last modified.

Onboard to an unfamiliar codebase in hours, not weeks. Ask "How does background processing work here?" or "Where is the main business logic?" — build a mental model of any system by asking questions about behavior, not by reading directory trees.

Why TeaRAGs

Agent on Grep vs Agent on Semantic Search

Without semantic search, an AI coding agent explores your codebase through brute force: launching subagents, running dozens of glob/grep calls, reading files speculatively, and burning through tokens on trial-and-error navigation. With semantic search, the agent asks one question and gets the right code immediately.

A controlled benchmark by grepai on the Excalidraw codebase (155K+ LOC TypeScript) measured the difference:

Metric	Agent + Grep	Agent + Semantic Search	Change
Subagent launches	5	0	-100%
Tool calls	139	62	-55%
Fresh input tokens	51,147	1,326	-97%
Cache creation tokens	563,883	162,289	-71%
Total billed cost	$6.78	$4.92	-27.5%

The cost savings come from eliminating the most expensive operations: subagent launches (which create expensive cache contexts) and speculative file reads. The agent no longer needs to guess where code lives — it knows.

2x faster discovery — the agent doesn't waste turns on dead-end searches
2x fewer tokens — the agent reads only relevant files instead of scanning entire directories
Comparable result quality — the same correct answer, just reached faster and cheaper

The savings scale with codebase size. On a 155K LOC codebase, semantic search reduced token consumption by 97%. On larger codebases (1M+ LOC, deep nesting, sprawling domains), the gap widens further — grep-based exploration becomes exponentially more expensive as the agent searches through more directories, while semantic search remains constant: one query, immediate answer, regardless of project size.

Research: grepai: Benchmark grepai vs grep on Claude Code (detailed token/cost breakdown) | Zilliz: Why I'm Against Claude Code's Grep-Only Retrieval (40%+ token reduction, qualitative analysis)

Agent on Semantic Search vs Agent on TeaRAGs

Plain semantic search finds code that looks like your query. That's a massive improvement over grep — but the agent still has no idea whether the code it found is good. It copies the first match, blind to quality.

TeaRAGs adds a trajectory enrichment layer: every search result carries 19 git-derived signals — churn, stability, authorship, bug-fix rates, code age — at function-level granularity. The agent doesn't just find code faster. It finds better code.

Capability	Semantic Search	TeaRAGs
Find code by meaning	✅	✅
Hybrid search (BM25 + vector)	⚠️ some tools	✅ RRF fusion
Know if code is stable or volatile	❌	✅ `churnVolatility`, `commitCount`
Know who owns the code	❌	✅ `dominantAuthor`, `authors[]`
Know if code is buggy	❌	✅ `bugFixRate` per function
Know when code was last touched	❌	✅ `ageDays`, `lastModifiedAt`
Link code to JIRA/GitHub tickets	❌	✅ `taskIds[]`
Find stable templates to copy	❌ guessing	✅ `rerank: "stable"`
Avoid high-risk code	❌ guessing	✅ `rerank: "hotspots"`
Match domain owner's style	❌	✅ `rerank: "ownership"`
Assess tech debt before modifying	❌	✅ `rerank: "techDebt"`
Function-level metrics	❌ file-level at best	✅ per function/method

The difference in practice: a plain semantic search agent copies the first match and hopes for the best. A TeaRAGs-powered agent finds code with a 0–20% bug-fix rate, written by the domain owner, stable for 6+ months — and copies that instead.

This is the shift from "find similar code" to agentic data-driven engineering — code generation decisions backed by empirical evidence.

Who It's For

Enterprise developers working in large, actively growing codebases (1M+ LOC) where grep stops being effective and deep domain knowledge is scattered across teams and timezones. As a privacy-first, local-first solution, TeaRAGs runs entirely on your machine — your code never leaves the perimeter, no cloud APIs required
Developers exploring unfamiliar codebases — engineers who need to understand architecture conceptually rather than search for specific implementations. Ask "how does the system handle X?" or "what are the core abstractions?" and build a mental model by asking questions about behavior, patterns, and responsibilities — not by reading file trees or grepping for class names
Projects with deep domain knowledge and naming challenges — codebases where identifiers don't match their purpose (Pipeline::StageClient for authentication, InfoRequest for user data), where business logic is buried behind generic abstractions, or where the same concept has different names across modules. Semantic search cuts through naming inconsistency to find code by what it does, not what it's called
Large monorepo teams where code ownership matters, patterns evolve across hundreds of contributors, and the cost of copying the wrong template is measured in production incidents
AI-assisted development enthusiasts who want to push coding agents beyond naive context injection — using empirical signals to make agents genuinely smarter about code quality and risk

Who It's Not For (For Now)

Small teams with small codebases — if your project fits in a single developer's head and grep finds everything you need, the overhead of a vector database and embedding pipeline isn't justified yet. TeaRAGs shines when scale makes intuition unreliable.
Microservice architectures with many small repos — TeaRAGs is optimized for monorepos and large codebases. If each service is 5–20K LOC in its own repository, the trajectory enrichment signals (churn, ownership, cross-file patterns) have less data to work with. That said, indexing multiple repos into separate collections is supported — it just won't deliver the same depth of insight as a monorepo with rich git history.

Next Steps

Origin — why TeaRAGs was created, the journey from frustration to a working tool
Comparison — TeaRAGs vs claude-context, grepai, serena, and others
Core Concepts — how code vectorization, trajectory enrichment, and reranking work together
Use Cases — 5 real-world semantic search scenarios from an enterprise codebase
Non-Goals — what TeaRAGs deliberately doesn't do
Quickstart — get up and running in 15 minutes

What It's For​

What It Does​

What It Can Do​

Why TeaRAGs​

Agent on Grep vs Agent on Semantic Search​

Agent on Semantic Search vs Agent on TeaRAGs​

Who It's For​

Who It's Not For (For Now)​

Next Steps​