Skip to main content

Embedding Providers

Provider Comparison

ProviderModelsDimensionsRate LimitNotes
Ollamaunclemusclez/jina-embeddings-v2-base-code (default), nomic-embed-text, mxbai-embed-large768, 768, 1024NoneLocal, no API key
OpenAItext-embedding-3-small, text-embedding-3-large1536, 30723500/minCloud API
Cohereembed-english-v3.0, embed-multilingual-v3.01024100/minMultilingual support
Voyagevoyage-2, voyage-large-2, voyage-code-21024, 1536300/minCode-specialized

For code search, we recommend unclemusclez/jina-embeddings-v2-base-code (default):

ollama pull unclemusclez/jina-embeddings-v2-base-code:latest
export EMBEDDING_MODEL="unclemusclez/jina-embeddings-v2-base-code:latest"
AspectBenefit
Code-optimizedTrained specifically on source code
Multilingual30+ programming languages
Enterprise-provenBattle-tested on 3.5M+ LOC codebases
Best performance/qualityOptimal balance for local/on-premise setups
CPU-friendlyRuns efficiently without GPU (great for Ollama)