jina-embeddings-v3 Collection Multilingual multi-task general text embedding model • 6 items • Updated 18 days ago • 12
jina-embeddings-v3: Multilingual Embeddings With Task LoRA Paper • 2409.10173 • Published 21 days ago • 21
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper • 2406.17557 • Published Jun 25 • 85
Multi-Task Contrastive Learning for 8192-Token Bilingual Text Embeddings Paper • 2402.17016 • Published Feb 26 • 5
Jina Embeddings 2: 8192-Token General-Purpose Text Embeddings for Long Documents Paper • 2310.19923 • Published Oct 30, 2023 • 13
Generalist embedding models are better at short-context clinical semantic search than specialized embedding models Paper • 2401.01943 • Published Jan 3 • 6
jina-embeddings-v2 Collection The V2 family of Jina Embeddings supports encoding large documents with 8k sequence length. • 8 items • Updated 20 days ago • 15
Jina Embeddings: A Novel Set of High-Performance Sentence Embedding Models Paper • 2307.11224 • Published Jul 20, 2023 • 5