💼 LanceDB Powers the Heart of AI at Dosu
- 90% label accuracy
- 70% less manual triage
- Millisecond search on millions of vectors
Those are the numbers that became real the moment the team switched to LanceDB inside Dosu. Learn more about how Dosu uses LanceDB as their living knowledge for codebases in this new Case Study: Meet Dosu - the Intelligent Knowledge Base for Software Teams and Agents
“LanceDB powers the heart of our AI at Dosu . Its lightning-fast search, built-in versioning, and effortless scalability let us move faster, ship smarter, and deliver cutting-edge intelligence to millions of developers without compromise.”
🎤 Event Recap!
In case you missed some of the cool events in June, the recordings are finally out! Check out the joint talks we did with Harvey and Databricks on how LanceDB helps scale enterprise AI systems in production!
- Scaling Enterprise-Grade RAG: Lessons from Legal Frontier : A joint talk with Calvin Qi from Harvey at the AI Engineer World’s Fair.
- Lakehouse Architecture for AI Data: Search, Analytics, Processing, Training : Our session at the Databricks DATA+AI SUMMIT with Chang She and Zhidong Qu .
📰 Product News: New Enterprise Features
Feature | Description |
---|---|
Seamless streaming ingestion | Automatically optimize indexes during streaming writes, providing consistently fast performance even under continuous data flow. |
Blazing-fast range queries | Range filters like “value >= 1000 and value < 2000” now can execute hundreds of times faster, supercharging vector search with metadata filtering. |
Multivector support in JavaScript SDK | LanceDB JavaScript/TypeScript SDK enables you to store and search multiple vector embeddings for a single item. |
Easy data exploration from LanceDB Cloud UI | Filter table data with SQL and select specific columns for a customizable view of your LanceDB datasets. |
Community Contributions
-
@jiaoew1991
Enwen Jiao previously worked on Data Engine at Minimax and now at Luma.ai. He has made multiple contributions to support Lance-Ray integration:
add_column
supports Ray to add columns to existing tables in a distributed manner PR1 , andadd_dataset_options
parameter supports exposing more native parameters when reading lance tables based on version and tag PR2 . - @geruh Drew Gallardo from AWS worked on AWS Glue integration with Lance PR1 , PR2
- @kazuhitoT Kazuhito Takeuchi for native integration with Lindera tokenizer PR1 , PR2
- A heartfelt thank you to our community contributors of Lance and LanceDB this past month: @yanghua @fangbo @emmanuel-ferdman @adi-ray @ddupg @chenghao-guo @LeoReeYang @majin1102 @HaochengLIU @Jay-ju @SaintBacchus @Sbargaoui @lalitx17 @allenanswerzq @bjurkovski @TaoKevinKK @wojiaodoubao @KazuhitoT @niyue @kilavvy @b4l @Dig-Doug @xhwhis @tristanz @chenkovsky @aniaan @yihong0618 @Kilerd @CyrusAttoun @kemingy @wengh @geruh
Open Source Releases Spotlight
Project | Version | Updates |
---|---|---|
LanceDB | v0.21.2 | Various improvements including support for ngram tokenizer , multivector for JavaScript , session support in Python and JavaScript |
LanceDB | v0.21.1 | Various improvements including batch Ollama embed calls |
Lance | v0.32.0 | Breaking changes: use FilteredReadExec in the planner & consolidated index cache |
Lance | v0.31.1 | New documentation website built with mkdocs |