DEV Community

Gaurav Tarlok Kakkar
Gaurav Tarlok Kakkar

Posted on • Edited on

How About Ditching the Hype: Do We Really Need a Specialized Vector Database?

With the emergence of Generative AI, vector databases have surged in popularity. They've found their niche in powering Retrieval Augmented Generation (RAG) applications. However, as we delve into the landscape of databases, a common trend emerges: nearly every database provider is incorporating vector search capabilities into their offerings. It's a strategic move driven by the fact that vector search is integral to capturing a substantial share of the RAG workload.

Some of the major releases:

So, the big question is: Is all this effort going to make the difference between vector and other databases disappear over time? Open thoughts 🤔

  • Why might customers consider moving to a separate database for vector search when their current database provider already offers vector search capabilities?
  • Will these databases come with RAG capabilities right out of the box, or will libraries like Langchain and llama-index be used as ETL pipelines on top of these databases to facilitate RAG?
  • Conversely, can these extensions or bolt-on vector search supports meet the scalability, latency, cost, and index freshness requirements of applications?
  • What if a specialized architectural change is needed to handle vector search due to the massive embedding size?
  • Perhaps both options will coexist, but for smaller workloads, the difference in performance and cost between specialized vector databases and built-in support may not be significant enough to justify maintaining a new database.

Sources:

  1. https://news.ycombinator.com/item?id=37747534
  2. https://nextword.substack.com/p/vector-database-is-not-a-separate

Top comments (1)

Collapse
 
gaurav274 profile image
Gaurav Tarlok Kakkar

Let me know if I missed any other major release.