DEV Community

🚀 pgai Vectorizer: Automate AI Embeddings With One SQL Command in PostgreSQL

Avthar Sewrathan on October 29, 2024

Learn how to automate AI embedding creation using the PostgreSQL you know and love. Managing embedding workflows for AI systems like RAG, search a...

Read full post

Ben Halpern • Oct 29 '24

Really excited about this challenge tomorrow

Avthar Sewrathan • Oct 30 '24

Thanks Ben, excited to see what you build in the OSS AI challenge!

Rob Benzo • Oct 29 '24

exciting!!!!

Avthar Sewrathan • Oct 30 '24

Thanks Rob!

Melody Mbewe • Oct 30 '24

Really exciting

Avthar Sewrathan • Oct 30 '24

Thanks Melody!

Sameer Kulkarni • Oct 30 '24

Very interesting and timely...
We are building a RAG based app and this article certainly is going to help us

Avthar Sewrathan • Oct 30 '24

Great to know Sameer -- excited to hear what you think!

Fahmi Noor Fiqri • Nov 7 '24 • Edited

Does the image pgai-vectorizer-worker support Ollama? The documentation do not provide an example config using Ollama

Avthar Sewrathan • Nov 8 '24

The pgai-vectorizer-worker does not support Ollama at this time, only OpenAI. But you can still use Ollama for generation models and OpenAi as the embedding model. Our team will add Ollama support very soon!

Fahmi Noor Fiqri • Nov 10 '24

Follow up question:

I have a vectorizer and timescale-ha in a docker compose but I'm always getting rate limit error from Open AI. I have set the concurrency to 1 but it still happens.

I tried to embed just a single PDF file (Attention is All You Need paper) for a RAG project but no matter how I configure the vectorizer, it just seems to always be hitting Open AI rate limit (I'm on Tier 1 account).

Is there a way to slow down the vectorizer? This is a bit of dilemma, I can't use Ollama in vectorizer and when I use Open AI, I always get rate limit error.

Fahmi Noor Fiqri • Nov 8 '24

Thanks for the info!

Restless Coder • Nov 12 '24

anyhow, it uses paid openAI key?
then why do we use it?