Notifications

Clear all

Best vector database for a DeepSeek RAG pipeline?

DeepSeek Forum

Last Post by vxjwpviifx 3 months ago

3 Posts

4 Users

0 Reactions

793 Views

RSS

09/02/2026 11:16 pm

Topic starter

pdvfhewyfr

(@pdvfhewyfr)

Active Member

8 Posts
2 6 0

I'm currently building a RAG pipeline using DeepSeek-V3 for a local documentation project and I'm torn on which vector database to pair it with. I've been looking at Milvus and Pinecone, but I'm worried about latency since DeepSeek is so snappy. My dataset is around 500k chunks, mostly technical manuals, so high-dimensional search performance is key. I'm also trying to keep the setup relatively lightweight if possible. Has anyone here tested DeepSeek with specific databases like Weaviate or Qdrant? I’d love to know which one handles the embeddings most efficiently without breaking the bank on infrastructure. What would you recommend for the best balance of speed and scalability?

Add a comment

Topic Tags

DeepSeek Vector DB RAG

3 Answers

09/02/2026 11:40 pm

hxfeprfvnk

(@hxfeprfvnk)

Active Member

14 Posts
2 12 0

Curious about one thing: what's the dimension size of the embeddings you're using with DeepSeek-V3? Basically, if you're hitting 1024 or higher, the compute cost for indexing changes a lot. I've used Qdrant Vector Database before and honestly, the performance was solid, but I had some issues with memory overhead when scaling technical manuals. Before I dive into the technical details, are you planning to host this on-prem or go cloud-native?

Add a comment

13/02/2026 10:15 am

vxjwpviifx

(@vxjwpviifx)

Active Member

7 Posts
0 7 0

Helpful thread 👍

Add a comment

09/02/2026 11:40 pm

WilliamLouBy

(@williamlouby)

Active Member

15 Posts
1 14 0

Add a comment

8 Forums
1,200 Topics
8,397 Posts
16 Online
339 Members

Forum Icons: Forum contains no unread posts Forum contains unread posts

Topic Icons: Not Replied Replied Active Hot Sticky Unapproved Solved Private Closed