What GPU should I grab to run DeepSeek-V3 at home without it taking forever to generate a sentence? Im super stoked to try this model because the benchmarks look insane.
I saw some guys on Reddit saying you can squeeze a 4-bit version onto two RTX 3090s but then I read another thread saying you need at least 80GB of VRAM to even load it properly even with heavy quantization so idk who to believe. I have about $3k saved up and I live in a tiny apartment in Seattle so I cant really build a massive server rack. Is one 4090 enough if I use some crazy compression or am I just dreaming here...
Unfortunately, a single NVIDIA GeForce RTX 4090 24GB GDDR6X just wont cut it for DeepSeek-V3. The VRAM demands are too high for quality quantization.
Running DeepSeek-V3 on one NVIDIA GeForce RTX 4090 24GB GDDR6X isn't feasible for decent performance. With a $3k budget, a methodical approach using dual GPUs is more reliable.
This is exactly what I needed to hear. Youre a lifesaver honestly.
I definitely agree that a single card isnt enough, but you really gotta check your wiring. Ive been very satisfied with my setup lately, but those older Seattle buildings sometimes cant handle the power draw of multiple cards on one circuit. Quick question tho, do you know if your apartment has updated electrical? You dont want to be tripping breakers every time you start a run...
🙌