What is the best GPU for running DeepSeek models locally?

Question

I am so incredibly hyped about these new DeepSeek models that everyone is talking about lately! I saw a few demos of the coding version and it looks like it could literally change my life for this small app project i'm trying to build. I really want to run it locally on my own computer because i'm a bit of a privacy freak and I dont want my data being sent off to some random server somewhere. Plus I think it would just be cool to have it working right there on my desk.

The thing is... I have absolutely no clue what I'm doing when it comes to computer parts. I've got about $900 saved up and I'm planning to head to the store this weekend to pick something up but looking at the GPU aisle makes my head spin. Like what even is VRAM?? Is that different from the normal memory in the computer? I see people mentioning the RTX 3060 or maybe a 4070 but then others say you need like 24 gigs of something to make the big models work and I'm just totally lost. I really dont want to buy the wrong thing and end up with a super expensive paperweight that cant even run the software. Sorry if this is a total beginner question but I'm just starting out and its all very overwhelming. What is actually the best GPU I should be looking for to run DeepSeek at home without breaking the bank?

YorkshireTeaGold · Accepted Answer

To give an accurate recommendation, which specific DeepSeek parameter size are you looking to run? The memory requirements scale linearly with the model size. A quick tip for your $900 budget is to prioritize the price-per-GB of VRAM. While the NVIDIA GeForce RTX 4060 Ti 16GB GDDR6 is a decent new entry, a used NVIDIA GeForce RTX 3090 24GB GDDR6X often fits your budget and provides the capacity needed for larger models.

CXrMrgor · Answer

In my experience, VRAM is the single most important factor for running DeepSeek locally. It is dedicated memory on the graphics card, which is totally different from your regular system RAM. Over the years, I have found that 16GB is the absolute minimum sweet spot for these coding models to run at a usable speed. Without enough VRAM, your computer will basically just crawl. With a $900 budget, you should probably get the ASUS GeForce RTX 4070 Ti Super 16GB GDDR6X. It fits your price range and handles the quantized versions of DeepSeek-Coder really well. If you want to spend less, the MSI Ventus GeForce RTX 3060 12GB GDDR6 is a decent entry point tho you will definitely notice the speed drop. Just make sure you dont buy an 8GB card... it wont work for what you want to do. Honestly, VRAM is king here.

ChampionshipHope · Answer

Re: "In my experience, VRAM is the single most..." - absolutely, but unfortunately capacity isnt the only thing that matters for reliability. I learned that lesson the hard way with the one I got last year.

My previous setup had plenty of memory but the cooling was just not as good as expected.

I found that running deepseek for hours literally cooked the components because i didnt account for the sustained load.

The system would just shut down right when the model was about to finish a long block of code. It was pretty disappointing tbh. Ngl, i focused so much on the specs that I ignored the power requirements. You really gotta make sure your power supply can handle the spikes because these cards pull way more than the advertised tdp when they're actually working... i learned my lesson the hard way.