I'm looking to set up a dedicated rig to run DeepSeek models locally for my coding projects. I'm specifically curious about the VRAM requirements for the 67B version versus the newer R1 models. Should I prioritize multiple 3090s or go for a high-end Mac? What's the most cost-effective hardware setup for smooth inference right now?
For your situation, I honestly think the hardware landscape for these big models is kinda frustrating right now. I've spent thousands trying to get smooth inference on the full R1, and unfortunately, it's just not as good as expected on consumer gear because of the sheer weight of the parameters.
Just saw this and honestly dont waste ur cash. I spent way too much on gear before realizing 4-bit quants are the way to go for R1.
I have been obsessing over this for the last few weeks while trying to figure out my own setup. Tbh the market is wild right now and it feels like there is a huge divide between the two main ecosystems.
Works great for me
Yo, ngl I'm still learning the ropes with DeepSeek but I've built rigs for years. I think those R1 models are way heavier on VRAM than the 67B ones. IIRC you're gonna need massive memory for smooth inference.
Late to the party but I've been running these big quants for years. Ngl, if you want to run the heavy R1 stuff without the headache of heat and power spikes from multiple consumer cards, check out the NVIDIA RTX A6000 48GB. In my experience, single-card stability wins every time when youre deep in a coding session. That 48GB VRAM pool handles the 67B version perfectly and can squeeze in decent quants of the R1 models without breaking a sweat. If you want a quieter life and need to run the larger quants, look for a refurbished Apple Mac Studio M1 Ultra 128GB RAM. I've tried many setups over the years and for VRAM capacity per dollar, Apples unified memory is basically the easiest route for local inference right now. Just keep in mind NVIDIA is still king if you ever want to fine-tune your own models. But for just running DeepSeek while you work? The Mac is dead silent and just works.