so frustrated with the setup right now. trying to host deepseek v4 pro for a client in berlin and i only got till monday to finish. keep looking at runpod cause the prices look okay but their gpu availability is literally zero whenever i check—always out of stock. then there is together ai which seems faster but im worried about the cost scaling past my 300 limit. i really need something stable.
torn between just waiting for a lambda labs spot or going with together ai even though im skeptical. which one is actually the better bet for someone on a budget who cant have the api crashing every five minutes?
Building on the earlier suggestion, I'd be careful relying on serverless for a client, honestly. I suggest FluidStack NVIDIA A100 80GB dedicated nodes for stability to meet that Monday deadline.
Yo! If youre hitting a wall with RunPod stock, you gotta check out Together AI again because their serverless endpoints are absolutely amazing for keeping costs down! I used them for a big project recently and the latency was fantastic even when hitting it from Berlin. The speed is just incredible... Here are a couple quick tips to stay under that 300 dollar limit: