So Ive been messing around with the DeepSeek API lately for this little side project Im doing for a local bakery nearby. Im trying to help them automate their inventory tracking and order summaries but GPT-4 was just eating through my tiny $20 budget way too fast so I switched over to DeepSeek since the pricing is way better. It seems pretty solid for the most part but honestly Im struggling to get the output format consistent and sometimes it just ignores my negative constraints entirely.
I did some digging online and saw a few people saying that DeepSeek handles instructions better if you use very specific XML tags like tags for instructions or examples but then I read another thread on a dev forum saying that the newer models actually get confused if you over-engineer the prompts too much. Some people swear by few-shot prompting but when I try it the model sometimes just starts hallucinating more examples instead of following the logic for my actual data. Im also not sure if I should be using the R1 reasoning style prompts or just stick to the standard V3 stuff... like does it really need me to tell it to think step by step if its already a reasoning model? It feels kinda redundant and Im worried Im just wasting tokens at this point.
What are the actually effective prompt engineering techniques you guys have found specifically for DeepSeek to keep it from drifting or losing the plot mid-response?
DeepSeek is a solid choice for local business stuff since the cost per token is basically negligible compared to OpenAI. I have been using the DeepSeek-V3 API for a few months now and found that it likes structure but hates fluff. To keep your inventory summaries consistent, you should try these specific tweaks:
Re: "DeepSeek is a solid choice for local business..."
Gonna try this over the weekend. Will report back if it works!
Can confirm this works. Did the same thing on mine and its been solid ever since.