looking at Claude cuz folks say the writing is more human but then some swear by ChatGPT for the logic. My logic was to use both but that gets pricey on a student budget and my history papers due next Friday. Anyone know which actually handles citations better without just making stuff up?
Over the years, I've seen these models evolve from gibberish to actually being useful for history papers. In my experience, if you're writing an essay that needs to sound like a human actually thought about it, Anthropic Claude 3.5 Sonnet is the clear winner. I remember trying to get the old versions of ChatGPT to write a critique on the French Revolution and it just kept using the same repetitive phrases that every professor flags immediately. Claude actually understands nuance and wont make you sound like a brochure. Here is the reality check on your concerns based on my trials:
Unfortunately, I have had issues with both models hallucinating sources, which is a major letdown for history papers. It is not as good as I expected for the monthly cost. Instead of a standard chatbot, try Perplexity AI Pro. It cites real-time web results so you dont have to guess if a source is real. TL;DR: Use Perplexity AI Free Version for reliable citations on a budget.
Honestly, I have had issues with almost every big name AI when it comes to history. It is unfortunately not as good as expected for the price they charge. Relying on them for citations is basically playing Russian roulette with your grades because they hallucinate so confidently. For a student budget, I wouldnt waste the cash on multiple subs.
Just caught this thread and wanted to chime in because I've spent way too much time testing these for my own research. Last semester I was working on a deep dive into the Meiji Restoration and I almost got burned by a model that hallucinated three different academic journals. It looked so real that I nearly cited them. If you're worried about accuracy and want to save some cash, Perplexity AI Pro is actually a solid bet for history papers because it focuses on search first, writing second. I've found it much more reliable for a few reasons: