this post was submitted on 28 Aug 2024
5 points (69.2% liked)

AI

4151 readers
1 users here now

Artificial intelligence (AI) is intelligence demonstrated by machines, unlike the natural intelligence displayed by humans and animals, which involves consciousness and emotionality. The distinction between the former and the latter categories is often revealed by the acronym chosen.

founded 3 years ago
 

I wanted to extract some crime statistics broken by the type of crime and different populations, all of course normalized by the population size. I got a nice set of tables summarizing the data for each year that I requested.

When I shared these summaries I was told this is entirely unreliable due to hallucinations. So my question to you is how common of a problem this is?

I compared results from Chat GPT-4, Copilot and Grok and the results are the same (Gemini says the data is unavailable, btw :)

So is are LLMs reliable for research like that?

you are viewing a single comment's thread
view the rest of the comments
[–] ViaFedi@lemmy.ml 0 points 2 months ago (1 children)

Solutions exist where you give the LLM a bunch of files e.g., PDFs which it then will solely base it's knowledge on

[–] jet@hackertalks.com 6 points 2 months ago (1 children)

It's still a probable token generator, you're just training it on your local data. Hallucinations will absolutely happen.

[–] slacktoid@lemmy.ml 0 points 2 months ago* (last edited 2 months ago)

This isn't training its called a RAG Workflow, as there is no training step per se