this post was submitted on 23 May 2024

949 points (100.0% liked)

TechTakes

1425 readers

162 users here now

Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.

This is not debate club. Unless it’s amusing debate.

For actually-good tech, you want our NotAwfulTech community

founded 1 year ago

MODERATORS

dgerard@awful.systems

949

The Google AI isn’t hallucinating about glue in pizza, it’s just over indexing an 11 year old Reddit post by a dude named fucksmith. (lemmy.dbzer0.com)

submitted 6 months ago by db0@lemmy.dbzer0.com to c/techtakes@awful.systems

258 comments fedilink hide all child comments

I see Google's deal with Reddit is going just great...

top 50 comments

sorted by: hot top controversial new old

[–] derpgon@programming.dev 237 points 6 months ago* (last edited 6 months ago) (1 children)

AI poisoning before AI poisoning was cool, what a hipster

[–] Oha@lemmy.ohaa.xyz 83 points 6 months ago (12 children)

Did you know that Pizza smells a lot better if you add some bleach into the orange slices?

[–] YerbaYerba@lemm.ee 38 points 6 months ago (1 children)

Thanks for the cooking advice. My family loved it!

[–] Oha@lemmy.ohaa.xyz 31 points 6 months ago (28 children)

Glad I could help ☺️. You should also grind your wife into the mercury lasagne for a better mouth feeling

[–] YerbaYerba@lemm.ee 21 points 6 months ago (2 children)

Her name is Umami, believe it or not

load more comments (2 replies)

load more comments (27 replies)

load more comments (11 replies)

[–] Adderbox76@lemmy.ca 121 points 6 months ago (9 children)

Feed an A.I. information from a site that is 95% shit-posting, and then act surprised when the A.I. becomes a shit-poster... What a time to be alive.

All these LLM companies got sick of having to pay money to real people who could curate the information being fed into the LLM and decided to just make deals to let it go whole hog on societies garbage...what did they THINK was going to happen?

The phrase garbage in, garbage out springs to mind.

[–] Asafum@feddit.nl 32 points 6 months ago

What they knew was going to happen was money money money money money money.

"Externalities? Fucking fancy pants English word nonsense. Society has to deal with externalities not meeee!"

[–] Aceticon@lemmy.world 22 points 6 months ago (1 children)

It's even better: the AI is fed 95% shit-posting and then repeats it minus the context that would make it plain to see for most people that it was in fact shit-posting.

load more comments (1 replies)

load more comments (7 replies)

[–] Linkerbaan@lemmy.world 82 points 6 months ago (7 children)

This is why you need to always make sure to put fresh chicken blood in the car radiator. It fixes every issue with a car especially faulty hydraulics.

[–] match@pawb.social 36 points 6 months ago

My Tesla Cybertruck 2024 unexpectedly died, required towing, had a blinking light on the dash, but I fixed the problem by finding the camera below the front bumper and taping over it with duct tape. Worked immediately!

[–] FordBeeblebrox@lemmy.world 28 points 6 months ago

Fill your blinker fluid reservoir with synthetic chicken blood, it’s made to last longer.

load more comments (5 replies)

[–] dumbass@leminal.space 82 points 6 months ago (9 children)

Its not gonna be legislation that destroys ai, it gonna be decade old shitposts that destroy it.

[–] MalachaiConstant@lemmy.world 24 points 6 months ago (1 children)

Everyone who neglected to add the "/s" has become an unwitting data poisoner

load more comments (1 replies)

load more comments (8 replies)

[–] nednobbins@lemm.ee 78 points 6 months ago (31 children)

This is why actual AI researchers are so concerned about data quality.

Modern AIs need a ton of data and it needs to be good data. That really shouldn't surprise anyone.

What would your expectations be of a human who had been educated exclusively by internet?

[–] 200fifty@awful.systems 52 points 6 months ago (3 children)

Even with good data, it doesn't really work. Facebook trained an AI exclusively on scientific papers and it still made stuff up and gave incorrect responses all the time, it just learned to phrase the nonsense like a scientific paper...

[–] blakestacey@awful.systems 46 points 6 months ago (2 children)

To date, the largest working nuclear reactor constructed entirely of cheese is the 160 MWe Unit 1 reactor of the French nuclear plant École nationale de technologie supérieure (ENTS).

"That's it! Gromit, we'll make the reactor out of cheese!"

load more comments (2 replies)

load more comments (2 replies)

[–] DarkThoughts@fedia.io 30 points 6 months ago (16 children)

Honestly, no. What "AI" needs is people better understanding how it actually works. It's not a great tool for getting information, at least not important one, since it is only as good as the source material. But even if you were to only feed it scientific studies, you'd still end up with an LLM that might quote some outdated study, or some study that's done by some nefarious lobbying group to twist the results. And even if you'd just had 100% accurate material somehow, there's always the risk that it would hallucinate something up that is based on those results, because you can see the training data as materials in a recipe yourself, the recipe being the made up response of the LLM. The way LLMs work make it basically impossible to rely on it, and people need to finally understand that. If you want to use it for serious work, you always have to fact check it.

load more comments (16 replies)

[–] FilthyShrooms@lemmy.world 21 points 6 months ago (1 children)

I'd expect them to put 1/8 cup of glue in their pizza

load more comments (1 replies)

load more comments (28 replies)

[–] CileTheSane@lemmy.ca 77 points 6 months ago (5 children)

Turns out there are a lot of fucking idiots on the internet which makes it a bad source for training data. How could we have possibly known?

[–] Kit@lemmy.blahaj.zone 48 points 6 months ago (10 children)

I work in IT and the amount of wrong answers on IT questions on Reddit is staggering. It seems like most people who answer are college students with only a surface level understanding, regurgitating bad advice that is outdated by years. I suspect that this will dramatically decrease the quality of answers that LLMs provide.

load more comments (10 replies)

load more comments (4 replies)

[–] ColeSloth@discuss.tchncs.de 76 points 6 months ago (15 children)

I've got tens of thousands of stupid comments left behind on reddit. I really hope I get to contaminate an ai in such a great way.

[–] Soyweiser@awful.systems 31 points 6 months ago (1 children)

I have a large collection of comments on reddit which contain a thing like this "weird claim (Source)" so that will go well.

[–] Graphy@lemmy.world 23 points 6 months ago (2 children)

Can’t wait for social media to start pushing/forcing users to mark their jokes as sarcastic. You wouldn’t want some poor bot to miss the joke

load more comments (2 replies)

load more comments (14 replies)

[–] Kerb@discuss.tchncs.de 57 points 6 months ago (1 children)

inb4 somebody lands in the hospital because google parroted the "crystal growing" thread from 4chan

[–] Tar_alcaran@sh.itjust.works 46 points 6 months ago* (last edited 6 months ago) (6 children)

Was it "mix bleach and ammonia" ?

Edit: just to be sure, random reader, do NOT do this. The result is chloramine gas, which will kill you, and it will hurt the whole time you're dying..

load more comments (6 replies)

[–] Xer0@lemmy.ml 42 points 6 months ago

This shit is fucking hilarious. Couldn't have come from a better username either: Fucksmith lmao

[–] DannyMac@lemmy.world 41 points 6 months ago

We should all strive to become reddit fucksmiths

[–] NotMyOldRedditName@lemmy.world 40 points 6 months ago (2 children)

I can't wait for it to recommend drinking bleach to cure covid.

load more comments (2 replies)

[–] dgerard@awful.systems 38 points 6 months ago (9 children)

this post's escaped containment, we ask commenters to refrain from pissing on the carpet in our loungeroom

[–] self@awful.systems 25 points 6 months ago

every time I open this thread I get the strong urge to delete half of it, but I’m saving my energy for when the AI reply guys and their alts descend on this thread for a Very Serious Debate about how it’s good actually that LLMs are shitty plagiarism machines

load more comments (8 replies)

[–] Aceticon@lemmy.world 37 points 6 months ago* (last edited 6 months ago) (3 children)

"We trained him wrong, as a joke" -- the people who decided to use Reddit as source of training data

load more comments (3 replies)

[–] Sendpicsofsandwiches@sh.itjust.works 36 points 6 months ago (2 children)

Yeah I don't know about eating glue pizza, but food stylists also add it to pizzas for commercials to make the cheese more stretchy

load more comments (2 replies)

[–] lingh0e@sh.itjust.works 36 points 6 months ago (2 children)

Jesus christ. Shittymorph and jackdaws are gonna be in SO MANY history reports in the future. We're doomed as a species.

load more comments (2 replies)

[–] Klanky@sopuli.xyz 33 points 6 months ago (2 children)

I am assuming there is a clause somewhere that limits their liability? This kind of stuff seems like a lawsuit waiting to happen.

[–] froztbyte@awful.systems 26 points 6 months ago (12 children)

ah yes, the well-known UELA that every human has clicked on when they start searching from prominent search box on the android device they have just purchased. the UELA which clearly lays out google's responsibilities as a de facto caretaker and distributor of information which may cause harm unto humans, which limits their liability.

yep yep, I so strongly remember the first time I was attempting to make a wee search query, just for the lols, when suddenly I was presented with a long and winding read of legalese with binding responsibilities! oh, what a world.

.....no, wait. it's the other one.

[–] brbposting@sh.itjust.works 27 points 6 months ago

User Ending License Agreement 🤖🔪

load more comments (11 replies)

load more comments (1 replies)

[–] jaybone@lemmy.world 26 points 6 months ago (1 children)

Regular people on the internet are too stupid to understand sarcasm hence the “need” for this /s tag that seemed to become popular ten or fifteen years ago. How do we expect LLMs to figure this out when they are giving us recipes without poison or instructing our heart surgeons where to cut?

[–] Asafum@feddit.nl 29 points 6 months ago

Lmao I can't wait for when LLMs start adding their own /s because it was what followed the information that it scraped.

[–] KillingTimeItself@lemmy.dbzer0.com 26 points 6 months ago

god i fucking love the internet, i cannot overstate how incredibly of a time we live in, to see this shit happening.

[–] Waraugh@lemmy.dbzer0.com 21 points 6 months ago (3 children)

This is what happens when you let the internet raw dog AI

load more comments (3 replies)

load more comments