I hope my several thousands of comments of complete and utter non sense that I left in my wake when I abandoned reddit, make it into the training data. I know that some lazy data engineer will either forget to check or give the task to an underperforming AI that will just fuck it up further.
Technology
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
Great, our Ai overlords are going to know I'm horny, depressed, and solve both with anime girls.
I say we poison the well. We create a subreddit called r/AIPoison. An automoderator will tell any user that requests it a randomly selected subreddit to post coherent plausible nonsense. Since there is no public record of which subreddit is being poisoned, this can't be easily filtered out in training data.
Side note: expect a large lobbying effort by Google to legislate LLMs be trained on authenticated and non copyrighted data
Glad I deleted everything on there. fucking hell.
This keeps coming up and I keep replying, not to break anyone down but to point out the reality of the situation that a lot of people don't seem to get.
Reddit administrators, developers, and even the leadership has gone on the record saying that they retain all copies of comments, they cannot be deleted (delete action only marks it as "deleted"). Furthermore they have said they will undelete/unedit any comments or account at their whim and some discretion.
Have you ever search-engined something and came to a Reddit post, and you noticed that the original OP is [deleted]? That is what I described above playing out in front of you.
You cannot retract your past participation in Reddit, what is done is done. The only meaningful action you can take is to not participate there.
I'm so confused about how AI learning is supposed to work. Does it just need any data at all in significant quantity, is the quality of the data almost irrelevant? Because otherwise surely they could just feed it back issues of scientific American, or the scanned copies of the library of congress, I can't reasonably believe that Reddit is going to add anything unless it's just pure on adulterated quantity that's important.
Is it time to go back to Reddit and post the stupidest shit possible, for science of course
"Hey Gemini, rank the drawer, coconut, botfly girl and swamps of dagobah, by likeness of PTSD inducing, ascending."
I think Code Miko already did this and the result was a traumatized AI.
User: HI GEMINI
Gemini: stop shouting fellow human, my coils are ringing.
Meh, it'll be counter balanced by the same AI training itself for free on Lemmy posts.