this post was submitted on 18 Jun 2023
164 points (100.0% liked)

Reddit Migration

37 readers
2 users here now

### About Community Tracking and helping #redditmigration to Kbin and the Fediverse. Say hello to the decentralized and open future. To see latest reeddit blackout info, see here: https://reddark.untone.uk/

founded 1 year ago
 

Thousands of moderators overseeing the site’s subreddits are on strike. It’s a wrinkle in Reddit’s plan to go public, and a sign that plan is premature, columnist Anita Ramaswamy writes.

you are viewing a single comment's thread
view the rest of the comments
[–] earthling@kbin.social 7 points 1 year ago* (last edited 1 year ago) (1 children)

The ongoing strike, spurred by Huffman’s plan to charge fees to third-party apps that serve up Reddit content, was supposed to last for 48 hours.

Not just charge fees... Exorbitant fees. Outrageous fees.

If Huffman wanted to target these much higher costs to LLMs, they could have instituted an approval process for 3PAs which got charged sane API fees while they charge much more for LLMs. I'm no dev but I think they could tell the difference between the two by just analyzing the API traffic.

But they aren't doing that. Maybe LLMs were the primary target but they sure aren't even trying to keep 3PAs around.

[–] CookieJarObserver@feddit.de 2 points 1 year ago (1 children)

Ai training data gets gathered with scrapers... It literally doesn't need the api. And they only need everything once. And maybe update it every year or so...

Also the whole nsfw thing...

This is just and only to kill third party clients.

And spez is close to someone that works for "open" ai... So a even more clownshow move.

[–] earthling@kbin.social 1 points 1 year ago (1 children)

Huh.. I would have thought they'd use the API when available but I honestly know nothing about it. Wouldn't gathering data via API provide more structured data thereby making it easier to feed into their models?

[–] CookieJarObserver@feddit.de 2 points 1 year ago (1 children)

Often its easier to just scrape everything rather than making a whole code to pull api requests and put them into a database and sort them while doing so. These companies scrape the entire internet, they don't have time or necessarie to use api, they don't need permanent access to a two way communication, they just need the data.

[–] earthling@kbin.social 1 points 1 year ago