this post was submitted on 17 Feb 2024
1088 points (98.7% liked)

Technology

59575 readers
3259 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] mtchristo@lemm.ee 35 points 9 months ago (4 children)

I bet they can scrape Lemmy content for free then. There are no legal mechanisms to prevent them from doing so.

[–] FiskFisk33@startrek.website 25 points 9 months ago (1 children)

I rather my data I've chosen to make public is free and accessible to all, than it being sold to the highest bidder.

[–] baseless_discourse@mander.xyz 10 points 9 months ago* (last edited 9 months ago) (2 children)

With that being said, I am not pleased that my content is packaged into a proprietary AI, and sold for money.

I think there are ways to opt-out of AI collection, at least for big companies. I wonder if it is implemented in Lemmy-UI and/or terms and conditions.

[–] FiskFisk33@startrek.website 7 points 9 months ago

on the other hand, if there's troves of free data, that takes the upper hand from the companies that can afford paying for it, and gives open source a much better chance at staying competitive.

[–] General_Effort@lemmy.world 4 points 9 months ago

You opt-out so that there is less free training data, making Reddit's data all the more valuable. I'm sure spez will be thankful.

[–] Trollception@lemmy.world 23 points 9 months ago (1 children)

Yes but i think reddit is many times more valuable than Lemmy. I just haven't found the same level of very specific subreddits that have lots and lots of activity. Most of the traffic here is memes, politics, news and Linux lovin. On reddit if I needed to find a community about my local town it's no problem and there are tens or hundreds of daily posts. The same community does exist on Lemmy but the last post was 6 months ago.

[–] Link@rentadrunk.org 1 points 9 months ago

I completely agree. There are lots of communities on Reddit that are missing on Lemmy. Have you tried posting your community? It might entice people to participate!

[–] SpaceCowboy@lemmy.ca 7 points 9 months ago

Well there's copyright law. There's already lawsuits happening so we'll have to see how this shakes out.

But even if the AI companies lose the lawsuits, I think it's likely they'll still have access to content where the T&C of the site says they're allowed to sell the data.

[–] Wappen@lemmy.world 4 points 9 months ago (1 children)

Hm but don't you automatically own the stuff you create yourself, as long as you don't consent to giving it away? I don't know the terms and conditions of my Lemmy instance though.

[–] dgmib@lemmy.world 6 points 9 months ago (1 children)

When was the last time anyone read the T&Cs of a social media website?

They basically all have a clause to the effect that you grant them a permanent, irrevocable license do whatever they want with anything you post.

You might still own the copyright to any content you produce, but by posting you’re granting them permission to do basically anything with it, including reselling it.

[–] Wappen@lemmy.world 1 points 9 months ago

Yeah I know but what about Lemmy instances?