this post was submitted on 29 Jun 2023
1 points (100.0% liked)

Technology

72263 readers
3172 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
 

The lawsuit alleges OpenAI crawled the web to amass huge amounts of data without people's permission.

all 9 comments
sorted by: hot top controversial new old
[–] Protegee9850@lemmy.world 1 points 2 years ago

Scraping is protected. GPT and the line are more akin to fair use machines than plagiarism machines. This is a lot of hot air to go nowhere. Rage bait

[–] 44swagnum@lemmy.world 1 points 2 years ago* (last edited 2 years ago) (1 children)

"We have to protect the children"

[–] Protegee9850@lemmy.world 1 points 2 years ago

The worst rush to legislation is done in the name of stopping terrorists and saving the children. Always.

[–] Hick@lemmy.world 0 points 2 years ago (2 children)

Scraping social media posts and reddit posts doesn’t sound like stealing, they’re public posts.

[–] SamB@lemmy.world 0 points 2 years ago (1 children)

I doubt it’s only about some Reddit posts. The scrapping was done on the whole web, capturing everything it could. So besides stealing data and presenting it as its own, it seems to have collected some even more problematic data which wasn’t properly protected.

[–] tallwookie@lemmy.world 0 points 2 years ago

if it was unsecured it's basically public. whomever put that data on a publicly accessible server is at fault