Technology

72263 readers

3172 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

L4s@hackingne.ws

A lawsuit claims OpenAI stole 'massive amounts of personal data,' including medical records and information about children, to train ChatGPT (www.businessinsider.com)

submitted 2 years ago by L4s@lemmy.world to c/technology@lemmy.world

7 comments fedilink hide all child comments

The lawsuit alleges OpenAI crawled the web to amass huge amounts of data without people's permission.

all 9 comments

sorted by: hot top controversial new old

[–] Protegee9850@lemmy.world 1 points 2 years ago

Scraping is protected. GPT and the line are more akin to fair use machines than plagiarism machines. This is a lot of hot air to go nowhere. Rage bait

[–] 44swagnum@lemmy.world 1 points 2 years ago* (last edited 2 years ago) (1 children)

"We have to protect the children"

[–] Protegee9850@lemmy.world 1 points 2 years ago

The worst rush to legislation is done in the name of stopping terrorists and saving the children. Always.

[–] Hick@lemmy.world 0 points 2 years ago (2 children)

Scraping social media posts and reddit posts doesn’t sound like stealing, they’re public posts.

[–] SamB@lemmy.world 0 points 2 years ago (1 children)

I doubt it’s only about some Reddit posts. The scrapping was done on the whole web, capturing everything it could. So besides stealing data and presenting it as its own, it seems to have collected some even more problematic data which wasn’t properly protected.

[–] tallwookie@lemmy.world 0 points 2 years ago

if it was unsecured it's basically public. whomever put that data on a publicly accessible server is at fault