this post was submitted on 17 Oct 2025

116 points (94.6% liked)

Showerthoughts

37958 readers

1299 users here now

A "Showerthought" is a simple term used to describe the thoughts that pop into your head while you're doing everyday things like taking a shower, driving, or just daydreaming. The most popular seem to be lighthearted clever little truths, hidden in daily life.

Here are some examples to inspire your own showerthoughts:

Rules

All posts must be showerthoughts
The entire showerthought must be in the title
No politics
- If your topic is in a grey area, please phrase it to emphasize the fascinating aspects, not the dramatic aspects. You can do this by avoiding overly politicized terms such as "capitalism" and "communism". If you must make comparisons, you can say something is different without saying something is better/worse.
- A good place for politics is c/politicaldiscussion
Posts must be original/unique
Adhere to Lemmy's Code of Conduct and the TOS

If you made it this far, showerthoughts is accepting new mods. This community is generally tame so its not a lot of work, but having a few more mods would help reports get addressed a little sooner.

Whats it like to be a mod? Reports just show up as messages in your Lemmy inbox, and if a different mod has already addressed the report, the message goes away and you never worry about it.

founded 2 years ago

MODERATORS

vatlark@lemmy.world

forkball@lemmy.world

SuperEars@lemmy.world

116

The fact that users are encouraged to include text descriptions with media content makes it perfect training data for AI. (lemmy.world)

submitted 2 weeks ago by phtheven@lemmy.world to c/showerthoughts@lemmy.world

14 comments fedilink hide all child comments

Lemmy hates AI.

I'm fully supportive of the accessibility for persons with disabilities, to be clear. It's ironic though. Does Lemmy's open source code make it easier for bots to scrape it?

top 14 comments

sorted by: hot top controversial new old

[–] FriendOfDeSoto@startrek.website 46 points 2 weeks ago (1 children)

It's not perfect training data. Being encouraged to add alt text and actually doing it are two different things. Writing good alt text is another matter all together. And anything that's on the internet is training data whether people want it to be or not. The only difference is ethical whether the scraper accepts and respects a version of robots dot txt, i.e. "do not scrape," that communicates the training data's holders' intentions. And if they torrent books you can guess how respectful they are.

[–] errer@lemmy.world 3 points 2 weeks ago

At this point all the imagery data they need is already out there. Not like your picture of a cat you post to Lemmy is gonna help these companies make a better model.

[–] Rhaedas@fedia.io 32 points 2 weeks ago

Web content should always strive to be more accessible. Things like AI should be better regulated instead. I think we've missed the boat on a big part of that though, should have legally clamped down on activities a long time ago.

[–] ramble81@lemmy.zip 29 points 2 weeks ago (1 children)

Something I mention every time this comes up. AI doesn’t need to scrape Lemmy. All someone has to do is set up their own federated instance and AcitivityPub will wrap it up in a nice JSON format for them to parse however they want. And there’s fundamentally nothing a person can do about it.

It’s just best to realize anything and everything on Lemmy is publicly available for any use, good or bad.

[–] MarriedCavelady50@lemmy.ml 3 points 1 week ago

This.

Every nation state with internet surveillance spun up a server to collect and archive ActivityPub posts.

In some ways it’s more freeing. No need to worry what a corporation will see fit to disclose to a gov agency.

[–] dontsayaword@piefed.social 21 points 2 weeks ago (1 children)

Yes, but that doesn't mean we should eschew accessibility

[–] morto@piefed.social 3 points 2 weeks ago

We can just use an AI to describe the images /s

[–] Blaster_M@lemmy.world 20 points 2 weeks ago (2 children)

alt text is meant for screen readers, so blind people can hear the image description.

[–] tourist@lemmy.world 4 points 2 weeks ago* (last edited 2 weeks ago)

Also, the alt texts vary in descriptiveness for that exact purpose. They're meant to be useful for humans, not for training data.

What would a blind person rather have as the alt text:

(there are no photos here, for the blind people listening)

A cute Alsatian puppy looking into the camera with a dog toy in its mouth

A 14 week old black/brown dog sitting on a tiled floor with a synthetic-rubber cuboid-cylindrical-shaped, blue-green-gradient chew toy in its mouth with its eyes and nose poised at a 30° angle towards the photographer's origin. Each tile on the floor is approximately 1.47m^2 and are a pearlescent shade of off-white. There is an unidentifiable black speck on the first tile in the top left quadrant of the image. The cameraman's fat finger is covering 1.97% of the bottom right quadrant. Focal length is set to 100mm. Exposure settings appear to be increased. The dog's genitals are not visible.

[–] ethaver@kbin.earth 4 points 2 weeks ago

and I actually really like that one particular use-case of ai because less required human interaction gives the blind user more independence. The remaining issue of corporatization and private ownership of something that should be a publicly owned resource (as with many other assistive technologies) is a society-wide issue and framing it as a futurist vs Luddite discussion is a powerful misdirection.

[–] badgermurphy@lemmy.world 9 points 2 weeks ago

At first I was concerned about these huge tech companies stealing all of human knowledge and using it to make a fortune and drive everyone that created the knowledge into poverty.

Now I see that they are stealing all of human knowledge to make LLMs, giant digital babbling talkers. It can't work how they want the way they're doing it, so it doesn't matter what data they consume. They seem to lose money on every LLM query, even if you're paying for the highest tier.

When they stop subsidizing the cost to cash in, the already lukewarm interest in LLMs will cool further as costs rise.

Shower response: I don't like that they're gobbling my data, but at least they're choking on it.

[–] Eyekaytee@aussie.zone 9 points 2 weeks ago (1 children)

ai is already amazing at image recognition, training or no training it’s already here

[–] Madrigal@lemmy.world 6 points 2 weeks ago

We've been training it through the Captcha system for decades now.

[–] Jankatarch@lemmy.world 4 points 1 week ago

All technicality aside, models trained on images and their text-descriptions for blind people falls under one of the few good use-cases to Machine Learning.