It's not perfect training data. Being encouraged to add alt text and actually doing it are two different things. Writing good alt text is another matter all together. And anything that's on the internet is training data whether people want it to be or not. The only difference is ethical whether the scraper accepts and respects a version of robots dot txt, i.e. "do not scrape," that communicates the training data's holders' intentions. And if they torrent books you can guess how respectful they are.
Showerthoughts
A "Showerthought" is a simple term used to describe the thoughts that pop into your head while you're doing everyday things like taking a shower, driving, or just daydreaming. The most popular seem to be lighthearted clever little truths, hidden in daily life.
Here are some examples to inspire your own showerthoughts:
- Both “200” and “160” are 2 minutes in microwave math
- When you’re a kid, you don’t realize you’re also watching your mom and dad grow up.
- More dreams have been destroyed by alarm clocks than anything else
Rules
- All posts must be showerthoughts
- The entire showerthought must be in the title
- No politics
- If your topic is in a grey area, please phrase it to emphasize the fascinating aspects, not the dramatic aspects. You can do this by avoiding overly politicized terms such as "capitalism" and "communism". If you must make comparisons, you can say something is different without saying something is better/worse.
- A good place for politics is c/politicaldiscussion
- Posts must be original/unique
- Adhere to Lemmy's Code of Conduct and the TOS
If you made it this far, showerthoughts is accepting new mods. This community is generally tame so its not a lot of work, but having a few more mods would help reports get addressed a little sooner.
Whats it like to be a mod? Reports just show up as messages in your Lemmy inbox, and if a different mod has already addressed the report, the message goes away and you never worry about it.
At this point all the imagery data they need is already out there. Not like your picture of a cat you post to Lemmy is gonna help these companies make a better model.
Web content should always strive to be more accessible. Things like AI should be better regulated instead. I think we've missed the boat on a big part of that though, should have legally clamped down on activities a long time ago.
Something I mention every time this comes up. AI doesn’t need to scrape Lemmy. All someone has to do is set up their own federated instance and AcitivityPub will wrap it up in a nice JSON format for them to parse however they want. And there’s fundamentally nothing a person can do about it.
It’s just best to realize anything and everything on Lemmy is publicly available for any use, good or bad.
This.
Every nation state with internet surveillance spun up a server to collect and archive ActivityPub posts.
In some ways it’s more freeing. No need to worry what a corporation will see fit to disclose to a gov agency.
Yes, but that doesn't mean we should eschew accessibility
We can just use an AI to describe the images /s
alt text is meant for screen readers, so blind people can hear the image description.
Also, the alt texts vary in descriptiveness for that exact purpose. They're meant to be useful for humans, not for training data.
What would a blind person rather have as the alt text:
(there are no photos here, for the blind people listening)
1:
A cute Alsatian puppy looking into the camera with a dog toy in its mouth
2:
A 14 week old black/brown dog sitting on a tiled floor with a synthetic-rubber cuboid-cylindrical-shaped, blue-green-gradient chew toy in its mouth with its eyes and nose poised at a 30° angle towards the photographer's origin. Each tile on the floor is approximately 1.47m^2 and are a pearlescent shade of off-white. There is an unidentifiable black speck on the first tile in the top left quadrant of the image. The cameraman's fat finger is covering 1.97% of the bottom right quadrant. Focal length is set to 100mm. Exposure settings appear to be increased. The dog's genitals are not visible.
and I actually really like that one particular use-case of ai because less required human interaction gives the blind user more independence. The remaining issue of corporatization and private ownership of something that should be a publicly owned resource (as with many other assistive technologies) is a society-wide issue and framing it as a futurist vs Luddite discussion is a powerful misdirection.
At first I was concerned about these huge tech companies stealing all of human knowledge and using it to make a fortune and drive everyone that created the knowledge into poverty.
Now I see that they are stealing all of human knowledge to make LLMs, giant digital babbling talkers. It can't work how they want the way they're doing it, so it doesn't matter what data they consume. They seem to lose money on every LLM query, even if you're paying for the highest tier.
When they stop subsidizing the cost to cash in, the already lukewarm interest in LLMs will cool further as costs rise.
Shower response: I don't like that they're gobbling my data, but at least they're choking on it.
ai is already amazing at image recognition, training or no training it’s already here
We've been training it through the Captcha system for decades now.
All technicality aside, models trained on images and their text-descriptions for blind people falls under one of the few good use-cases to Machine Learning.