this post was submitted on 09 Mar 2025
336 points (98.8% liked)
Technology
64937 readers
3974 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
There are lists of bots that instance Admins can block for a range of reasons.
Anything online can be scraped but big firms might run into regulatory trouble if they are caught randomly scraping sites without consent. At the moment, the big social media apps have a tonne of content to train on in tightly controlled conditions, so they don't really need to go into the wild, yet. However, we need to be vigilant, block them and make a fuss if we catch them at it.
What's to stop a company from standing up their own instance?
If they only create an admin account and then federate to every instance, now they have everyone's content.
I'm suddenly realizing the anti-AI blurbs people add to their comments now make sense.
IANAL, but the way the federation by necessity copies your posts and information to every instance there is and to be able to do that it all needs to be under a licence that allows it to happen, those blurbs almost certainly are legally entirely meaningless. The only thing I can think of is claiming a non-commercial use violations, but that could put every instance that runs on donations under fire as well.
That’s a very good shout, I wasn’t aware there are pre existing lists. That’s a great step, and definitely one I will look to add to my own instance.
We just added it as the old frontend was getting hammered by bots - it helped a lot.