this post was submitted on 22 Apr 2025
1504 points (98.9% liked)

Memes

50114 readers
360 users here now

Rules:

  1. Be civil and nice.
  2. Try not to excessively repost, as a rule of thumb, wait at least 2 months to do it if you have to.

founded 6 years ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[–] saigot@lemmy.ca 4 points 1 week ago (1 children)

If it was done with enough regularity to eb a problem, one could just put an LLM model like this in-between to preprocess the data.

[–] Azzu@lemm.ee 4 points 1 week ago (2 children)

That doesn't work, you can't train models on another model's output without degrading the quality. At least not currently.

[–] Vashtea@sh.itjust.works 1 points 1 week ago* (last edited 1 week ago)

I don't think he was suggesting training on another model's output, just using ai to filter the training data before it is used.

[–] FooBarrington@lemmy.world 1 points 1 week ago

No, that's not true. All current models use output from previous models as part of their training data. You can't solely rely on it, but that's not strictly necessary.