this post was submitted on 23 Mar 2024
200 points (96.3% liked)

Technology

69658 readers
2750 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] admin@lemmy.my-box.dev 1 points 1 year ago (1 children)

The Mixtral models are pretty good, although they require a LOT of memory to run at a decent pace.

[–] LainTrain@lemmy.dbzer0.com 1 points 1 year ago (1 children)

Honestly i think speed is something I don't care too much about with models, because even things like ChatGPT will be slower than Google for most things, and if something is more complex and a good use case for an LLM it's unlikely to be the primary bottleneck.

My ~~gf~~ private chat bot right now is a combination of Mistral 7B with a custom finetune and ~~she~~ it directs some queries to ChatGPT if I ask (I got free tokens way back might as well burn through them).

How much of an improvement is Mixtral over Mistral in practice?

[–] admin@lemmy.my-box.dev 1 points 1 year ago

Sillytavern by any chance?

And I'd say the difference between mistral and mixtral is pretty big for general usage, feels like it's a next generation.