BigMuffin69

joined 1 year ago
[–] BigMuffin69@awful.systems 7 points 2 months ago* (last edited 2 months ago) (3 children)

So they had the new Claude hooked up to some tools so that it could play Pokemon red. Somewhat impressive (at least to me!) It was able to beat lt surge after several days of play. They had a stream demo'ing it on twitch and despite the on paper result of getting 3 gym badges, poor fellas got stuck in Viridian forest trying to find the exit to the maze.

As far as finding the exit goes... I guess you could say he was stumped? (MODS PLEASE DONT BAN)

strim if anyone is curious. Yes, i know this is clever advertising for anthropic, but i do find it cute and maybe someone else will?

https://www.twitch.tv/claudeplayspokemon

[–] BigMuffin69@awful.systems 12 points 2 months ago (13 children)

Bruh, Big Yud was yapping that this means the orthogonality thesis is false and mankind is saved b.c. of this. But then he immediately retreated to, "we are all still doomed b.c. recursive self-improvement." I wonder what it's like to never have to update your priors.

Also, I saw other papers that showed almost all prompt rejection responses shared common activation weights and tweeking them can basically jailbreak any model, so what is probably happening here is that by finetuning to intentionally make malicious code, you are undoing those rejection weights + until this is reproduced by nonsafety cranks im pressing x to doubt.

[–] BigMuffin69@awful.systems 8 points 2 months ago* (last edited 2 months ago)

Bruh, Anthropic is so cooked. < 1 billion in rev, and 5 billion cash burn. No wonder Dario looks so panicked promising super intelligence + the end of disease in t minus 2 years, he needs to find the world's biggest suckers to shovel the money into the furnace.

As a side note, rumored Claude 3.7(12378752395) benchmarks are making rounds and they are uh, not great. Still trailing o1/o3/grok except for in the "Agentic coding benchmark" (kek), so I guess they went all in on the AI swe angle. But if they aren't pushing the frontier, then there's no way for them to pull customers from Xcels or people who have never heard of Claude in the first place.

On second thought, this is a big brain move. If no one is making API calls to Clauderino, they aren't wasting money on the compute they can't afford. The only winning move is to not play.

[–] BigMuffin69@awful.systems 10 points 2 months ago* (last edited 2 months ago)

Yud be like: "kek you absolute rubes. ofc I simply meant AI would be like a super accountant. I didn't literally mean it would be able to analyze gov't waste from studying the flow of matter at the molecular level... heh, I was just kidding... unless 🥺 ? "

[–] BigMuffin69@awful.systems 26 points 2 months ago* (last edited 2 months ago) (12 children)

Deep thinker asks why?

Thus spoketh the Yud: "The weird part is that DOGE is happening 0.5-2 years before the point where you actually could get an AGI cluster to go in and judge every molecule of government. Out of all the American generations, why is this happening now, that bare bit too early?"

Yud, you sweet naive smol uwu baby~~esian~~ boi, how gullible do you have to be to believe that a) tminus 6 months to AGI kek (do people track these dog shit predictions?) b) the purpose of DOGE is just accountability and definitely not the weaponized manifestation of techno oligarchy ripping apart our society for the copper wiring in the walls?

[–] BigMuffin69@awful.systems 6 points 2 months ago (1 children)

Dawg, I didn't even survive the basic training in the game

[–] BigMuffin69@awful.systems 5 points 2 months ago (3 children)

My life for super Earth 🫡

[–] BigMuffin69@awful.systems 9 points 3 months ago (5 children)

"listen up jack, we're losing this election"

[–] BigMuffin69@awful.systems 11 points 3 months ago

Made the fatal mistake of posting a sneer on my main, only to have my friend let me know they had been assigned the same dorm room as Dan. Same friend was later roommates with my wife's best friend (and former cohabitant). Small world!

[–] BigMuffin69@awful.systems 27 points 3 months ago* (last edited 3 months ago) (1 children)

Bruh. This is the moment I go full on Frank Grimes.

view more: ‹ prev next ›