David Gerard@awful.systemsM to

TechTakes@awful.systemsEnglish · 4 days ago

hilarious thread on gpt-oss

4

23

hilarious thread on gpt-oss

David Gerard@awful.systemsM to

TechTakes@awful.systemsEnglish · 4 days ago

4

xcancel link: https://xcancel.com/jxmnop/status/1953899426075816164

this thing is clearly trained via RL to think and solve tasks for specific reasoning benchmarks. nothing else. and it truly is a tortured model. here the model hallucinates a programming problem about dominos and attempts to solve it, spending over 30,000 tokens in the process completely unprompted, the model generated and tried to solve this domino problem over 5,000 separate times

Chat

istewart@awful.systems
link
fedilink
English
arrow-up
12·
4 days ago

they seem to have trained on nearly everything you’ve ever heard of. especially a lot of Perl

This is profoundly hilarious to me for some reason. AppleScript, of all things, also seems suspiciously high on that graph. As does Pascal running neck and neck with Swift.
- bitofhope@awful.systems
  link
  fedilink
  English
  arrow-up
  1·
  2 days ago
  Python seems surprisingly low too

TechTakes@awful.systems

techtakes@awful.systems

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !techtakes@awful.systems

Big brain tech dude got yet another clueless take over at HackerNews etc? Here’s the place to vent. Orange site, VC foolishness, all welcome.

This is not debate club. Unless it’s amusing debate.

For actually-good tech, you want our NotAwfulTech community

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

82 users / day
274 users / week
1.19K users / month
2.9K users / 6 months
2 local subscribers
2.11K subscribers
201 Posts
3.02K Comments
Modlog

mods:
David Gerard@awful.systems