ErikJonker, to ai
@ErikJonker@mastodon.social avatar

The score of Llama3 70B on the LMSYS leaderboard is impressive. Although it's also clear that the latest GPT-4 is still a lot better. However Llama3 is opensource and freely available and a larger version (400B parameters) is on the way and will be closer to GPT4 with regard to performance on the various benchmarks.
https://chat.lmsys.org/?leaderboard
#AI #GPT4 #LMSYS #Leaderboard #Llama3 #opensource

ErikJonker, to ai
@ErikJonker@mastodon.social avatar

Interesting , GPT-4-Turbo is on top again, beating Claude3 and GPT-5 hasn't even arrived. At the same time a lot of people actually prefer Claude3 , leaderboards don't tell the whole story probably.

https://chat.lmsys.org/?leaderboard

ErikJonker, (edited ) to ai
@ErikJonker@mastodon.social avatar

Claude 3 is officially on the top of the leaderbord, although it's just one leaderboard/benchmark and added value always depends on use and context, it's still the end of GPT4 total dominance (unil GPT5 arrives probably). Interesting is also the performance of the Claude 3 Haiku model which is relatively small/cheap.
https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard

  • All
  • Subscribed
  • Moderated
  • Favorites
  • megavids
  • tacticalgear
  • magazineikmin
  • thenastyranch
  • Youngstown
  • mdbf
  • rosin
  • slotface
  • InstantRegret
  • khanakhh
  • Durango
  • kavyap
  • osvaldo12
  • DreamBathrooms
  • JUstTest
  • GTA5RPClips
  • ngwrru68w68
  • everett
  • tester
  • ethstaker
  • cisconetworking
  • cubers
  • modclub
  • provamag3
  • anitta
  • normalnudes
  • Leos
  • lostlight
  • All magazines