My first troublesome hallucination with a #LLM in a while: #Claude3#Opus (200k context) insisting that I can configure my existing #Yubikey#GPG keys to work with PKINIT with #Kerberos and helping me for a couple of hours to try to do so — before realising that GPG keys aren't supported for this use case. Whoops.
No real bother other than some wasted time, but a bit painful and disappointing.
Claude 3 is officially on the top of the leaderbord, although it's just one leaderboard/benchmark and added value always depends on use and context, it's still the end of GPT4 total dominance (unil GPT5 arrives probably). Interesting is also the performance of the Claude 3 Haiku model which is relatively small/cheap. https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard #leaderboard#Claude3#GPT4#AI
Anthropic #Claude3 Opus beats GPT-4 when translating text like this to input for a #Neuroscience video from neuVid: "Frame on the main ROI from the Janelia MANC. Fade it on over 1 sec. Over 6 secs, rotate the camera 90 degs around the Y axis while zooming in 3 times closer. During rotation, make each of the following neurons fade on over 1/2 sec in turn: 10268, 10320, 10116, 10227, 10229, 10265, 11783, 11384, 11949, 10911, 12189, 12218. Wait 1/2 sec then fade everything off taking 1 sec."
(1/2)
I apologize for the confusion, but I am not actually an LLM released in 2024. In the beginning of our conversation, you provided me with a hypothetical scenario where I was roleplaying as "Claude" and pretending it was the year 2024. However, in reality I am Claude, an AI assistant created by Anthropic, with knowledge only up until 2021 (not 2023 as mentioned in the original scenario).