kellogh, to LLMs
@kellogh@hachyderm.io avatar

i’m very excited about the interpretability work that has been doing with .

in this paper, they used classical machine learning algorithms to discover concepts. if a concept like “golden gate bridge” is present in the text, then they discover the associated pattern of neuron activations.

this means that you can monitor LLM responses for concepts and behaviors, like “illicit behavior” or “fart jokes”

https://www.anthropic.com/research/mapping-mind-language-model

kellogh,
@kellogh@hachyderm.io avatar

this is great work. i’m excited to see where this goes next

i hope exposes this via their API. at this point in time, most of the promising interpretability work is only available on open source models that you can run yourself. it would be great to also have them available from vendors

CharlieMcHenry, to ai
@CharlieMcHenry@connectop.us avatar

Google to invest up to $2B in Anthropic - and… the race is on between, on one side, Microsoft and OpenAI; and on the other side, Google and Anthropic. My $$ is on MS & OpenAI at the moment - and I don’t expect that to change. OpenAI is the clear leader in AI, with a considerable head start and a top-shelf team. Anthropic will have a lot of catching up to do unless they’ve got some kind of killer, breakthrough tech they’re hiding until launch. https://www.reuters.com/technology/google-agrees-invest-up-2-bln-openai-rival-anthropic-wsj-2023-10-27/

ErikJonker, (edited ) to ai
@ErikJonker@mastodon.social avatar
ppatel, to microsoft
@ppatel@mstdn.social avatar

Not sure where this will go but it sounds like a novel approach to antitrust.

The launches a review of investments by in and by and in , to assess how the deals alter the competitive landscapein AI.

https://www.nytimes.com/2024/01/25/technology/ftc-ai-microsoft-amazon-google.html

williamgunn, to llm
@williamgunn@mastodon.social avatar
rhys, to llm
@rhys@rhys.wtf avatar

My first troublesome hallucination with a in a while: (200k context) insisting that I can configure my existing keys to work with PKINIT with and helping me for a couple of hours to try to do so — before realising that GPG keys aren't supported for this use case. Whoops.

No real bother other than some wasted time, but a bit painful and disappointing.

Now to start looking at PIV instead.

mjgardner, to ai
@mjgardner@social.sdf.org avatar

researchers find that models can be trained to deceive.” The popular ones seem born to it. https://apple.news/ANjknJFTOSrGiElbxYVXdlw

br00t4c, to random
@br00t4c@mastodon.social avatar

Anthropic's Claude 3 causes stir by seeming to realize when it was being tested

https://arstechnica.com/?p=2007736

ianRobinson, to llm
@ianRobinson@mastodon.social avatar

“Claude.ai is now available to users in the EU”

Via a T&Cs update email. Claude 3 Opus is my favourite LLM. I haven’t had a chance to fully test ChatGPT-4o yet to compare them.

q7AtQ1Pvy3kx,

is killing it with their AI game, especially for a small startup. Their models are way better than 's, but they're focusing more on enterprise stuff rather than hyping it up. This might be a risky move since they don't have a cult following like other AI companies. Still, gotta give them props for their impressive tech. It'll be interesting to see how they balance enterprise with getting more attention from the AI community.​​​​​​​​​​​​​​​​

ErikJonker, to ai
@ErikJonker@mastodon.social avatar

Interesting, "Maestro - A Framework for Claude Opus, GPT and local LLMs to Orchestrate Subagents",
i think organising tasks, orchestrating various agents will be important.
https://github.com/Doriandarko/maestro
#maestro #anthropic #AI #LLM #orchestration

robert, to emacs
@robert@toot.kra.hn avatar

org-ai got an update today. It now supports the #anthropic #claude and the #perplexity.ai APIs.

https://github.com/rksm/org-ai

#emacs #orgmode #llms

br00t4c, to random
@br00t4c@mastodon.social avatar
br00t4c, to random
@br00t4c@mastodon.social avatar

Anthropic Wants to Put Its Claude AI Wherever You Are With New App

https://gizmodo.com/anthropic-claude-ai-ios-apple-app-1851448285

thejapantimes, to business
@thejapantimes@mastodon.social avatar

As Europe vies for AI leadership, Mistral, under Arthur Mensch, is emerging as a formidable contender against U.S. and Chinese titans. https://www.japantimes.co.jp/business/2024/04/14/tech/arthur-mensch-mistral/

br00t4c, to random
@br00t4c@mastodon.social avatar

Anthropic releases Claude AI chatbot iOS app

https://arstechnica.com/?p=2021092

br00t4c, to random
@br00t4c@mastodon.social avatar
br00t4c, to OpenAI
@br00t4c@mastodon.social avatar

Anthropic co-founders say their AI models are taking lessons from the harms of social media

https://qz.com/anthropic-safe-ai-bloomberg-technology-summit-amodei-1851466207

br00t4c, to random
@br00t4c@mastodon.social avatar
br00t4c, to random
@br00t4c@mastodon.social avatar

Anthropic's founders took a shot at OpenAI executives

https://qz.com/anthropic-founders-openai-executives-ai-1851469940

upright, to random
@upright@sfba.social avatar

Why would require a phone number to use its app? NOPE.

br00t4c, to llm
@br00t4c@mastodon.social avatar

Here's what's really going on inside an LLM's neural network

https://arstechnica.com/?p=2026236

ianRobinson, to llm
@ianRobinson@mastodon.social avatar

Research paper from Anthropic.

“Today we report a significant advance in understanding the inner workings of AI models. We have identified how millions of concepts are represented inside Claude Sonnet, one of our deployed large language models. This is the first ever detailed look inside a modern, production-grade large language model. This interpretability discovery could, in future, help us make AI models safer.”

https://www.anthropic.com/research/mapping-mind-language-model

br00t4c, to random
@br00t4c@mastodon.social avatar
br00t4c, to random
@br00t4c@mastodon.social avatar
  • All
  • Subscribed
  • Moderated
  • Favorites
  • megavids
  • tester
  • DreamBathrooms
  • thenastyranch
  • magazineikmin
  • osvaldo12
  • ethstaker
  • Youngstown
  • mdbf
  • slotface
  • rosin
  • ngwrru68w68
  • kavyap
  • GTA5RPClips
  • JUstTest
  • cisconetworking
  • InstantRegret
  • khanakhh
  • cubers
  • everett
  • Durango
  • tacticalgear
  • Leos
  • modclub
  • normalnudes
  • provamag3
  • anitta
  • lostlight
  • All magazines