Posts

This profile is from a federated server and may be incomplete. Browse more on the original instance.

kellogh, to LLMs
@kellogh@hachyderm.io avatar

i’m very excited about the interpretability work that has been doing with .

in this paper, they used classical machine learning algorithms to discover concepts. if a concept like “golden gate bridge” is present in the text, then they discover the associated pattern of neuron activations.

this means that you can monitor LLM responses for concepts and behaviors, like “illicit behavior” or “fart jokes”

https://www.anthropic.com/research/mapping-mind-language-model

kellogh,
@kellogh@hachyderm.io avatar

this is great work. i’m excited to see where this goes next

i hope exposes this via their API. at this point in time, most of the promising interpretability work is only available on open source models that you can run yourself. it would be great to also have them available from vendors

Lobrien,

@kellogh This does, of course, imply vastly easier subversion of guardrails. Bad actors will have an easier time manipulating bias.

kellogh, to ai
@kellogh@hachyderm.io avatar

iTerm2 developer caves to the bullies and moves the feature to a plugin

https://news.ycombinator.com/item?id=40458135

kellogh,
@kellogh@hachyderm.io avatar

@Xoriff eh, the hacker news & mastodon comments got into the bullying range pretty fast.

a lot of people seem to feel entitled to free software being catered to their wishes. i’ve run into the same sort of entitlement in software i’ve open sourced

kellogh,
@kellogh@hachyderm.io avatar

@sanityinc the whole fiasco highlights how much we demand from open source, how little respect maintainers get, and how tiny the communities are. most people didn’t even realize this was an open source project

kellogh, to LLMs
@kellogh@hachyderm.io avatar

if i had more time, i'd love to investigate PII coming from . i've seen it generate phone numbers and secrets, but i wonder if these are real or not. i imagine you could look at the logits to figure out if phone number digits were randomly chosen or if the sequence is meaningful to the LLM. anyone aware of researchers who have already done this?

kellogh,
@kellogh@hachyderm.io avatar

i would guess that phone numbers are probably mostly random, since so many phone numbers are found online, whereas AWS keys are less common, so you're probably more likely to get partial or even full real keys

Lobrien,

@kellogh Someone claimed that a long magic number used in their highly-optimized (FFT?) code was spit out by Copilot. (This was soon after release.) The constant was arrived at by long fine-tuning, not conceptual in any way.

kellogh, to random
@kellogh@hachyderm.io avatar

this has been bugging me a lot. like, yeah, there’s definitely AI scams out there. and yeah, a lot of people are using it from the wrong end, but it’s also clearly a substantial technology. time to realize that
https://mas.to/@carnage4life/112484753548884371

kellogh,
@kellogh@hachyderm.io avatar

@maltimore i wish you weren’t right

kellogh,
@kellogh@hachyderm.io avatar

@swiftcoder i think the iTerm2 overreaction really highlighted that people have indeed stopped thinking and are instead using their emotions

kellogh, to random
@kellogh@hachyderm.io avatar

“One House Republican called the incident "vile" and said it has caused concern among GOP lawmakers.”

anonymous coward, show thyself!

https://www.axios.com/2024/05/22/rnc-vials-blood-capitol-police-suspicious

jimfl,
@jimfl@hachyderm.io avatar

@kellogh What we’re hearing: vile incident
What they’re actually saying: vial incident

kellogh, to ai
@kellogh@hachyderm.io avatar

thinking about my education growing up, my k-6 teachers were wretched with getting facts right. one teacher didn’t have a single science experiment work. lots of stuff i was taught k-12 was outright wrong.

the thing is, students exceed their teachers all the time. a teacher isn’t the limiting factor for a student

i keep hearing that is worthless bc it hallucinates. yet it’s taught me functioning skills within UI dev, graphic design, 3D printing, 3D design

maybe i didn’t actually learn?

u0421793,

@kellogh what on earth do you mean by “k-6” and “k-12”, there’s no explanation

kellogh,
@kellogh@hachyderm.io avatar

@u0421793 kindergarten through 6th grade, kindergarten through 12th grade

kellogh, to random
@kellogh@hachyderm.io avatar

alternative title: Scientific Study with Dubious Methods Produces Shocking Results
https://noc.social/@todayilearned/112474329186430771

capraobscura,
@capraobscura@hachyderm.io avatar

@kellogh Having almost stepped on a couple of rattlesnakes in my life, I can say for absolute certain that no way in hell am I paying attention to this study or its results and will continue to not step on snek. 😂

kellogh, to random
@kellogh@hachyderm.io avatar

this is in reference to super-alignment & safety, but my cousin also had her DEI team disbanded and “distributed” in the same way

on the surface, i think safety, DEI, and similar topics should be embedded in the culture and not centralized into a specific team. centralization would cause people to say, “oh that’s not my job”.

then again, any time a centralized team is disbanded, my immediate thought is, “apparently safety/DEI/etc. doesn’t matter to this company”. it’s a paradox, i suppose

TEG,
@TEG@mastodon.online avatar

@kellogh I always feel a little conflicted about reports like this. Like, it's 100% a good and important thing in general, but that doesn't mean a specific person or team or culture engaged with AI safety automatically inherits that value regardless of what they're actually contributing.

That said, I do think it might need a dedicated if small team to ensure that things are widely embedded.

kellogh,
@kellogh@hachyderm.io avatar

@TEG yeah, security is another one. but security is hard and you typically need dedicated team just to host security professionals. someone needs to act as a bar raiser in order to maintain the culture…

kellogh, to ai
@kellogh@hachyderm.io avatar

i get a whole lot of emails about “generative expert immediately available for new role”… 🤔 expert?

kellogh,
@kellogh@hachyderm.io avatar

@jneno lol

jimfl,
@jimfl@hachyderm.io avatar
kellogh, to random
@kellogh@hachyderm.io avatar

holy shit

yesterday while trail running i came across this fallen tree. it looks like a thick vine wrapped and choked the life out of it, and the storm this weekend finally took it out. i couldn’t easily identify the vine, but whatever

this morning i wake up and i’m breaking tf out with what sure looks like poison ivy rashes.

i came back, and identified the vine as, yep, poison ivy. thick woody 1/3” vines up and down the full tree

A close-up of two green leaves with visible veins and black specks on their surfaces. The leaves are attached to a thin branch with a blurred background of other foliage and tree bark.

sashawood,

@kellogh when I had this bad, they put me on prednisone to knock it down

kellogh,
@kellogh@hachyderm.io avatar

@sashawood i actually asked to not have prednisone, bc i didn’t like the reactions i’ve had in the past, and i’m actually managing the symptoms okay with my OTC cocktail

kellogh, to random
@kellogh@hachyderm.io avatar

i can’t believe tomorrow is friday already

dgentry,
@dgentry@hachyderm.io avatar

@kellogh Tim.

bobmcwhirter,
@bobmcwhirter@hachyderm.io avatar
kellogh, to random
@kellogh@hachyderm.io avatar

rain is cathartic

kellogh,
@kellogh@hachyderm.io avatar

just to be clear, i’m talking about water falling from the sky

kellogh, to random
@kellogh@hachyderm.io avatar

the #gpt4o news is cool, but now i want to see an embedding model that i can use with a streaming interruptible conversation

kellogh,
@kellogh@hachyderm.io avatar

i suppose keras & pytorch effectively do this for neural nets

YvanDaSilva,
@YvanDaSilva@hachyderm.io avatar

@kellogh I'm on the opposite end.
Python is okay for testing, quick and dirty work.
But if you want any performance in the backend (where you do data manipulation), as soon as you need to create something that will be used with speed in mind.
I'd go with Go, Rust, zig, c or any other performant language.
If it's the familiarity with python you're looking for there's many other languages that look and feel like python such as Julia, Crystal, etc

kellogh, to random
@kellogh@hachyderm.io avatar

just had a conversation with my neighbor — he’s wearing an NRA tshirt and explaining that the pistol he has strapped to his hip is for shooting copperheads. And also he’s concerned that we leave our garage open too much and that copperheads might get in

doak,
@doak@mastodon.content.town avatar

@kellogh No Reptiles Allowed

kellogh,
@kellogh@hachyderm.io avatar

@doak i’ll allow reptiles if they keep the attack turkeys away

kellogh, to random
@kellogh@hachyderm.io avatar

i wish “type checking for infrastructure” was a thing

my code declares that there should be a S3 bucket that’s different from that other S3 bucket, etc. —> spin up the type checker, it reads APIs and verifies, “yep, this code should run fine”

brianknight,
@brianknight@hachyderm.io avatar

@kellogh @olafurw

We’ve used CDK with TypeScript to do this with good results. It has allowed us to subtype things like S3 buckets and benefit from code tests and type checking.

kellogh,
@kellogh@hachyderm.io avatar

@brianknight @olafurw got a github link?

  • All
  • Subscribed
  • Moderated
  • Favorites
  • JUstTest
  • kavyap
  • DreamBathrooms
  • khanakhh
  • magazineikmin
  • InstantRegret
  • tacticalgear
  • thenastyranch
  • Youngstown
  • rosin
  • slotface
  • modclub
  • everett
  • ngwrru68w68
  • anitta
  • Durango
  • osvaldo12
  • normalnudes
  • cubers
  • ethstaker
  • mdbf
  • tester
  • GTA5RPClips
  • cisconetworking
  • Leos
  • megavids
  • provamag3
  • lostlight
  • All magazines