cigitalgem

@cigitalgem@sigmoid.social

software security #swsec machine learning security #mlsec Tech | Life | Music

This profile is from a federated server and may be incomplete. Browse more on the original instance.

nitashatiku, 11 months ago to random

My latest dispatch from the AI boom for the Washington Post looks at a billionaire-backed movement to push concerns about "existential risks" around AI from the fringes of tech culture into the mainstream. Recently tech philanthropists have helped fund about 20 student-led “AI Safety” clubs at elite colleges like Stanford, Harvard, MIT, NYU, Columbia as part of an effort to recruit idealistic talent to focus their time on the speculative risk that AI could kill us all.
https://www.washingtonpost.com/technology/2023/07/05/ai-apocalypse-college-students

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ europlus, cigitalgem, drewharwell, 1br0wn +6 more

cigitalgem, 11 months ago

@nitashatiku excellent work

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

65dBnoise, 11 months ago to llm

"Over both evolutionary time and every individual’s lived experience, natural language to-and-fro has always been with fellow human beings. As we encounter synthetic language output, it is very difficult not to extend trust in the same way as we would with a human. We argue that systems need to be very carefully designed so as not to abuse this trust."

By @emilymbender and Chirag Shah

https://iai.tv/articles/all-knowing-machines-are-a-fantasy-auid-2334

#LLM #AI #ChatGPT

reply

expand (6)

collapse (6)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ Jigsaw_You, GhostOnTheHalfShell, simon_brooke, emilymbender

cigitalgem, 11 months ago

@65dBnoise you trust humans???

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 10 months ago

@65dBnoise LOL

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

lauren, 10 months ago to random

Glad to see that the new "Oppenheimer" movie didn't bomb.

reply

expand (11)

collapse (11)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cazabon, wordshaper, stooovie

cigitalgem, 10 months ago

@lauren you didn't

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 10 months ago to random

You can't fix an LLM by red teaming. It does exactly what it was designed to do. Autoassociative predictive word generation.

So what do you prove when you do prompt injection? Not a damn thing.

Always ask this. How does someone FIX what comes out of a pen test? If there is no fix, there is no change in security posture.

#MLsec
https://www.washingtonpost.com/technology/2023/08/08/ai-red-team-defcon/?wpisrc=nl_technology202

reply

expand (20)

collapse (20)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ devinprater, cigitalgem

cigitalgem, 10 months ago

@ojensen you can demonstrate that with one exploit, but you can't "prove" anything. I agree that some people don't get get this yet. But the disingenuous press coverage that pretends this will secure AI is hogwash.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simplenomad, 10 months ago to random

After more than 2 decades my primary care physician retired, 2.5 years ago, it took me several months to find a suitable replacement, who after 6 months decided to stop seeing patients and focus on clinical research full time. Another search commenced - found a doctor and she's been great for the past year. Yesterday in the mail I received a notice that she's moving out of state.

This is the United States. It is hard to find a doctor that is 1) not in bed with the pharmaceutical companies, 2) moving me through a quick fire assembly line, and 3) actually considers alternate health solutions, ie DO instead of MD. The search begins again.... #FirstWorldProblems #USHealthCare

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

cigitalgem, 10 months ago (edited 10 months ago)

@simplenomad I see this looming on the horizon for myself.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

judell, 10 months ago to llm

"This suggests that the speed of fine-tuning LLMs is far exceeding that of peer review publications (OK, that’s not saying too much!) and we are clearly going to see considerable more improvements of these LLMs in the times ahead."

https://erictopol.substack.com/p/medical-ai-is-on-a-tear#%C2%A7large-language-models-are-answering-medical-questions-increasingly-correctly

#llm #medicine

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 10 months ago

@judell gack!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

exteriorpower, 11 months ago to random

What would you recommend someone read if they are skeptical of the Yudkowsky/AI “doomer” perspective, but curious to learn more and open to having their mind changed by good arguments? I’m especially interested in arguments that might be convincing to a logical, thoughtful, open minded person coming from outside the rationalist/EA/utilitarian worldview.

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 10 months ago

@exteriorpower the top five papers in the annotated bibliography.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 11 months ago

@exteriorpower https://berryvilleiml.com/references/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 10 months ago to random

"starting to doubt" my ass. #MLsec

https://fortune.com/2023/08/01/can-ai-chatgpt-hallucinations-be-fixed-experts-doubt-altman-openai/

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 10 months ago to random

We know better than this. #MLsec https://www.nytimes.com/2023/08/06/technology/facial-recognition-false-arrest.html

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 10 months ago to random

BIML via vid. This week we talked about two ideas articles:

dughof on ML https://www.theatlantic.com/ideas/archive/2023/07/godel-escher-bach-geb-ai/674589/

data data data https://knowingmachines.org/9-ways-to-see/investigating-datasets

#MLsec

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 10 months ago

I forgot to wear a shirt today. But it's just zoom!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 10 months ago to random

Mitnick https://www.dignitymemorial.com/obituaries/las-vegas-nv/kevin-mitnick-11371668

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 10 months ago to infosec

I was the victim (er, guest) on a recent podcast. Have a listen. #swsec @appsec #infosec #MLsec

https://www.synopsys.com/blogs/software-security/building-security-in-podcast-machine-learning-ai/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 10 months ago to random

"He adds that the main method used to fine-tune models to get them to behave, which involves having human testers provide feedback, may not, in fact, adjust their behavior that much."

Another reason that Red Teaming of the sort DefCon plans to do is a waste of time. #MLsec

https://www.wired.com/story/ai-adversarial-attacks/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 10 months ago (edited 10 months ago) to random

Just made 28 new KIVA micro-loans with recycled loan paybacks. Join Team BIML today!

https://bit.ly/cigitalgem-kiva

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 10 months ago

@virome_girl So am I. I have been doing Kiva for many years and love to watch the loan pile grow and grow.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 10 months ago to random

Can you code using predictive statistical patterns? Nope.

#MLsec

https://www.theregister.com/2023/08/07/chatgpt_stack_overflow_ai/

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 10 months ago

"From semi-structured interviews, it is apparent that polite language, articulated and text-book style answers, comprehensiveness, and affiliation in answers make completely wrong answers seem correct,"

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

exteriorpower, 10 months ago to random

Getting ready to head to ICML in Honolulu tomorrow. I haven’t traveled much since 2020. How are hotels doing with HEPA filters in room AC vents these days, and how are people handling it when A/C filtering is not in place?

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 10 months ago

@exteriorpower I know. Humans are absurd.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 10 months ago

@exteriorpower it is as if COVID does not exist.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 11 months ago to random

This is just complete nonsense and does nothing to enhance #MLsec.

Frustrating.

"Red Teaming" https://arxiv.org/pdf/2209.07858.pdf

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

yaleman, 11 months ago to random

I'm so glad that my replacement reading glasses with the new prescription that helps me see haven't arrived before I go on an important business trip 😡

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 11 months ago

@yaleman who needs see? Who needs hear?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 11 months ago to random

Dughof (my thesis advisor way back when) wrote this https://www.theatlantic.com/ideas/archive/2023/07/godel-escher-bach-geb-ai/674589/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...