cigitalgem

@cigitalgem@sigmoid.social

software security #swsec machine learning security #mlsec Tech | Life | Music

This profile is from a federated server and may be incomplete. Browse more on the original instance.

judell, 9 months ago to llm

"Just as ChatGPT can make up facts, it’s apparently willing to lie about ensuring that the code it writes passes the tests you give it. It can also behave like a recalcitrant child who knows, but must constantly be reminded, to follow the rules. But if you hold its feet to the fire, tests can be a great way to focus its attention on the code you’re asking it to write."

#llm #programming #testing

https://thenewstack.io/test-driven-development-with-llms-never-trust-always-verify/

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 9 months ago

@judell you can also hire ten million monkeys. Same tests apply to the code they type out.

LOL

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 9 months ago

@judell maybe flying monkeys...like the wizard of oz

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 9 months ago to random

Even a cursory read of this rudderless article shows the futility of the DEF CON AI red teaming bullshit. We need to do better as a discipline. #MLsec

https://cyberscoop.com/def-con-ai-hacking-red-team/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

kimw, 9 months ago to random

Superheroes 🦸‍♀️🦸‍♂️, privacy 😎, and threat modeling ⚡️
What's not to like?!

Are you ready for the clash of privacy vs. security?✊️
Check the recording of this epic battle between Professor Privacy and Captain Security (@sec_tigger) at @WEareTROOPERS

youtu.be/rBdcupIhkDc

For this fun talk, I had the pleasure to join forces with Avi Douglen. Together we explained the need to protect privacy, the power of threat modeling, and how privacy can be a force multiplier when combined with security.

image/jpeg
image/jpeg

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 9 months ago

@kimw how did it go???

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 9 months ago

@kimw lol

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Free_Press, 9 months ago to random

Incredible video shows lightning 'strikes upwards' from Agua Volcano in Guatemala
#AureFreePress

video/mp4

reply

expand (17)

collapse (17)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ yaitorr, samhainnight, HistoPol, Whiskeyomega +10 more

cigitalgem, 9 months ago

@Free_Press lightning always goes up

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

georgetakei, 9 months ago to random

There’s a HURRICANE this weekend heading for Baja and Southern California. Folks, that is not in any way normal.

I may not live long enough to see the worst of the impacts of climate change, but I want to use my voice and platform today to urge action.

We can start by ending subsidies and investments into fossil fuels. Full stop.

reply

expand (35)

collapse (35)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ maddad, dannotdaniel, onepict, EndemicEarthling +24 more

cigitalgem, 9 months ago

@georgetakei everything is fine! #climatechange

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 9 months ago to random

NEW BIML Bibliography entry

https://knowingmachines.org/publications/9_ways_to_see_a_dataset

Knowing Machines

This is a rather vacuous treatment of a critically-important problem. How do we represent things in ML and what implications do such representations have? We were hoping for more treatment of: distributedness, bigness, sparseness, and modling.

#MLsec

https://berryvilleiml.com/references/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 9 months ago (edited 9 months ago) to random

New BIML Bibliography entry (under popular press)

https://www.theatlantic.com/ideas/archive/2023/07/godel-escher-bach-geb-ai/674589/

Doug Hofstadter

An excellent view of LLM production as seen by a top cognitive scientist

#MLsec

https://berryvilleiml.com/references/

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 9 months ago

@Hippasus500 my pleasure.

dughof was my thesis advisor way back when.

you will enjoy this article.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 9 months ago to random

NEW BIML Bibliography entry

DATA VALIDATION FOR MACHINE LEARNING

Breck, et al.

This basic paper is about validating input data (as opposed to the validation set as linked to the training set).

#MLsec

https://berryvilleiml.com/references/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 9 months ago to random

NEW BIML Bibliography entry

Red Teaming Language Models to Reduce Harms:
Methods, Scaling Behaviors, and Lessons Learned

Anthropic

https://arxiv.org/pdf/2209.07858.pdf

Absolute malarky informed by zero understanding of security, pen testing, and what a real red team does.

#MLsec
https://berryvilleiml.com/references/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 9 months ago to random

NEW BIML Bibliography top 5 entry!

THE CURSE OF RECURSION:
TRAINING ON GENERATED DATA MAKES MODELS FORGET

Shumailov, et al.

https://arxiv.org/pdf/2305.17493.pdf

A very easy to grasp discourse covering the math of eating your own tail. This is directly relevant to LLMs and the pollution of large datasets. We pointed out this risk in 2020. This is the math.

#MLsec
https://berryvilleiml.com/references/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

cigitalgem, 9 months ago to random

Data walls going up #MLsec

https://news.slashdot.org/story/23/08/15/2242238/nyt-prohibits-using-its-content-to-train-ai-models?utm_source=rss1.0mainlinkanon

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 9 months ago to random

Dear everyone. This is just stupid.

#MLsec for preschoolers

https://foreignpolicy.com/2023/08/15/defcon-ai-red-team-vegas-white-house-chatbots-llm/?utm_source=dlvr.it&utm_medium=mastodon

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 9 months ago

@danielcornell yeah. Pretend security for the win!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 10 months ago to random

About that AI "red teaming"

#MLsec

https://apnews.com/article/ai-cybersecurity-malware-microsoft-google-openai-redteaming-1f4c8d874195c9ffcc2cdffa71e4f44b

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

cigitalgem, 10 months ago to random

Can you code using predictive statistical patterns? Nope.

#MLsec

https://www.theregister.com/2023/08/07/chatgpt_stack_overflow_ai/

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 10 months ago

"From semi-structured interviews, it is apparent that polite language, articulated and text-book style answers, comprehensiveness, and affiliation in answers make completely wrong answers seem correct,"

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 10 months ago to random

You can't fix an LLM by red teaming. It does exactly what it was designed to do. Autoassociative predictive word generation.

So what do you prove when you do prompt injection? Not a damn thing.

Always ask this. How does someone FIX what comes out of a pen test? If there is no fix, there is no change in security posture.

#MLsec
https://www.washingtonpost.com/technology/2023/08/08/ai-red-team-defcon/?wpisrc=nl_technology202

reply

expand (20)

collapse (20)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ devinprater, cigitalgem

cigitalgem, 10 months ago

@ojensen you can demonstrate that with one exploit, but you can't "prove" anything. I agree that some people don't get get this yet. But the disingenuous press coverage that pretends this will secure AI is hogwash.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 9 months ago

See https://apnews.com/article/ai-cybersecurity-malware-microsoft-google-openai-redteaming-1f4c8d874195c9ffcc2cdffa71e4f44b

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

cigitalgem, 10 months ago to random

Repeat after me. AI "red teaming" is bullshit. Do real #MLsec and stop the nonsense.

https://www.washingtonpost.com/technology/2023/08/08/ai-red-team-defcon/?wpisrc=nl_technology202

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

cigitalgem, 9 months ago

Here. I said it in the press.

#MLsec

https://apnews.com/article/ai-cybersecurity-malware-microsoft-google-openai-redteaming-1f4c8d874195c9ffcc2cdffa71e4f44b

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ jrconlin, baldur, cigitalgem

cigitalgem, 10 months ago to random

We know better than this. #MLsec https://www.nytimes.com/2023/08/06/technology/facial-recognition-false-arrest.html

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...