cigitalgem

@cigitalgem@sigmoid.social

software security #swsec machine learning security #mlsec Tech | Life | Music

This profile is from a federated server and may be incomplete. Browse more on the original instance.

cigitalgem, 9 months ago to random

You can't fix an LLM by red teaming. It does exactly what it was designed to do. Autoassociative predictive word generation.

So what do you prove when you do prompt injection? Not a damn thing.

Always ask this. How does someone FIX what comes out of a pen test? If there is no fix, there is no change in security posture.

#MLsec
https://www.washingtonpost.com/technology/2023/08/08/ai-red-team-defcon/?wpisrc=nl_technology202

reply

expand (20)

collapse (20)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ devinprater, cigitalgem

cigitalgem, 1 month ago to random

We just lost another great light of rationalism. Dan Dennett helped get me started in philosophy of mind way back in the late '80s. Dan was right about lots of things. https://dailynous.com/2024/04/19/daniel-dennett-death-1942-2024/

reply

expand (16)

collapse (16)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem, dgoldsmith, tokensane, gvwilson +3 more

cigitalgem, 5 months ago to ML

#ML systems can leak confidential data in their training set even with a very silly attack. This is a direct and clear #MLsec issue that applies well beyond the #LLM case

https://www.engadget.com/a-silly-attack-made-chatgpt-reveal-real-phone-numbers-and-email-addresses-200546649.html

reply

expand (10)

collapse (10)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem, joe, ErikJonker, briankrebs

cigitalgem, 2 months ago to ML

Have a look at the Usenix login; interview featuring myself and the BIML LLM work. #MLsec #ML #AI #LLM

https://berryvilleiml.com/2024/03/15/rik-farrow-interviews-mcgraw-for-login/

reply

expand (6)

collapse (6)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 3 months ago to llm

Lets do a TOP TEN LLM Risks list #MLsec #LLM #ML

Recursive pollution

Get the full paper here https://berryvilleiml.com/results/

reply

expand (6)

collapse (6)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

cigitalgem, 2 months ago to ML

It's the data, dummy.

"The AI company, for example, says it has an advantage of having access to X’s trove of posts."

Musk bought twitter for the data pile. #MLsec #ML #AI #LLM

https://www.wsj.com/tech/ai/elon-musks-x-leans-on-his-ai-startup-9038380d

reply

expand (5)

collapse (5)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 11 months ago to random

My view is that the "API off" and/or "API expensive" decisions at both twitter and reddit have only to do with building a data moat so that LLMs from "outside" not in clear partnership with the pile in question are prohibited from accessing possible training data. #MLsec

reply

expand (5)

collapse (5)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 11 days ago to random

Nothing to see here, just armed robots https://arstechnica.com/gadgets/2024/05/robot-dogs-armed-with-ai-targeting-rifles-undergo-us-marines-special-ops-evaluation/

reply

expand (5)

collapse (5)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

cigitalgem, 3 months ago to llm

Lets do a TOP TEN LLM Risks list #MLsec #LLM #ML

7: Reproducibility economics

Get the full paper here https://berryvilleiml.com/results/

reply

expand (5)

collapse (5)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 4 months ago to ML

It's not just authors anymore. The NY Times sues OpenAI and Microsoft over ML copyright issues.

#ML systems leak training data consistently.

#MLsec

https://www.wsj.com/tech/ai/new-york-times-sues-microsoft-and-openai-alleging-copyright-infringement-fd85e1c4?mod=mhp

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 11 months ago to random

Today's talk at secappdev was all about the flaw #swsec #appsec

You do ARA aka threat modelling, right?

Flag in front of the secappdev venue in leuven.

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 4 months ago to random

Roy is well past his date range

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

cigitalgem, 3 months ago to llm

Lets do a TOP TEN LLM Risks list #MLsec #LLM #ML

Data debt

Get the full paper here https://berryvilleiml.com/results/

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 3 months ago to llm

Lets do a TOP TEN LLM Risks list #MLsec #LLM #ML

6: Poison in the data

Get the full paper here https://berryvilleiml.com/results/

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 6 months ago to random

Reminder: in my view, recursion pollution is the number one LLM risk by a long shot #MLsec

https://www.darkreading.com/dr-tech/will-the-ai-arms-race-lead-to-the-pollution-of-the-internet-

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 7 months ago to random

#MLsec is an enormous challenge growing faster than we can comprehend https://www.nytimes.com/2023/09/20/technology/chatgpt-dalle3-images-openai.html

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 3 months ago to random

The "sleeper agents" paper from anthropic is such a complete bullshit I don't even know where to start. Good grief...such terrible "science."

#MLsec

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 2 months ago to ML

NEW Security Ledger podcast features BIML's LLM risk analysis, recursive pollution, and data feudalism. Always a great time chatting with Paul Roberts! @securityledger
#MLsec #ML #AI #LLM

https://securityledger.com/2024/02/episode-256-recursive-pollution-data-feudalism-gary-mcgraw-on-llm-insecurity/

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

cigitalgem, 2 months ago to random

Dear press people, you can't fix generative AI by blocking prompts. Really. If you need to talk about why that is, call me up. This credulous coverage is just silly.

#MLsec #reporters

https://www.cnbc.com/2024/03/08/microsoft-blocking-terms-that-cause-its-ai-to-create-violent-images.html

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ Paxxi, cigitalgem

cigitalgem, 5 months ago to ML

This is absolutely excellent work from a great reporter/fiction author. Spot on. #ML #MLsec

"A.I.’s errors have an endearingly anthropomorphic name — hallucinations — but this year made clear just how high the stakes can be."

https://www.nytimes.com/2023/12/19/opinion/artificial-intelligence-chatgpt.html

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 3 months ago to ML

So, what about that NIST AI attack taxonomy? Here's what BIML thinks:

https://berryvilleiml.com/2024/01/23/another-round-of-adversarial-machine-learning-from-nist/

#MLsec #ML #AI #LLM

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ gimulnautti

cigitalgem, 9 months ago to random

Dear everyone. This is just stupid.

#MLsec for preschoolers

https://foreignpolicy.com/2023/08/15/defcon-ai-red-team-vegas-white-house-chatbots-llm/?utm_source=dlvr.it&utm_medium=mastodon

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 1 month ago to ML

I don't believe we can filter our way out of drinking a polluted ocean of training data. #MLsec #ML #AI #LLM https://www.techtarget.com/searchEnterpriseAI/news/366574580/Microsoft-hires-DeepMind-co-founder-amid-Google-Apple-news

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

cigitalgem, 6 months ago to random

Executive order on ML/AI guardrails #MLsec https://apnews.com/article/biden-ai-artificial-intelligence-executive-order-cb86162000d894f238f28ac029005059

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 9 months ago (edited 9 months ago) to random

New BIML Bibliography entry (under popular press)

https://www.theatlantic.com/ideas/archive/2023/07/godel-escher-bach-geb-ai/674589/

Doug Hofstadter

An excellent view of LLM production as seen by a top cognitive scientist

#MLsec

https://berryvilleiml.com/references/

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...