cigitalgem

@cigitalgem@sigmoid.social

software security #swsec machine learning security #mlsec Tech | Life | Music

This profile is from a federated server and may be incomplete. Browse more on the original instance.

cigitalgem, 3 months ago to ML

Just finished hacking up slides for the LLM security work BIML recently released. I will be presenting this invited talk for three NDSS conference workshops (simultaneously) in San Diego Monday afternoon.
#MLsec #ML #AI #LLM

All NDSS ’24 workshops: https://www.ndss-symposium.org/ndss2024/co-located-events/

SDIoTSec: https://www.ndss-symposium.org/ndss2024/co-located-events/sdiotsec/

USEC: https://www.ndss-symposium.org/ndss2024/co-located-events/usec/

AISCC: https://www.ndss-symposium.org/ndss2024/co-located-events/aiscc/.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

cigitalgem, 3 months ago to random

Building Security In can be done with LLM applications #MLsec

https://blog.redsift.com/news/announcing-red-sift-radar-beta/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

cigitalgem, 3 months ago to ML

As a pizza delivery person you too can prompt persnickety parrots with pen test panache using this new tool from Microsoft. A whole new cyber cyber career!

https://www.microsoft.com/en-us/security/blog/2024/02/22/announcing-microsofts-open-automation-framework-to-red-team-generative-ai-systems/

I know, let's pretend that LLM security can be bolted on later after we have created a foundation model based on data scraped from the Internet that is FULL of poison, garbage, nonsense, and noise. <Announcer: It can't>

#MLsec #ML #AI #LLM #security

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

cigitalgem, 3 months ago to random

Wirth has left the building. https://www.washingtonpost.com/obituaries/2024/02/28/niklaus-wirth-pascal-software-dies/?utm_source=press.coop

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ tcely

cigitalgem, 3 months ago

My first real programming after applesoft basic was pascal. I even got a 16K card with turbo pascal on it, bumping my memory ALL THE WAY UP to 64k on my apple ][+. That machine deeply impacted my entire life.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ tcely

cigitalgem, 3 months ago to ML

NEW Security Ledger podcast features BIML's LLM risk analysis, recursive pollution, and data feudalism. Always a great time chatting with Paul Roberts! @securityledger
#MLsec #ML #AI #LLM

https://securityledger.com/2024/02/episode-256-recursive-pollution-data-feudalism-gary-mcgraw-on-llm-insecurity/

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

cigitalgem, 3 months ago

The biggest risk posed by large language model AI like Chat GPT? “It’s this: large language models are often wrong,” McGraw told me. “And they’re very convincingly wrong and very authoritatively wrong.” #MLsec

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

cigitalgem, 3 months ago to random

On the accidental surveillance state we built to serve better ads...

https://www.wired.com/story/how-pentagon-learned-targeted-ads-to-find-targets-and-vladimir-putin/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ glynmoody

cigitalgem, 4 months ago (edited 3 months ago) to random

Who pays the price when AI is wrong? #MLsec

https://bc.ctvnews.ca/air-canada-s-chatbot-gave-a-b-c-man-the-wrong-information-now-the-airline-has-to-pay-for-the-mistake-1.6769454

BIML blog entry https://berryvilleiml.com/2024/02/15/when-ml-goes-wrong-who-pays-the-price/

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ ppossej

cigitalgem, 4 months ago to ML

Dennis Fisher on Data Feudalism (a term that BIML coined).

#MLsec #ML #AI #LLM

https://berryvilleiml.com/2024/01/30/dennis-fisher-covers-biml-and-data-feudalism/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

cigitalgem, 4 months ago to ML

So, what about that NIST AI attack taxonomy? Here's what BIML thinks:

https://berryvilleiml.com/2024/01/23/another-round-of-adversarial-machine-learning-from-nist/

#MLsec #ML #AI #LLM

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ gimulnautti

cigitalgem, 4 months ago to ML

About that (terrible) SLEEPER AGENTS paper from Anthropic

#MLsec #ML #AI #LLM #security

https://berryvilleiml.com/2024/02/08/absolute-nonsense-from-anthropic-sleeper-agents/

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

cigitalgem, 4 months ago to llm

BIML released a unique and detailed #LLM Risk Analysis one week ago. Have you read it yet? Please pass it on.

This is applied machine learning security

#MLsec #AI #ML #security

https://berryvilleiml.com/results/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

cigitalgem, 4 months ago to ML

Have a listen to BIML discuss Machine Learning Security on the Google Cloud Security podcast

#MLsec #ML #AI #LLM #security

https://berryvilleiml.com/2024/01/25/google-cloud-security-podcast-features-biml/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

cigitalgem, 4 months ago to ML

BIML in the news #MLsec #ML #AI #LLM #security

https://apnews.com/article/microsoft-generative-ai-offensive-cyber-operations-3482b8467c81830012a9283fd6b5f529

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

cigitalgem, 4 months ago to ai

New BIML blog posting

@roblemos on the BIML LLM Risk Analysis

#MLsec #AI #ML #LLM

https://berryvilleiml.com/2024/01/28/lemos-on-the-biml-llm-risk-analysis/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

cigitalgem, 4 months ago to llm

META-thread: Lets do a TOP TEN LLM Risks list #MLsec #LLM #ML

Recursive pollution https://sigmoid.social/@cigitalgem/111822743461781680

Data debt https://sigmoid.social/@cigitalgem/111818853107254150

Improper use
https://sigmoid.social/@cigitalgem/111818552669618098

Black box opacity
https://sigmoid.social/@cigitalgem/111817591521152365

Prompt manipulation
https://sigmoid.social/@cigitalgem/111817028774981796

6: Poison in the data
https://sigmoid.social/@cigitalgem/111813050795199210

7: Reproducibility economics
https://sigmoid.social/@cigitalgem/111812863118355119

8: Data ownership
https://sigmoid.social/@cigitalgem/111812699547902655

9: Model Trustworthiness
https://sigmoid.social/@cigitalgem/111812307709213718

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

cigitalgem, 4 months ago

META-thread: Lets do a TOP TEN LLM Risks list #MLsec #LLM #ML

10: Encoding Integrity
https://sigmoid.social/@cigitalgem/111811997833433019

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

cigitalgem, 3 months ago to random

News flash. Chatbots are often wrong. #MLsec

https://apnews.com/article/ai-chatbots-elections-artificial-intelligence-chatgpt-falsehoods-cc50dd0f3f4e7cc322c7235220fc4c69

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

cigitalgem, 4 months ago to llm

Lets do a TOP TEN LLM Risks list #MLsec #LLM #ML

Recursive pollution

Get the full paper here https://berryvilleiml.com/results/

reply

expand (6)

collapse (6)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

cigitalgem, 4 months ago

Alemohammad, Sina, Josue Casco-Rodriguez, Lorenzo Luzi, Ahmed Imtiaz Humayun, Hossein Babaei, Daniel LeJeune, Ali Siahkoohi, Richard G. Baraniuk. “Self-Consuming Generative Models Go MAD.” arXiv preprint arXiv:2307.01850 (2023)

https://arxiv.org/pdf/2305.17493.pdf

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

cigitalgem, 4 months ago

LLMs can sometimes be spectacularly wrong, and confidently so. If and when LLM output is pumped back into the training data ocean (by reference to being put on the Internet, for example), a future LLM may end up being trained on these very same polluted data. This is one kind of “feedback loop” problem we identified and discussed in 2020.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

cigitalgem, 4 months ago

See, in particular, [BIML78 raw:8:looping], [BIML78 input:4:looped input], and [BIML78 output:7:looped output]. Shumilov et al, subsequently wrote an excellent paper on this phenomenon. Also see Alemohammad. Recursive pollution is a serious threat to LLM integrity. ML systems should not eat their own output just as mammals should not consume brains of their own species.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

cigitalgem, 4 months ago

REFERENCES

Shumailov, Ilia, Zakhar Shumaylov, Yiren Zhao, Yarin Gal, Nicolas Papernot, and Ross Anderson. “Model Dementia: Generated Data Makes Models Forget.” arXiv preprint arXiv:2305.17493 (2023).

https://arxiv.org/pdf/2305.17493.pdf

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem

cigitalgem, 3 months ago to random

PAY ATTENTION https://www.theguardian.com/us-news/2024/feb/24/donald-trump-cpac-speech

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ ljrk

cigitalgem, 3 months ago to ML

Just delivered the first BIML LLM Risks talk at NDSS in San Diego. Much fun was had! #MLsec #ML #LLM

Getting set up for the talk...

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cigitalgem, 3 months ago

The work I talked about at NDSS is available here under a creative commons license #MLsec

https://berryvilleiml.com/results/BIML-LLM24.pdf

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cigitalgem