@cigitalgem@sigmoid.social
@cigitalgem@sigmoid.social avatar

cigitalgem

@cigitalgem@sigmoid.social

software security #swsec machine learning security #mlsec Tech | Life | Music

This profile is from a federated server and may be incomplete. Browse more on the original instance.

judell, to llm
@judell@social.coop avatar

"Just as ChatGPT can make up facts, it’s apparently willing to lie about ensuring that the code it writes passes the tests you give it. It can also behave like a recalcitrant child who knows, but must constantly be reminded, to follow the rules. But if you hold its feet to the fire, tests can be a great way to focus its attention on the code you’re asking it to write."

https://thenewstack.io/test-driven-development-with-llms-never-trust-always-verify/

cigitalgem,
@cigitalgem@sigmoid.social avatar

@judell you can also hire ten million monkeys. Same tests apply to the code they type out.

LOL

cigitalgem,
@cigitalgem@sigmoid.social avatar

@judell maybe flying monkeys...like the wizard of oz

cigitalgem, to random
@cigitalgem@sigmoid.social avatar

Even a cursory read of this rudderless article shows the futility of the DEF CON AI red teaming bullshit. We need to do better as a discipline.

https://cyberscoop.com/def-con-ai-hacking-red-team/

kimw, to random

Superheroes 🦸‍♀️🦸‍♂️, privacy 😎, and threat modeling ⚡️
What's not to like?!

Are you ready for the clash of privacy vs. security?✊️
Check the recording of this epic battle between Professor Privacy and Captain Security (@sec_tigger) at @WEareTROOPERS

youtu.be/rBdcupIhkDc

For this fun talk, I had the pleasure to join forces with Avi Douglen. Together we explained the need to protect privacy, the power of threat modeling, and how privacy can be a force multiplier when combined with security.

image/jpeg
image/jpeg

cigitalgem,
@cigitalgem@sigmoid.social avatar

@kimw how did it go???

cigitalgem,
@cigitalgem@sigmoid.social avatar

@kimw lol

Free_Press, to random
@Free_Press@mstdn.social avatar

Incredible video shows lightning 'strikes upwards' from Agua Volcano in Guatemala

video/mp4

cigitalgem,
@cigitalgem@sigmoid.social avatar

@Free_Press lightning always goes up

georgetakei, to random

There’s a HURRICANE this weekend heading for Baja and Southern California. Folks, that is not in any way normal.

I may not live long enough to see the worst of the impacts of climate change, but I want to use my voice and platform today to urge action.

We can start by ending subsidies and investments into fossil fuels. Full stop.

cigitalgem,
@cigitalgem@sigmoid.social avatar

@georgetakei everything is fine!

cigitalgem, to random
@cigitalgem@sigmoid.social avatar

NEW BIML Bibliography entry

https://knowingmachines.org/publications/9_ways_to_see_a_dataset

Knowing Machines

This is a rather vacuous treatment of a critically-important problem. How do we represent things in ML and what implications do such representations have? We were hoping for more treatment of: distributedness, bigness, sparseness, and modling.

https://berryvilleiml.com/references/

cigitalgem, (edited ) to random
@cigitalgem@sigmoid.social avatar

New BIML Bibliography entry (under popular press)

https://www.theatlantic.com/ideas/archive/2023/07/godel-escher-bach-geb-ai/674589/

Doug Hofstadter

An excellent view of LLM production as seen by a top cognitive scientist

https://berryvilleiml.com/references/

cigitalgem,
@cigitalgem@sigmoid.social avatar

@Hippasus500 my pleasure.

dughof was my thesis advisor way back when.

you will enjoy this article.

cigitalgem, to random
@cigitalgem@sigmoid.social avatar

NEW BIML Bibliography entry

DATA VALIDATION FOR MACHINE LEARNING

Breck, et al.

This basic paper is about validating input data (as opposed to the validation set as linked to the training set).

https://berryvilleiml.com/references/

cigitalgem, to random
@cigitalgem@sigmoid.social avatar

NEW BIML Bibliography entry

Red Teaming Language Models to Reduce Harms:
Methods, Scaling Behaviors, and Lessons Learned

Anthropic

https://arxiv.org/pdf/2209.07858.pdf

Absolute malarky informed by zero understanding of security, pen testing, and what a real red team does.


https://berryvilleiml.com/references/

cigitalgem, to random
@cigitalgem@sigmoid.social avatar

NEW BIML Bibliography top 5 entry!

THE CURSE OF RECURSION:
TRAINING ON GENERATED DATA MAKES MODELS FORGET

Shumailov, et al.

https://arxiv.org/pdf/2305.17493.pdf

A very easy to grasp discourse covering the math of eating your own tail. This is directly relevant to LLMs and the pollution of large datasets. We pointed out this risk in 2020. This is the math.


https://berryvilleiml.com/references/

cigitalgem, to random
@cigitalgem@sigmoid.social avatar
cigitalgem, to random
@cigitalgem@sigmoid.social avatar
cigitalgem,
@cigitalgem@sigmoid.social avatar

@danielcornell yeah. Pretend security for the win!

cigitalgem, to random
@cigitalgem@sigmoid.social avatar
cigitalgem, to random
@cigitalgem@sigmoid.social avatar

Can you code using predictive statistical patterns? Nope.

https://www.theregister.com/2023/08/07/chatgpt_stack_overflow_ai/

cigitalgem,
@cigitalgem@sigmoid.social avatar

"From semi-structured interviews, it is apparent that polite language, articulated and text-book style answers, comprehensiveness, and affiliation in answers make completely wrong answers seem correct,"

cigitalgem, to random
@cigitalgem@sigmoid.social avatar

You can't fix an LLM by red teaming. It does exactly what it was designed to do. Autoassociative predictive word generation.

So what do you prove when you do prompt injection? Not a damn thing.

Always ask this. How does someone FIX what comes out of a pen test? If there is no fix, there is no change in security posture.


https://www.washingtonpost.com/technology/2023/08/08/ai-red-team-defcon/?wpisrc=nl_technology202

cigitalgem,
@cigitalgem@sigmoid.social avatar

@ojensen you can demonstrate that with one exploit, but you can't "prove" anything. I agree that some people don't get get this yet. But the disingenuous press coverage that pretends this will secure AI is hogwash.

cigitalgem,
@cigitalgem@sigmoid.social avatar
cigitalgem, to random
@cigitalgem@sigmoid.social avatar

Repeat after me. AI "red teaming" is bullshit. Do real and stop the nonsense.

https://www.washingtonpost.com/technology/2023/08/08/ai-red-team-defcon/?wpisrc=nl_technology202

cigitalgem,
@cigitalgem@sigmoid.social avatar
cigitalgem, to random
@cigitalgem@sigmoid.social avatar
  • All
  • Subscribed
  • Moderated
  • Favorites
  • JUstTest
  • mdbf
  • everett
  • osvaldo12
  • magazineikmin
  • thenastyranch
  • rosin
  • normalnudes
  • Youngstown
  • Durango
  • slotface
  • ngwrru68w68
  • kavyap
  • DreamBathrooms
  • tester
  • InstantRegret
  • ethstaker
  • GTA5RPClips
  • tacticalgear
  • Leos
  • anitta
  • modclub
  • khanakhh
  • cubers
  • cisconetworking
  • megavids
  • provamag3
  • lostlight
  • All magazines