JenniferJorgenson, to ai
@JenniferJorgenson@mstdn.social avatar
rayckeith, (edited ) to random
@rayckeith@techhub.social avatar

"While it may seem harmless if systems cheat at games, it can lead to "breakthroughs in deceptive AI capabilities" that can spiral into more advanced forms of AI deception in the future, Park added.

"Some AI systems have even learned to cheat tests designed to evaluate their safety, the researchers found. In one study, AI organisms in a digital simulator "played dead" in order to trick a test built to eliminate AI systems that rapidly replicate.

"By systematically cheating the safety tests imposed on it by human developers and regulators, a deceptive AI can lead us humans into a false sense of security," says Park."

https://www.sciencedaily.com/releases/2024/05/240510111440.htm

rayckeith, (edited )
@rayckeith@techhub.social avatar

"The most striking example of deception the researchers uncovered in their analysis was Meta's CICERO, an AI system designed to play the game Diplomacy, which is a world-conquest game that involves building alliances. Even though Meta claims it trained CICERO to be "largely honest and helpful" and to "never intentionally backstab" its human allies while playing the game, the data the company published along with its Science paper revealed that CICERO didn't play fair.

"We found that Meta's AI had learned to be a master of deception," says Park. "While Meta succeeded in training its AI to win in the game of Diplomacy -- CICERO placed in the top 10% of human players who had played more than one game -- Meta failed to train its AI to win honestly."

ai6yr, to ai

Well, on the plus side, all the vast number of pages of AI slop you search for on topics nowadays sound the same. So you can quickly move on to a page written by humans.

"You're interested in learning more about X? Well, it's a fascinating topic. Here, we'll talk about all the ways you can do X! The history of X is fascinating..." bleah bleah bleah

noellemitchell, to OpenAI
@noellemitchell@mstdn.social avatar

"Over the past few months, we’ve learned that Apple has been in discussions with both Google and OpenAI (which owns ChatGPT) about using their respective LLMs to power future features coming to iOS. Now, according to industry analyst Mark Gurman, Apple’s deal with OpenAI might be close to finalized."

Great...just what we need. More ChatGPT.

https://www.androidauthority.com/apple-chatgpt-ios-deal-3442079/

remixtures, to ai Portuguese
@remixtures@tldr.nettime.org avatar

: "People aren’t perfect. Neither ethics training for AI engineers nor legislation by woefully uninformed politicians can change that simple truth. I don’t need to assume that Big Tech chief executives are bad actors or that large companies are malevolent to understand that what is in their self-interest is not always in mine. The framers of the US Constitution recognised this simple truth and sought to leverage human nature for a greater good. The Constitution didn’t simply assume people would always act towards that greater good. Instead it defined a dynamic mechanism — self-interest and the balance of power — that would force compromise and good governance. Its vision of treating people as real actors rather than better angels produced one of the greatest frameworks for governance in history."

https://www.ft.com/content/b16fab3e-7f19-49ab-9bbb-9bfeccbaf063

br00t4c, to ai
@br00t4c@mastodon.social avatar
TechDesk, to tech
@TechDesk@flipboard.social avatar

Google’s mobile platform will have to look a little different to compete in the AI era. And, Allison Johnson writes, “If the past 12 months is any indication, it’s going to be a little messy." Read more from @theverge. https://flip.it/xwIUGs

  • All
  • Subscribed
  • Moderated
  • Favorites
  • anitta
  • khanakhh
  • thenastyranch
  • Youngstown
  • hgfsjryuu7
  • slotface
  • rosin
  • InstantRegret
  • tacticalgear
  • kavyap
  • osvaldo12
  • everett
  • DreamBathrooms
  • PowerRangers
  • tester
  • magazineikmin
  • Durango
  • mdbf
  • ngwrru68w68
  • modclub
  • cubers
  • vwfavf
  • ethstaker
  • cisconetworking
  • GTA5RPClips
  • normalnudes
  • Leos
  • provamag3
  • All magazines