U1 News
  • Home
  • World
  • U.S.
  • Business
  • Technology
  • Science
  • Entertainment
  • Sport
  • Health
Global News

Israel targets Hezbollah commander in Beirut strike after deadly Golan Heights attack

July 30, 2024

Taylor Swift speaks out after Southport mass stabbing at dance class

July 30, 2024

3 girls killed in stabbing at Taylor Swift-themed UK dance class. 7 people still critically wounded

July 30, 2024
Facebook Twitter Instagram
Trending
  • Worst cities for allergies revealed, along with tips to manage symptoms
  • FDA approves first at-home HPV test to screen for cervical cancer
  • Brain stimulation technology improves Parkinson’s treatment for music conductor
  • Left-handedness linked to autism, schizophrenia in major neurological study
  • Heart health unexpectedly affected by shingles vaccine
  • Doctors remove spinal cancer through eye socket in revolutionary surgery
  • Laundry done at home by healthcare workers may spread superbugs, says new study
  • Longevity and organ function predicted in new ‘body clock’ tool
Sunday, May 11
U1 News
  • Home
  • World

    Israel targets Hezbollah commander in Beirut strike after deadly Golan Heights attack

    July 30, 2024

    Taylor Swift speaks out after Southport mass stabbing at dance class

    July 30, 2024

    3 girls killed in stabbing at Taylor Swift-themed UK dance class. 7 people still critically wounded

    July 30, 2024

    Kerala, India, hit by landslides, killing at least 99

    July 30, 2024

    Taylor Swift ‘in shock’ after horrific UK stabbing, as police say 3rd child dies

    July 30, 2024
  • U.S.

    Biden criticises ‘extreme’ Supreme Court in push for reform

    July 30, 2024

    FBI details shooter’s search history before Trump assassination attempt

    July 30, 2024

    Reps. Mike Kelly, Jason Crow to lead task force on Trump rally shooting

    July 29, 2024

    Biden to call for major Supreme Court reforms, including term limits, at Civil Rights Act event Monday

    July 29, 2024

    Sonya Massey’s death revives pain for Breonna Taylor, Floyd activists

    July 29, 2024
  • Business

    AMD stock jumps on earnings beat driven by AI chip sales

    July 30, 2024

    Amazon is responsible for dangerous products sold on its site, federal agency rules

    July 30, 2024

    Microsoft investigating new outages of services after global CrowdStrike chaos

    July 30, 2024

    S&P 500, Nasdaq Tumble as Chip Stocks Slide Ahead of Big Tech Earnings

    July 30, 2024

    American consumers feeling more confident in July as expectations of future improve

    July 30, 2024
  • Technology

    Apple says Safari protects your privacy. We fact checked those claims.

    July 30, 2024

    GameStop Dunks On Xbox 360 Store Closing And Gets Savaged

    July 30, 2024

    Logitech has an idea for a “forever mouse” that requires a subscription

    July 30, 2024

    Friend: a new digital companion for the AI age

    July 30, 2024

    London Sports Mod Community Devolves Into War

    July 30, 2024
  • Science

    NASA’s Lunar Gateway has a big visiting vehicles problem

    August 1, 2024

    Boeing’s Cursed ISS Mission May Finally Make It Back to Earth

    July 30, 2024

    Should you floss before or after you brush your teeth?

    July 30, 2024

    Ancient swimming sea bug ‘taco’ had mandibles, new fossils show

    July 30, 2024

    NASA’s DART asteroid impact mission revealed ages of twin space rock targets (images)

    July 30, 2024
  • Entertainment

    Richard Gadd Backs Netflix to Get ‘Baby Reindeer’ Lawsuit Dismissed

    July 30, 2024

    Batman: Caped Crusader review: a pulpy throwback to DC’s Golden Age

    July 30, 2024

    Channing Tatum Praises Ryan Reynolds For Taking Gamble On Gambit

    July 30, 2024

    ‘Star Wars Outlaws’ somehow made me fall in love with Star Wars again

    July 30, 2024

    Great Scott and O’Brien’s Pub find new life in Allston

    July 30, 2024
  • Sport

    How Snoop Dogg became a fixture of the Paris Olympics

    July 30, 2024

    Team USA’s Coco Gauff exits Olympics singles tournament with a third-round loss : NPR

    July 30, 2024

    French police investigating abuse targeting Olympic opening ceremony DJ over ‘Last Supper’ scene

    July 30, 2024

    French DJ Takes Legal Action

    July 30, 2024

    Why BYU’s Jimmer Fredette is at the 2024 Paris Olympics

    July 30, 2024
  • Health

    Worst cities for allergies revealed, along with tips to manage symptoms

    May 11, 2025

    FDA approves first at-home HPV test to screen for cervical cancer

    May 10, 2025

    Brain stimulation technology improves Parkinson’s treatment for music conductor

    May 10, 2025

    Left-handedness linked to autism, schizophrenia in major neurological study

    May 10, 2025

    Heart health unexpectedly affected by shingles vaccine

    May 9, 2025
U1 News
Home»Technology»“Superhuman” Go AIs still have trouble defending against these simple exploits
Technology

“Superhuman” Go AIs still have trouble defending against these simple exploits

u1news-staffBy u1news-staffJuly 12, 2024No Comments5 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
Gettyimages 1257709162 760x380.jpg
Share
Facebook Twitter LinkedIn Pinterest Email
Expanding / Man versus machine in a sea of ​​stone.

Getty Images

In ancient Chinese games goState-of-the-art artificial intelligence was generally able to beat the best human players At least since 2016But in recent years, researchers Flaws found in these top-level AIs go algorithm that Give humanity a fighting chanceBy using unorthodox “cyclical” strategies – strategies that even novice humans can detect and defeat – crafty humans can exploit gaps in the strategies of top-level AI, tricking the algorithm and causing it to lose.

With MIT researchers Fur AI They wanted to see whether they could improve on this “worst” performance in an otherwise “superhuman” AI Go algorithm, so they tested three ways of strengthening the top-level algorithm. KataGo algorithmDefending against adversarial attacks.,The results show that creating a truly robust and,unexploitable AI can be difficult, even in a,domain as tightly controlled as board games.

Three strategies that failed

In the preprint paper “can go Will AI become adversarially robust?The researchers go AI that is truly “robust” against all attacks – that is, not one that makes “game-losing blunders” that humans wouldn’t make, but one that requires competing AI algorithms to expend significant computing resources to beat it. Ideally, a robust algorithm should be able to use additional computing resources when faced with unknown situations to overcome potential attacks.

A real-world example of the original cyclic attack.
Expanding / A real-world example of the original cyclic attack.

The researchers tried three methods to generate such robust data. go Algorithm. In the first stage, the researchers simply tweaked the KataGo model with more examples of unorthodox patrol strategies that had previously beaten the KataGo model, in the hope that by seeing more patterns, KataGo could learn to detect and beat these patterns.

This strategy initially looked promising, enabling KataGo to win 100 percent of its games against a periodic “attacker.” But after the attacker itself was tweaked (a process that used much less computing power than KataGo’s tweaks), its win rate dropped to 9 percent against small variations on the original attack.

In the second defense attempt, the researchers repeated multiple rounds of an “arms race,” in which new adversarial models discovered new exploits and new defense models attempted to plug those newly discovered holes. Even after 10 rounds of such iterative training, the final defense algorithm only managed to win 19 percent of games against the final attack algorithm, which had discovered never-before-seen variations of the exploit. This was true even when the updated algorithm maintained an advantage over the previous attacker that had been trained in the past.

With the right algorithmic strategies, even kids can become world-class <em>Go</em> You can beat the AI. ” src=”https://cdn.arstechnica.net/wp-content/uploads/2024/07/GettyImages-109417607-640×427.jpg” width=”640″ height=”427″ srcset=”https://cdn.arstechnica.net/wp-content/uploads/2024/07/GettyImages-109417607-1280×853.jpg 2x”/></a><figcaption class=
Expanding / Even kids can beat world-class athletes go If the AI ​​knows the right algorithm usage strategy,

Getty Images

As a final attempt, the researchers Vision TransformerThis is an attempt to avoid the “bad inductive biases” present in the convolutional neural network that originally trained KataGo. This method also failed, winning only 22 percent of the time against variations of a patrol attack that were “reproducible by human experts,” the researchers wrote.

Will it have any effect?

In all three defense attempts, the opponents who defeated KataGo generally did not represent new heights previously unseen. goInstead, these attack algorithms focused on finding exploitable weaknesses in high-performing AI algorithms, even though they would lose to most human players using simple attack strategies.

These exploitable holes highlight the importance of evaluating the “worst-case” performance of an AI system, even if its “average” performance seems superhuman. On average, KataGo can beat even high-level human players using traditional strategies. But in the worst case, an otherwise “weak” opponent can find a hole in the system and cause it to collapse.

It’s easy to extend this thinking to other types of generative AI systems. Successfully complete several complex creative and referential tasks Maybe not yet When faced with a trivial math problem, they fail completely (or Being “poisoned” by malicious prompts). Visual AI models Explain and analyze complex photographs nevertheless Fail miserably when presented with basic geometric shapes.

If you can solve these kinds of puzzles, you may have better visual reasoning abilities than even the most advanced AI.
Expanding / If you can solve these kinds of puzzles, you may have better visual reasoning abilities than even the most advanced AI.

Improving these “worst case” scenarios is Avoid embarrassing mistakes When releasing AI systems to the public, it is often much quicker and easier for a determined “adversary” to find new holes in an AI algorithm’s performance than it is to improve the algorithm and fix the issues, the new research finds.

And if that’s true goThat may be even more true in uncontrolled environments, in games that are highly complex but have tightly defined rules. “The thing about AI is that these vulnerabilities are hard to eliminate,” said Adam Grieve, CEO of FAR. He told Nature“If you can’t solve the problem in a simple domain, go,If this is the case, there seems to be little prospect of a short-term fix for similar issues like the ChatGPT jailbreak.”

Still, the researchers are not despairing. None of their methods ” [new] Attack is impossible.” goTheir strategy was able to plug previously identified, immutable, “fixed” exploits. This is because go “The AI ​​can be hardened by training it against a large enough set of attacks,” the researchers write, proposing future work that could achieve this.

Either way, this new research shows that making AI systems more robust against worst-case scenarios may be just as valuable as pursuing new, more human/superhuman capabilities.

AIs defending exploits simple Superhuman trouble
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
u1news-staff
u1news-staff
  • Website

Related Posts

Physician offers 8 simple steps to bolster a strong immune system

March 7, 2025

Heart disease could be prevented with this one simple test

February 28, 2025

Apple says Safari protects your privacy. We fact checked those claims.

July 30, 2024

GameStop Dunks On Xbox 360 Store Closing And Gets Savaged

July 30, 2024
Add A Comment

Leave A Reply Cancel Reply

Latest Posts

Worst cities for allergies revealed, along with tips to manage symptoms

May 11, 2025

FDA approves first at-home HPV test to screen for cervical cancer

May 10, 2025

Brain stimulation technology improves Parkinson’s treatment for music conductor

May 10, 2025

Left-handedness linked to autism, schizophrenia in major neurological study

May 10, 2025
Unites States

Biden criticises ‘extreme’ Supreme Court in push for reform

July 30, 2024

FBI details shooter’s search history before Trump assassination attempt

July 30, 2024

Reps. Mike Kelly, Jason Crow to lead task force on Trump rally shooting

July 29, 2024

Subscribe to Updates

Get the latest sports news from SportsSite about soccer, football and tennis.

Copyright ©️ All rights reserved. | U1 News
  • Home
  • About Us
  • Contact
  • Privacy Policy
  • Terms & Conditions
  • Disclaimer

Type above and press Enter to search. Press Esc to cancel.