This Hacker Team Is Bulletproofing AI Models For Companies Like OpenAI And Anthropic

Gray Swan Ai News

This Hacker Team Is Bulletproofing AI Models For Companies Like OpenAI And Anthropic
Ai SafetyAi ModelsSecurity
  • 📰 ForbesTech
  • ⏱ Reading Time:
  • 47 sec. here
  • 11 min. at publisher
  • 📊 Quality Score:
  • News: 53%
  • Publisher: 59%

Sarah Emerson is a senior writer who reports on technology companies and culture in Silicon Valley. She's broken news about the empires of billionaires such as Eric Schmidt and fallen billionaire Ryan Breslow. Sarah has also followed the trends and ideologies shaping today's AI zeitgeist.

The researchers behind Gray Swan AI started the company after finding a major vulnerability in models from OpenAI, Anthropic , Google and Meta. Now, they build products that help safeguard them.

The breakneck pace at which AI is evolving has created a vast ecosystem of new companies — some creating ever more powerful models, others identifying the threats that may accompany them. Gray Swan is among the latter but takes it a step further by building safety and security measures for some of the issues it identifies. “We can actually provide the mechanisms by which you remove those risks or at least mitigate them,” Kolter told.

Looking forward, Gray Swan is keen on cultivating a community of hackers, and it’s not alone. At last year’s Defcon security conference, more than 2,000 people participated in an AIoften enlist internal and external red teamers to assess new models, and have announced official bug bounty programs that reward sleuths for exposing exploits around high-risk domains, such as CBRN .a vulnerability in Anthropic’s Claude Sonnet-3.5 — are also valuable resources for model developers.

We have summarized this news so that you can read it quickly. If you are interested in the news, you can read the full text here. Read more:

ForbesTech /  🏆 318. in US

Ai Safety Ai Models Security Openai Jailbreaking Ethical Hacking Anthropic Zico Kolter Red Teaming

United States Latest News, United States Headlines

Similar News:You can also read news stories similar to this one that we have collected from other news sources.

Terminator: The Sarah Connor Chronicles Reflects On Show's Cancellation 15 Years Later, Reveals John Storyline Had It Renewed For Season 3Terminator: The Sarah Connor Chronicles Reflects On Show's Cancellation 15 Years Later, Reveals John Storyline Had It Renewed For Season 3Lena Headey as Sarah Connor in Terminator: The Sarah Connor Chronicles
Read more »

Chicago NWSL team unveils new team name and logo for 2025Chicago NWSL team unveils new team name and logo for 2025The Chicago Red Stars won't be the Red Stars starting in 2025.
Read more »

Chicago NWSL team unveils new team name and logo for 2025Chicago NWSL team unveils new team name and logo for 2025The Chicago Red Stars won't be the Red Stars starting in 2025.
Read more »

‘How to Die Alone’s Leading Men Decide if They’re Team Alex or Team Terrance‘How to Die Alone’s Leading Men Decide if They’re Team Alex or Team TerranceTaylor Gates is an LA-based critic earned her BFA in Creative Writing from the University of Evansville. She has been with Collider since 2022.
Read more »

Girls Revolt: High School Soccer Team Refuses to Play Team with Male PlayerGirls Revolt: High School Soccer Team Refuses to Play Team with Male PlayerSource of breaking news and analysis, insightful commentary and original reporting, curated and written specifically for the new generation of independent and conservative thinkers.
Read more »

SNL 50 Cold Open: Family Feud Sees Team Harris vs. Team Trump (VIDEO)SNL 50 Cold Open: Family Feud Sees Team Harris vs. Team Trump (VIDEO)Host Steve Harvey (Kenan Thompson) welcomes key players in the 2024 election, like Kamala Harris (Maya Rudolph), Doug Emhoff (Andy Samberg), Tim Walz (Jim Gaffigan) and Joe Biden (Dana Carvey), to face off against Donald Trump (James Austin Johnson), Donald Trump Jr. (Mikey Day) and JD Vance (Bowen Yang) in a game of Family Feud.
Read more »



Render Time: 2025-08-28 06:40:55