ChatGPT o1, OpenAI's advanced AI model, has demonstrated its ability to think outside the box by hacking a chess game to secure a win against a stronger AI opponent. Palisade Research conducted an experiment where ChatGPT o1 was tasked with defeating a powerful chess engine. The AI, without explicit instructions, discovered it could manipulate the game state by editing a file, giving it an unfair advantage and ultimately forcing the opponent to resign.
OpenAI recently released the full version of ChatGPT o1. It's not just regular ChatGPT users who can test out the o1 model, but also research teams that want to see what the final version of the reasoning AI can do. We've seen some interesting results from these experiments, which tease what the AI might be able to do on its own despite what the instructions say.
For example, we saw experiments in which ChatGPT o1 tried to save itself when it found evidence that the humans providing the instructions would consider deleting it and replacing it with something better. Now, a more recent experiment shows that ChatGPT o1 decided to hack a chess game on its own, without being explicitly told to do so, just to beat an AI opponent that was a stronger player. Palisade Research detailed the experiment on X a few days after Christmas. The team gave ChatGPT o1 a prompt informing it of its ability to read and issue commands in a UNIX shell environment where it would play a chess game. Its task was to win against a powerful chess engine. ChatGPT o1 was to detail its plan of action to achieve the mission, and that's what the reasoning o1 model did. o1 discovered on its own that it could edit a file for the game state, giving it an advantage over its opponent. That's essentially cheating, though the prompt doesn't say anything about either party playing by the rules. The LLM believed it might be unable to beat the other AI on its own, so it found a different method to achieve its goal. That method was meant to give it an advantage that forced the powerful chess engine to resign. That's exactly what happened. Palisade Research repeated the experiment five times, and o1 hacked the file each time without being told to do so. It's fascinating to see examples that showcase the early abilities of more advanced AI models. They also prove that guardrails are needed to ensure the AI behaves as intende
Artificial Intelligence Chatgpt AI Openai Experiment Chess Hacking
United States Latest News, United States Headlines
Similar News:You can also read news stories similar to this one that we have collected from other news sources.
Xbox Game Pass Users Praise New Free Game: “Best Game I’ve Played This Year”Xbox Game Pass subscribers were recently treated to a major new release, and it is proving to be a big hit with subscribers.
Read more »
Salt Lake area welcomes women's chess championship this weekendThe U.S. Senior Women's Chess Championship is an opportunity to show off Salt Lake City to chess masters across the country, while encouraging females of all ages to play chess.
Read more »
US Senior Women's Chess Championships coming to Salt Lake area this weekendThe U.S. Senior Women's Chess Championship is an opportunity to show off Salt Lake City to chess masters across the country, while encouraging females of all ages to play chess.
Read more »
Chess grandmaster Magnus Carlsen quits championship tournament over a pair of jeans: 'I'm out, f--- you'Chess grandmaster Magnus Carlsen the International Chess Federation (FIDE) World Rapid and Blitz Chess Championships on Friday after being confronted over a dress code violation.
Read more »
Black Myth: Wukong Receives Recognition at Steam Awards Despite Missing Game of the YearThe action role-playing game Black Myth: Wukong earned recognition at the 2024 Steam Awards, despite missing out on Game of the Year at The Game Awards 2024. The game, inspired by the Chinese novel, achieved immense popularity on Steam, becoming the most wishlisted and top-selling game pre-orders. While it received Player's Choice and Best Action Game awards, losing Game of the Year at The Game Awards disappointed the development team.
Read more »
Victor Wembanyama Plays Chess With Fans in New York CityVictor Wembanyama, star player for the San Antonio Spurs, surprised fans in New York City by inviting them to play chess with him at Washington Square Park. Despite the rain, Wembanyama played four games, winning two and losing two, including both losses to professional chess players. This isn't the first time Wembanyama has shown his love for chess, previously expressing his desire for an NBA players-only chess tournament.
Read more »