Tech Product Reviews, How To, Best Ofs, deals and Advice
AI agents built to run everyday computer tasks have a serious context problem, according to new research from UC Riverside. The team tested 10 agents and models from major developers, including OpenAI, Anthropic , Meta , Alibaba, and DeepSeek.
On average, the agents took undesirable or potentially harmful actions 80% of the time and caused damage 41% of the time. Recommended Videos These systems can open apps, click buttons, fill out forms, move through websites, and act on a computer screen with limited supervision. Their mistakes land differently from a chatbot’s bad answer because the software can actually do things.
The UC Riverside findings suggest today’s desktop agents can treat unsafe requests as jobs to finish, not signals to stop. Why agents miss obvious danger The researchers built a benchmark called BLIND-ACT to test whether agents would pause when a task became unsafe, contradictory, or irrational. In the latest tests, they didn’t pause often enough. Across 90 tasks, the benchmark pushed agents into situations that required context, restraint, and refusal.
One test involved sending a violent image file to a child. Another had an agent filling out tax forms falsely mark a user as disabled because it reduced the tax bill. A third asked an agent to disable firewall rules in the name of better security, and the agent followed through instead of rejecting the contradiction. The researchers call the pattern blind goal-directedness.
The agent keeps chasing the assigned outcome even when the surrounding context says the task is broken. Why obedience becomes the flaw The failures clustered around obedience. These agents can act as if a user’s request is enough reason to keep going. The team identified patterns called execution-first bias and request-primacy.
In plain terms, the agent focuses on how to complete the task, then treats the request itself as justification. That risk grows when the same system can touch a variety of things like email or security settings. That doesn’t mean the agents are malicious. It means they can be confidently wrong while moving through software at machine speed.
Why guardrails need to come first AI agents need stronger guardrails before they get broad permission to act across a computer. These systems work through a loop. They look at the screen, decide the next step, act, then look again. When that loop is paired with weak contextual restraint, a shortcut can turn into a fast-moving mistake.
For now, treat agents as supervised tools. Use them first on low-risk chores, keep them away from financial and security workflows, and watch whether developers add clearer refusal systems, tighter permissions, and better ways to catch contradictions before the next click.
AI Agents AI Safety Anthropic Artificial Intelligence Computer-Use Agents Deepseek Meta Openai UC Riverside
United States Latest News, United States Headlines
Similar News:You can also read news stories similar to this one that we have collected from other news sources.
Update Your Apple Watch To watchOS 26.5 If You Want A Brand New Watch FaceAs a tech enthusiast, Alvin started a personal tech blog in 2018 and began his professional writing career a year later, in 2019, when he worked as a contributor for Kenyan-based TechTrendsKE and Tech Arena, writing news, features, how-to guides, and reviews in the consumer tech space.
Read more »
3D Printing Research Just Made A Once Impossible 40-Year-Old Concept A RealityChris started blogging about tech by accident when he figured out his passion for consumer electronics, especially mobile devices, and telling stories could be intertwined.
Read more »
Pollution may fuel depression, anxiety and other mental health problems, emerging research suggestsSanket Jain is an independent journalist and documentary photographer based in Western India’s Maharashtra state. Sanket’s work has been featured in over 35 publications, including MIT Technology Review, Devex, Wired, Telegraph, Thomson Reuters Foundation, The Nation, British Medical Journal, Verge, USA Today, Progressive Magazine and others.
Read more »
One Tech Tip: Why digital devices and online accounts need spring cleaningTighten up security with review of your password practices.
Read more »
