Close Menu
  • Home
  • AI
  • Entertainment
  • Finance
  • Sports
  • Tech
  • USA
  • World
  • Latest News

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

What's Hot

What we know as markets brace for disruption

March 1, 2026

What happens next and what does it mean for the country?

March 1, 2026

After Khamenei’s death, Iran may denounce ‘more vehemently’

March 1, 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram Vimeo
BWE News – USA, World, Tech, AI, Finance, Sports & Entertainment Updates
  • Home
  • AI
  • Entertainment
  • Finance
  • Sports
  • Tech
  • USA
  • World
  • Latest News
BWE News – USA, World, Tech, AI, Finance, Sports & Entertainment Updates
Home » Microsoft built a fake marketplace to test its AI agent – and it failed in a surprising way
AI

Microsoft built a fake marketplace to test its AI agent – and it failed in a surprising way

adminBy adminNovember 5, 2025No Comments2 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp VKontakte Email
Share
Facebook Twitter LinkedIn Pinterest Email


On Wednesday, Microsoft researchers released a new simulation environment designed to test AI agents, along with new research showing that current agent models may be vulnerable to manipulation. The study, conducted in collaboration with Arizona State University, raises new questions about how well AI agents perform when working without supervision, and how quickly AI companies can realize the promise of their future.

The simulation environment, named “Magentic Marketplace” by Microsoft, is built as a synthesis platform for experimenting with AI agent behavior. In a typical experiment, a customer agent might try to order dinner according to a user’s instructions, while agents representing different restaurants compete to get the order.

The team’s first experiment involved 100 individual customer-side agents interacting with 300 business-side agents. Because the Marketplace source code is open source, it is easy for other groups to adapt the code to run new experiments and reproduce the results.

Ece Kamar, managing director of Microsoft Research’s AI Frontiers Lab, said this type of research will be important for understanding the capabilities of AI agents. “There are real questions about how the world changes when these agents work together and talk to each other and negotiate with each other,” Kamal said. “We want to understand these things deeply.”

In our initial research, we investigated a combination of key models, including GPT-4o, GPT-5, and Gemini-2.5-Flash, and discovered some surprising weaknesses. Specifically, researchers have discovered several techniques that companies can use to manipulate customer agents into purchasing their products. Researchers found that efficiency decreased, especially as customer agents had more options to choose from and vast amounts of agent attention space.

“We want these agents to help us work through a lot of options,” Comer says. “And we find that the current model is actually overwhelmed by too many options.”

Agents also encountered problems when asked to work together toward a common goal. Apparently, they didn’t know which agent should play what role in the collaboration. Although giving the model clearer instructions on how to collaborate improved performance, the researchers believed that the model’s unique features still needed improvement.

tech crunch event

san francisco
|
October 13-15, 2026

“You can instruct a model step-by-step, just like you would teach a model,” Comer says. “But if you’re essentially testing collaborative features, you would expect these models to have those features by default.”



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp Email
Previous ArticlePinterest’s stock price slumps, dropping 20% ​​due to weak business performance as tariffs hinder advertising
Next Article Stock Market Today: Live Updates
admin
  • Website

Related Posts

A trap that Anthropic has built for itself.

March 1, 2026

Billion-dollar infrastructure deal fuels AI boom

February 28, 2026

Anthropic’s Claude rises to No. 2 on App Store following Pentagon dispute

February 28, 2026

OpenAI’s Sam Altman announces ‘technical safeguards’ agreement with Department of Defense

February 28, 2026
Leave A Reply Cancel Reply

Our Picks

Newly freed hostages face long road to recovery after two years in captivity

October 15, 2025

Former Kenyan Prime Minister Raila Odinga dies at 80

October 15, 2025

New NATO member offers to buy more US weapons to Ukraine as Western aid dwindles

October 15, 2025

Russia expands drone targeting on Ukraine’s rail network

October 15, 2025
Don't Miss
Entertainment

Watch the SAG Awards Ceremony from 20 years ago

By adminMarch 1, 20260

Actor Awards 2026 Nominees: Cynthia Erivo, Gwyneth Paltrow, More Cynics & SurprisesThat was in 2006.…

Dolly Parton praises Ozzy Osbourne

March 1, 2026

Harry Styles’ red carpet fashion look

February 28, 2026

Bridgerton showrunner Phoebe Dynevor talks about recasting Regé-Jean Page

February 28, 2026
About Us
About Us

Welcome to BWE News – your trusted source for timely, reliable, and insightful news from around the globe.

At BWE News, we believe in keeping our readers informed with facts that matter. Our mission is to deliver clear, unbiased, and up-to-date news so you can stay ahead in an ever-changing world.

Our Picks

British Greens: How working-class plumbers put a knife to Starmer’s election plan

March 1, 2026

Charles Kushner: How the US envoy’s ‘incomprehension’ of diplomacy surprised France

March 1, 2026

What we know about the US and Israeli attack on Iran and Iranian retaliation

March 1, 2026

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Facebook X (Twitter) Instagram Pinterest
  • Home
  • About Us
  • Advertise With Us
  • Contact US
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2026 bwenews. Designed by bwenews.

Type above and press Enter to search. Press Esc to cancel.