AI Agents Drop the Ball: Struggling with CRM and Confidentiality Concerns

Researchers at Salesforce have revealed that LLM-based AI agents are struggling with CRM tests, achieving only a 58% success rate on single-step tasks and a mere 35% on multi-step tasks. The CRMArena-Pro tool highlights their poor performance and low confidentiality awareness. There’s a significant gap between LLM capabilities and real-world enterprise demands.

3P

Published: June 16, 2025 1:48 pmAdded: June 16, 2025 at 7:06 amAssembled by: The Editor

Pro Dashboard

Hot Take:

Looks like LLM-based AI agents are not the CRM superheroes we were hoping for. Instead of saving the day, they’re fumbling confidential info and tripping over multi-step tasks like a toddler learning to walk. Salesforce might want to rethink that “very high margin opportunity” they’ve been dreaming about. Until then, humans, you’re still the reigning champs of customer service!

Key Points:

LLM agents score only 58% on single-step CRM tasks.
Performance drops to a dismal 35% on multi-step tasks.
LLM agents struggle with handling confidential information.
Benchmark called CRMArena-Pro used for the study.
Salesforce sees AI agents as a potential high-margin opportunity.

Pro Dashboard

Membership Required

You must be a member to access this content.

View Membership Levels

Already a member? Log in here