Salesforce study finds LLM agents flunk CRM and confidentiality tests

Joseph K
Jun 16, 2025
1 min read

A new benchmark developed by academics shows that LLM-based AI agents perform below par on standard CRM tests and fail to understand the need for customer confidentiality.

A team led by Kung-Hsiang Huang, a Salesforce AI researcher, showed that using a new benchmark relying on synthetic data, LLM agents achieve around a 58 percent success rate on tasks that can be completed in a single step without needing follow-up actions or more information.

https://www.theregister.com/2025/06/16/salesforce_llm_agents_benchmark/

Salesforce study finds LLM agents flunk CRM and confidentiality tests

Comments

Recent Posts

How an AI Agent Wiped Out 2.5 Years of Production Data in Minutes

Caylent Names Valerie Henderson CEO to Spearhead "Agentic" AI Delivery on AWS

Inside Amazon's Leaked Playbook: Defending the $50B OpenAI Deal and Navigating Anthropic Tensions

AWS Plans Massive New Quantum Computing Research Center in Pasadena

Amazon Blames AI-Assisted Deployments for Recent AWS Cloud Outages

Get In Touch

Headquarters

Seoul Office