Friday, 26 September 2025
27.9 C
Singapore
26.7 C
Thailand
20.1 C
Indonesia
26.8 C
Philippines

Salesforce advances enterprise AI with new agent simulation, benchmarking and data tools

Salesforce introduces new AI simulation, benchmarking and data tools to help enterprises deploy reliable and sustainable AI agents.

Salesforce AI Research has introduced a series of innovations aimed at helping enterprises prepare AI agents for real-world use. The company unveiled an advanced simulation platform for testing AI in complex business settings, launched new benchmarking tools to measure agent performance, and enhanced its Data Cloud with smarter data unification capabilities.

Preparing AI agents for real-world scenarios

To improve the way AI agents are trained before deployment, Salesforce has released CRMArena-Pro, an enterprise-grade simulation environment. Building on the earlier CRMArena tool, the new platform tests AI performance in multi-turn, multi-agent tasks such as sales forecasting, customer service triage and configure-price-quote processes. The environment uses synthetic data and mimics real business complexity, including API integrations and safeguards for personal information.

By modelling unpredictable business events, CRMArena-Pro enables companies to evaluate an AI agent’s accuracy, efficiency and resilience before it goes live. The system acts as a digital twin of enterprise operations, allowing safe experimentation while preparing AI to handle edge cases like supply chain disruptions or high-pressure customer service scenarios.

Setting benchmarks for AI readiness

Salesforce also introduced the Agentic Benchmark for CRM, the first assessment designed to test AI agents in the specific contexts that businesses care about. It measures five key metrics — accuracy, cost, speed, trust and safety, and sustainability. This approach gives IT leaders a clearer way to compare models and select those that match their operational needs.

Sustainability is a new metric in the evaluation process, reflecting the environmental impact of AI systems. As models become larger and more resource-intensive, the benchmark helps companies balance computing demands with performance needs.

Complementing this, Salesforce released MCP-Eval and MCP-Universe, two additional benchmarking frameworks. MCP-Eval provides scalable synthetic tests across a wide range of systems, while MCP-Universe offers more challenging, real-world scenarios to identify where AI agents might fail. Together, they allow organisations to diagnose weaknesses and fine-tune their agents for reliable enterprise performance.

Improving data for AI-driven decisions

Recognising that quality data is crucial for AI, Salesforce has added a new capability to its Data Cloud called Account Matching. This system uses large and small language models to automatically merge duplicate and inconsistent records across business units. Instead of manual rule-based clean-ups, AI now matches and unifies accounts — for example, linking “The Example Company, Inc.” with “Example Co.” into a single record.

Early customer results show significant impact. One company using Account Matching unified over one million accounts in its first month, achieving a 95% match rate and cutting average handling time by 30 minutes per case. The tool also reduces manual work by sending only the most complex data cases to humans, speeding up sales cycles and improving efficiency.

Moving toward the agentic enterprise

These updates show Salesforce’s push to equip companies with practical tools to adopt AI responsibly and effectively. By combining simulation, reliable benchmarking and clean data, the company aims to help businesses build “agentic enterprises” — organisations where AI works alongside humans to handle everyday tasks, reduce operational friction and support growth.

Hot this week

Huawei co-develops DeepSeek model with stronger censorship tools

Huawei co-develops DeepSeek-R1-Safe AI model with Zhejiang University, boosting censorship tools and outperforming rivals in safety tests.

Tech Week Singapore 2025 to highlight AI’s role in global collaboration

Tech Week Singapore 2025 will gather global leaders to explore AI, cybersecurity, and digital transformation on 8–9 October.

UAE launches BRIDGE Summit in Abu Dhabi to unite global media and entertainment leaders

UAE launches BRIDGE Summit in Abu Dhabi, the world’s largest media and entertainment platform, bringing together over 60,000 participants.

OpenAI rumoured to be developing ChatGPT-powered devices

OpenAI is reportedly developing ChatGPT-powered devices, including smart glasses and wearables, with a launch expected by 2026.

Windows 11 tests new Copilot Vision button in taskbar

Microsoft tests a new Copilot Vision button in Windows 11, letting users share app content with its AI assistant for instant analysis.

SEON enhances AI tools to help fraud and AML teams act faster

SEON launches enhanced AI tools that cut fraud and AML review time by up to 50%, offering faster insights and clear decision-making support.

Waste4Change scales operations across 19 locations

Waste4Change posts a decade of 88% growth, collecting 64.9m kg of waste in Indonesia while expanding operations and cutting carbon emissions.

Meta launches pop-up stores to showcase new Ray-Ban smart glasses

Meta opens pop-up stores to showcase its new Ray-Ban Display smart glasses, which feature built-in augmented reality capabilities.

Lenovo launches AI-ready IT solutions to help small and medium businesses modernise and scale

Lenovo introduces AI-ready IT solutions and flexible pricing models to help small and medium businesses modernise easily and scale efficiently.

Related Articles

Popular Categories