Salesforce is building a "flight simulator" for AI agents because 95% of enterprise pilots will not achieve production

Salesforce He betting that strict tests in simulated business environments are solved by one of the biggest problems of Enterprise Artificial Intelligence: agents working in demonstrations, but fail in the sloppy reality of corporate operations.

The cloud software giant presented three fundamental research initiatives this week, including CRMENA-PROwhat does it call “digital twin“Business operations in which AI agents could be tested before implementation. The commercial appears when enterprises are struggling with universal AI pilot failures and recent security fears after recent violations, which violated lots of of instances of the Salesforce client.

- Advertisement -

“Pilots do not learn to fly in a storm; flight simulators train, which force them to prepare the most extreme challenges,” said Silvio Savarese, chief scientist Salesforce and the head of AI Research during a press conference. “Similarly, AI agents use simulation and training testing, preparing them to deal with the unpredictability of everyday business scenarios before their implementation.”

Push research reflects the growing frustration of enterprises related to the implementation of AI. The latest MIT report showed that 95% of AI generative pilots in corporations will not achieve production, while their very own Salesforce research shows that only large language models themselves achieve only 35% of success indicators in complex business scenarios.

AI scaling hits its limits

Power capitals, the growing costs of the token and inference delay are transforming AI Enterprise. Join our exclusive salon to find how the best teams are:

Changing energy into a strategic advantage
Architect of effective inference regarding real capability profits
Unlocking competitive roi using balanced AI systems

Secure your house to stay ahead: https://bit.ly/4mwgni

Digital Twins for Enterprise AI: How Salesforce simulates real business chaos

CRMENA-PRO It represents the Salesforce sample to fill the gap between the promise of artificial intelligence and performance. Unlike existing reference points that test general capabilities, the platform evaluates agents in the scope of real tasks of enterprises, comparable to customer support escalation, sales forecasting and disruption of supply chain using synthetic, but realistic business data.

“If synthetic data are not carefully generated, they can lead to misleading or above optimistic results about how well your agent actually reaches in the real environment,” he explained Jason WuResearch manager in Salesforce, who managed the development of CRMENNA-PRO.

The platform works relatively in real Salesforce production environments, not toy configuration, using data approved by domain experts with appropriate business experience. It supports each business and business scenarios and can simulate conversations with many abdomen that capture true conversational dynamics.

Salesforce is used as a “zero client” to check these innovations internal. “Before we introduce anything to the market, we will introduce innovations to the hands of our own team to test it,” he said Muralidhar krishnapragidPresident and Cto Salesforce, during a press conference.

Five indicators that determine if your agent AI is ready

In addition to the simulation environment, Salesforce introduced Agentic benchmark for CRMDesigned for the evaluation of AI agents in five critical indicators of the company: accuracy, cost, speed, trust and security and sustainable environmental development.

The sustainable development rate is particularly noteworthy, helping corporations adjust the size of the model with the complexity of the task to cut back environmental impact while maintaining performance. “By crossing the noise of the overload model, the reference point gives companies a clear, based on a given way of pairing appropriate models with appropriate agents,” the company said.

The comparative effort concerns the practical challenge facing IT leaders: with recent AI models issued almost every day, determining which are suitable for specific business applications is becoming more and harder.

Why the sloppy data of the enterprise can make or break the implementation of AI

The third initiative focuses on the basic conditions of reliable AI: pure, unified data. Salesforce’s Account match Insurance uses refined language models for automatic identification and consolidation of duplicate duplicates between systems, considering that “The Education Company, Inc.” and “example Co.” represent the same subject.

Data consolidation works emerged from the partnership between Salesforce research and product teams. “What detail in the cloud of data suggests basically if you think about something as simple as even the user, they have many, many identifiers in many systems in every company,” explained Krishnapragad.

One of the fundamental clients of the cloud suppliers reached 95% of the fitting indicator using technology, saving sellers for half-hour to attach, eliminating the have to manually reference many screens to discover accounts.

Ads appear in connection with increased security problems after the theft campaign, which influenced over 700 organizations of Salesforce clients at the starting of this month. According to Google’s Ground Intelligence Group, Hackers used Oauth tokens From the Salesloft chat agent to access to the Salesforce instance and the theft of certificates for Amazon Web Services, Snowflake and other platforms.

The violation emphasized the gaps in the integration of third parties, on which enterprises rely on the involvement of customers powered by artificial intelligence. Salesforce has had since then Sales ceiling surgery has been removed From the APPEXCHANGE investigating investigation.

The difference between AI demonstrations and the company’s reality is greater than you think

Simulation and comparative initiatives reflect the broader recognition that the implementation of AI Enterprise requires greater than impressive demonstration movies. Real business environments contain older software, inconsistent data formats and complex work flows that may derail even sophisticated AI systems.

“The main aspects we wanted to talk about today are the consistency aspect, so how to make sure that we leave them in an unsatisfactory way, if you just connect LM to the use of the company, in something that will achieve much higher results,” said Savarese during a press conference.

Salesforce’s approach emphasizes the need for reliable work to AI agents in various scenarios, not distinguished narrow tasks. The concept of the company “Enterprise General Intelligence“(EGI) focuses on construction agents, which are each capable and consistent in performing complex business tasks.

Because enterprises are still investing in AI technologies, the success of such platforms CRMENA-PRO He can determine whether the current wave of enthusiasm AI translates into a sustainable business transformation, or whether it becomes one other example of the promise of technology exceeding practical delivery.

Research initiatives will be presented at Dreamforce Salesforce conference in OctoberWhere the company is expected to announce additional AI changes because it strives to take care of its managerial position on the increasingly competitive AI Enterprise market.

Daily observations in matters of business use with VB every day

If you wish to impress your boss, VB Daily is covered by you. We offer you an internal measure about what corporations do with generative artificial intelligence, from regulatory changes to practical implementation, so you’ll be able to share insights for the maximum roi.

Read our Privacy Policy

Thanks for the subscription. Check out more VB newsletter here.

There was a mistake.

Salesforce is building a “flight simulator” for AI agents because 95% of enterprise pilots will not achieve production

Digital Twins for Enterprise AI: How Salesforce simulates real business chaos

Five indicators that determine if your agent AI is ready

Why the sloppy data of the enterprise can make or break the implementation of AI

The difference between AI demonstrations and the company’s reality is greater than you think

Latest Posts

Recomended