Microsoft-backed startup debuts task-optimized enterprise AI models that run on processors

A brand new enterprise AI-focused startup is emerging from hiding today, promising to deliver so-called “task-optimized” models that deliver higher performance at lower costs.

Fastino based in San Francisco also reveals that it has raised $7 million in a pre-seed funding round from Insight Partners and M12, Microsoft’s Venture Fund, in addition to participation from Github CEO Thomas Dohmke. Fastino is building its family of enterprise AI models and developer tools. The models are recent and are not based on any existing large language model (LLM). Like most generative AI providers, Fastino’s models have a transformer architecture, although they use some progressive techniques aimed at improving accuracy and usability for the enterprise. Unlike most other LLM providers, Fastino models will run well on general-purpose CPUs and do not require expensive GPUs to run.

- Advertisement -

The idea for Fastino was born from the founders’ own experiences in the industry and the real challenges of implementing artificial intelligence on a large scale.

Ash Lewis, the company’s CEO and co-founder, was building a developer agent technology often known as DevGPT. His co-founder, George Hurn-Maloney, was previously the founding father of Waterway DevOps, which was acquired by JFrog in 2023. Lewis explained that his previous company’s developer agent was using OpenAI in the background, which led to some issues.

“We were spending almost a million dollars a year on API,” Lewis said. “We didn’t feel like we had any real control over it.”

Fastino’s approach is a departure from traditional large language models. Instead of making general-purpose AI models, the company has developed task-optimized models that excel at specific enterprise functions.

“The whole idea is that if you narrow down the scope of these models, make them less general, to make them more optimized for your task, they will only be able to respond within a certain range,” Lewis explained.

How a task-optimized model approach can improve enterprise AI performance

The concept of using a smaller model to optimize for a specific use case is not an entirely recent idea. Small Language Models (SLMs) similar to Microsoft’s Phi-2 and vendors similar to Arcee AI have been advocating this approach for some time.

Hurn-Maloney said Fastino calls its models task-optimized reasonably than SLM for a variety of reasons. First, he believes that the term “small” is often associated with less accuracy, which is not the case with Fastino. Lewis said the goal is actually to create a recent category of model that is not a generic model that is simply large or small in terms of the variety of parameters.

Fastino models are task-optimized reasonably than general models. The goal is to make models less broad in scope and more specialized for specific enterprise tasks. By focusing on specific tasks, Fastino claims its models are in a position to achieve greater accuracy and reliability in comparison with general language models.

These models particularly stand out:

Structuring text data
RAG (Recovery Assisted Generation) pipeline operation.
Task planning and reasoning
Generating a JSON response for a function call

Optimized models mean no GPU is required, lowering AI costs in the enterprise

The key differentiator of Fastino models is the fact that they’ll run on processors and do not require the use of AI GPU accelerator technology.

Fastino enables rapid inference about processors using many different techniques.

“If we’re just talking about absolutely simple terms, just multiply less,” Lewis said. “A lot of our techniques in architecture are simply focused on doing fewer matrix multiplication tasks.”

He added that the models provide answers in milliseconds, not seconds. This performance extends to edge devices, with successful implementations demonstrated on hardware as humble as a Raspberry Pi.

“I think many companies pay attention to TCO [total cost of ownership] for embedding AI in their applications,” added Hurn-Maloney. “So I think being able to take expensive GPUs out of the equation is obviously helpful as well.”

Fastino models are not yet widely available. That said, the company already works with industry leaders in consumer devices, financial services and e-commerce, including a major North American manufacturer of devices for home and automotive applications.

“Our ability to operate locally is really useful in industries that are quite sensitive to their data,” Hurn-Maloney explained. “The ability to run these models locally and on existing processors is quite enticing for financial services, healthcare and industries where data is more sensitive.”

VB every day

Stay up thus far! Get the latest news in your inbox every day

By subscribing, you conform to VentureBeat’s Terms of Service.

Thank you for subscribing. Find more VB newsletters here.

An error occurred.

As the Edtech sector matures, the American technology company wants to bring influence on the manager in the classroom

The largest rounds of financing the week: OpenAI easily exceeds a huge week

The turbine collects USD 22 million to help VC investors get cash without selling their rates

Great talent Venture Talent

New session at TechCrunch All Stage: Jahanvi Sardana about how to best transform markets

By 2027, most employees will be a freelancer. Are you ready?

How to set your financial company as an industry leader

Why relying on artificial intelligence can be your biggest business mistake

Use these AI gaps to generate 7-profits

5 simple product hacks that will make you more effective

This is a leader superpower from 2025 – do you have what you need?

Most people make this career mistake. Are you guilty of him?

One thing that ruins your business faster than anything else

Each company will become a crisis – here’s how to adapt quickly

Reflect the potential of brain problems with these 3 Hacks of Neuronuki

Q1 Global Startup Funding will publish the strongest quarter from KW. 2 2022

Start funding is slowed down in February in connection with the uncertainty of the exit

The largest funding rounds of the week: Massive List of Saronic peaks

Nih funding uncertainty Spurs New Biotech Venture Fund

Cleantech Funding for a slow start in 2025

Microsoft-backed startup debuts task-optimized enterprise AI models that run on processors

How a task-optimized model approach can improve enterprise AI performance

Optimized models mean no GPU is required, lowering AI costs in the enterprise

Latest Posts

By 2027, most employees will be a freelancer. Are you ready?

As the Edtech sector matures, the American technology company wants to...

TechCrunch All Stage: Find out how artificial intelligence can pay MVPS...

Invest in artificial intelligence that will make chatbots outdated

Openai has just made chatgpt plus for free for millions of...

It worries the market induced with a tariff in startup and...

In addition to general comparative tests: as Yourbench allows enterprises to...

WHERE Credit’s Nete: Inside Experian AI RAME, which changes financial access

Recomended

By 2027, most employees will be a freelancer. Are you ready?

As the Edtech sector matures, the American technology company wants to bring influence on the manager in the classroom

TechCrunch All Stage: Find out how artificial intelligence can pay MVPS with Chris Gardner

Invest in artificial intelligence that will make chatbots outdated

The largest rounds of financing the week: OpenAI easily exceeds a huge week

I thought I knew business – my MBA turned out to be wrong