Google will present the Gemma 3 Open Source model with a 128K context window

Even when large language models and reasoning remain popular, organizations are increasingly turning to smaller models to launch AI processes with less energy and costs.

While some organizations distilla larger models for smaller versions, models suppliers like Google Continue publishing small language models (SLM) as an alternative to large languages (LLM) models that may cost more without dedication to performance or accuracy.

- Advertisement -

With this in mind, Google has released the latest version of its small model, Gemma, which accommodates prolonged contextual windows, larger parameters and more multimodal reasoning.

Gemma 3, which has the same processing power as larger Gemini 2.0 models, stays best used by smaller devices, resembling telephones and laptops. The recent model has 4 sizes: parameters 1b, 4b, 12b and 27b.

With a larger context window of 128 tokens – while Gemma 2 had a 80K context window – Gemma 3 can understand more information and complex demands. Google has updated Gemma 3 to act in 140 languages, analyze images, text and short movies in addition to supporting functions to automate tasks and agency work flows.

Gemma gives good results

To further reduce computing costs, Google introduced quantized Gemma versions. Think Quantized models as compressed models. This happens in the means of “reducing the precision of numerical values in the weight of the model” without devoting accuracy.

Google said that Gemma 3 “provides the latest performance for its size” and exceeds leading LLM, resembling LAMA-405B, Deepseek-V3 and O3-Mini. Gemma 3 27b, especially, took second place to Deepseek-R1 in the results of Elo Chatbot Arena results. It crowned DeepseekSmaller model, Deepseek V3, OpenaiO3-mini, Finish“S lama-405b i Mistral Big.

By quantifying Gemma 3, users can improve performance, launch the model applications and build “which can fit on one GPU host and Tensor Processing (TPU)”.

Gemma 3 integrates with programmers tools resembling hugging transformers, Ollam, Jax, Keras, Pytorch and others. Users also can access Gemma 3 via Google AI Studio, hugging the face or kaggle. Companies and developers can ask for access to API Gemma 3 via AI Studio.

Gemma shield for safety

Google said he built security protocols at Gemma 3, including the safety control of paintings called Shieldgemma 2.

(*3*) writes Google in a blog post. “While thorough testing of more talented models often informs about our assessment less talented, improved STEM GEMMA 3 performance prompted specific assessments focused on its potential improper use in creating harmful substances; Their results indicate low risk. “

Shieldgemma 2 is checking the safety of the image of the 4b parameter built on the foundation of Gemma 3. It finds and prevents the model from reacting with images containing content about sexual discounts, violence and other dangerous materials. Users can adapt Shieldgemma 2 to their specific needs.

Small models and distillation are growing

Since Google released Gemma in February 2024, SLM has noted an increase in interest. Other small models, resembling Microsoft Phi-4 and Mistral Small 3, indicate that enterprises wish to build applications with models as powerful as LLM, but they do not necessarily use the entire width of what it is able to.

Enterprises also began to deal with smaller versions of LLM, which they like through distillation. To make it clear, Gemma is not a distillation of Gemini 2.0; Rather, it is trained with the same data set and architecture. The distilled model learns from a larger model that Gemma does not make.

Organizations often prefer to match some cases of use to the model. Instead of implementing LLM, resembling O3-Mini or Claude 3.7 Sonnet for a easy code editor, a smaller model, no matter whether SLM or distilled version, can easily perform these tasks without the excessive processing of a huge model.

Daily observations in matters of business use with VB each day

If you wish to impress your boss, VB Daily is covered by you. We provide you with an internal measure about what corporations do with generative artificial intelligence, from regulatory changes to practical implementation, so you’ll be able to share insights for the maximum roi.

Read our Privacy Policy

Thanks for the subscription. Check out more VB newsletter here.

There was a mistake.

Rething Risk: why investing in software is more terrifying than you think

Find out what VC wants to see from the founders from TechCrunch Sessions: AI

The founders of the Stax Payments siblings have returned with a 20 million dollars seed increase for their new fintech, it’s worth

Turning my startup saved him from failure – here is how it can also help

CEO of Bolt Ryan Breslow explains his disturbing personal loan worth USD 30 million, announces a new “Super APP”

How to transform your event into experience with PR

5 trends affecting the future of E -commerce

6 ways to detect and use new trends on social media

What origin of AI games can teach us about business breakthroughs

How entrepreneurs can be overtaking in the News Age

What do you do to close the gap in gender salaries? Start with these 3 steps.

Black women use side hustle and bustle to soften the gap in wages. Does it help them or hurts?

People of color build wealth, making hustles on the side – but here is why the concert economy can also hurt them.

Michael Strahan shares the way of thinking that drives his success

Why do you need to focus on personal brands before PR

Start funding is slowed down in February in connection with the uncertainty of the exit

The largest funding rounds of the week: Massive List of Saronic peaks

Nih funding uncertainty Spurs New Biotech Venture Fund

Cleantech Funding for a slow start in 2025

Seed funding has declined sharply in these sectors

Google will present the Gemma 3 Open Source model with a 128K context window

Gemma gives good results

Gemma shield for safety

Small models and distillation are growing

Latest Posts

What do you do to close the gap in gender salaries?...

Another company Robotyka AI sets great financing because dexterity closes the...

Rething Risk: why investing in software is more terrifying than you...

Find out what VC wants to see from the founders from...

Rise of Ai drives us in new unicorns

Inside the Broadcom data simplification strategy, which allows 26 business units...

How Yelp has reviewed competing LLM in terms of correctness, significance...

Omeda Studios introduces an update for MOBA predecessors because players exceed...

Recomended

What do you do to close the gap in gender salaries? Start with these 3 steps.

Another company Robotyka AI sets great financing because dexterity closes the latest round

Rething Risk: why investing in software is more terrifying than you think

Find out what VC wants to see from the founders from TechCrunch Sessions: AI

How to master public relations and build lasting trust in the changing world

The founders of the Stax Payments siblings have returned with a 20 million dollars seed increase for their new fintech, it’s worth