OpenAI CEO shares the first image generated by GPT-4o

OpenAI CEO Greg Brockman posted from his X account what appears to be the first public image generated using the company’s latest GPT-4o model.

- Advertisement -

As you may see in the image below, it’s quite convincingly photorealistic and shows a person wearing a black T-shirt with the OpenAI logo writing something in chalk on a blackboard that reads, “Transfer between modalities. Suppose we directly model P (text, pixels, audio) with one large autoregressive transformer. What are the benefits and disadvantages?”

GPT-4o Generated Image – So much to find with GPT-4o’s image generating capabilities alone. The team is working hard to bring them to the world. pic.twitter.com/5mO5aQxbaK

— Greg Brockman (@gdb) May 15, 2024

The latest GPT-4o model, which debuted on Monday, improves on the previous family of GPT-4 models (GPT-4, GPT-4 Vision and GPT-4 Turbo) by being faster, cheaper and retaining more information from inputs akin to audio and picture.

It is in a position to do this because OpenAI has taken a different approach than its previous GPT-4 LLM. While these combined many different models together and converted other media akin to audio and images to text and vice versa, the latest GPT-4o was trained on multimedia tokens from the outset, allowing it to directly analyze and interpret image and audio without prior convert it to text.

VB event

Artificial Intelligence Impact Tour: An Artificial Intelligence Audit

Join us when we return to New York on June 5 to have interaction with top executives and delve into strategies for auditing AI models to make sure integrity, optimal performance, and ethical compliance across organizations. Secure your entry to this exclusive, invitation-only event.

Ask for an invitation

Based on the image above, the latest approach is a noticeable improvement over the last OpenAI DALL-E 3 image generation model that debuted in September 2023. I ran a similar prompt on DALL-E 3 in ChatGPT and here is the result.

As you may see, Brockman’s shared image created with GPT-4o significantly improves the quality, photorealism, and accuracy of text generation.

However, GPT-4o’s native image generation capabilities are not yet publicly available. As Brockman alluded to in his X post, saying, “The team is working hard to share them with the world.”

VB every day

Stay updated! Get the latest news in your inbox every day

By subscribing, you comply with VentureBeat’s Terms of Service.

Thanks for subscribing. Find more VB newsletters here.

An error occured.

The culture of technological startups is not as innovative as the founders may think

Keith Rabois from Khosli runs a series A USD 11.5 million at the startup, it calls it “the future of the housing market”

Like American increases in defense technology, Europe is delayed

Mark Cuban Backs Skylight, Alternative Tiktok based on basic BlueSky technology

Caastle Board confirms financial anxiety, Furlough employees

Why automation kills your performance and exhaustion of profits

How to slow down bad customers in the right way

How to succeed as a planning leader

Is your company fighting? Take these steps to bring your company to success

How to master 5 pillars of entrepreneurial perfection

One thing that ruins your business faster than anything else

Each company will become a crisis – here’s how to adapt quickly

Reflect the potential of brain problems with these 3 Hacks of Neuronuki

The best leaders master their communication with confidence

Jon Taffer from the Rescue bar tells us what really annoys him

Start funding is slowed down in February in connection with the uncertainty of the exit

The largest funding rounds of the week: Massive List of Saronic peaks

Nih funding uncertainty Spurs New Biotech Venture Fund

Cleantech Funding for a slow start in 2025

Seed funding has declined sharply in these sectors

OpenAI CEO shares the first image generated by GPT-4o

VB event

Latest Posts

One thing that ruins your business faster than anything else

Why automation kills your performance and exhaustion of profits

The culture of technological startups is not as innovative as the...

Keith Rabois from Khosli runs a series A USD 11.5 million...

WHERE Credit’s Nete: Inside Experian AI RAME, which changes financial access

Google’s Gemini 2.5 Pro is the smartest model that you don’t...

Nintendo Highlights today application, abandoning a legend about Zeld

The second person of the horror facilitates on May 22, on...

Recomended

One thing that ruins your business faster than anything else

Why automation kills your performance and exhaustion of profits

The culture of technological startups is not as innovative as the founders may think

Keith Rabois from Khosli runs a series A USD 11.5 million at the startup, it calls it “the future of the housing market”

This type of leadership is crucial in the times of low employee morale

What to do when your environment suppresses your development