Skip to main content

OpenAI’s latest model creates life like images and readable text, try it free

ChatGPT and OpenAI logos.
OpenAI

OpenAI has introduced its 4o model into ChatGPT to enable native image generation within the chatbot atmosphere. This upgrade makes it so you don’t have to use OpenAI’s Dall-E image generation model as a separate entity, though Dall-E remains available for those as a preference. The AI brand has also enabled its Sora AI video generator within ChatGPT. 

The new features are currently available for ChatGPT free users, as well as for ChatGPT Plus, Team, and Pro users. Availability will be coming to enterprise and education users next week.

Previously, Dall-E 3 was the image generation plug-in for paid ChatGPT subscribers. Meanwhile, those who wanted to try the generator for free could do so through the basic tier of Microsoft Copilot

Recommended Videos

The model has been lauded as one of the top image generators available, particularly in its paid version. Despite the benefit of all ChatGPT users being able to use image generation natively with the 4o model, those using the free tier of ChatGPT should be prepared to run into some limitations, such as maximums for file uploads and data analysis, CNET noted. 

Even so, ChatGPT will benefit from having more realistic images with more legible text after OpenAI spent a year having GPT-4o go through a post-launch training effort called “reinforcement learning from human feedback” (RLHF), according to the Wall Street Journal

After announcing GPT-4o in May 2024, OpenAI had a team of over 100 “human trainers” scouring the model for typos, as well as common errors in hands and faces, the project’s lead researcher, Gabriel Goh told the publication.

The GPT-4o model will also bring to ChatGPT the ability to create transparent backgrounds. This should be a major benefit for business users and creatives, as it will allow them to create logos or other iconography, ChatGPT multimodal product lead, Jackie Shannon also noted to WSJ. 

Despite the improvements that OpenAI has made, the updated GPT-4o model as a whole still has its shortcomings. It still has a propensity toward hallucinations, which is a common AI feature that has yet to be resolved. Maintaining editing consistency remains a challenge within the ChatGPT atmosphere; however, OpenAI has promised rapid updates, as early as next week. 

Another ongoing issue for OpenAI is the matter of ethics and legality. The brand insists its model was trained on “publicly available data,” and through proprietary data it owns via partnerships with brands including Shutterstock, WSJ noted. 

Images generated through ChatGPT based on the 4o model won’t have AI watermarks. However, the brand has indicated images will include C2PA⁠ metadata denoting them as AI-generated. This remains the industry standard.

Fionna Agomuoh
Fionna Agomuoh is a Computing Writer at Digital Trends. She covers a range of topics in the computing space, including…
Meta’s latest open source AI models challenge GPT, Gemini, and Claude
Meta AI widget on Home Screen.

Meta has announced the latest iteration of its open-source AI model family Llama 4, which the brand has developed while competition in the generative AI industry continues to intensify.

The new AI family includes four models, and Meta detailed Llama 4 Scout, Llama 4 Maverick, and Llama 4 Behemoth. Meta detailed on its AI website that the models were trained on “large amounts of unlabeled text, image, and video data.” This indicates that the models will have varied multimodal capabilities.

Read more
OpenAI adjusts AI roadmap for better GPT-5
OpenAI press image

OpenAI is reconfiguring its rollout plan for upcoming AI models. The company’s CEO, Sam Altman shared on social media on Friday that it will delay the launch of its GPT-5 large language model (LLM) in favor of some lighter reasoning models to release first.

The brand will now launch new o3 and o4-mini reasoning models in the coming weeks as an alternative to the GPT-5 launch fans were expecting. In this time, OpenAI will be smoothing out some issues in developing the LLM before a final rollout. The company hasn’t detailed a specific timeline, just indicating that GPT-5 should be available in the coming months.

Read more
Midjourney’s new image generation model announced to take on OpenAI’s GPT-4o
Midjourney logo on web explore feed.

Even though MidJourney set out to be one of the most promising image generation models in the early days of AI, it appears to have fallen behind more accessible, easy to use, and free tools such Gemini, ChatGPT, and Bing. Adding to its woes is the latest update to OpenAI's GPT-4o model which allows exceptionally good image generation with the ability to recreate real photos and produce immaculate text. So to stay relevant -- or perhaps catch the hype train being shunted by the wave of Studio Ghibli-inspired AI art flooding the internet, MidJourney is rolling out an updated model with several improvements.

CEO David Holz announced details of the new V7 model on MidJourney's official Discord server and through a blog post. They said the new model is "smarter with text prompts" and produces images with "noticeably higher" quality and "beautiful textures."

Read more