Hot topics close

OpenAI introduces Sora, its text-to-video AI model

OpenAI introduces Sora its texttovideo AI model
It can create videos up to one minute long.
  • Artificial Intelligence/
  • Tech
/

OpenAI’s latest model takes text prompts and turns them into ‘complex scenes with multiple characters, specific types of motion,’ and more.

By Emma Roth, a news writer who covers the streaming wars, consumer tech, crypto, social media, and much more. Previously, she was a writer and editor at MUO.

Share this story

An AI-generated video showing someone walking on a street in Tokyo
Image: OpenAI

OpenAI is launching a new video-generation model, and it’s called Sora. The AI company says Sora “can create realistic and imaginative scenes from text instructions.” The text-to-video model allows users to create photorealistic videos up to a minute long — all based on prompts they’ve written.

Sora is capable of creating “complex scenes with multiple characters, specific types of motion, and accurate details of the subject and background,” according to OpenAI’s introductory blog post. The company also notes that the model can understand how objects “exist in the physical world,” as well as “accurately interpret props and generate compelling characters that express vibrant emotions.”

Image: OpenAI

The model can also generate a video based on a still image, as well as fill in missing frames on an existing video or extend it. The Sora-generated demos included in OpenAI’s blog post include an aerial scene of California during the gold rush, a video that looks as if it were shot from the inside of a Tokyo train, and others. Many have some telltale signs of AI — like a suspiciously moving floor in a video of a museum — and OpenAI says the model “may struggle with accurately simulating the physics of a complex scene,” but the results are overall pretty impressive.

A couple of years ago, it was text-to-image generators like Midjourney that were at the forefront of models’ ability to turn words into images. But recently, video has begun to improve at a remarkable pace: companies like Runway and Pika have shown impressive text-to-video models of their own, and Google’s Lumiere figures to be one of OpenAI’s primary competitors in this space, too. Similar to Sora, Lumiere gives users text-to-video tools and also lets them create videos from a still image.

Sora is currently only available to “red teamers” who are assessing the model for potential harms and risks. OpenAI is also offering access to some visual artists, designers, and filmmakers to get feedback. It notes that the existing model might not accurately simulate the physics of a complex scene and may not properly interpret certain instances of cause and effect.

Earlier this month, OpenAI announced it’s adding watermarks to its text-to-image tool DALL-E 3, but notes that they can “easily be removed.” Like its other AI products, OpenAI will have to contend with the consequences of fake, AI photorealistic videos being mistaken for the real thing.

Similar news
News Archive
  • Rudy Mancuso
    Rudy Mancuso
    Watch an Elaborate One-Shot Montage in 'Música'
    7 Apr 2024
    9
  • This Is Me Now
    This Is Me... Now
    Movie review: This Is Me… Now is the most J.Lo thing J.Lo's ever done
    17 Feb 2024
    2
  • FC BATE Borisov
    FC BATE Borisov
    DONE DEAL: Arsenal striker Eddie Nketiah signs with Leeds
    8 Aug 2019
    1
  • Afr
    Afr
    Banks could face 46pc hit to profits from card defaults
    17 Jun 2024
    1
  • Kombucha
    Kombucha
    Kombucha Market to Reach USD 9.48 Bn by 2029, emerging at a CAGR of 15.3 percent and forecast 2023-2029
    29 Mar 2024
    4
This week's most popular news