Hot topics close

OpenAI’s newest model Sora can generate videos — and they look decent

OpenAIs newest model Sora can generate videos  and they look decent
OpenAI has been working on a video generation model, Sora. The results aren't perfect -- but they're among the better examples we've seen.

OpenAI, following in the footsteps of startups like Runway and tech giants like Google and Meta, is getting into video generation.

OpenAI today unveiled Sora, a generative AI model that creates video from text. Given a brief — or detailed — description or a still image, Sora can generate 1080p movie-like scenes with multiple characters, different types of motion and background details, OpenAI claims.

Sora can also “extend” existing video clips — doing its best to fill in the missing details.

“Sora has a deep understanding of language, enabling it to accurately interpret prompts and generate compelling characters that express vibrant emotions,” OpenAI writes in a blog post. “The model understands not only what the user has asked for in the prompt, but also how those things exist in the physical world.”

Now, there’s a lot of bombast in OpenAI’s demo page for Sora — the above statement being an example. But the cherry-picked samples from the model do look rather impressive, at least compared to the other text-to-video technologies we’ve seen.

For starters, Sora can generate videos in a range of styles (e.g., photorealistic, animated, black and white) up to a minute long — far longer than most text-to-video models. And these videos maintain reasonable coherence in the sense that they don’t always succumb to what I like to call “AI weirdness,” like objects moving in physically impossible directions.

Check out this tour of an art gallery, all generated by Sora (ignore the graininess — compression from my video-GIF conversion tool):

OpenAI Sora

Image Credits: OpenAI

Or this animation of a flower blooming:

OpenAI Sora

Image Credits: OpenAI

I will say that some of Sora’s videos with a humanoid subject — a robot standing against a cityscape, for example, or a person walking down a snowy path — have a video game-y quality to them, perhaps because there’s not a lot going on in the background. AI weirdness manages to creep into many clips besides, like cars driving in one direction, then suddenly reversing or arms melting into a duvet cover.

OpenAI Sora

Image Credits: OpenAI

OpenAI — for all its superlatives — acknowledges the model isn’t perfect. It writes:

“[Sora] may struggle with accurately simulating the physics of a complex scene, and may not understand specific instances of cause and effect. For example, a person might take a bite out of a cookie, but afterward, the cookie may not have a bite mark. The model may also confuse spatial details of a prompt, for example, mixing up left and right, and may struggle with precise descriptions of events that take place over time, like following a specific camera trajectory.”

OpenAI’s very much positioning Sora as a research preview, revealing little about what data was used to train the model (short of ~10,000 hours of “high-quality” video) and refraining from making Sora generally available. Its rationale is the potential for abuse; OpenAI correctly points out that bad actors could misuse a model like Sora in myriad ways.

OpenAI says it’s working with experts to probe the model for exploits and building tools to detect whether a video was generated by Sora. The company also says that, should it choose to build the model into a public-facing product, it’ll ensure that provenance metadata is included in the generated outputs.

“We’ll be engaging policymakers, educators and artists around the world to understand their concerns and to identify positive use cases for this new technology,” OpenAI writes. “Despite extensive research and testing, we cannot predict all of the beneficial ways people will use our technology, nor all the ways people will abuse it. That’s why we believe that learning from real-world use is a critical component of creating and releasing increasingly safe AI systems over time.”

Similar news
News Archive
  • Mitch Tambo
    Mitch Tambo
    Mitch Tambo delivers plea to Australia on Indigenous issues during Reconciliation Week on Q+A
    27 May 2021
    1
  • Neurosurgery
    Neurosurgery
    Surgical Endoscope for Neurosurgical Market Booming Worldwide With Leading Key Players -B.Braun, Ackermann ...
    6 Feb 2023
    1
  • World Club Challenge
    World Club Challenge
    Matt Peet promises more nights to match Wigan's World Club Challenge triumph
    24 Feb 2024
    24
  • Yuffie Kisaragi
    Yuffie Kisaragi
    Every Playable Character In FF7 Rebirth, Ranked By How Fun They Are
    5 Mar 2024
    5
  • VLine
    V/Line
    V/Line strikes could cause Bad Blood for Swifties | Brimbank & North West
    16 Feb 2024
    7
  • Principal
    Principal
    Principal Asset Management launches Principal US High Conviction Equity Fund
    2 Nov 2024
    2
This week's most popular news