Air Head creators say OpenAI's Sora finicky to work with, needs hundreds of prompts, serious VFX work for under 2 minutes of cohesive story ^↺

Shy Kids made Air Head in collaboration with OpenAI's Sora video generation model. (Image source: Shy Kids on YouTube)

OpenAI recently showed off an impressive demo reel created by production house Shy Kids using its Sora video generator. As it turns out, Shy Kids sunk an incredible amount of work into the post-production of Air Head, despite OpenAI's assertion that Sora makes video production effortless.

Julian van der Merwe, Published 04/27/2024 🇫🇷 🇪🇸 ...

AI Fail Software

When OpenAI announced Sora, its video generator AI, one of the videos that was being used to demonstrate its capabilities is the Shy Kids short titled Air Head. While the video was initially touted as an impressive show reel for the OpenAI model, a recent FX Guide interview with Shy Kids reveals that far more work went into the video than many had thought.

While what Sora can do is certainly impressive and was nigh impossible to do just a year or two ago, the Shy Kids team still took nearly two weeks to create Air Head — mostly due to the limitations of the AI. One of the biggest hurdles Shy Kids ran into with Sora was its lack of cohesion, which forced the production team to use an unorthodox editing method, not unlike creating a found footage film or documentary.

It was just getting a whole bunch of shots and trying to cut it up in an interesting way to the VO. – Patrick Cederberg, post-production on Air Head

Shy Kids says it had a script for the video, but the team had to be fluid and adapt to the varied output generated by Sora. Shy Kids also had a hard time keeping things consistent from shot to shot, with Sora often generating a different type of head on the balloon. Overall, Cederberg says it took “hundreds of generations” to get just under a minute and a half of edited footage for the video. He also estimates that the ratio of source material to final content was 300:1, meaning Shy Kids only used around 0.33% of the video Sora generated in its final edit.

My math is bad, but I would guess probably 300:1 in terms of the amount of source material to what ended up in the final.

Working with Sora meant more than just generating hundreds of clips, though. The team also had to manually go in and perform all the regular tasks like colour grading, retiming, and even VFX to remove unwanted elements from the frame. In one generated clip, Sora output a balloon with a face imprinted on the front, and in others, the balloon would be different colours or have an unwanted string hanging from the bottom — all of which had to be removed.

More advanced applications of VFX saw the Shy Kids team removing an entire head that had been generated onto Sonny, the main character, in place of the balloon. Things like this were removed in Adobe After Effects (which is $34.99/mo. and available on Amazon) in order to reach a final product.

While Sora and generative AI video has come a long way, it seems like it's far from replacing the artists behind the scenes — especially if the content being produced is meant to be coherent or anything longer than a few seconds. This likely also explains why, with the exception of two, all the “unedited” clips OpenAI has posted to its Sora page are all on the order of 20 seconds or less.

▶ load Youtube video

Source(s)

FX Guide, Shy Kids on YouTube, OpenAI

Shengshu Technology unveils Vidu 1.5 - an AI model that can generate realistic videos from textual prompts (Image source: Vidu)

Shengshu Technology reveals a new AI video generator to take on OpenAI’s Sora 11/13/2024

A historic deal in the AI world (Image Source: Runway)

Lionsgate's entire film catalog can now be used to train custom Runway AI text-to-video model 09/19/2024

OpenAI CEO Sam Altman (Image source: Korea Metro)

OpenAI allegedly valued at $150 billion, in talks to raise $6.5 billion 09/12/2024

TikTok parent brings video-creating AI to App Store, Play Store 08/13/2024

aiOla is an Israel-based company that uses AI-driven solutions for digitizing paper-based workflows. (Image source: aiOla)

Whisper-Medusa is aiOla’s new open-source speech-recognition AI model, claiming to be 50% faster than OpenAI's Whisper 08/03/2024

¿Un nuevo rival de Google/Perplexity? (Fuente de la imagen: Open AI)

BúsquedaGPT: Open AI se enfrenta a Google con su propio motor de búsqueda de IA 07/29/2024

A new Google/Perplexity rival? (Image source: Open AI)

SearchGPT: Open AI takes on Google with its own AI search engine 07/29/2024

The SearchGPT prototype claims to provide relevant sources for all search results. (Source: OpenAI)

OpenAI's ‘SearchGPT’ prototype enters limited testing, bringing conversational AI and source attribution to searches 07/25/2024

Humane may be selling its dying business to HP for $1 billion, according to rumors. (Image via Humane and VectorStock, w/ edits)

Humane, maker of flawed AI Pin, may sell business to HP for $1 billion 06/06/2024

It has been suggested that Rabbit R1 is an Android app. (Image source: Rabbit)

Rabbit R1 shown running on Google Pixel 6A leaves CEO defending hardware 05/01/2024

Huawei plans to start mass production of HBM2 chips by 2026 (Image source: PCGamesHardware)

Huawei prepares to produce homegrown HBM memory chips to accelerate AI and HPC development 04/30/2024

Casas dos sonhos acessíveis e amigáveis ao clima podem ser possíveis graças a um novo arquiteto de IA e à impressão 3D (imagem: Icon)

Casas dos sonhos baratas e amigáveis ao clima: Novo arquiteto com IA e impressão 3D transformam o setor de construção 04/27/2024

Affordable and climate-friendly dream homes could be possible thanks to a new AI architect and 3D printing (image: Icon)

Cheap, climate-friendly dream homes: New AI architect and 3D printing transform construction industry 04/27/2024

Adobe Researchers work on upscaling low-quality videos using VideoGigaGAN AI. (Source: Adobe Research)

Adobe researchers demonstrate progress of VideoGigaGAN AI to upscale low-quality videos while maintaining high detail level 04/27/2024

According to a report from South Korea, the Samsung Galaxy Watch7 could already offer non-invasive blood sugar monitoring. (Image: AliExpress)

Samsung Galaxy Watch7 is said to already offer a non-invasive blood sugar monitor thanks to AI 04/26/2024

The AI wars: Microsoft unveils Phi-3, a capable AI model that fits easily on a phone 04/26/2024

It appears that the Amazon team working on improving Alexa's search results and AI functionality has been illegally using copyrighted data for training purposes. (Image source: Amazon - edited)

Amazon AI team flouted copyright laws to win arms race - former AI lead dev 04/26/2024

The ID.UX line will come with a new look. (Source: Volkswagen)

Volkswagen ID. CODE previewed as L4 self-driving car at Chinese auto show 04/25/2024

Xiaomi's RedmiBook Pro 16 is a real price-performance champion and one of the best multimedia laptops without dGPU 04/25/2024

Lenovo Yoga Pro 9i 16 G9 still comes with a great mini LED panel at 1200 nits, but also new problems 04/25/2024

The Skyler is the new cat-eye style frame design (Image Source: Meta)

Ray-Ban Meta glasses get multimodal AI assistant, new frame styles and video calling 04/24/2024

Noble ROM 4.1 with Galaxy AI features drops for Samsung Galaxy Note 9 and S9 series next month (Image source: Samsung [edited])

Samsung Galaxy Note 9 from 2018 gets Galaxy AI features via custom ROM 04/24/2024

Lenovo has launched two new ThinkPad models (image via Lenovo)

Lenovo ThinkPad P14s i Gen 5 and ThinkPad P16s i Gen 3: New business laptops presented with Intel Meteor Lake processors and 16:10 screen 04/23/2024

Read all 1 comments / answer

Loading Comments

Comment on this article

Core i9-powered Lenovo Legion Pro 5...

Warframe status rework changes vita...

Julian van der Merwe - Senior News Writer - 1181 articles published on Notebookcheck since 2022

A lifelong techie with a love for open-source and an irresistible urge for tinkering and cracking stuff open to see how it ticks. Julian covers anything tech-related but has a particular fascination with mechanical keyboards, gaming, and quirky camera gear. With background in industrial design, Julian is familiar with the ins and outs of ergonomics, manufacturing, and materials, and he firmly believes that any tech not designed for people shouldn't exist.

contact me via: @NGC_1275, julian_vandermerwe, LinkedIn

Please share our article, every link counts!