How does text to video render so fast?

350 viewsOtherTechnology

I saw an example on Twitter recently of very well done text to video renders that look as good as Pixar animation. But these tools work in minutes apparently. How does it work when Pixar needs supercomputers to render every single frame in their movies and it’s very time consuming?

In: Technology

4 Answers

Anonymous 0 Comments

When Pixar builds a movie, it is creating a detailed 3D space and rendering it out to a specific camera.

When current AI video models make a video, they are just drawing pixels. They don’t have any understanding of 3D space, they have no knowledge of lighting sources, they can’t track what is happening off-screen.

This makes them very quick (relatively) because there’s a lot less processing needed. But, at least for now, the Pixar method is much more reliable.

Think of it like the difference between building a model house and drawing a house. If you build the house, you can film it from any angle, move the camera, do whatever you want. But it takes a long time.

If you draw the house, you have the house from that specific perspective in that specific moment, and it’s fast. Nothing is being simulated. You don’t have other angles, you can’t change the lighting.

You are viewing 1 out of 4 answers, click here to view all answers.