Usually they would record the audio in advance then synchronize the animation to that voice over. With computers these days they can produce some reference positions for the mouth and lip movements for given language sounds, like a “p” sound is going to require closing the lips, etc.
The computer then runs a program that recognizes those language sounds (perhaps also referencing a script) and can animate the movement of the face between those points automatically. On big budget productions they would then go in and tweak those animations to perfection.
Latest Answers