There are several different ways to do that like the other posters already mentioned.
A specific example would be the use of a software that helps you align phonemes (essentially descriptions of mouth shapes) onto a timeline.
For example the open source software Papagayo-NG (Disclaimer: slight self-promotion on my part) is one such software:
https://github.com/morevnaproject/papagayo-ng
You load your sound file and type in your text, then you let it break that down into it’s smaller building blocks and you can arrange those on a timeline.
The final result can then be exported to a lot of different formats which you can then use in your animation software.
Latest Answers