Basically it uses a few data points to infer the rest.
Like if you had to guess the next numbers in this series. 1, 3, 5, 7, 9, 11, 13, 15… you use the data and you have to guess the rest.
Deepfakes takes actual footage of those people and notes their facial expression, lip movements, voice, etc and then it basically fills in the gaps of whatever you want it to say or do.
For speech, it’s all sound waves and transitions. So you get a sample of their voice and you can basically fill in the gaps. Same thing for facial expressions and lip movements. It’s not always perfect but with feedback it can fine tune itself.
Latest Answers