The tricky part happens when the AI model is first created, or trained if you prefer. All that info you’re talking about is crunched down until it is basically reduced to a really complex math formula. The formula is large and complicated, but doing math is what computers are best at, so it can be run relatively easily.
In the case of generative text models, the result is basically a very advanced version of predictive text algorithms that run on your phone when you are texting. Those algorithms on your phone look at the words you’ve written and try to guess what you might write next based on what other people write. Generative AI models basically do the same thing, except they’re using a much larger dataset and they are able to generate much more than a single word at a time. So when you ask the model a question, it isn’t even trying to look for the correct answer, it’s finding an approximation of “what do people normally receive as an answer to a question like this?”
Latest Answers