How are computers motivated in Q Learning?

738 views

I get that it uses a number going up and down depending on the behavior, but why does the computer/AI try to make the number go up?

In: Technology

5 Answers

Anonymous 0 Comments

There’s a evolution-like algorithm where a program leaves numbers which produced better result only. Later numbers decrease/increase not from some random value, but from these “generations” and then the elimination of worst results gets done again… Of course this is not the only algorithm, this is just an example.

You are viewing 1 out of 5 answers, click here to view all answers.