How are computers motivated in Q Learning?

731 views

I get that it uses a number going up and down depending on the behavior, but why does the computer/AI try to make the number go up?

In: Technology

5 Answers

Anonymous 0 Comments

The goal is to maximize the reward. It’s explicitly told that higher is better.

Nearly all machine learning is centered around maximizing our minimizing some target.

You are viewing 1 out of 5 answers, click here to view all answers.