AnswerCult

Question

1.11K viewsJanuary 3, 2024

Question 100.55K January 9, 2021 0 Comments

I get that it uses a number going up and down depending on the behavior, but why does the computer/AI try to make the number go up?

In: Technology

5 Answers

You are viewing 1 out of 5 answers, click here to view all answers.

Answer 1 · 2021-01-09T04:01:26+00:00

Computers don’t want, or care about, anything until we give them a program that tells them what to want/care about. (I’m anthropomorphizing computers here, obviously, but you get the point.)

In machine learning, we give the computer a “value function” or a “reward function” that tells them how good a job they’re doing, and then we tell them that their goal in life is to maximize that function. Once we tell them that, they go after it with everything they have. That’s simply how programming works; if you write code that gets the machine to come up with some options, work out what the reward would be for each option and then choose the one with the highest reward, then assuming the computer’s working correctly, that’s what it will do.

AnswerCult

How are computers motivated in Q Learning?

5 Answers

Search questions

Popular Questions

Latest Answers