AnswerCult

Question

1.25K viewsDecember 31, 2023

Question 100.55K August 3, 2023 0 Comments

Its pretty common for an ML model to lose accuracy at various points during training. But presumably that means they are worse so why do we take the last version instead of the one that had the highest accuracy?

In: 1

12 Answers

You are viewing 1 out of 12 answers, click here to view all answers.

Answer 1 · 2023-08-03T15:46:48+00:00

When the model’s accuracy is very high, it’s likely to be “overfit,” which would mean it is only accurate for the data you trained it with. In other words, it doesn’t generalize. It can’t deal with data it hasn’t seen before.

There are methods to combat this, such as testing the trained model with a different set of data than the data you trained it with. However, if you pick from a set of trained models based on their test results, you’re effectively using the test data as part of the training data, which defeats the purpose entirely.

AnswerCult

Why when training ML models do we not take the model with the highest accuracy

12 Answers

Search questions

Popular Questions

Latest Answers