AnswerCult

Question 1

1.63K viewsJanuary 1, 2024

Question 100.55K December 19, 2022 0 Comments

In ML/AI, GPUs are used to train neural networks of various sizes. They are vastly superior to training on CPUs. Why is this?

In: 679

26 Answers

You are viewing 1 out of 26 answers, click here to view all answers.

Answer · 2022-12-19T14:28:00+00:00

Each CPU core tends to have 1 floating point unit, maybe a very small number of arithmetic units, etc. While each CPU core has many operating modes, lots of features, the amount of calculation it can do is more limited as a result. A lot of the CPU’s actual circuitry is dedicated to things other than actual computation, like instruction processing and event ordering.

A GPU’s equivalent of a CPU core has dozens, maybe hundreds, of floating point units available to it. Basically a single instruction can order all floating point units it controls to simultaneously perform the operation `x += y` or such. However each such core is more limited, and anything that can’t make good use of that bulk of FPUs will seriously hurt performance. Furthermore it has generally fewer features available.

GPUs tend to do best when the job involves more calculation and less decision making along the process.

Question 2

0 viewsJanuary 1, 2024

Question 100.55K December 19, 2022 0 Comments

In ML/AI, GPUs are used to train neural networks of various sizes. They are vastly superior to training on CPUs. Why is this?

In: 679

26 Answers

You are viewing 1 out of 26 answers, click here to view all answers.

Answer · 2022-12-19T14:28:00+00:00

Each CPU core tends to have 1 floating point unit, maybe a very small number of arithmetic units, etc. While each CPU core has many operating modes, lots of features, the amount of calculation it can do is more limited as a result. A lot of the CPU’s actual circuitry is dedicated to things other than actual computation, like instruction processing and event ordering.

A GPU’s equivalent of a CPU core has dozens, maybe hundreds, of floating point units available to it. Basically a single instruction can order all floating point units it controls to simultaneously perform the operation `x += y` or such. However each such core is more limited, and anything that can’t make good use of that bulk of FPUs will seriously hurt performance. Furthermore it has generally fewer features available.

GPUs tend to do best when the job involves more calculation and less decision making along the process.

AnswerCult

What about GPU Architecture makes them superior for training neural networks over CPUs?

26 Answers

26 Answers

Search questions

Popular Questions

Latest Answers