The other commenter mentioned that in normal circumstances, sounds come from different directions and we can focus on sounds from one direction / location.
Partly, we can do this by just turning our heads. The shape of our ears makes it so that we hear sounds coming from directly in front of us the best. So turning to face a sound will let us hear it more clearly.
In addition, we can differentiate sounds at least partly by what direction they came from, WITHOUT turning our head, because the shape of our ears also makes sounds from different directions sound slightly different, as well as the fact that a sound on your right will both be slightly louder in your right ear AND arrive at your right ear a split second before your left. So your brain can focus on “that sound that’s coming from behind us and to the right.”
Lastly, audio in a recording has typically been compressed in a way that may affect characteristics of the sound that we might use to tell different sounds apart. Audio compression especially affects reproduction of frequencies above ~15kHz. I am not enough of an audio expert to know how those frequencies affect our ability to localize sounds or tell them apart, but it may very well be important.
Latest Answers