Modern smartphones have built in filters that “listen” and filter for specific frequencies of a typical person talking. So the rest of the background noise is filtered out of the total signal. When recording a video the phone does not “know” what is the important part so it just records everything the same.
Edit: Also, if you call someone everything near the phone is seen as more important so background noise is filtered more.
Latest Answers