How can two singers sing the same song in the same key still have distinguishable voices?

495 views

This is actually question my daughter posed and I’m pretty stumped. She asked how, if two people with (let’s say) perfect pitch sing a song, how is it possible that we can still tell who is singing when the notes would be identical?

Note: I know absolutely nothing about music, but figured this was the best place to ask for her.

Edit: Wow, many of these answers are incredible! I had no idea this would receive such in depth and thoughtful feedback. I have learned a huge amount. I was not exaggerating above when I said I know nothing about music (I don’t even know what pitch is – just quoted my daughter on that) and I’m grateful to those of you who took the time to help me learn.

In: Biology

21 Answers

Anonymous 0 Comments

Every response I’ve seen is correct, but I’m a music teacher and here’s a great way to visualize it.

Sound comes in the form of waves, and waves are fundamentally the same whether they’re waves on the water or in the air. If you took two waves of water and compared them for visually-identifiable differences, after enough thought you might realize that there’s a few things that can be measured:

1. Height
2. Width
3. “Texture”

Height simply means measuring from a trough to the peak of the wave. Width is measuring from trough to trough (or from peak to peak). The third one is SLIGHTLY more complicated but it would be, for example, the difference between a glassy smooth wave, or a choppy one. It’s disturbances on the surface of the wave and there are millions of slight variations that are possible there. You can have two choppy waves, but their choppiness can be varied.

If you can see those three characteristics in water waves, you can also HEAR them with sound waves! It’s just harder to imagine because we find it easier to think through our eyes.

When you hear a taller sound wave, you perceive that as volume. Taller waves are louder, shorter waves are quieter.

When you hear the width of the waves, you perceive that as pitch. Narrower waves, bunched up closer together are higher in pitch. Broad waves are lower.

That leaves the texture. If you can see it, you can hear it… but it’s an easy one to forget about. If you have two sounds that are at the same pitch and volume, and yet sound distinct… that’s the texture! In music we call it the timbre but it’s the same thing. A choppy wave sounds much different than a smooth one, even if they’re the same height and width.

Examples of textures you can observe:

1. Someone with a really nasal voice, or a deep booming voice.
2. On a piano, the thick gravelly sound of the bottom notes, compared to the clear bell-like sound of the high notes.
3. The pluck of a guitar vs. the pluck of a harp.
4. Every vowel is technically a different timbre too! If you sing every vowel on one pitch, the fact that you can discern the difference between an “a” and an “e” means that they have different timbres!

Anonymous 0 Comments

Interestingly, it’s the beginning and end of notes that hold most of the key in differentiating between instruments. The middle sounds pretty similar: https://youtu.be/thD6TNUoyIk

You might also get good answers at /r/musictheory

Anonymous 0 Comments

Music teacher here.

The same way that you can have dark green and light green and green stripes and green spots and shiny green and matte green (importantly, all without changing the color towards red or blue), you can have a note come out in many different ways without changing the pitch (high-ness or low-ness of a note).

In music, this is called timbre (pronounced TAM-BER for some reason), or “Tone Color”.

As an interesting exercise, have them hit a single note, and move their mouth through the vowels.

Compare the ‘O’ sound with the “EEEE” sound. The O sounds lower, while the EEEE sounds higher, even though the pitch stays the same.
When we make sounds with our mouths, there is one main pitch, and lots of little “sub-pitches” called harmonics, that change the way the main pitch sounds.

There exists a type of note with no extra harmonics, called a Sine Wave, which is only the main pitch and nothing else. YouTube can play it for you.

Anonymous 0 Comments

A piano, guitar, marimba, glockenspiel, flute, harpsichord, harp, and…you get my point, can all play for the most part a set of identical notes, and yet you could easily distinguish them from one another.

Human voices are all different in the same way. We all have differences in our voices that contribute to how our singing voice sounds. Though I will say, the more well trained they are and how perfect their pitch is, you’d find it hard to distinguish 2 female soprano singers singing an E6 or similarly high note. But down in the mid range of your singing voice which comes from a combination of your chest and head voices, you’ll start to hear the differences between 2 singers quite clearly.

Anonymous 0 Comments

It’s not just about the pitch but the timbre of your voice.

When we think of Bob Dylan’s gravely voice, we’re talking about timbre, not pitch. When you hear his duets with Johnny Cash, you can immediately tell who’s who.

I’m not an expert by any means. But I can sing the same note in different ways. Once using a chesty voice, then using a softer, breathier voice. I can also go more nasally, or apply some distortion. Just changing the shape of your mouth can also impact the sound.

Anonymous 0 Comments

A fun example—

If you take recordings of instruments playing the same sound, and you cut off the beginning and end for each instrument, you’ll have trouble identifying the difference.

Anonymous 0 Comments

The difference in our voices is created by the differences in the shape and size and tilt of our voice box, the individual shape, strength and movement of the video cords inside that and the physical differences in the shape and size of our airway, tongue, teeth, mouth and nasal cavities. In other words, subtle differences in physical anatomy generate the difference because sound travels directly in different bodies from the vocal cords all the way through the mouth and nose.

(I’m a speech therapist)

Anonymous 0 Comments

I think there’s a better and more interesting answer than the ones posted here, even though they’re all good explanations.

The “note” a singer, or any other instrument makes, is a frequency. Literally “how **frequently** does the sound oscillate?”

With a guitar, it’s “how frequently does the guitar string oscillate?” Meaning vibrate. If you watched a guitar string in slow-motion, you’d be able to see it vibrating after it was plucked. You can kinda see it even without slo-mo, it’s just a blur.

With your voice, it’s flaps of skin in your throat that are vibratring.

If someone sings an A#, that means their vocal chords are vibrating 466 times per second. Everyone singing an A# at the same time is vibrating their vocal chords 466 times per second.

But sound is MORE than just a frequency, which you know if you think about it. It’s also an “amplitude.” Which means “loudness.” We could both be singing A#, but I might sing louder than you. Same note, two different volumes.

But sound also has a SHAPE! Which is SUPER COOL! Let’s look at the “purest” tone, which is called a [Sine Wave.](https://www.wisc-online.com/assetrepository/getfile?id=3768&getType=view&width=0&height=0)

That is a real simple wave and because it’s so simple it would make a very pure tone if you listened to it. But **pitch** is **just** frequency. A wave with a different **shape** but the same **frequency** would be the same pitch, but could sound very different.

Let’s look at a different kind of wave. What’s called a [Saw Wave.](https://qph.fs.quoracdn.net/main-qimg-41c43d0e532ac48adf72c485e31c5e33-c)

You can see why it’s called a saw wave, right? Looks like the teeth of a saw!

Well, this makes a VERY different sound. It sounds…actually it sorta sound the way it looks! It has an *edge*. It’s not as pure as the sine wave. When you listen to any bowed instrument, the sound you’re hearing is a Saw Wave, because that’s the actual physical motion of the string!

[Watch this!](https://preview.redd.it/5azm0uox03y21.gif?format=mp4&s=ea49e13ee62118531858184d756c3b8047f810a2)

(the preview might not be working)

https://preview.redd.it/5azm0uox03y21.gif?format=mp4&s=ea49e13ee62118531858184d756c3b8047f810a2

You can see it there. The bow is pulled across the string. At first, the friction of the bow catches the string and pulls it smoothly back. That’s the “ramp up” of the saw wave. Eventually the tension in the string overcomes the bow’s friction, and the string ‘snaps’ back. Which is the sharp, straight-down line of the saw wave. But the bow is still pulling, so the string gets caught again and the cycle repeats.

Saw Waves and Sine Waves are still pretty simple though. The waves produced by the human voice look *weird* and *messy.* [Look!](https://i.stack.imgur.com/lhC4Y.gif)

If you look on the graph, everything from the 1 hash, to the 8 mark is ONE cycle. That is a complex wave and it’s still way simpler than the human voice. The human voice looks more like [this.](https://www.researchgate.net/profile/Jody_Kreiman/publication/281119746/figure/fig6/AS:668580781752330@1536413488325/Waveform-showing-extreme-aperiodicity-phrase-finally-by-a-female-English-speaker.png)

THAT is why two people singing the same note are recognizably different. They’re vocal chords are vibrating VERY complexly. So complex, it’s almost unique! When you recognize someone’s voice, you’re recognizing the unique properties of the SHAPE of the wave their vocal chords make. That shape is based on the physical shape of their vocal chords and their throat and even their mouth which is helping shape the sound as it comes out.

The **rate** at which their skin flaps vibrate might be the same, but because their skin is floppy and weird shaped, it doesn’t just go smoothly up and down like a guitar string. It waggles all over WHILE going up and down and that is what singers and musicians call “timbre.” Timbre means “The way your skin flaps waggle around while you vibrate them.”

Anonymous 0 Comments

Oh cool something I know a little about from my past in audio recording.

The top answer is totally right. But interesting thing that happens when you’re recording vocals or any other instrument for that matter. You can duplicate tracks so you have two sound files playing the exact same pitch and timbre. Everything is exactly the same. To the listener, all it will sound like is as if the original track got louder. But take the exact same singer or instrument and record a brand new take playing the same thing, the minute differences, even from the exact same instrument/player/singer is enough to give the listener the perception of layers rather than just being louder.

Also fun fact, if you simply move the second duplicate track off by milliseconds, it doesn’t give it the same “layered” sound of a new take, but instead creates the “chime-y” like sound effect called “chorus” (or swirly sound called “phaser”/“flange” depending on the amount of milliseconds delay).

TL; DR – In theory, if two voices could be so identical in timing, pitch, timbre, and everything, you definitely couldn’t tell them apart. But only computers or recordings can be so precise. So anything performed by humans, there are so many small imperfections in performance that your brain can tell the difference.

Anonymous 0 Comments

[removed]