In musical terms, what properties are varied by the human voice to produce different words / syllables? Announcing the arrival of Valued Associate #679: Cesar Manara Planned maintenance scheduled April 23, 2019 at 00:00UTC (8:00pm US/Eastern)Alternative Numerical Representation of PitchWhat is the difference between male head voice and falsetto?What is the technology where two vocals overlap simultaneously in songs?Transposing the human voiceHow Do Time Signatures Affect a Song?Singing On Pitch?Why do Talkbox and Autotune effects sound similar?What differentiates great music from good?Do classical pieces sound different today than the originals due to temperament?Psychological barriers in vocals

How much damage would a cupful of neutron star matter do to the Earth?

Co-worker has annoying ringtone

Maximum summed subsequences with non-adjacent items

Misunderstanding of Sylow theory

Why do early math courses focus on the cross sections of a cone and not on other 3D objects?

Is it fair for a professor to grade us on the possession of past papers?

What are the discoveries that have been possible with the rejection of positivism?

What do you call the main part of a joke?

Converted a Scalar function to a TVF function for parallel execution-Still running in Serial mode

File name problem(?)

How were pictures turned from film to a big picture in a picture frame before digital scanning?

What order were files/directories output in dir?

Do I really need to have a message in a novel to appeal to readers?

Movie where a circus ringmaster turns people into animals

Why is it faster to reheat something than it is to cook it?

Lagrange four-squares theorem --- deterministic complexity

How many time has Arya actually used Needle?

If Windows 7 doesn't support WSL, then what is "Subsystem for UNIX-based Applications"?

Putting class ranking in CV, but against dept guidelines

An adverb for when you're not exaggerating

How fail-safe is nr as stop bytes?

How does a spellshard spellbook work?

Flash light on something

Can a Beast Master ranger change beast companions?



In musical terms, what properties are varied by the human voice to produce different words / syllables?



Announcing the arrival of Valued Associate #679: Cesar Manara
Planned maintenance scheduled April 23, 2019 at 00:00UTC (8:00pm US/Eastern)Alternative Numerical Representation of PitchWhat is the difference between male head voice and falsetto?What is the technology where two vocals overlap simultaneously in songs?Transposing the human voiceHow Do Time Signatures Affect a Song?Singing On Pitch?Why do Talkbox and Autotune effects sound similar?What differentiates great music from good?Do classical pieces sound different today than the originals due to temperament?Psychological barriers in vocals










6















Why, for example, does the word "hello" sound completely different to the word "goodbye", or the letter "a" from the letter "b"?



I know it can't be pitch, because all of these words and syllables can be spoken at the same pitch and still sound distinct, and changing the lyrics of a song does not change the pitch.



What musical property is it then that makes words sound different from each other?










share|improve this question







New contributor




JShorthouse is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.















  • 1





    en.wikipedia.org/wiki/Timbre

    – Your Uncle Bob
    3 hours ago















6















Why, for example, does the word "hello" sound completely different to the word "goodbye", or the letter "a" from the letter "b"?



I know it can't be pitch, because all of these words and syllables can be spoken at the same pitch and still sound distinct, and changing the lyrics of a song does not change the pitch.



What musical property is it then that makes words sound different from each other?










share|improve this question







New contributor




JShorthouse is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.















  • 1





    en.wikipedia.org/wiki/Timbre

    – Your Uncle Bob
    3 hours ago













6












6








6


1






Why, for example, does the word "hello" sound completely different to the word "goodbye", or the letter "a" from the letter "b"?



I know it can't be pitch, because all of these words and syllables can be spoken at the same pitch and still sound distinct, and changing the lyrics of a song does not change the pitch.



What musical property is it then that makes words sound different from each other?










share|improve this question







New contributor




JShorthouse is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.












Why, for example, does the word "hello" sound completely different to the word "goodbye", or the letter "a" from the letter "b"?



I know it can't be pitch, because all of these words and syllables can be spoken at the same pitch and still sound distinct, and changing the lyrics of a song does not change the pitch.



What musical property is it then that makes words sound different from each other?







theory voice






share|improve this question







New contributor




JShorthouse is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.











share|improve this question







New contributor




JShorthouse is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.









share|improve this question




share|improve this question






New contributor




JShorthouse is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.









asked 3 hours ago









JShorthouseJShorthouse

1312




1312




New contributor




JShorthouse is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.





New contributor





JShorthouse is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.






JShorthouse is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.







  • 1





    en.wikipedia.org/wiki/Timbre

    – Your Uncle Bob
    3 hours ago












  • 1





    en.wikipedia.org/wiki/Timbre

    – Your Uncle Bob
    3 hours ago







1




1





en.wikipedia.org/wiki/Timbre

– Your Uncle Bob
3 hours ago





en.wikipedia.org/wiki/Timbre

– Your Uncle Bob
3 hours ago










2 Answers
2






active

oldest

votes


















6














I think you might be best served by linguistics, specifically phonetics.



Pitch is sort of an element, but specific pitch isn't the concern. Instead, some vocal sounds are "voiced" meaning the vocal chords vibrate (producing pitches.) For example, the f in 'fan is not voiced, but when voiced it becomes v like 'van.'



How vowel and consonant sounds are produced is understood in linguistics as a matter of vocal anatomy of the tongue, palette, etc. and described with terms like fricative, labial, etc. There is a complex mapping of the inside of the mouth in linguistics.



You could describe the actions of the voice with acoustics with terms like amplitutde, wave form, etc. But, linguistics actually has a whole branch devoted to the study of vocal sounds.



By the way, in voice training these topics are called diction.






share|improve this answer




















  • 1





    Phoenix + phonetics = phoenetics! :-) ...I corrected my typo, thanks!

    – Michael Curtis
    2 hours ago






  • 1





    Found another of my own typos: 'best server' instead of 'best served' ...can you tell I work in computer support?

    – Michael Curtis
    2 hours ago


















6














As Michael Curtis has pointed out, from the linguistic side, the study of phonetics is all about what speech sounds humans make and how they make them. Phonetics doesn't really approach things from a musical perspective, so I thought I might try to make some correlations between phonetics and musical acoustics.



Phonetics divides speech sounds (phonemes) into two broad categories: vowels and consonants. The lines can be a bit blurry there, but vowel sounds always involve the vocal cords and usually made with the mouth more or less open, while consonants involve specific motions of the teeth, lips, and tongue and may or may not use the vocal cords.



For vowels, we always use our vocal cords, which means vowels always have some pitch. The pitches used during speech generally do not have a typical musical relationship, but sometimes might be "accidentally" musical. For instance, when a child taunts on the playground something like, "Johnny is a chick-en!", they often us a sing-song tone that is a melodic minor third. But that's incidental.



The way we make different vowel sounds is by changing the shape of our mouths, and this changes the timbre of the sound made by our vocal cords. Another musical way to look at it is that we are filtering (like with EQ or a synth filter) the pitch that is created by our vocal cords.



That entirely covers the musical aspects of vowel sounds. We could talk about loudness and duration (the two other main dimensions of music), but neither of those change the vowel sound we make or hear.



Consonants are more complicated. Let's divide them into the phonetic categories of voiced (using the vocal cords) and unvoiced (not using the vocal cords).



Unvoiced consonants (like /t/, /p/, /f/, /k/, /s/), from a musical standpoint, are closest to percussion sounds. These are the kinds of sounds we make when we beat box.
Percussion sounds and unvoiced consonants are both musically unpitched, and instead distinguished solely by timbre. The two main timbral elements of these sounds are the envelope and formant. The formant is like a filter setting, just like for vowel sounds, but since there is no pitch to filter, what is being filtered instead is noise or unpitched tones. Unpitched tones are groups of frequencies that do not have a harmonic relationship to each other, so we don't hear them as a note. Think of two different cymbals, a "high" one and a "low" one as being examples of noise with two different formants.



For unvoiced consonants, there are two subcategories we can talk about, plosives, fricatives. Plosives (/t/, /p/, /k/) have a very short loudness envelope that reaches maximum volume very quickly and then dies away just as quickly. This is most similar to a drum sound. The different sounds of plosives come from their different formants. In this case, it's mainly how much and what kinds of noise is being made along with the plosive sound. A /p/ sound has essentially no noise, like a kick drum, while /t/ and /k/ have two different kinds of noise that are more like a hi hat and snare drum, respectively. Another thing that makes the /t/ sound different from the /k/ is the position of the mouth is different, which causes different filtering just like we see in the vowel sounds.



Fricatives (/f/, /s/, /sh/, /th/) are all bursts of noise that generally last longer than plosives (they have a slower loudness envelope), and they each have their own formant, or filter setting, that changes the character of the noise. Note that /f/ is a fairly even noise sound, while /s/ has more of a sense of some frequencies being louder than others, /sh/ is a more uneven noise sound, and /th/ is a muted noise sound without as much of the upper frequencies.



For the voiced consonants, most of them are essentially the same as the unvoiced ones outlined above, except they also involve the vocal cords, so there is again a pitch of some kind when voiced consonants are spoken. These consonants include /d/ (voiced /t/), /b/ (voiced /p/), /z/ (voiced /s/) and so on. I believe every unvoiced consonant has a voiced version in English (I believe this is also true in Japanese).



There are a few voiced consonants that do not have unvoiced versions and also straddle the line between consonants and vowels. The two closest to being vowels are the /y/ and /w/ sounds. These are basically vowels where the formant or filter is changed while we say them. This is done by changing the positing of the tongue or lips while the vocal cords create a pitch.



Two others, /m/ and /n/, are basically made similar to humming, and the main way we tell the difference is by how the consonant changes to a vowel to determine whether it was an /m/ or /n/. During the transition to vowel, the difference between /m/ and /n/ is similar to the difference between /w/ and /y/.



Finally, /l/ and /r/ are essentially vowels that have rather extreme formants or filters applied do them. They also sound different when they are approached and left (what you might call their formant/filter envelopes).



If you're really paying attention, you've noticed I have not discussed every English phoneme. I have touched on all the musical aspects of phonemes in all languages. Here's more of a breakdown aspects of phonemes:



  • Different sound sources, including the vocal cords to make pitches and parts of the mouth that can make noises

  • Different mouth positions to filter the sound sources in different ways to create different formants

  • Different loudness envelopes, or how the loudness changes with time

  • Different formant envelopes, or how the filtering changes with time

Those are the primary elements that distinguish different phonemes summarized with musical, rather than phonetic, terms.




For fun, let's break down "hello" and "goodbye" musically, as if we were going to try to make a synth make these sounds:



"Hello"



  1. /h/ - filtered noise, very muted and fairly quiet

  2. /e/ - filtered pitch, fairly bright formant filter (a kind of bandpass filter)

  3. /l/ - filtered pitch, changing the formant filter dramatically as the consonant develops, along with a dip in the loudness envelope right at the "middle" of the /l/ sound

  4. /o/ - filtered pitch, arriving at a much darker formant than the /e/ sound

"Goodbye"



  1. /g/ - filtered pitch, loudness envelope with short attack, short quiet filtered noise burst, formant filter with an envelope that starts very dark (like an /n/ sound) and then gets bright for a very short time and then quickly settles to the position for the next phoneme

  2. /oo/ - filtered pitch, note this is similar to a /u/ formant

  3. /d/ - filtered pitch continues but the loudness envelope drops to essentially zero for just a short moment and then comes back up to the original loudness with the same formant (like "duh"), possibly with slight noise burst right when the loudness is coming back up

  4. /b/ - again, loudness drops to zero and then comes up quickly with the same formant and pitch (like "buh") but with no noise burst

  5. /y/ or /ai/ - formant filter sweep from current /oo/ or /u/ position to a much brighter sound like /i/





share|improve this answer

























    Your Answer








    StackExchange.ready(function()
    var channelOptions =
    tags: "".split(" "),
    id: "240"
    ;
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function()
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled)
    StackExchange.using("snippets", function()
    createEditor();
    );

    else
    createEditor();

    );

    function createEditor()
    StackExchange.prepareEditor(
    heartbeatType: 'answer',
    autoActivateHeartbeat: false,
    convertImagesToLinks: false,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: null,
    bindNavPrevention: true,
    postfix: "",
    imageUploader:
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    ,
    noCode: true, onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    );



    );






    JShorthouse is a new contributor. Be nice, and check out our Code of Conduct.









    draft saved

    draft discarded


















    StackExchange.ready(
    function ()
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmusic.stackexchange.com%2fquestions%2f82996%2fin-musical-terms-what-properties-are-varied-by-the-human-voice-to-produce-diffe%23new-answer', 'question_page');

    );

    Post as a guest















    Required, but never shown

























    2 Answers
    2






    active

    oldest

    votes








    2 Answers
    2






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes









    6














    I think you might be best served by linguistics, specifically phonetics.



    Pitch is sort of an element, but specific pitch isn't the concern. Instead, some vocal sounds are "voiced" meaning the vocal chords vibrate (producing pitches.) For example, the f in 'fan is not voiced, but when voiced it becomes v like 'van.'



    How vowel and consonant sounds are produced is understood in linguistics as a matter of vocal anatomy of the tongue, palette, etc. and described with terms like fricative, labial, etc. There is a complex mapping of the inside of the mouth in linguistics.



    You could describe the actions of the voice with acoustics with terms like amplitutde, wave form, etc. But, linguistics actually has a whole branch devoted to the study of vocal sounds.



    By the way, in voice training these topics are called diction.






    share|improve this answer




















    • 1





      Phoenix + phonetics = phoenetics! :-) ...I corrected my typo, thanks!

      – Michael Curtis
      2 hours ago






    • 1





      Found another of my own typos: 'best server' instead of 'best served' ...can you tell I work in computer support?

      – Michael Curtis
      2 hours ago















    6














    I think you might be best served by linguistics, specifically phonetics.



    Pitch is sort of an element, but specific pitch isn't the concern. Instead, some vocal sounds are "voiced" meaning the vocal chords vibrate (producing pitches.) For example, the f in 'fan is not voiced, but when voiced it becomes v like 'van.'



    How vowel and consonant sounds are produced is understood in linguistics as a matter of vocal anatomy of the tongue, palette, etc. and described with terms like fricative, labial, etc. There is a complex mapping of the inside of the mouth in linguistics.



    You could describe the actions of the voice with acoustics with terms like amplitutde, wave form, etc. But, linguistics actually has a whole branch devoted to the study of vocal sounds.



    By the way, in voice training these topics are called diction.






    share|improve this answer




















    • 1





      Phoenix + phonetics = phoenetics! :-) ...I corrected my typo, thanks!

      – Michael Curtis
      2 hours ago






    • 1





      Found another of my own typos: 'best server' instead of 'best served' ...can you tell I work in computer support?

      – Michael Curtis
      2 hours ago













    6












    6








    6







    I think you might be best served by linguistics, specifically phonetics.



    Pitch is sort of an element, but specific pitch isn't the concern. Instead, some vocal sounds are "voiced" meaning the vocal chords vibrate (producing pitches.) For example, the f in 'fan is not voiced, but when voiced it becomes v like 'van.'



    How vowel and consonant sounds are produced is understood in linguistics as a matter of vocal anatomy of the tongue, palette, etc. and described with terms like fricative, labial, etc. There is a complex mapping of the inside of the mouth in linguistics.



    You could describe the actions of the voice with acoustics with terms like amplitutde, wave form, etc. But, linguistics actually has a whole branch devoted to the study of vocal sounds.



    By the way, in voice training these topics are called diction.






    share|improve this answer















    I think you might be best served by linguistics, specifically phonetics.



    Pitch is sort of an element, but specific pitch isn't the concern. Instead, some vocal sounds are "voiced" meaning the vocal chords vibrate (producing pitches.) For example, the f in 'fan is not voiced, but when voiced it becomes v like 'van.'



    How vowel and consonant sounds are produced is understood in linguistics as a matter of vocal anatomy of the tongue, palette, etc. and described with terms like fricative, labial, etc. There is a complex mapping of the inside of the mouth in linguistics.



    You could describe the actions of the voice with acoustics with terms like amplitutde, wave form, etc. But, linguistics actually has a whole branch devoted to the study of vocal sounds.



    By the way, in voice training these topics are called diction.







    share|improve this answer














    share|improve this answer



    share|improve this answer








    edited 2 hours ago

























    answered 2 hours ago









    Michael CurtisMichael Curtis

    12.3k744




    12.3k744







    • 1





      Phoenix + phonetics = phoenetics! :-) ...I corrected my typo, thanks!

      – Michael Curtis
      2 hours ago






    • 1





      Found another of my own typos: 'best server' instead of 'best served' ...can you tell I work in computer support?

      – Michael Curtis
      2 hours ago












    • 1





      Phoenix + phonetics = phoenetics! :-) ...I corrected my typo, thanks!

      – Michael Curtis
      2 hours ago






    • 1





      Found another of my own typos: 'best server' instead of 'best served' ...can you tell I work in computer support?

      – Michael Curtis
      2 hours ago







    1




    1





    Phoenix + phonetics = phoenetics! :-) ...I corrected my typo, thanks!

    – Michael Curtis
    2 hours ago





    Phoenix + phonetics = phoenetics! :-) ...I corrected my typo, thanks!

    – Michael Curtis
    2 hours ago




    1




    1





    Found another of my own typos: 'best server' instead of 'best served' ...can you tell I work in computer support?

    – Michael Curtis
    2 hours ago





    Found another of my own typos: 'best server' instead of 'best served' ...can you tell I work in computer support?

    – Michael Curtis
    2 hours ago











    6














    As Michael Curtis has pointed out, from the linguistic side, the study of phonetics is all about what speech sounds humans make and how they make them. Phonetics doesn't really approach things from a musical perspective, so I thought I might try to make some correlations between phonetics and musical acoustics.



    Phonetics divides speech sounds (phonemes) into two broad categories: vowels and consonants. The lines can be a bit blurry there, but vowel sounds always involve the vocal cords and usually made with the mouth more or less open, while consonants involve specific motions of the teeth, lips, and tongue and may or may not use the vocal cords.



    For vowels, we always use our vocal cords, which means vowels always have some pitch. The pitches used during speech generally do not have a typical musical relationship, but sometimes might be "accidentally" musical. For instance, when a child taunts on the playground something like, "Johnny is a chick-en!", they often us a sing-song tone that is a melodic minor third. But that's incidental.



    The way we make different vowel sounds is by changing the shape of our mouths, and this changes the timbre of the sound made by our vocal cords. Another musical way to look at it is that we are filtering (like with EQ or a synth filter) the pitch that is created by our vocal cords.



    That entirely covers the musical aspects of vowel sounds. We could talk about loudness and duration (the two other main dimensions of music), but neither of those change the vowel sound we make or hear.



    Consonants are more complicated. Let's divide them into the phonetic categories of voiced (using the vocal cords) and unvoiced (not using the vocal cords).



    Unvoiced consonants (like /t/, /p/, /f/, /k/, /s/), from a musical standpoint, are closest to percussion sounds. These are the kinds of sounds we make when we beat box.
    Percussion sounds and unvoiced consonants are both musically unpitched, and instead distinguished solely by timbre. The two main timbral elements of these sounds are the envelope and formant. The formant is like a filter setting, just like for vowel sounds, but since there is no pitch to filter, what is being filtered instead is noise or unpitched tones. Unpitched tones are groups of frequencies that do not have a harmonic relationship to each other, so we don't hear them as a note. Think of two different cymbals, a "high" one and a "low" one as being examples of noise with two different formants.



    For unvoiced consonants, there are two subcategories we can talk about, plosives, fricatives. Plosives (/t/, /p/, /k/) have a very short loudness envelope that reaches maximum volume very quickly and then dies away just as quickly. This is most similar to a drum sound. The different sounds of plosives come from their different formants. In this case, it's mainly how much and what kinds of noise is being made along with the plosive sound. A /p/ sound has essentially no noise, like a kick drum, while /t/ and /k/ have two different kinds of noise that are more like a hi hat and snare drum, respectively. Another thing that makes the /t/ sound different from the /k/ is the position of the mouth is different, which causes different filtering just like we see in the vowel sounds.



    Fricatives (/f/, /s/, /sh/, /th/) are all bursts of noise that generally last longer than plosives (they have a slower loudness envelope), and they each have their own formant, or filter setting, that changes the character of the noise. Note that /f/ is a fairly even noise sound, while /s/ has more of a sense of some frequencies being louder than others, /sh/ is a more uneven noise sound, and /th/ is a muted noise sound without as much of the upper frequencies.



    For the voiced consonants, most of them are essentially the same as the unvoiced ones outlined above, except they also involve the vocal cords, so there is again a pitch of some kind when voiced consonants are spoken. These consonants include /d/ (voiced /t/), /b/ (voiced /p/), /z/ (voiced /s/) and so on. I believe every unvoiced consonant has a voiced version in English (I believe this is also true in Japanese).



    There are a few voiced consonants that do not have unvoiced versions and also straddle the line between consonants and vowels. The two closest to being vowels are the /y/ and /w/ sounds. These are basically vowels where the formant or filter is changed while we say them. This is done by changing the positing of the tongue or lips while the vocal cords create a pitch.



    Two others, /m/ and /n/, are basically made similar to humming, and the main way we tell the difference is by how the consonant changes to a vowel to determine whether it was an /m/ or /n/. During the transition to vowel, the difference between /m/ and /n/ is similar to the difference between /w/ and /y/.



    Finally, /l/ and /r/ are essentially vowels that have rather extreme formants or filters applied do them. They also sound different when they are approached and left (what you might call their formant/filter envelopes).



    If you're really paying attention, you've noticed I have not discussed every English phoneme. I have touched on all the musical aspects of phonemes in all languages. Here's more of a breakdown aspects of phonemes:



    • Different sound sources, including the vocal cords to make pitches and parts of the mouth that can make noises

    • Different mouth positions to filter the sound sources in different ways to create different formants

    • Different loudness envelopes, or how the loudness changes with time

    • Different formant envelopes, or how the filtering changes with time

    Those are the primary elements that distinguish different phonemes summarized with musical, rather than phonetic, terms.




    For fun, let's break down "hello" and "goodbye" musically, as if we were going to try to make a synth make these sounds:



    "Hello"



    1. /h/ - filtered noise, very muted and fairly quiet

    2. /e/ - filtered pitch, fairly bright formant filter (a kind of bandpass filter)

    3. /l/ - filtered pitch, changing the formant filter dramatically as the consonant develops, along with a dip in the loudness envelope right at the "middle" of the /l/ sound

    4. /o/ - filtered pitch, arriving at a much darker formant than the /e/ sound

    "Goodbye"



    1. /g/ - filtered pitch, loudness envelope with short attack, short quiet filtered noise burst, formant filter with an envelope that starts very dark (like an /n/ sound) and then gets bright for a very short time and then quickly settles to the position for the next phoneme

    2. /oo/ - filtered pitch, note this is similar to a /u/ formant

    3. /d/ - filtered pitch continues but the loudness envelope drops to essentially zero for just a short moment and then comes back up to the original loudness with the same formant (like "duh"), possibly with slight noise burst right when the loudness is coming back up

    4. /b/ - again, loudness drops to zero and then comes up quickly with the same formant and pitch (like "buh") but with no noise burst

    5. /y/ or /ai/ - formant filter sweep from current /oo/ or /u/ position to a much brighter sound like /i/





    share|improve this answer





























      6














      As Michael Curtis has pointed out, from the linguistic side, the study of phonetics is all about what speech sounds humans make and how they make them. Phonetics doesn't really approach things from a musical perspective, so I thought I might try to make some correlations between phonetics and musical acoustics.



      Phonetics divides speech sounds (phonemes) into two broad categories: vowels and consonants. The lines can be a bit blurry there, but vowel sounds always involve the vocal cords and usually made with the mouth more or less open, while consonants involve specific motions of the teeth, lips, and tongue and may or may not use the vocal cords.



      For vowels, we always use our vocal cords, which means vowels always have some pitch. The pitches used during speech generally do not have a typical musical relationship, but sometimes might be "accidentally" musical. For instance, when a child taunts on the playground something like, "Johnny is a chick-en!", they often us a sing-song tone that is a melodic minor third. But that's incidental.



      The way we make different vowel sounds is by changing the shape of our mouths, and this changes the timbre of the sound made by our vocal cords. Another musical way to look at it is that we are filtering (like with EQ or a synth filter) the pitch that is created by our vocal cords.



      That entirely covers the musical aspects of vowel sounds. We could talk about loudness and duration (the two other main dimensions of music), but neither of those change the vowel sound we make or hear.



      Consonants are more complicated. Let's divide them into the phonetic categories of voiced (using the vocal cords) and unvoiced (not using the vocal cords).



      Unvoiced consonants (like /t/, /p/, /f/, /k/, /s/), from a musical standpoint, are closest to percussion sounds. These are the kinds of sounds we make when we beat box.
      Percussion sounds and unvoiced consonants are both musically unpitched, and instead distinguished solely by timbre. The two main timbral elements of these sounds are the envelope and formant. The formant is like a filter setting, just like for vowel sounds, but since there is no pitch to filter, what is being filtered instead is noise or unpitched tones. Unpitched tones are groups of frequencies that do not have a harmonic relationship to each other, so we don't hear them as a note. Think of two different cymbals, a "high" one and a "low" one as being examples of noise with two different formants.



      For unvoiced consonants, there are two subcategories we can talk about, plosives, fricatives. Plosives (/t/, /p/, /k/) have a very short loudness envelope that reaches maximum volume very quickly and then dies away just as quickly. This is most similar to a drum sound. The different sounds of plosives come from their different formants. In this case, it's mainly how much and what kinds of noise is being made along with the plosive sound. A /p/ sound has essentially no noise, like a kick drum, while /t/ and /k/ have two different kinds of noise that are more like a hi hat and snare drum, respectively. Another thing that makes the /t/ sound different from the /k/ is the position of the mouth is different, which causes different filtering just like we see in the vowel sounds.



      Fricatives (/f/, /s/, /sh/, /th/) are all bursts of noise that generally last longer than plosives (they have a slower loudness envelope), and they each have their own formant, or filter setting, that changes the character of the noise. Note that /f/ is a fairly even noise sound, while /s/ has more of a sense of some frequencies being louder than others, /sh/ is a more uneven noise sound, and /th/ is a muted noise sound without as much of the upper frequencies.



      For the voiced consonants, most of them are essentially the same as the unvoiced ones outlined above, except they also involve the vocal cords, so there is again a pitch of some kind when voiced consonants are spoken. These consonants include /d/ (voiced /t/), /b/ (voiced /p/), /z/ (voiced /s/) and so on. I believe every unvoiced consonant has a voiced version in English (I believe this is also true in Japanese).



      There are a few voiced consonants that do not have unvoiced versions and also straddle the line between consonants and vowels. The two closest to being vowels are the /y/ and /w/ sounds. These are basically vowels where the formant or filter is changed while we say them. This is done by changing the positing of the tongue or lips while the vocal cords create a pitch.



      Two others, /m/ and /n/, are basically made similar to humming, and the main way we tell the difference is by how the consonant changes to a vowel to determine whether it was an /m/ or /n/. During the transition to vowel, the difference between /m/ and /n/ is similar to the difference between /w/ and /y/.



      Finally, /l/ and /r/ are essentially vowels that have rather extreme formants or filters applied do them. They also sound different when they are approached and left (what you might call their formant/filter envelopes).



      If you're really paying attention, you've noticed I have not discussed every English phoneme. I have touched on all the musical aspects of phonemes in all languages. Here's more of a breakdown aspects of phonemes:



      • Different sound sources, including the vocal cords to make pitches and parts of the mouth that can make noises

      • Different mouth positions to filter the sound sources in different ways to create different formants

      • Different loudness envelopes, or how the loudness changes with time

      • Different formant envelopes, or how the filtering changes with time

      Those are the primary elements that distinguish different phonemes summarized with musical, rather than phonetic, terms.




      For fun, let's break down "hello" and "goodbye" musically, as if we were going to try to make a synth make these sounds:



      "Hello"



      1. /h/ - filtered noise, very muted and fairly quiet

      2. /e/ - filtered pitch, fairly bright formant filter (a kind of bandpass filter)

      3. /l/ - filtered pitch, changing the formant filter dramatically as the consonant develops, along with a dip in the loudness envelope right at the "middle" of the /l/ sound

      4. /o/ - filtered pitch, arriving at a much darker formant than the /e/ sound

      "Goodbye"



      1. /g/ - filtered pitch, loudness envelope with short attack, short quiet filtered noise burst, formant filter with an envelope that starts very dark (like an /n/ sound) and then gets bright for a very short time and then quickly settles to the position for the next phoneme

      2. /oo/ - filtered pitch, note this is similar to a /u/ formant

      3. /d/ - filtered pitch continues but the loudness envelope drops to essentially zero for just a short moment and then comes back up to the original loudness with the same formant (like "duh"), possibly with slight noise burst right when the loudness is coming back up

      4. /b/ - again, loudness drops to zero and then comes up quickly with the same formant and pitch (like "buh") but with no noise burst

      5. /y/ or /ai/ - formant filter sweep from current /oo/ or /u/ position to a much brighter sound like /i/





      share|improve this answer



























        6












        6








        6







        As Michael Curtis has pointed out, from the linguistic side, the study of phonetics is all about what speech sounds humans make and how they make them. Phonetics doesn't really approach things from a musical perspective, so I thought I might try to make some correlations between phonetics and musical acoustics.



        Phonetics divides speech sounds (phonemes) into two broad categories: vowels and consonants. The lines can be a bit blurry there, but vowel sounds always involve the vocal cords and usually made with the mouth more or less open, while consonants involve specific motions of the teeth, lips, and tongue and may or may not use the vocal cords.



        For vowels, we always use our vocal cords, which means vowels always have some pitch. The pitches used during speech generally do not have a typical musical relationship, but sometimes might be "accidentally" musical. For instance, when a child taunts on the playground something like, "Johnny is a chick-en!", they often us a sing-song tone that is a melodic minor third. But that's incidental.



        The way we make different vowel sounds is by changing the shape of our mouths, and this changes the timbre of the sound made by our vocal cords. Another musical way to look at it is that we are filtering (like with EQ or a synth filter) the pitch that is created by our vocal cords.



        That entirely covers the musical aspects of vowel sounds. We could talk about loudness and duration (the two other main dimensions of music), but neither of those change the vowel sound we make or hear.



        Consonants are more complicated. Let's divide them into the phonetic categories of voiced (using the vocal cords) and unvoiced (not using the vocal cords).



        Unvoiced consonants (like /t/, /p/, /f/, /k/, /s/), from a musical standpoint, are closest to percussion sounds. These are the kinds of sounds we make when we beat box.
        Percussion sounds and unvoiced consonants are both musically unpitched, and instead distinguished solely by timbre. The two main timbral elements of these sounds are the envelope and formant. The formant is like a filter setting, just like for vowel sounds, but since there is no pitch to filter, what is being filtered instead is noise or unpitched tones. Unpitched tones are groups of frequencies that do not have a harmonic relationship to each other, so we don't hear them as a note. Think of two different cymbals, a "high" one and a "low" one as being examples of noise with two different formants.



        For unvoiced consonants, there are two subcategories we can talk about, plosives, fricatives. Plosives (/t/, /p/, /k/) have a very short loudness envelope that reaches maximum volume very quickly and then dies away just as quickly. This is most similar to a drum sound. The different sounds of plosives come from their different formants. In this case, it's mainly how much and what kinds of noise is being made along with the plosive sound. A /p/ sound has essentially no noise, like a kick drum, while /t/ and /k/ have two different kinds of noise that are more like a hi hat and snare drum, respectively. Another thing that makes the /t/ sound different from the /k/ is the position of the mouth is different, which causes different filtering just like we see in the vowel sounds.



        Fricatives (/f/, /s/, /sh/, /th/) are all bursts of noise that generally last longer than plosives (they have a slower loudness envelope), and they each have their own formant, or filter setting, that changes the character of the noise. Note that /f/ is a fairly even noise sound, while /s/ has more of a sense of some frequencies being louder than others, /sh/ is a more uneven noise sound, and /th/ is a muted noise sound without as much of the upper frequencies.



        For the voiced consonants, most of them are essentially the same as the unvoiced ones outlined above, except they also involve the vocal cords, so there is again a pitch of some kind when voiced consonants are spoken. These consonants include /d/ (voiced /t/), /b/ (voiced /p/), /z/ (voiced /s/) and so on. I believe every unvoiced consonant has a voiced version in English (I believe this is also true in Japanese).



        There are a few voiced consonants that do not have unvoiced versions and also straddle the line between consonants and vowels. The two closest to being vowels are the /y/ and /w/ sounds. These are basically vowels where the formant or filter is changed while we say them. This is done by changing the positing of the tongue or lips while the vocal cords create a pitch.



        Two others, /m/ and /n/, are basically made similar to humming, and the main way we tell the difference is by how the consonant changes to a vowel to determine whether it was an /m/ or /n/. During the transition to vowel, the difference between /m/ and /n/ is similar to the difference between /w/ and /y/.



        Finally, /l/ and /r/ are essentially vowels that have rather extreme formants or filters applied do them. They also sound different when they are approached and left (what you might call their formant/filter envelopes).



        If you're really paying attention, you've noticed I have not discussed every English phoneme. I have touched on all the musical aspects of phonemes in all languages. Here's more of a breakdown aspects of phonemes:



        • Different sound sources, including the vocal cords to make pitches and parts of the mouth that can make noises

        • Different mouth positions to filter the sound sources in different ways to create different formants

        • Different loudness envelopes, or how the loudness changes with time

        • Different formant envelopes, or how the filtering changes with time

        Those are the primary elements that distinguish different phonemes summarized with musical, rather than phonetic, terms.




        For fun, let's break down "hello" and "goodbye" musically, as if we were going to try to make a synth make these sounds:



        "Hello"



        1. /h/ - filtered noise, very muted and fairly quiet

        2. /e/ - filtered pitch, fairly bright formant filter (a kind of bandpass filter)

        3. /l/ - filtered pitch, changing the formant filter dramatically as the consonant develops, along with a dip in the loudness envelope right at the "middle" of the /l/ sound

        4. /o/ - filtered pitch, arriving at a much darker formant than the /e/ sound

        "Goodbye"



        1. /g/ - filtered pitch, loudness envelope with short attack, short quiet filtered noise burst, formant filter with an envelope that starts very dark (like an /n/ sound) and then gets bright for a very short time and then quickly settles to the position for the next phoneme

        2. /oo/ - filtered pitch, note this is similar to a /u/ formant

        3. /d/ - filtered pitch continues but the loudness envelope drops to essentially zero for just a short moment and then comes back up to the original loudness with the same formant (like "duh"), possibly with slight noise burst right when the loudness is coming back up

        4. /b/ - again, loudness drops to zero and then comes up quickly with the same formant and pitch (like "buh") but with no noise burst

        5. /y/ or /ai/ - formant filter sweep from current /oo/ or /u/ position to a much brighter sound like /i/





        share|improve this answer















        As Michael Curtis has pointed out, from the linguistic side, the study of phonetics is all about what speech sounds humans make and how they make them. Phonetics doesn't really approach things from a musical perspective, so I thought I might try to make some correlations between phonetics and musical acoustics.



        Phonetics divides speech sounds (phonemes) into two broad categories: vowels and consonants. The lines can be a bit blurry there, but vowel sounds always involve the vocal cords and usually made with the mouth more or less open, while consonants involve specific motions of the teeth, lips, and tongue and may or may not use the vocal cords.



        For vowels, we always use our vocal cords, which means vowels always have some pitch. The pitches used during speech generally do not have a typical musical relationship, but sometimes might be "accidentally" musical. For instance, when a child taunts on the playground something like, "Johnny is a chick-en!", they often us a sing-song tone that is a melodic minor third. But that's incidental.



        The way we make different vowel sounds is by changing the shape of our mouths, and this changes the timbre of the sound made by our vocal cords. Another musical way to look at it is that we are filtering (like with EQ or a synth filter) the pitch that is created by our vocal cords.



        That entirely covers the musical aspects of vowel sounds. We could talk about loudness and duration (the two other main dimensions of music), but neither of those change the vowel sound we make or hear.



        Consonants are more complicated. Let's divide them into the phonetic categories of voiced (using the vocal cords) and unvoiced (not using the vocal cords).



        Unvoiced consonants (like /t/, /p/, /f/, /k/, /s/), from a musical standpoint, are closest to percussion sounds. These are the kinds of sounds we make when we beat box.
        Percussion sounds and unvoiced consonants are both musically unpitched, and instead distinguished solely by timbre. The two main timbral elements of these sounds are the envelope and formant. The formant is like a filter setting, just like for vowel sounds, but since there is no pitch to filter, what is being filtered instead is noise or unpitched tones. Unpitched tones are groups of frequencies that do not have a harmonic relationship to each other, so we don't hear them as a note. Think of two different cymbals, a "high" one and a "low" one as being examples of noise with two different formants.



        For unvoiced consonants, there are two subcategories we can talk about, plosives, fricatives. Plosives (/t/, /p/, /k/) have a very short loudness envelope that reaches maximum volume very quickly and then dies away just as quickly. This is most similar to a drum sound. The different sounds of plosives come from their different formants. In this case, it's mainly how much and what kinds of noise is being made along with the plosive sound. A /p/ sound has essentially no noise, like a kick drum, while /t/ and /k/ have two different kinds of noise that are more like a hi hat and snare drum, respectively. Another thing that makes the /t/ sound different from the /k/ is the position of the mouth is different, which causes different filtering just like we see in the vowel sounds.



        Fricatives (/f/, /s/, /sh/, /th/) are all bursts of noise that generally last longer than plosives (they have a slower loudness envelope), and they each have their own formant, or filter setting, that changes the character of the noise. Note that /f/ is a fairly even noise sound, while /s/ has more of a sense of some frequencies being louder than others, /sh/ is a more uneven noise sound, and /th/ is a muted noise sound without as much of the upper frequencies.



        For the voiced consonants, most of them are essentially the same as the unvoiced ones outlined above, except they also involve the vocal cords, so there is again a pitch of some kind when voiced consonants are spoken. These consonants include /d/ (voiced /t/), /b/ (voiced /p/), /z/ (voiced /s/) and so on. I believe every unvoiced consonant has a voiced version in English (I believe this is also true in Japanese).



        There are a few voiced consonants that do not have unvoiced versions and also straddle the line between consonants and vowels. The two closest to being vowels are the /y/ and /w/ sounds. These are basically vowels where the formant or filter is changed while we say them. This is done by changing the positing of the tongue or lips while the vocal cords create a pitch.



        Two others, /m/ and /n/, are basically made similar to humming, and the main way we tell the difference is by how the consonant changes to a vowel to determine whether it was an /m/ or /n/. During the transition to vowel, the difference between /m/ and /n/ is similar to the difference between /w/ and /y/.



        Finally, /l/ and /r/ are essentially vowels that have rather extreme formants or filters applied do them. They also sound different when they are approached and left (what you might call their formant/filter envelopes).



        If you're really paying attention, you've noticed I have not discussed every English phoneme. I have touched on all the musical aspects of phonemes in all languages. Here's more of a breakdown aspects of phonemes:



        • Different sound sources, including the vocal cords to make pitches and parts of the mouth that can make noises

        • Different mouth positions to filter the sound sources in different ways to create different formants

        • Different loudness envelopes, or how the loudness changes with time

        • Different formant envelopes, or how the filtering changes with time

        Those are the primary elements that distinguish different phonemes summarized with musical, rather than phonetic, terms.




        For fun, let's break down "hello" and "goodbye" musically, as if we were going to try to make a synth make these sounds:



        "Hello"



        1. /h/ - filtered noise, very muted and fairly quiet

        2. /e/ - filtered pitch, fairly bright formant filter (a kind of bandpass filter)

        3. /l/ - filtered pitch, changing the formant filter dramatically as the consonant develops, along with a dip in the loudness envelope right at the "middle" of the /l/ sound

        4. /o/ - filtered pitch, arriving at a much darker formant than the /e/ sound

        "Goodbye"



        1. /g/ - filtered pitch, loudness envelope with short attack, short quiet filtered noise burst, formant filter with an envelope that starts very dark (like an /n/ sound) and then gets bright for a very short time and then quickly settles to the position for the next phoneme

        2. /oo/ - filtered pitch, note this is similar to a /u/ formant

        3. /d/ - filtered pitch continues but the loudness envelope drops to essentially zero for just a short moment and then comes back up to the original loudness with the same formant (like "duh"), possibly with slight noise burst right when the loudness is coming back up

        4. /b/ - again, loudness drops to zero and then comes up quickly with the same formant and pitch (like "buh") but with no noise burst

        5. /y/ or /ai/ - formant filter sweep from current /oo/ or /u/ position to a much brighter sound like /i/






        share|improve this answer














        share|improve this answer



        share|improve this answer








        edited 1 hour ago

























        answered 1 hour ago









        Todd WilcoxTodd Wilcox

        37.4k370125




        37.4k370125




















            JShorthouse is a new contributor. Be nice, and check out our Code of Conduct.









            draft saved

            draft discarded


















            JShorthouse is a new contributor. Be nice, and check out our Code of Conduct.












            JShorthouse is a new contributor. Be nice, and check out our Code of Conduct.











            JShorthouse is a new contributor. Be nice, and check out our Code of Conduct.














            Thanks for contributing an answer to Music: Practice & Theory Stack Exchange!


            • Please be sure to answer the question. Provide details and share your research!

            But avoid


            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.

            To learn more, see our tips on writing great answers.




            draft saved


            draft discarded














            StackExchange.ready(
            function ()
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmusic.stackexchange.com%2fquestions%2f82996%2fin-musical-terms-what-properties-are-varied-by-the-human-voice-to-produce-diffe%23new-answer', 'question_page');

            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown







            Popular posts from this blog

            Bett Inhaltsverzeichnis Geschichte | Bettformen | Bettgrößen | Andere Bezeichnungen | Bettenmangel | Betten in der bildenden Kunst | Schlafmedizinische Gesichtspunkte | Siehe auch | Literatur | Weblinks | Einzelnachweise | NavigationsmenüBett, Bettstatt, BettstelleCommons: BettBabybetten: Anwendung, Ausstattungsmerkmale und VergleichskriterienWasserbetten. Vorurteile im TestHapfnNursch10.1007/s11818-012-0584-74006250-8AKS4329276-8

            Luksemburg Sisukord Nimi | Asend | Loodus | Riigikord | Haldusjaotus | Rahvastik | Riigikaitse | Majandus | Taristu | Ajalugu | Eesti ja Luksemburgi suhted | Haridus | Kultuur | Vaata ka | Viited | Välislingid | Navigeerimismenüü50° N, 6° EÜlevaade Luksemburgi kaitsealadest.Luksemburgi rahvaarv. Statistikaamet.World Bank'i andmebaasÜlevaade Luksemburgi loodusest.Ülevaade Luksemburgi metsadest.Guy Colling. "Red List of the Vascular Plants of Luxembourg." Travaux scientifiques du Musée national d’histoire naturelle Luxembourg. 2005.Luxembourg’s biodiversity at risk.Maailma kahepaiksete andmebaas.Denis Lepage. "Luxembourg." Avibase.Ülevaade temperatuuridest. Luksemburgi meteoroloogiateenistus.Ülevaade Luksemburgist. Euroopa Liidu esinduse koduleht.Système politique. TerritoireÜlevaade Luksemburgi rahvastikust. Luksemburgi statistikaamet.Luksemburgi rahvastik. Luksemburgi statistikaamet.The World FactbookMonique Borsenberger, Paul Dickes. "Religions au Luxembourg. Quelle évolution entre 1999-2008". Luksemburgi statistikaamet. 2011.Luksemburgi peapiiskopkond. Catholic-Hierarchy.Luksemburgi armee koduleht.Luksemburgi armee relvastus.Eesti Välisministeerium.Luksemburgi rahvastik. Luksemburgi statistikaamet.Luksemburgi Eesti Seltsi koduleht.Helen Eelrand. "Raadio, mis muutis maailma." Eesti Päevaleht. 13. märts 2004.Ülevaade Luksemburgi haridussüsteemist.Ülevaade Luksemburgi keskkoolidest.Luksemburgr

            Valle di Casies Indice Geografia fisica | Origini del nome | Storia | Società | Amministrazione | Sport | Note | Bibliografia | Voci correlate | Altri progetti | Collegamenti esterni | Menu di navigazione46°46′N 12°11′E / 46.766667°N 12.183333°E46.766667; 12.183333 (Valle di Casies)46°46′N 12°11′E / 46.766667°N 12.183333°E46.766667; 12.183333 (Valle di Casies)Sito istituzionaleAstat Censimento della popolazione 2011 - Determinazione della consistenza dei tre gruppi linguistici della Provincia Autonoma di Bolzano-Alto Adige - giugno 2012Numeri e fattiValle di CasiesDato IstatTabella dei gradi/giorno dei Comuni italiani raggruppati per Regione e Provincia26 agosto 1993, n. 412Heraldry of the World: GsiesStatistiche I.StatValCasies.comWikimedia CommonsWikimedia CommonsValle di CasiesSito ufficialeValle di CasiesMM14870458910042978-6