Acoustic Parameters:
May be indicators of neurological disease. Obtaining acoustical data is vital for diagnosis and intervention. The measures provide a basis of comparison for the person’s voice before, during, and after therapy. Measures indicate parameters which should be targeted for therapy. Abnormal acoustic measures may vary across individuals with the same disorder (some may have a high Fo and some a low Fo) and may change in severity with the progression of disease (the parameter is affected most or that contributes the most to speech intelligibility may change with time) . Measures can help determine whether a particular treatment strategy is effective or not.
Frequency variables:
1. Average fundamental frequency: measures the persons speaking fundamental frequency (SFF). Perceptually this corresponds to pitch. It is affected by age, sex, and gender. Fo may be abnormally low and may suggest pathology.
Females have an average Fo of 180-250 Hz; Males have an average Fo between 80 and 150 Hz
2. Frequency Variability: changes in the fundamental frequency.
Fo variability is measured in terms of standard deviations from the average Fo. FoSD in normal conversational speech is around 20-35 Hz. For production of the prolongation of a vowel, the FoSD should be around 3-6 Hz. Sometimes Fo variability is measured in semitones called the pitch sigma. Pitch sigma for normal speakers during conversation should be around 2-4 semitones.
Males lowest 80Hz and highest 700 Hz; Females lowest135Hz and highest over 1000Hz.
4. Jitter or frequency perturbation is the frequency variability between cycles of vibration. Normal jitter is around 1% or less or from .2% to 1%. High jitter levels suggest that something is interfering with normal vocal fold vibration and the mucosal wave. Measuring cycle to cycle variability of vibration can allow us to detect changes in neuromuscular function of the muscles controlling the vocal folds. Jitter is related to periodicity of the vocal folds. When Jitter is elevated, this suggests an inability to maintain periodic vibration or the person’s ability to vibrate folds rhythmically.
5. Formant Frequency Measures: Frequency component amplified by resonator (vocal tract). Acoustic properties that distinguish speech sounds. The Maxima of harmonic energy. Typically measured by LPC (Linear Predictive Coding) or spectrographic analysis.
F1 related to tongue height; F2 related to tongue advancement
F2 transition: change in frequency value of formant over time; reflects change in position of articulators
Amplitude and
Intensity Variables
1. Average
Amplitude Level: the overall
level of amplitude during a speaking task. Perceptually this
corresponds to
loudness. Normal conversational speech
is usually between 65-80dBSPL, with an average around 70dB. Voice amplitude that a person can generate
depends strongly on the vocal Fo.
Clinical
implications: Amplitude may decrease slightly in older individuals. Very low vocal amplitude suggests difficulty
in opening the vocal folds widely enough and closing them.
2. Amplitude Variability is expressed as
standard deviation, measured in dbSPL.
Standard deviation of amplitude for a neutral, unemotional
sentence is
around 10 dB SLP.
3. Dynamic Range relates to the physiological range of the vocal amplitudes that a speaker can generate from the softest above a whisper to the loudest shout. Normal adults should be able to produce a minimum level of around 30dB and a maximum of approximately 115dB. A restricted range may prevent a person from using stress and emphasis patterns appropriately. Dynamic range depends on the Fo produced and tends to be greatest for Fo in the midrange and less for Fo that is much lower or much higher.
Clinical implications for neurological disorders:
PD
-higher than
normal Fo
-lower SD of frequency and amplitude
-decreased phonational and
dynamic
ranges
Neurological
disorders
- less able to use Fo effectively to distinguish between declarative and interrogative sentences.
Perceptual characteristics:
-monopitch, monoloudness, inappropriate stress characteristics
4. Shimmer or amplitude perturbation is the amplitude variability between cycles of vibration. Shimmer values below .5 dB are normal. Measuring cycle to cycle variability of vibration can allow us to detect changes in neuromuscular function of the muscles controlling the vocal folds. Shimmer is related to periodicity of the vocal folds. When Shimmer is elevated, this suggests an inability to maintain periodic vibration or the person’s ability to vibrate folds rhythmically.
Jitter and Shimmer Clinical implications:
Document vocal function for neurological disorders such as ALS, PD
Increased jitter and shimmer in ALS show degeneration in the speech muscles before the degeneration is actually heard as impaired speech. Jitter and Shimmer have been used as outcome measures for different types of treatment for PD. Acoustically PD have higher Fo, high jitter, low intensity, decreased phonational range and dynamic range.
Duration measures:
Multidimensional Visi-Pitch,
VTI (voice turbulence index suggesting incomplete glottic closure, SPI (soft phonation index suggesting weak vocalization or phonation), and DSH (degree of subharmonics) are typical for diplophonic voices and voices with glottal fry.
Application for Motor speech disorders:
Voice
and Tremor
Task “ah”
VFo: Coefficient of variation of Fo which is the Fo variation or SD of period to period calculated Fo for long term variations
Norm: 0.5%- 0.69%
VAm: Coefficient of variation for amplitude or the peak amplitude variation for long- term variation
Norm 1.8% - 6.7%
MFTR magnitude of frequency of tremor
Norm: 0.2%- 0.3%
AFTR magnitude of amplitude of tremor
Norm: .7% - 1.6%
Diadochokinetic Analysis: task Say “puh-puh-puh…”
DDKavr: syllables/second; norm = 5.97 -6.22 (Disordered speech is usually slow)
DDKcvp: Coefficient of variation of DDK Period (DDK Rhythm);
Norm: 5.8% (disordered voices have a higher variation)
DDKjit: Pertubation of the DDK period; norm: 1.002% - 1.161% (disordered high variation)
Frequency and Intensity: task “read MY GRANDFATHER”
Frequency Range: norm: 22.7 semitones, (11-17 semitones is below normal)
Dynamic Range: minimum is 30 dB
Measures of Quality:
1. Harmonics to noise ratio: a measure of the proportion of harmonic sound to noise in the voice measured in decibels. This measure can be useful form making objective quantitative assessments of breathiness, roughness, or hoarseness of a person’s voice. Averages around 15 dB.
2. Voice tremor: refers to a regular variation the fundamental frequency or amplitude of the voice. Usually the variation is about 3-5 Hz about the mean fundamental frequency.
3. Phonation time:
Maximum Phonation time: The maximum time a subject can sustain a tone on one breath. Norms: Adult males: 20 sec, females 15 sec.; children 10 seconds. Short maximum phonation times reflect inefficiency of the phonatory or respiratory system.
S/Z ratio: The maximum sustained phonation time of /s/ divided by the maximum sustained phonation time of /z/. A normal speaker would be expected to sustain both the voiceless/s/ and the voiced /z/ for approximately equal durations, resulting in a ratio of 1. S/Z ratios greater than 1.4 indicate vocal fold pathology.
4. Voice:
Voice stoppages When silences become longer than normally expected or appear unexpectedly during phonation, they call attention to themselves, disrupt intelligibility and are considered abnormal.
Frequency breaks: A shift in fundamental frequency either up or down in frequency.