The VOCAL FOLDS, which are in the larynx (sometimes called the voice box, and located in the
throat), function differently for different sounds. The main differences involve whether they are
together or apart and whether or not they are vibrating. For VOICED sounds, the vocal folds are
close together and vibrating; for VOICELESS sounds, the vocal folds are usually apart and
stationary. There are several other terms for the vocal folds that you may encounter in linguistic
writings: VOCAL BANDS, VOCAL CORDS.
Vocal folds and vibratory movement
Chest voice is characterized by the great amplitude of the vibratory movements of the vocal folds.
The distance between the folds can reach about 3 mm. The vocal folds are thick and have the
appearance of lips.
The vocal folds vibrate over their whole length, from the inner surface of the thyroid cartilage to the
arytenoid vocal process. This produces a great vibrating mass with weak effective tension. The
duration of closure increases when the frequency is lowered. The closure phase lasts longer than the
opening phase.
Normal voice is the result of the vocal folds vibrating in a particular way. For this to happen, as
soon as the speaker has inhaled, the aretynoid cartilages draw the folds (and themselves) together
but not so tightly that they cannot be forced open by the pressure exerted by air which is then
expelled from the lungs. The vocal folds have depth. The aretynoids cause them to close in depth,
like a pair of curtains with the edges touching all the way from top to bottom. When the egressive
pulmonic airstream reaches the bottom, it is strong enough to start pushing the folds apart from the
bottom edge and, gradually, it pushes them apart in depth as it reaches the top of the obstruction. It
is acting against the force being exerted by the aretynoids in doing this remember, they are trying
to hold the folds together, touching each
other. You can follow these stages in Figure 1:
Early voice researchers in the 1950's and 1960's explained vocal fold oscillation with the
myoelastic-aerodynamic theory. According to these theories, Bernoulli forces (negative pressure)
cause the vocal folds to be sucked together, creating a closed airspace below the glottis. Continued
air pressure from the lungs builds up underneath the closed folds. Once this pressure becomes high
enough, the folds are blown outward, thus opening the glottis and releasing a single 'puff' of air.
The lateral movement of the vocal folds continues until the natural elasticity of the tissue takes over,
and the vocal folds move back to their original, closed position. Then, the cycle begins again. Each
cycle produces a single small puff of air; the sound of the human voice is nothing more than tens or
hundreds of these small puffs of air being released every second and filtered by the vocal tract.
Let's further examine the myoelastic-aerodynamic theory. Myo- means muscle; the vocal folds,
after all, are mostly comprised of muscle tissue. The -elastic suffix serves to remind us that the
vocal fold is elastic and that we have active control over its elastic properties. Aerodynamic means
that the theory deals with the motion of air and other gaseous fluids, and with the forces active on
bodies in motion (such as the vocal folds) in relation to such fluids.
A simple rectangular block represents one vocal fold. A spring is useful for portraying the tissue
stiffness or restoring force in the vocal fold. Finally, we've added a damping constant to represent
the viscosity (energy absorption) of the tissue. The damping constant is similar to the shock
absorber on a car or a tubular damper on a screen door.
So, how well does this simple model explain how the vocal folds sustain oscillation? Not well at all,
researchers have found. Bernoulli forces alone cannot account for continual energy conversion from
airstream to tissue. Soon, oscillation would damp out.
The One-Mass Model
A crucial component must be added to our simple model. An acoustic tube (to represent the vocal
tract) is now attached to our model. Why is the acoustic tube needed? For the vocal folds to sustain
oscillation, we know there must be a negative pressure within the glottis. But, pressure from the
lungs cannot be negative; it is always positive. So, how does the air pressure at the level of the
glottis become negative? Make a mental image of the the air from the lungs moving unidirectionally upward. When the glottis is closing, the airflow begins to decrease, but the air that is
above the glottis does not "know" this, so it continues to move with its same speed (because of
inertia). This creates a region just above the vocal folds where the air pressure decreases, because
air is not coming from the bottom through the glottis as fast as it is leaving above. When the vocal
folds are opening, fluid pressure against the walls is greater than when the vocal folds are close
together. Thus, it is the asymmetry of driving force (air) that sustains oscillation.
Although our one-mass model is a closer representation of actual vocal fold oscillation than the
myoelastic-aerodynamic model, some refinements will make the model even more like human
phonation.
An increased use of videostroboscopy in clinics and research labs has allowed vocologists to
observe many sets of vocal folds in slow motion. These observations have shown that vocal folds
rarely move in a uniform block as depicted in our one-mass model. Rather, the vocal folds move in
a wave-like motion from bottom to top, with the bottom edge leading the way. A more sophisticated
version of the model can mimic the motion.
The Three-Mass Model
So, let's continue building our model. In order to model the shape of the vocal folds more
accurately, we add two small masses (depicted as m1 and m2), one on top of the other, to represent
the cover of the vocal fold. A large mass (depicted as m) represents the thyroarytenoid muscle.
Although independent of one another in movement, all three masses are connected by springs and
damping constants. Here is how the model looks and moves now:
Note that at some points in the cycle, the bottom of the vocal folds are farther apart than the upper
part of the folds. We call this a convergent shape because the airflow is converging. On the other
hand, the airflow diverges when the lowermost parts of the vocal folds are closer together; this is a
divergent glottal shape. Average air pressures within the glottis tend to be larger in the convergent
glottal configuration than in the divergent shape, resulting in the asymmetry of air pressures needed
to sustain oscillation.
As technical capabilities in computer software and hardware increase and as improved imaging
techniques allow researchers to study vocal folds in motion, models are increasingly becoming
more realistic. Dr. Ingo Titze and his colleagues at The University of Iowa routinely use 16-mass
models in their studies.
Source: http://www.ncvs.org/ncvs/tutorials/voiceprod/tutorial/model.html (check for animated versions
of pictures
See also Tutorial on VOT in plosives, with audio files of contrasts, from University College
London
http://www.phon.ucl.ac.uk/home/johnm/siphtra/plostut2/plostut2.htm