The sound that heard by the air (also called audio) is analog
in nature and is a continuous waveform.
Bell Labs produced the first digital audio synthesis in the
1950s.
A computer needs to transfer the analog sound wave into its
digital representation, consisting of discrete number
A microphone converts the sound waves into electrical
signal.
This signal is then amplified, filtered, and sent to an analog-
to-digital converter. Output this data as sound, the
stream of data is sent to the speakers via a digital-to
analog converter, a reconstruction filter, and the audio is
amplified. This produces the analog sound wave that we
hear.
Sampling
The audio input from a source is sampled several thousand
times per second. Each sample is a snapshot of the original
signal at a particular time.
Sampling rate
When sampling a sound, the computer processes snapshots
of the waveform. The frequency of these snapshots is called
the sampling rate. The rate can vary typically from 5000-
90,000 sampling per second.
Sampling rate is an important (though not the only) factor in
determining how accurately the digitized sound represents
the original analog sound.
Digitization
Digitization is the process of assigning a discrete value
to each of the sampled values. It is performed by an
Integrated Chip (IC) called an A to D Converter. In the
case of 8-bit digitization, this between 0 and 255 (or –
128 and 127). In 16-bit digitization, this value is
between 0 and 65,535 (or – 32,768 and 32,767).
The process of digitization that a digitized signals. This
is related to the number of bits per sample. A higher
number of bits used to store the sampled value leads to
a more accurate sample, with less noise.
Fidelity
Fidelity is defined as the closeness of the recorded
version to the original sound. In the case of digital
speech, it depends upon the number of bits per sample
and the sampling rate. A rally high-fidelity (hi-fi)
recording takes up a lot of memory space (176.4Kb for
every second of audio of stereo quality sampled at 16
bit, 44.1 kHz per channel). Fortunately for most
computer multimedia applications, it is not necessary
to have very high fidelity sound.
NYQUIST THEOREM
The sampling frequency determines the limit of audio
frequencies that can be reproduced digitally.
According to SyQuest theorem, a minimum of two
samples (per cycle) is necessary to represent a given
sound at a minimum rate of 880 samples per second.
Therefore, Sampling rate are = 2 x Highest frequency.
A distortion knows as “aliasing” occurs and it cannot
be removed by post processing the digitized audio
signal. So frequencies that are above half the sampling
rate are filtered out prior to sampling to remove any
aliasing affects.
Sound formats and settings
Recording at high sampling rates produces a more
accurate capture of high frequency content of the
sound. Another aspect to consider is the “ BIT-
RESOLUTION.”
Stereo recording are made by recording on two channel,
and are lifelike and realistic and not Mono Sound are
less realistic, flat, and not a dramatic, but they have a
smaller file size.
Stereo sounds require twice the space as compared at
mono recording. To calculate the storage space
required, the following formula are used:
Mono Recording:
File size = Sampling rate x duration of recording in seconds x
(bits per sample/8)x1
Stereo Recording:
Sound format are standard in most audio editing
software. Sampling rates of 8, 11, 22, and 44 kHz are
used more often.