Pulse clarity characterizations are drawn from various 2) Articulation. Another possible non-periodical account of
description of the dynamic curves presented in paragraph A. pulse clarity is related to articulation (ART variable), i.e., the
Most of these descriptions concern the periodicity of performance characterization oriented along the
particular events, as detailed in paragraphs C and D. This staccato/legato polarity. The Average Silence Ratio (ASR) is
paragraph does, however, not focus on periodic pulsation, but defined for the RMS curve, and indicates the percentage of
rather on the characterization of individual pulses. These frames that have an RMS energy significantly lower than the
pulses are extracted from the dynamic curves through a mean RMS energy of all frames (Feng, Zhuang & Pan, 2003).
detection of the major peaks, i.e. local maxima that show The ASR is similar to the low-energy rate (Burred & Lerch,
significant contrast. 2003), except the use of a different energy threshold: the ASR
1) Attack Characterization. The clarity of each pulse might is meant to characterize significantly silent frames.
be correlated with the sharpness of the corresponding note 3) Variability. Finally, a variability factor (VAR) sums the
attack. The attack phase of each note can be tentatively amplitude difference between successive local extrema of the
detected using the amplitude envelope: the peaks, as shown in onset detection curve, and another factor (PEAK) stores the
Figure 7, actually correspond to the end of each attack phase. average magnitude of the peaks in the amplitude envelope.
The starting point of each phase can be estimated as well, by
extracting the nearest local minimum preceding each peak. To C. Statistical description of the autocorrelation function
be more precise, the local minimum is actually computed Compared to non-periodic descriptors of individual pulses,
from a smoothed version of the amplitude envelope, in order global characterization of periodic pulsations may be
to ignore any spurious burst of energy not directly related to supposed to significantly contribute to the perception of pulse
the attack phase itself. clarity. Rhythmic periodicity is commonly estimated via the
computation of the autocorrelation function of the amplitude
Figure 7 shows the onset detection of the beginning of a
envelope (Brown, 1993), or more preferably of the time
performance of Schumann's Kinderszene II, with
derivative of the amplitude envelope (Figure 9). This
determination of the attack phases.
autocorrelation function, as shown in Figure 10, indicates the
presence of periodicities for a given range of possible periods.
The peaks indicate the most probable periods.
From the resulting autocorrelation function, various
statistical characterizations have been chosen in order to
indicate the prominence of the pulsations.
1) Global Maximum. The most evident descriptor is the
amplitude of the main peak (MAX variable), i.e., the global
maximum of the curve. By property, the autocorrelation
function reaches its peak at the origin, i.e. for a period of 0 s,
meaning that the correlation due to periodic repetitions in the
Figure 9. Time derivation of the amplitude envelope of ragtime signal are all necessarily inferior to the correlation of the signal
(cf. Figure 1). with itself. This has no use in our study, except that the
maximum at the origin can be used as a reference in order to
normalize the autocorrelation function. In this way, the actual
values shown in the autocorrelation function correspond
uniquely to periodic repetitions, and are not influenced by the
global intensity of the total signal. The global maximum will
be extracted within a part of the autocorrelation function
corresponding to perceptible rhythmic periodicities, i.e. for the
range of tempi between 40 and 200 BPM.
l0 l1 l2