Page 34 - Summer 2006
P. 34
1 W. Koenig, H. K. Dunn, and L. Y. Lacy, “The sound spectro- graph,” J. Acoust. Soc. Am. 18(1), 19-49 (1946).
2 G. A. Kopp and H. C. Green, “Basic phonetic principles of visi- ble speech,” J. Acoust. Soc. Am. 18(1), 74-89 (1946).
3 L. K. Montgomery and I. S. Reed, “A generalization of the Gabor-Helstrom transform,” IEEE Trans. Information Theory IT-13, 344-345 (1967).
4 D. Gabor, “Theory of Communication,” J. Inst. Elect. Eng. Part III 93(26), 429-457 (1946).
5 J. R. Carson, “Notes of the theory of modulation,” Proc. Inst. Radio Eng. 10(1), 57-64 (1922).
6 J. R. Carson and T. C. Fry, “Variable frequency electric circuit theory with application to the theory of frequency-modulation,” Bell Syst. Tech. J. 16(4), 513-540 (1937).
7 K. Kodera, C. de Villedary, and R. Gendrin, “A new method for the numerical analysis of non-stationary signals,” Phys. Earth and Planetary Interiors 12, 142-150 (1976).
8 K. Kodera, R. Gendrin, and C. de Villedary, “Analysis of time- varying signals with small BT values,” IEEE Trans. Acoust., Speech, and Sig. Proc. ASSP-26(1), 64-76 (1978).
9 A. W. Rihaczek, “Signal energy distribution in time and frequency,” IEEE Trans. Information Theory IT-14(3), 369-374 (1968).
10 D. J. Nelson, “Cross-spectral methods for processing speech,” J. Acoust. Soc. Am. 110(5), 2575-2592 (2001).
11 T. Nakatani and T. Irino, “Robust and accurate fundamental fre- quency estimation based on dominant harmonic components,” J. Acoust. Soc. Am. 116(6), 3690-3700 (2004).
12 F. Auger and P. Flandrin, “Improving the readability of time-fre- quency and time-scale representations by the reassignment method,” IEEE Trans. Sig. Proc. 43(5), 1068-1089 (1995).
13 S. A. Fulop and K. Fitz, “Algorithms for computing the time-cor- rected instantaneous frequency (reassigned) spectrogram, with applications,” J. Acoust. Soc. Am. 119(1), 360-371 (2006).
14 J. L. Larimer and J. A. MacDonald, “Sensory feedback from elec- troreceptors to electromotor pacemaker centers in gymnotids,” Am. J. Physiol. 214(6), 1253-1261 (1968).
15 K. Fitz and L. Haken, “On the use of time-frequency reassign- ment in additive sound modeling,” J. Audio Eng. Soc. 50(11), 879-893 (2002).
16 D. J. Nelson, “Instantaneous higher order phase derivatives,” Digital Sig. Proc. 12, 416-428 (2002).
Fig. 10. A few milliseconds from vowel [ae] “had,” comparing the visibility of reso- nance components in a conventional spectrogram (upper) against a reassigned spectrogram plotted using Nelson’s component isolation technique. Both are com- puted using 5.9 ms analysis frames.
work—have each recognized that, for many applications, it is the components of a signal that contain the most sought information, not the overall energy distribution. Going hand in hand with this break from tradition, by focusing entirely on instantaneous frequencies the method sets aside the “clas- sical” Fourier spectrum which decomposes a signal (or its successive frames, in the spectrogram) into sine waves of infinite extent. From the perspective of signal modeling, we already know that real signals are never so constituted, so it is not exactly a leap of faith to stop representing them that way. It is our hope that the future will bring production-qual- ity software that everyone can access, which allows a choice between conventional and reassigned spectrograms. For many applied acousticians, the choice will be clear.AT
References for further reading:
32 Acoustics Today, July 2006