Page 45 - Summer 2007
P. 45

 Standards News
 CALIBRATION OF SPEECH IN THE REAL WORLD A REQUEST FROM INDUSTRY
Robert S. Schlauch
Department of Speech-Language-Hearing Sciences, University of Minnesota Minneapolis, Minnesota 55455
 Awidely used American National
Standards Institute (ANSI) stan-
dard is the one titled “Method for
measuring the intelligibility of speech
over communication systems,” ANSI S3.2
(1989). Long after this standard was pub-
lished in its current form, I became the
chair of its working group (WG), primari-
ly through attrition. Shortly after becoming the WG chair, it was brought to my attention that ANSI S3.2 refers to an ANSI standard for guidelines regarding the calibration of speech. This document does not exist. Apparently, the ANSI committee working on the speech calibration standard was unable to reach a consensus on this part. The goal of this short note is to find persons in industry who are willing to work on a description of a method for speech calibration for inclusion in the standard for measuring the intelligibility of speech over communication systems. Laboratory scientists are well represented on the working group, but persons familiar with real-world applications must be included in order to develop a standard that is acceptable for industry. Below, I describe some of the issues.
Speech is a time varying stimulus, and that raises a prob- lem for its calibration. Physical measures of the dynamic range of intensity for phonemes, the smallest segments of speech, are 50 dB or more (Boothroyd, 1994). Perceptual studies confirm that the effective dynamic range of speech is as high as 43 dB (Studebaker et al., 1999).
Before the widespread use of computers, speech calibra- tion of recorded materials was often done using a volume unit meter or the slow response setting on a sound level meter. The level of the speech was taken from the level of the peaks read on the meter, a method that is dependent on the device’s damping characteristics. This method is often based on the level measured in a coupler attached to a sound level meter, and it is still used today (e.g., Warren et al., 2005; Healy and Bacon, 2007; Hornsby and Ricketts, 2006). An informal survey of current issues of the Journal of the Acoustical Society of America revealed another popular method for speech calibration. This alternative method equates stimuli (e.g., sentences or words) for their root- mean-square (rms) level over their entire duration (Bradlow and Alexander, 2007; Erwin, et al., 2007; Kramer et al., 2007; Humes et al., 2006; Levi et al., 2007; Sommers and Barcroft, 2006; Van Engen and Bradlow, 2007). This latter approach is a trivial calculation for digitally stored stimuli. Use of a tone or other sound source with a root-mean-square (rms) ampli-
 “Speech is a time varying stimulus, and that raises a problem
for its calibration.”
 tude equivalent to the speech stimuli is a practical method for daily calibration of such stimuli.
A frequent goal of studies involving speech stimuli is speech understanding, and these measurements are often con- ducted in the presence of competing sounds. The current standard (ANSI S3.2)
does not specify the characteristics of a competing stimulus yet it recognizes that one may be present. In the laboratory, a typical masker for these situations is a noise filtered to have the same spectrum as the long-term spectrum of the speech stimuli used in the study. This type of shaped noise, as well as noise with other characteristics (e.g., pink noise or white noise), is easy to generate using a computer and has the ben- efit of being replicable. Would a standardized competing stimulus be appropriate for evaluating and comparing com- munication systems in industry?
The working group for the standard needs input regarding the applications in industry. We need to know whether the methods described above are appropriate for calibration for a wide variety of industrial applications. Our suspicion is that they will not be broadly accepted. At a meeting of the Acoustical Society of America, a colleague described to me a hypothetical example for the evaluation of a communication system in a tank, a tracked, armored combat vehicle. In this sit- uation, the “talker” would produce words using live voice with background noise varying, depending on the terrain and the status of the vehicle’s engine. The listener, whose responses are used to evaluate the effectiveness of the communication sys- tem, may be in a similar vehicle. So, now, the stimulus pre- sented to the listener would be potentially affected by the noise from both vehicles. What would be an appropriate method for calibrating the speech and documenting the noise in this situ- ation? If the speech and noise in this environment were record- ed separately, using a probe-tube microphone at the listener’s ear, the digital evaluations described above could be used for calibration. This method might be flexible but places addition- al demands on users of the standard. Also, if the noise differs across applications, one might question whether the results represent a standardized exam.
Send me an email (schla001@umn.edu) or leave me a phone message (612-624-7001), if you want to contribute to this process. I am seeking committee members, but any- one willing to share applications of the standard or other advice with our working group should also feel free to con- tact me. AT
Standards News 43












































































   43   44   45   46   47