Aim of this multimedia demo
The aim of this multimedia demo is to illustrate the changes in voice fundamental frequency (also called ‘voice pitch’) that occur as a function of age and talker sex. The recordings are taken from two speech corpora recorded at UCL: kidLUCID and elderLUCID. The kidLUCID corpus is accessible at the OSCAAR archive based at Northwestern University, and the elderLUCID corpus will be made available by mid 2017.
Using the diapix task to elicit spontaneous speech in interaction
‘Diapix’ (van Engen et al., 2011, Baker and Hazan, 2011) is a problem-solving ‘spot the difference’ picture task used for eliciting spontaneous speech interactions between two participants. Participants have to try and work out what 12 differences there are between their pictures without seeing their partner’s picture. The pictures are carefully designed to elicit certain vocabulary items. This task has been used with children aged 8 years and above as well as with adults aged up to 85 years. To get a ‘clean’ recording for each participant, the diapix task can be done with each participant seated in a separate sound-treated booth and communicating via headphones.
Examples of spontaneous speech for different age groups
The following examples illustrate the changes in the fundamental frequency of the voice (‘voice pitch’) for female and male talkers at different ages. They are extracted from recordings made while participants were carrying out the diapix task with their conversational partners in good listening conditions. All talkers are discussing the shack on the top right hand side of the pictures above. Median fundamental frequency (f0) values were calculated over the whole recording of the diapix task lasting 5-6 minutes on average.
9-10 year olds
Note how similar f0 values are for male and female talkers at this age. It is quite difficult to reliably tell the sex of the talker from the voice.
F39, 10 year old girl (median f0: 258 Hz)
M30, 10 year old boy (median f0: 271 Hz)
11-12 year olds
These two 11 year old talkers still have a high median f0.
F46, 11 year old girl (median f0: 251 Hz)
M17, 11 year old boy (median f0: 261 Hz)
13-14 year olds
Here, you notice that the median f0 for the 13 year old girl is lower than that measured for the 9 and 11 year old girls. The first male talker, 14 year old M21, is still showing a high f0 while M07, also aged 14, is more advanced in puberty and shows a dramatic reduction in median f0.
F06, 13 year old girl (median f0: 214 Hz)
M21, 14 year old boy (median f0: 216 Hz)
M07, 14 year old boy (median f0: 85Hz)
These two young adults display typical average f0 values for their age and sex.
YAF09, 22 year old woman (median f0: 223 Hz)
YAM01, 23 year old man (median f0: 105 Hz)
Older adults (65-85 yrs)
In this age range, the difference in perceived pitch tends to be reduced as there is a tendency for fundamental frequency to decrease in women and increase in men. In fact, the two talkers here, also chosen at random, have virtually identical mean fundamental frequency.
OAF35, 72 year old woman (median f0: 149 Hz)
OAM61, 69 year old man (median f0: 151 Hz)
The research reported here was carried out by Valerie Hazan and Outi Tuomainen (Speech, Hearing and Phonetic Sciences, UCL), and was funded by two grants from the UK Economics and Social Research Council (RES-062-23-3106 and ES/L007002/1). We acknowledge the contribution of: Michèle Pettinato as postdoctoral researcher on the project involving child speakers, Jeesun Kim and Chris Davis (MARCS Institute, Australia) as co-investigators and Doug Brungart and Ben Sheffield (Walter Reed, USA) as collaborators on the project involving older adults. All individuals included in this demonstration have given consent for their recordings to be used without restriction.