Sensory Modality and Speech Perception
Future research can be designed to test additional aspects of the theory. Fortunately, the theory makes some very spe- cific predictions. For example, if multisensory perception is actually a consequence of common supramodal informa- tion contained in both light and sound, then “integration” functionally occurs at the level of the stimulus input. If this is true, evidence of integration should be observed at the earli- est stages. Potentially, early integration is already evidenced by (1) visual modulation of early auditory brain areas and (2) crossmodal influences of low-level speech features, such as the voice onset timing distinguishing “p” from “b.” How- ever, other researchers have argued that the modalities stay separate up through the determination of words (Bernstein et al., 2004a). Future research will need to examine additional evidence for early versus later integration of the channels.
Relatedly, if as the supramodal approach claims, integration
is a function of the input itself, then integration should be “impenetrable” to other cognitive influences (e.g., higher level linguistics, attention). However, a number of studies have shown higher level lexical influences on the strength of the
McGurk effect (e.g., Brancazio, 2004), contrary to the predic- tion of the supramodal account. As intimated above, however, the McGurk effect is not a straight forward tool for measuring integration. Very recent research suggests that lexical influ- ences may actually bear on postintegration categorization of segments (Dorsi, 2019). Still, more research is needed to determine the degree to which multisensory integration is impenetrable to outside cognition.
Finally, although work has been conducted to discover supramodal information across audio and visual channels, similar principles may apply to the haptic channel as well. As discussed, the haptic channel seems to induce the same per- ceptual and neurophysiological cross-sensory modulations as audio and visual speech. It is less clear how an informational form in the haptic stream could be supramodal with the other channels (but see Turvey and Fonseca, 2014). Future research can address this question to explain the miraculous abili- ties of Rick Joy to provide his speech brain with articulatory information from a most surprising source.
