Neural responses in human superior temporal cortex support coding of voice representations |
| |
Authors: | Kyle Rupp Jasmine L. Hect Madison Remick Avniel Ghuman Bharath Chandrasekaran Lori L. Holt Taylor J. Abel |
| |
Affiliation: | 1. Department of Neurological Surgery, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America;2. Department of Communication Science and Disorders, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America;3. Department of Psychology, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America;4. Department of Bioengineering, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America; New York University, UNITED STATES |
| |
Abstract: | The ability to recognize abstract features of voice during auditory perception is an intricate feat of human audition. For the listener, this occurs in near-automatic fashion to seamlessly extract complex cues from a highly variable auditory signal. Voice perception depends on specialized regions of auditory cortex, including superior temporal gyrus (STG) and superior temporal sulcus (STS). However, the nature of voice encoding at the cortical level remains poorly understood. We leverage intracerebral recordings across human auditory cortex during presentation of voice and nonvoice acoustic stimuli to examine voice encoding at the cortical level in 8 patient-participants undergoing epilepsy surgery evaluation. We show that voice selectivity increases along the auditory hierarchy from supratemporal plane (STP) to the STG and STS. Results show accurate decoding of vocalizations from human auditory cortical activity even in the complete absence of linguistic content. These findings show an early, less-selective temporal window of neural activity in the STG and STS followed by a sustained, strongly voice-selective window. Encoding models demonstrate divergence in the encoding of acoustic features along the auditory hierarchy, wherein STG/STS responses are best explained by voice category and acoustics, as opposed to acoustic features of voice stimuli alone. This is in contrast to neural activity recorded from STP, in which responses were accounted for by acoustic features. These findings support a model of voice perception that engages categorical encoding mechanisms within STG and STS to facilitate feature extraction.Voice perception occurs via specialized networks in higher order auditory cortex, but how voice features are encoded remains a central unanswered question. Using human intracerebral recordings of auditory cortex, this study provides evidence for categorical encoding of voice. |
| |
Keywords: | |
|
|