Publications

Displaying 1 - 6 of 6
  • Cos, F., Bujok, R., & Bosker, H. R. (in press). Test-retest reliability of audiovisual lexical stress perception after >1.5 years. In Proceedings of the 12th International Conference on Speech Prosody.

    Abstract

    In natural communication, we typically both see and hear our conversation partner. Speech comprehension thus requires the integration of auditory and visual information from the speech signal. This is for instance evidenced by the Manual McGurk effect, where the perception of lexical stress is biased towards the syllable that has a beat gesture aligned to it. However, there is considerable individual variation in how heavily gestural timing is weighed as a cue to stress. To assess within-individualconsistency, this study investigated the test-retest reliability of the Manual McGurk effect. We reran an earlier Manual McGurk experiment with the same participants, over 1.5 years later. At the group level, we successfully replicated the Manual McGurk effect with a similar effect size. However, a correlation of the by-participant effect sizes in the two identical experiments indicated that there was only a weak correlation between both tests, suggesting that the weighing of gestural information in the perception of lexical stress is stable at the group level, but less so in individuals. Findings are discussed in comparison to other measures of audiovisual integration in speech perception. Index Terms: Audiovisual integration, beat gestures, lexical stress, test-retest reliability
  • Ghaleb, E., Burenko, I., Rasenberg, M., Pouw, W., Uhrig, P., Holler, J., Toni, I., Ozyurek, A., & Fernandez, R. (in press). Cospeech gesture detection through multi-phase sequence labeling. In IEEE/CVF Winter Conference on Applications of Computer Vision (WACV).
  • Matteo, M., & Bosker, H. R. (in press). How to test gesture-speech integration in ten minutes. In Proceedings of the 12th International Conference on Speech Prosody.
  • Rohrer, P. L., Hong, Y., & Bosker, H. R. (in press). Gestures time to vowel onset and change the acoustics of the word in Mandarin. In Proceedings of the 12th International Conference on Speech Prosody.
  • Rohrer, P. L., Bujok, R., Van Maastricht, L., & Bosker, H. R. (in press). The timing of beat gestures affects lexical stress perception in Spanish. In Proceedings of the 12th International Conference on Speech Prosody.
  • Uluşahin, O., Bosker, H. R., McQueen, J. M., & Meyer, A. S. (in press). Knowledge of a talker’s f0 affects subsequent perception of voiceless fricatives. In Proceedings of the 12th International Conference on Speech Prosody.

    Abstract

    The human brain deals with the infinite variability of speech through multiple mechanisms. Some of them rely solely on information in the speech input (i.e., signal-driven) whereas some rely on linguistic or real-world knowledge (i.e., knowledge-driven). Many signal-driven perceptual processes rely on the enhancement of acoustic differences between incoming speech sounds, producing contrastive adjustments. For instance, when an ambiguous voiceless fricative is preceded by a high fundamental frequency (f0) sentence, the fricative is perceived as having lower a spectral center of gravity (CoG). However, it is not clear whether knowledge of a talker’s typical f0 can lead to similar contrastive effects. This study investigated a possible talker f0 effect on fricative CoG perception. In the exposure phase, two groups of participants (N=16 each) heard the same talker at high or low f0 for 20 minutes. Later, in the test phase, participants rated fixed-f0 /?ɔk/ tokens as being /sɔk/ (i.e., high CoG) or /ʃɔk/ (i.e., low CoG), where /?/ represents a fricative from a 5-step /s/-/ʃ/ continuum. Surprisingly, the data revealed the opposite of our contrastive hypothesis, whereby hearing high f0 instead biased perception towards high CoG. Thus, we demonstrated that talker f0 information affects fricative CoG perception.

Share this page