Displaying 101 - 112 of 112
-
Siahaan, P., & Wijaya Rajeg, G. P. (2023). Multimodal language use in Indonesian: Recurrent gestures associated with negation. In W. Pouw, J. Trujillo, H. R. Bosker, L. Drijvers, M. Hoetjes, J. Holler, S. Kadava, L. Van Maastricht, E. Mamus, & A. Ozyurek (
Eds. ), Gesture and Speech in Interaction (GeSpIn) Conference. doi:10.17617/2.3527196.Abstract
This paper presents research findings on manual gestures
associated with negation in Indonesian, utilizing data sourced
from talk shows available on YouTube. The study reveals that
Indonesian speakers employ six recurrent negation gestures,
which have been observed in various languages worldwide.
This suggests that gestures exhibiting a stable form-meaning
relationship and recurring frequently in relation to negation are
prevalent around the globe, although their distribution may
differ across cultures and languages. Furthermore, the paper
demonstrates that negation gestures are not strictly tied to
verbal negation. Overall, the aim of this paper is to contribute
to a deeper understanding of the conventional usage and cross-
linguistic distribution of recurrent gestures. -
Sidnell, J., & Stivers, T. (
Eds. ). (2005). Multimodal Interaction [Special Issue]. Semiotica, 156. -
Sprenger, S. A., & Van Rijn, H. (2005). Clock time naming: Complexities of a simple task. In B. G. Bara, L. Barsalou, & M. Bucciarelli (
Eds. ), Proceedings of the 27th Annual Meeting of the Cognitive Science Society (pp. 2062-2067). -
Stern, G. (2023). On embodied use of recognitional demonstratives. In W. Pouw, J. Trujillo, H. R. Bosker, L. Drijvers, M. Hoetjes, J. Holler, S. Kadava, L. Van Maastricht, E. Mamus, & A. Ozyurek (
Eds. ), Gesture and Speech in Interaction (GeSpIn) Conference. doi:10.17617/2.3527204.Abstract
This study focuses on embodied uses of recognitional
demonstratives. While multimodal conversation analytic
studies have shown how gesture and speech interact in the
elaboration of exophoric references, little attention has been
given to the multimodal configuration of other types of
referential actions. Based on a video-recorded corpus of
professional meetings held in French, this qualitative study
shows that a subtype of deictic references, namely recognitional
references, are frequently associated with iconic gestures, thus
challenging the traditional distinction between exophoric and
endophoric uses of deixis. -
ten Bosch, L., & Scharenborg, O. (2005). ASR decoding in a computational model of human word recognition. In Interspeech'2005 - Eurospeech, 9th European Conference on Speech Communication and Technology (pp. 1241-1244). ISCA Archive.
Abstract
This paper investigates the interaction between acoustic scores and symbolic mismatch penalties in multi-pass speech decoding techniques that are based on the creation of a segment graph followed by a lexical search. The interaction between acoustic and symbolic mismatches determines to a large extent the structure of the search space of these multipass approaches. The background of this study is a recently developed computational model of human word recognition, called SpeM. SpeM is able to simulate human word recognition data and is built as a multi-pass speech decoder. Here, we focus on unravelling the structure of the search space that is used in SpeM and similar decoding strategies. Finally, we elaborate on the close relation between distances in this search space, and distance measures in search spaces that are based on a combination of acoustic and phonetic features. -
Uhrig, P., Payne, E., Pavlova, I., Burenko, I., Dykes, N., Baltazani, M., Burrows, E., Hale, S., Torr, P., & Wilson, A. (2023). Studying time conceptualisation via speech, prosody, and hand gesture: Interweaving manual and computational methods of analysis. In W. Pouw, J. Trujillo, H. R. Bosker, L. Drijvers, M. Hoetjes, J. Holler, S. Kadava, L. Van Maastricht, E. Mamus, & A. Ozyurek (
Eds. ), Gesture and Speech in Interaction (GeSpIn) Conference. doi:10.17617/2.3527220.Abstract
This paper presents a new interdisciplinary methodology for the
analysis of future conceptualisations in big messy media data.
More specifically, it focuses on the depictions of post-Covid
futures by RT during the pandemic, i.e. on data which are of
interest not just from the perspective of academic research but
also of policy engagement. The methodology has been
developed to support the scaling up of fine-grained data-driven
analysis of discourse utterances larger than individual lexical
units which are centred around ‘will’ + the infinitive. It relies
on the true integration of manual analytical and computational
methods and tools in researching three modalities – textual,
prosodic1, and gestural. The paper describes the process of
building a computational infrastructure for the collection and
processing of video data, which aims to empower the manual
analysis. It also shows how manual analysis can motivate the
development of computational tools. The paper presents
individual computational tools to demonstrate how the
combination of human and machine approaches to analysis can
reveal new manifestations of cohesion between gesture and
prosody. To illustrate the latter, the paper shows how the
boundaries of prosodic units can work to help determine the
boundaries of gestural units for future conceptualisations. -
Uluşahin, O., Bosker, H. R., McQueen, J. M., & Meyer, A. S. (2023). No evidence for convergence to sub-phonemic F2 shifts in shadowing. In R. Skarnitzl, & J. Volín (
Eds. ), Proceedings of the 20th International Congress of the Phonetic Sciences (ICPhS 2023) (pp. 96-100). Prague: Guarant International.Abstract
Over the course of a conversation, interlocutors sound more and more like each other in a process called convergence. However, the automaticity and grain size of convergence are not well established. This study therefore examined whether female native Dutch speakers converge to large yet sub-phonemic shifts in the F2 of the vowel /e/. Participants first performed a short reading task to establish baseline F2s for the vowel /e/, then shadowed 120 target words (alongside 360 fillers) which contained one instance of a manipulated vowel /e/ where the F2 had been shifted down to that of the vowel /ø/. Consistent exposure to large (sub-phonemic) downward shifts in F2 did not result in convergence. The results raise issues for theories which view convergence as a product of automatic integration between perception and production. -
Vogel, C., Koutsombogera, M., Murat, A. C., Khosrobeigi, Z., & Ma, X. (2023). Gestural linguistic context vectors encode gesture meaning. In W. Pouw, J. Trujillo, H. R. Bosker, L. Drijvers, M. Hoetjes, J. Holler, S. Kadava, L. Van Maastricht, E. Mamus, & A. Ozyurek (
Eds. ), Gesture and Speech in Interaction (GeSpIn) Conference. doi:10.17617/2.3527176.Abstract
Linguistic context vectors are adapted for measuring the linguistic contexts that accompany gestures and comparable co-linguistic behaviours. Focusing on gestural semiotic types, it is demonstrated that gestural linguistic context vectors carry information associated with gesture. It is suggested that these may be used to approximate gesture meaning in a similar manner to the approximation of word meaning by context vectors. -
Wagner, A., & Braun, A. (2003). Is voice quality language-dependent? Acoustic analyses based on speakers of three different languages. In Proceedings of the 15th International Congress of Phonetic Sciences (ICPhS 2003) (pp. 651-654). Adelaide: Causal Productions.
-
Weber, A., & Smits, R. (2003). Consonant and vowel confusion patterns by American English listeners. In M. J. Solé, D. Recasens, & J. Romero (
Eds. ), Proceedings of the 15th International Congress of Phonetic Sciences.Abstract
This study investigated the perception of American English phonemes by native listeners. Listeners identified either the consonant or the vowel in all possible English CV and VC syllables. The syllables were embedded in multispeaker babble at three signal-to-noise ratios (0 dB, 8 dB, and 16 dB). Effects of syllable position, signal-to-noise ratio, and articulatory features on vowel and consonant identification are discussed. The results constitute the largest source of data that is currently available on phoneme confusion patterns of American English phonemes by native listeners. -
Weber, A., & Smits, R. (2003). Consonant and vowel confusion patterns by American English listeners. In Proceedings of the 15th International Congress of Phonetic Sciences (ICPhS 2003) (pp. 1437-1440). Adelaide: Causal Productions.
Abstract
This study investigated the perception of American English phonemes by native listeners. Listeners identified either the consonant or the vowel in all possible English CV and VC syllables. The syllables were embedded in multispeaker babble at three signalto-noise ratios (0 dB, 8 dB, and 16 dB). Effects of syllable position, signal-to-noise ratio, and articulatory features on vowel and consonant identification are discussed. The results constitute the largest source of data that is currently available on phoneme confusion patterns of American English phonemes by native listeners. -
Witteman, J., Karaseva, E., Schiller, N. O., & McQueen, J. M. (2023). What does successful L2 vowel acquisition depend on? A conceptual replication. In R. Skarnitzl, & J. Volín (
Eds. ), Proceedings of the 20th International Congress of the Phonetic Sciences (ICPhS 2023) (pp. 928-931). Prague: Guarant International.Abstract
It has been suggested that individual variation in vowel compactness of the native language (L1) and the distance between L1 vowels and vowels in the second language (L2) predict successful L2 vowel acquisition. Moreover, general articulatory skills have been proposed to account for variation in vowel compactness. In the present work, we conceptually replicate a previous study to test these hypotheses with a large sample size, a new language pair and a
new vowel pair. We find evidence that individual variation in L1 vowel compactness has opposing effects for two different vowels. We do not find evidence that individual variation in L1 compactness
is explained by general articulatory skills. We conclude that the results found previously might be specific to sub-groups of L2 learners and/or specific sub-sets of vowel pairs.
Share this page