Personal tools

Mechanisms and representations in comprehending speech -

MaRCS Subprojects

Research in the MaRCS project

is divided in two complementary subprojects,called "Speech" and "Speech+". In the Speech subproject, we focus on every detail of the speech-comprehension process to elucidate the inner workings of speech comprehension. This subproject leads to small-scale models of how speech comprehension might work.

In the other subproject, Speech+, we put these models to the test in various large-scale domains, such as spontaneous conversation or speech comprehension in the elderly.

 

Speech

This subproject projects focuses on the inner workings of speech comprehension. In very careful experimental approaches, each detail of the speech comprehension process is examined.

•    The speech signal carries information on different time scales. In time steps of 10-2s, information comes in on which segments are produced (/p/, /b/, /t/ ,... ?); in time steps of 10-1s, information comes in on speech prosody (intonation, rhythm, loudness). How are these segmental and suprasegmental properties used and integrated?

•    Comprehending speech is more than audition. When we perceive speech, we draw on prior knowledge about the words and the most likely combination of words and phonemes in our native language. Additionally, a speaker produces not only an acoustic signal but also a visual signal. How are these sources of information combined? Do lexical and visual information actually change how we perceive the acoustic signal?

•    Different speakers may pronounce the same word rather differently. How do we deal with this variability? Previous work suggests that we store instances of speech sounds and words from individual speakers in memory. How exactly is this episodic knowledge stored and how does it influence online processing?

•    Speech comprehension is not an end in itself. We recognize words to get at the intended meaning. How does this interface work? Does the recognition of a word form automatically activate all conceptual information? Conceptual information is multi-faceted and contains information about shapes, colours, smells, categories, etc. Do all these aspects become available automatically, or does access depend on the situational demands and the intentions of the listener?

Speech+

The detailed approach of the first subproject is supplemented with a second subproject in which speech comprehension as a whole is considered in a variety of domains. This also evaluates the ecological validity of our models.

•    Speech is unlike print in that it contains reduced variants of the same word (just yesterday becomes "jus yesday"). Such variations are considered as erroneous in print but occur regularly in speech. How do the different aspects of speech comprehension contribute to the recognition of such forms in natural conversation?

•    The apparent effortlessness of speech comprehension easily breaks down. How can we explain the problems in speech comprehension in a second language? What is the contribution of hearing loss and cognitive decline to speech perception problems in elderly listeners?

•    Is speech perception special? Are there commonalities with other perceptual processes in vision and audition? Is speech a cultural artefact that has adapted in a memetic evolution to our perceptual system? Or have in fact our perceptual systems evolved genetically to support the perception of speech?

•    How does speech perception relate to speech production? How are they integrated to support efficient communication in dialogue settings?

 

 

Last checked 2012-05-16 by Holger Mitterer
Max Planck Institute for Psycholinguistics


Street address

Wundtlaan 1
6525 XD Nijmegen
The Netherlands

Mailing address
P.O. Box 310
6500 AH Nijmegen
The Netherlands

Phone:  +31-24-3521911
Fax:      +31-24-3521213

Image right

Scrabble stones 081202