Application of audio and video processing methods for language research
Annotations of media recordings are the grounds for linguistic research. Since creating those annotations is a very laborious task,
reaching 100 times longer than the length of the annotated media, innovative audio and video processing algorithms are needed, in
order to improve the efficiency and quality of annotation process. The AVATecH project, started by the Max-Planck Institute for
Psycholinguistics (MPI) and the Fraunhofer institutes HHI and IAIS, aims at significantly speeding up the process of creating
annotations of audio-visual data for humanities research. In order for this to be achieved a range of state-of-the-art audio and video
pattern recognition algorithms have been developed and integrated into widely used ELAN annotation tool. To address the problem of
heterogeneous annotation tasks and recordings we provide modular components extended by adaptation and feedback mechanisms to
achieve competitive annotation quality within significantly less annotation time.
Share this page