Quality and Usability

Speech communication

Integrated Event
(Möller, 4 SWS/6 LP, each WiSe)
LV-Number: 0434 L 900
Language: English

Topics

Speech signals and speech sounds; human speech production; auditory perception; speech signal transmission and coding; speech recognition and speaker recognition; speech synthesis; spoken dialogue systems; multimodal dialogue systems.

Overview

Speech is the most important means of human communication, and more and more it develops into an important modality for human-computer interaction. Already systems work by speech recognition, interpretation of linguistic content, control of dialoge flow, generation  of responses or production of speech signals. Beyond that, the efficient transmission of speech is of utmost importance, both in conventional transmission networks as well as in networks with paket switching (eg. Voice over IP).
In the course of this lecture the basis for unterstanding and designing  communication technology systems based on speech will be provided. Starting with the production and perception of natural human speech will shed light on many important characteristics of speech signals and requirements for their processing. Essential means for representing speech signals in the time and frequency domains will be laid out. On this basis, the functioning of important components of systems of speech technology will be explicated. Apart from efficient coding of speech, speech recognition, speech synthesis, as well as interaction withspeech processing systems (spoken dialogue systems, alternative term: voice user interfaces) will be central. Finally, improvement strategies for the smoother adaption of such systems to human communicative needs via multimodal means of input and output will be presented (multimodal dialogue systems).

Target Group

The lecture has been developed with a focus on students of electrical engineering, computer engineering, as well as computer science. Above these, students from linguistics, communication sciences, engineering acoustics, sociology, human factors, as well as other departments are very welcome. Previous knowledge in speech signal processing or linguistics is not required.

 

current semester

Timing:

Lecture: Mondays 10–12h, H0106, from 23.10.2023
Exercise: Mondays 14-16h, Zoom, from 23.10.2023

Moses registration deadline: November 27th, 2023
 

Please visit the ISIS-Course* to participate.

*Please note that the ISIS-Course might be not available yet
 

Portfolio exam

  • 70 points:   Two written tests with each 35 points
  • 20 points:   Two Matlab programming assignments with each 10 points
  • 10 points:   Project with a spoken dialogue system