The technology to allow humans to communicate with machines by speech and the technology to enable machines to understand when humans communicate with each other is rapidly maturing. This course provides an introduction to the theoretical background as well as the experimental practice that has made the field what it is today. We will cover theoretical foundations, essential algorithms and approaches, and experimental strategies. We will treat current research, state-of-the-art research, and commercial systems and will introduce the participants to ongoing work in representation, algorithms and interface design. This course is primarily for graduate students in LTI, CS, Robotics, ECE, HCI, Psychology, or Computational Linguistics. Others by prior permission of instructor. No prior experience with speech recognition is necessary, but a solid background in mathematics, computer science, or signal processing will help. The course is suitable for graduate students with some background in computer science, electrical engineering, Human-computer interaction or natural language processing, as well as for advanced undergraduates. The course involves written and programming assignments. Some reading of papers may also be required. This course is also listed in ECE as 18-781. Instructor: Florian Metze (mailto:fmetze@cs.cmu.edu) TA: Wend-Huu (Roger) Hsiao (mailto:wrhsiao@cs.cmu.edu) Time: Monday and Wednesday, 4:30 – 6:00 Room: WEH5310 |