TY - BOOK AU - Renals,Steve TI - Multimodal signal processing: human interactions in meetings SN - 9781107022294 (hbk.) U1 - 621.3822 22 PY - 2012/// CY - Cambridge, New York PB - Cambridge University Press KW - Signal processing KW - Digital techniques KW - Interactive multimedia KW - Computer input-output equipment KW - Computer conferencing KW - Technological innovations N1 - Includes bibliographical references and index N2 - "Bringing together experts in multimodal signal processing, this book provides a detailed introduction to the area, with a focus on the analysis, recognition and interpretation of human communication. The technology described has powerful applications. For instance, automatic analysis of the outputs of cameras and microphones in a meeting can make sense of what is happening - who spoke, what they said, whether there was an active discussion and who was dominant in it. These analyses are layered to move from basic interpretations of the signals to richer semantic information. The book covers the necessary analyses in a tutorial manner, going from basic ideas to recent research results. It includes chapters on advanced speech processing and computer vision technologies, language understanding, interaction modeling and abstraction, as well as meeting support technology. This guide connects fundamental research with a wide range of prototype applications to support and analyze group interactions in meetings"--; "This book is an introduction to multimodal signal processing. In it, we use the goal of building applications that can understand meetings as a way to focus and motivate the processing we describe. Multimodal signal processing takes the outputs of capture devices running at the same time - primarily cameras and microphones, but also electronic whiteboards and pens - and automatically analyses them to make sense of what is happening in the space being recorded. For instance, these analyses might indicate who spoke, what was said, whether there was an active discussion, and who was dominant in it. These analyses require the capture of multimodal data using a range of signals,followed by a low-level automatic annotation of them, gradually layering up annotation until information that relates to user requirements is extracted"-- UR - http://assets.cambridge.org/97811070/22294/cover/9781107022294.jpg ER -