rent-now

Rent More, Save More! Use code: ECRENTAL

5% off 1 book, 7% off 2 books, 10% off 3+ books

9783540245094

Machine Learning for Multimodal Interaction

by ;
  • ISBN13:

    9783540245094

  • ISBN10:

    354024509X

  • Format: Paperback
  • Copyright: 2005-03-24
  • Publisher: Springer-Verlag New York Inc
  • Purchase Benefits
  • Free Shipping Icon Free Shipping On Orders Over $35!
    Your order must be $35 or more to qualify for free economy shipping. Bulk sales, PO's, Marketplace items, eBooks and apparel do not qualify for this offer.
  • eCampus.com Logo Get Rewarded for Ordering Your Textbooks! Enroll Now
List Price: $99.99

Summary

This book constitutes the thoroughly refereed post-proceedings of the First International Workshop on Machine Learning for Multimodal Interaction, MLMI 2004, held in Martigny, Switzerland in June 2004. The 30 revised full papers presented were carefully selected during two rounds of reviewing and revision. The papers are organized in topical sections on HCI and applications, structuring and interaction, multimodal processing, speech processing, dialogue management, and vision and emotion.

Table of Contents

I HCI and Applications
Accessing Multimodal Meeting Data: Systems, Problems and Possibilities
Simon Tucker, Steve Whittaker
1(11)
Browsing Recorded Meetings with Ferret
Pierre Wellner, Mike Flynn, Maël Guillemot
12(10)
Meeting Modelling in the Context of Multimodal Research
Dennis Reidsma, Rutger Rienks, Natasa Jovanovic
22(14)
Artificial Companions
Yorick Wilks
36(10)
Zakim – A Multimodal Software System for Large-Scale Teleconferencing
Max Froumentin
46(10)
II Structuring and Interaction
Towards Computer Understanding of Human Interactions
Iain McCowan, Daniel Gatica-Perez, Samy Bengio, Darren Moore, Hervé Bourlard
56(20)
Multistream Dynamic Bayesian Network for Meeting Segmentation
Alfred Dielmann, Steve Renals
76(11)
Using Static Documents as Structured and Thematic Interfaces to Multimedia Meeting Archives
Denis Lalanne, Rolf Ingold, Didier von Rotz, Ardhendu Behera, Dalila Mekhaldi, Andrei Popescu-Belis
87(14)
An Integrated Framework for the Management of Video Collection
Nicolas Maënne-Loccoz, Bruno Janvier, Stéphane Marchand-Maillet, Eric Bruno
101(10)
The NITE XML Toolkit Meets the ICSI Meeting Corpus: Import, Annotation, and Browsing
Jean Carletta, Jonathan Kilgour
111(11)
III Multimodal Processing
S-SEER: Selective Perception in a Multimodal Office Activity Recognition System
Nuria Oliver, Eric Horvitz
122(14)
Mapping from Speech to Images Using Continuous State Space Models
Tue Lehn-Schiøler, Lars Kai Hansen, Jan Larsen
136(10)
An Online Algorithm for Hierarchical Phoneme Classification
Ofer Dekel, Joseph Keshet, Yoram Singer
146(13)
Towards Predicting Optimal Fusion Candidates: A Case Study on Biometric Authentication Tasks
Norman Poh, Samy Bengio
159(14)
Mixture of SVMs for Face Class Modeling
Julien Meynet, Vlad Popovici, Jean Philippe Thiran
173(9)
AV16.3: An Audio-Visual Corpus for Speaker Localization and Tracking
Guillaume Lathoud Jean-Marc Odobez, Daniel Gatica-Perez
182(14)
IV Speech Processing
The 2004 ICSI-SRI-UW Meeting Recognition System
Chuck Wooters, Nikki Mirghafori, Andreas Stolcke, Tuormo Pirinen, Ivan Bulyko, Dave Gelbart, Martin Graciarena, Scott Otterson, Barbara Peskin, Mari Ostendorf
196(13)
On the Adequacy of Baseforin Pronunciations and Pronunciation Variants
Mathew Magimai-Doss, Hervé Bourlard
209(14)
Tandem Connectionist Feature Extraction for Conversational Speech Recognition
Qifeng Zhu, Barry Chen, Nelson Morgan, Andreas Stolcke
223(9)
Long-Term Temporal Features for Conversational Speech Recognition
Barry Chen, Qifeng Zhu, Nelson Morgan
232(11)
Speaker Indexing in Audio Archives Using Gaussian Mixture Scoring Simulation
Hagai Aronowitz, David Burshtein, Amihood Amir
243(10)
Speech Transcription and Spoken Document Retrieval in Finnish
Mikko Kurimo, Ville Turunen, Inger Ekman
253(10)
A Mixed-Lingual Phonological Component Which Drives the Statistical Prosody Control of a Polyglot TTS Synthesis System
Harald Romsdorfer, Beat Pfister, René Beutler
263(14)
V Dialogue Management
Shallow Dialogue Processing Using Machine Learning Algorithms (or Not)
Andrei Popescu-Belis, Alexander Clark, Maria Georgescul, Denis Lalanne, Sandrine Zufferey
277(14)
ARCHIVUS: A System for Accessing the Content of Recorded Multimodal Meetings
Agnes Lisowska, Martin Rajman, Trung H. Bui
291(14)
VI Vision and Emotion
Piecing Together the Emotion Jigsaw
Roddy Cowie, Marc Schröder
305(13)
Emotion Analysis in Man-Machine Interaction Systems
T. Balomenos, A. Raouzaiou, S. Ioannou, A. Drosopoulos, K. Karpouzis, S. Kollias
318(11)
A Hierarchical System for Recognition, Tracking and Pose Estimation
Philipp Zehnder, Esther Koller-Meier, Luc Van Gool
329(12)
Automatic Pedestrian Tracking Using Discrete Choice Models and Image Correlation Techniques
Santiago Venegas-Martinez, Gianluca Antonini, Jean Philippe Thiran, Michel Bierlaire
341(8)
A Shape Based, Viewpoint Invariant Local Descriptor
Mihai Osian, Tinne Tuytelaars, Luc Van Gool
349(12)
Author Index 361

Supplemental Materials

What is included with this book?

The New copy of this book will include any supplemental materials advertised. Please check the title of the book to determine if it should include any access cards, study guides, lab manuals, CDs, etc.

The Used, Rental and eBook copies of this book are not guaranteed to include any supplemental materials. Typically, only the book itself is included. This is true even if the title states it includes any access cards, study guides, lab manuals, CDs, etc.

Rewards Program