Machine Learning for Multimodal... | Buy

This book constitutes the thoroughly refereed post-proceedings of the First International Workshop on Machine Learning for Multimodal Interaction, MLMI 2004, held in Martigny, Switzerland in June 2004. The 30 revised full papers presented were carefully selected during two rounds of reviewing and revision. The papers are organized in topical sections on HCI and applications, structuring and interaction, multimodal processing, speech processing, dialogue management, and vision and emotion.

I HCI and Applications

Accessing Multimodal Meeting Data: Systems, Problems and Possibilities

Simon Tucker, Steve Whittaker

1

(11)

Browsing Recorded Meetings with Ferret

Pierre Wellner, Mike Flynn, Maël Guillemot

12

(10)

Meeting Modelling in the Context of Multimodal Research

Dennis Reidsma, Rutger Rienks, Natasa Jovanovic

22

(14)

Artificial Companions

Yorick Wilks

36

(10)

Zakim – A Multimodal Software System for Large-Scale Teleconferencing

Max Froumentin

46

(10)

II Structuring and Interaction

Towards Computer Understanding of Human Interactions

Iain McCowan, Daniel Gatica-Perez, Samy Bengio, Darren Moore, Hervé Bourlard

56

(20)

Multistream Dynamic Bayesian Network for Meeting Segmentation

Alfred Dielmann, Steve Renals

76

(11)

Using Static Documents as Structured and Thematic Interfaces to Multimedia Meeting Archives

Denis Lalanne, Rolf Ingold, Didier von Rotz, Ardhendu Behera, Dalila Mekhaldi, Andrei Popescu-Belis

87

(14)

An Integrated Framework for the Management of Video Collection

Nicolas Maënne-Loccoz, Bruno Janvier, Stéphane Marchand-Maillet, Eric Bruno

101

(10)

The NITE XML Toolkit Meets the ICSI Meeting Corpus: Import, Annotation, and Browsing

Jean Carletta, Jonathan Kilgour

111

(11)

III Multimodal Processing

S-SEER: Selective Perception in a Multimodal Office Activity Recognition System

Nuria Oliver, Eric Horvitz

122

(14)

Mapping from Speech to Images Using Continuous State Space Models

Tue Lehn-Schiøler, Lars Kai Hansen, Jan Larsen

136

(10)

An Online Algorithm for Hierarchical Phoneme Classification

Ofer Dekel, Joseph Keshet, Yoram Singer

146

(13)

Towards Predicting Optimal Fusion Candidates: A Case Study on Biometric Authentication Tasks

Norman Poh, Samy Bengio

159

(14)

Mixture of SVMs for Face Class Modeling

Julien Meynet, Vlad Popovici, Jean Philippe Thiran

173

(9)

AV16.3: An Audio-Visual Corpus for Speaker Localization and Tracking

Guillaume Lathoud Jean-Marc Odobez, Daniel Gatica-Perez

182

(14)

IV Speech Processing

The 2004 ICSI-SRI-UW Meeting Recognition System

Chuck Wooters, Nikki Mirghafori, Andreas Stolcke, Tuormo Pirinen, Ivan Bulyko, Dave Gelbart, Martin Graciarena, Scott Otterson, Barbara Peskin, Mari Ostendorf

196

(13)

On the Adequacy of Baseforin Pronunciations and Pronunciation Variants

Mathew Magimai-Doss, Hervé Bourlard

209

(14)

Tandem Connectionist Feature Extraction for Conversational Speech Recognition

Qifeng Zhu, Barry Chen, Nelson Morgan, Andreas Stolcke

223

(9)

Long-Term Temporal Features for Conversational Speech Recognition

Barry Chen, Qifeng Zhu, Nelson Morgan

232

(11)

Speaker Indexing in Audio Archives Using Gaussian Mixture Scoring Simulation

Hagai Aronowitz, David Burshtein, Amihood Amir

243

(10)

Speech Transcription and Spoken Document Retrieval in Finnish

Mikko Kurimo, Ville Turunen, Inger Ekman

253

(10)

A Mixed-Lingual Phonological Component Which Drives the Statistical Prosody Control of a Polyglot TTS Synthesis System

Harald Romsdorfer, Beat Pfister, René Beutler

263

(14)

V Dialogue Management

Shallow Dialogue Processing Using Machine Learning Algorithms (or Not)

Andrei Popescu-Belis, Alexander Clark, Maria Georgescul, Denis Lalanne, Sandrine Zufferey

277

(14)

ARCHIVUS: A System for Accessing the Content of Recorded Multimodal Meetings

Agnes Lisowska, Martin Rajman, Trung H. Bui

291

(14)

VI Vision and Emotion

Piecing Together the Emotion Jigsaw

Roddy Cowie, Marc Schröder

305

(13)

Emotion Analysis in Man-Machine Interaction Systems

T. Balomenos, A. Raouzaiou, S. Ioannou, A. Drosopoulos, K. Karpouzis, S. Kollias

318

(11)

A Hierarchical System for Recognition, Tracking and Pose Estimation

Philipp Zehnder, Esther Koller-Meier, Luc Van Gool

329

(12)

Automatic Pedestrian Tracking Using Discrete Choice Models and Image Correlation Techniques

Santiago Venegas-Martinez, Gianluca Antonini, Jean Philippe Thiran, Michel Bierlaire

341

(8)

A Shape Based, Viewpoint Invariant Local Descriptor

Mihai Osian, Tinne Tuytelaars, Luc Van Gool

349

(12)

Author Index

361

What is included with this book?

The New copy of this book will include any supplemental materials advertised. Please check the title of the book to determine if it should include any access cards, study guides, lab manuals, CDs, etc.

The Used, Rental and eBook copies of this book are not guaranteed to include any supplemental materials. Typically, only the book itself is included. This is true even if the title states it includes any access cards, study guides, lab manuals, CDs, etc.