(0) items

Note: Supplemental materials are not guaranteed with Rental or Used book purchases.
Crowdsourcing for Speech Processing Applications to Data Collection, Transcription and Assessment,9781118358696
This item qualifies for

Your order must be $59 or more, you must select US Postal Service Shipping as your shipping preference, and the "Group my items into as few shipments as possible" option when you place your order.

Bulk sales, PO's, Marketplace Items, eBooks, Apparel, and DVDs not included.

Crowdsourcing for Speech Processing Applications to Data Collection, Transcription and Assessment

by ; ; ; ;


Pub. Date:
List Price: $129.00

Buy New Textbook

Usually Ships in 3-4 Business Days

Rent Textbook

We're Sorry
Sold Out

Used Textbook

We're Sorry
Sold Out


We're Sorry
Not Available

More New and Used
from Private Sellers
Starting at $54.07

Questions About This Book?

What version or edition is this?
This is the 1st edition with a publication date of 4/29/2013.
What is included with this book?
  • The New copy of this book will include any supplemental materials advertised. Please check the title of the book to determine if it should include any CDs, lab manuals, study guides, etc.


Provides an insightful and practical introduction to crowdsourcing as a means of rapidly processing speech data

Intended for those who want to get started in the domain and  learn how to set up a task, what interfaces are available, how to assess the work, etc. as well as for those who already have used crowdsourcing and want to create better tasks and obtain better assessments of the work of the crowd. It will include screenshots to show examples of good and poor interfaces; examples of case studies in speech processing tasks, going through the task creation process, reviewing options in the interface, in the choice of medium (MTurk or other) and explaining choices, etc.

  • Provides an insightful and practical introduction to crowdsourcing as a means of rapidly processing speech data.
  • Addresses important aspects of this new technique that should be mastered before attempting a crowdsourcing application.
  • Offers speech researchers the hope that they can spend much less time dealing with the data gathering/annotation bottleneck, leaving them to focus on the scientific issues. 
  • Readers will directly benefit from the book’s successful examples of how crowd- sourcing was implemented for speech processing, discussions of interface and processing choices that worked and  choices that didn’t, and guidelines on how to play and record speech over the internet, how to design tasks, and how to assess workers.

Essential reading for researchers and practitioners in speech research groups involved in speech processing

Table of Contents

Contributors vii

Preface ix

1 An Overview

1.1 Growing Needs for Speech Data

1.1.1 Origins of Crowdsourcing

1.1.2 Operational Definition of Crowdsourcing

1.1.3 Functional Definition of Crowdsourcing

1.2 Some Issues

1.3 Some Terminology

1.4 Acknowledgements


2 The Basics

2.1 An Overview of the Literature on Crowdsourcing for Speech Processing

2.1.1 Evolution of the Use of Crowdsourcing for Speech

2.1.2 Geographic Locations of Crowdsourcing for Speech

2.1.3 Specific Areas of Research

2.2 Alternate Solutions

2.3 Some Ready-Made Platforms for Crowdsourcing

2.4 Making Task Creation Easier

2.5 Getting Down to Brass Tacks

2.5.1 Hearing and Being Heard Over the Web

2.5.2 Prequalification

2.5.3 Native Language of the Workers

2.5.4 Payment

2.5.5 Choice of Platform in the Literature

2.5.6 The Complexity of the Task

2.6 Quality Control

2.6.1 Was that Worker a Bot?

2.6.2 Quality Control in the Literature

2.7 Judging the Quality of the Literature

2.8 Some Quick Tips


13 Collecting Speech from Crowds

13.1 A Short History of Speech Collection

13.1.1 Speech Corpora

13.1.2 Spoken Language Systems

13.1.3 User-Configured Recording Environments

13.2 Technology for Web-based Audio Collection

13.2.1 Silverlight

13.2.2 Java

13.2.3 Flash

13.2.4 HTML and JavaScript

13.3 Example:WAMI Recorder

13.3.1 The JavaScript API

13.3.2 Audio Formats

13.4 Example: The WAMI Server

13.4.1 PHP Script

13.4.2 Google App Engine

13.4.3 Server Configuration Details

13.5 Example: Speech Collection on Amazon Mechanical Turk

13.5.1 Server Setup

13.5.2 Deploying to Amazon Mechanical Turk

13.5.3 The Command Line Interface

13.6 Using the Platform Purely for Payment

13.7 Advanced Methods of Crowdsourced Audio Collection

13.7.1 Collecting Dialogue Interactions

13.7.2 Human Computation

13.8 Summary

13.9 Acknowledgements



Please wait while the item is added to your cart...