Search Engines Information Retrieval in Practice

by ; ;
  • ISBN13:


  • ISBN10:


  • Edition: 1st
  • Format: Hardcover
  • Copyright: 2/6/2009
  • Publisher: Pearson

Note: Supplemental materials are not guaranteed with Rental or Used book purchases.

  • Free Shipping On Orders Over $59!
    Your order must be $59 or more to qualify for free economy shipping. Bulk sales, PO's, Marketplace items, eBooks and apparel do not qualify for this offer.
  • Get Rewarded for Ordering Your Textbooks! Enroll Now
  • We Buy This Book Back!
    In-Store Credit: $13.65
    Check/Direct Deposit: $13.00

Supplemental Materials

What is included with this book?

  • The New copy of this book will include any supplemental materials advertised. Please check the title of the book to determine if it should include any access cards, study guides, lab manuals, CDs, etc.
  • The Rental copy of this book is not guaranteed to include any supplemental materials. Typically, only the book itself is included. This is true even if the title states it includes any access cards, study guides, lab manuals, CDs, etc.


KEY BENEFIT : Written by a leader in the field of information retrieval, this text provides the background and tools needed to evaluate, compare and modify search engines. KEY TOPICS : Coverage of the underlying IR and mathematical models reinforce key concepts. Numerous programming exercises make extensive use of Galago, a Java-based open source search engine. MARKET : A valuable tool for search engine and information retrieval professionals.

Author Biography

W. Bruce Croft is a Distinguished Professor in the Department of Computer Science at the University of Massachusetts, Amherst, which he joined in 1979. In 1992, he became the Director of the Center for Intelligent Information Retrieval (CIIR), which combines basic research with technology transfer to a variety of government and industry partners. He has published more than 180 articles related to information retrieval. Dr. Croft was elected a Fellow of ACM in 1997, received the Research Award from the American Society for Information Science and Technology in 2000, and received the Gerard Salton Award from the ACM Special Interest Group in Information Retrieval (SIGIR) in 2003.

Donald Metzler is a Research Scientist in the Search and Computational Advertising group at Yahoo! Research in Santa Clara, CA. He obtained his Ph.D. from the University of Massachusetts in 2007. During his graduate studies he was awarded a Microsoft Live Labs Graduate Fellowship. His research interests include formal information retrieval models, web search, advertising, and machine learning.

Trevor Strohman is a software engineer in the Google search quality division. His Ph.D., from the University of Massachusetts Amherst, focused on high-performance text retrieval systems that are easily adaptable to fit specific retrieval applications. He has published papers and presented a tutorial at the top information retrieval conference, SIGIR. He is the creator of the Galago search engine, and the primary developer of the Indri search engine (www.lemurproject.org/indri). He has ten years of professional software development experience, including desktop, server, and web applications.

Table of Contents

Search Engines and Information Retrievalp. 1
What is Information Retrieval?p. 1
Search Enginesp. 6
Search Engineersp. 9
Book Overviewp. 10
Architecture of a Search Enginep. 15
What is an Architecture?p. 15
Basic Building Blocksp. 16
Breaking It Downp. 19
Text Acquisitionp. 19
Text Transformationp. 21
Index Creationp. 24
User Interactionp. 25
Rankingp. 27
Evaluationp. 29
How Does It Really Work?p. 30
Crawls and Feedsp. 33
Deciding what to searchp. 33
Crawling the Webp. 33
Directory Crawlingp. 34
Document Feedsp. 34
The Conversion Problemp. 34
Storing the Documentsp. 35
Detecting Duplicatesp. 36
Removing Noisep. 39
Processing Text
Table of Contents provided by Publisher. All Rights Reserved.

Rewards Program

Write a Review