rent-now

Rent More, Save More! Use code: ECRENTAL

5% off 1 book, 7% off 2 books, 10% off 3+ books

9780262026512

Information Retrieval

by ; ;
  • ISBN13:

    9780262026512

  • ISBN10:

    0262026511

  • Format: Hardcover
  • Copyright: 2010-07-23
  • Publisher: Mit Pr
  • Purchase Benefits
  • Free Shipping Icon Free Shipping On Orders Over $35!
    Your order must be $35 or more to qualify for free economy shipping. Bulk sales, PO's, Marketplace items, eBooks and apparel do not qualify for this offer.
  • eCampus.com Logo Get Rewarded for Ordering Your Textbooks! Enroll Now
List Price: $68.00

Summary

Information retrieval is the foundation for modern search engines. This text offers an introduction to the core topics underlying modern search technologies, including algorithms, data structures, indexing, retrieval, and evaluation. The emphasis is on implementation and experimentation; each chapter includes exercises and suggestions for student projects. Wumpus-a multiuser open-source information-retrieval system developed by one of the authors and available online-provides model implementations and a basis for student work. The modular structure of the book allows instructors to use it in a variety of graduate-level courses, including courses taught from a database systems perspective, traditional information retrieval courses with a focus on IR theory, and courses covering the basics of Web retrieval. After an introduction to the basics of information retrieval, the text covers three major topic areas-indexing, retrieval, and evaluation-in self-contained parts. The final part of the book draws on and extends the general material in the earlier parts, treating such specific applications as parallel search engines, Web search, and XML retrieval. End-of-chapter references point to further reading; exercises range from pencil and paper problems to substantial programming projects. In addition to its classroom use, Information Retrievalwill be a valuable reference for professionals in computer science, computer engineering, and software engineering.

Table of Contents

Complete Table of Contents
Forewordp. xix
Prefacep. xxi
Notationp. xxv
Foundations
Introductionp. 2
What Is Information Retrieval?p. 2
Information Retrieval Systemsp. 5
Working with Electronic Textp. 9
Test Collectionsp. 23
Open-Source IR Systemsp. 27
Further Readingp. 28
Exercisesp. 30
Bibliographyp. 32
Basic Techniques
Inverted Indicesp. 33
Retrieval and Rankingp. 51
Evaluationp. 66
Summaryp. 76
Further Readingp. 77
Exercisesp. 79
Bibliographyp. 82
Tokens and Terms
Englishp. 85
Charactersp. 91
Character N-Gramsp. 92
European Languagesp. 94
CJK Languagesp. 95
Further Readingp. 97
Exercisesp. 99
Bibliographyp. 100
Indexing
Static Inverted Indicesp. 104
Index Components and Index Life Cyclep. 104
The Dictionaryp. 106
Postings Listsp. 110
Interleaving Dictionary and Postings Listsp. 114
Index Constructionp. 118
Other Types of Indicesp. 131
Summaryp. 132
Further Readingp. 132
Exercisesp. 133
Bibliographyp. 135
Query Processing
Query Processing for Ranked Retrievalp. 137
Lightweight Structurep. 160
Further Readingp. 169
Exercisesp. 170
Bibliographyp. 171
Index Compression
General-Purpose Data Compressionp. 175
Symbolwise Data Compressionp. 176
Compressing Postings Listsp. 191
Compressing the Dictionaryp. 216
Summaryp. 222
Further Readingp. 223
Exercisesp. 224
Bibliographyp. 225
Dynamic Inverted Indicesp. 228
Batch Updatesp. 229
Incremental Index Updatesp. 231
Document Deletionsp. 243
Document Modificationsp. 250
Discussion and Further Readingp. 251
Exercisesp. 253
Bibliographyp. 254
Retrieval And Ranking
Probilistic Retrievalp. 257
Modeling Relevancep. 259
The Binary Independence Modelp. 261
The Robertson/Sparck Jones Weighting Formulap. 264
Term Frequencyp. 266
Document Length: BM25p. 271
Relevance Feedbackp. 273
Field Weights: BM25Fp. 277
Experimental Comparisonp. 279
Further Readingp. 280
Exercisesp. 281
Bibliographyp. 282
Language Modeling and Related Methodsp. 286
Generating Queries from Documentsp. 287
Language Models and Smoothingp. 289
Ranking with Language Modelsp. 292
Kullback-Leibler Divergencep. 296
Divergence from Randomnessp. 298
Passage Retrieval and Rankingp. 302
Experimental Comparison
Further Readingp. 306
Exercisesp. 307
Bibliographyp. 307
Categorization and Filteringp. 310
Detailed Examplesp. 313
Classificationp. 331
Probabilistic Classifiersp. 339
Linear Classifiersp. 349
Similarity-Based Classifiersp. 354
Generalized Linear Modelsp. 355
Information-Theoretic Modelsp. 359
Experimental Comparisonp. 366
Further Readingp. 371
Exercisesp. 372
Bibliographyp. 373
Fusion and Metalearningp. 376
11.1 Search-Result Fusionp. 377
Stacking Adaptive Filtersp. 381
Stacking Batch Classifiersp. 383
Baggingp. 385
Boostingp. 387
Learning to Rankp. 394
Further Readingp. 400
Exercisesp. 401
Bibliographyp. 401
Evaluation
Measuring Effectivenessp. 406
Traditional Effectiveness Measuresp. 407
The Text REtrieval Conference (TREC)p. 410
Using Statistics in Evaluationp. 412
Minimizing Adjudication Effortp. 441
Nontraditional Effectiveness Measuresp. 451
Further Readingp. 460
Exercisesp. 462
Bibliographyp. 463
Measuring Efficiencyp. 468
Efficiency Criteriap. 468
Queueing Theoryp. 472
Query Schedulingp. 478
Cachingp. 479
Further Readingp. 484
Exercisesp. 484
Bibliographyp. 485
Applications And Extensions
Parallel Information Retrievalp. 488
Parallel Query Processingp. 488
MapReducep. 498
Further Readingp. 503
Exercisesp. 504
Bibliographyp. 505
Web Searchp. 507
The Structure of the Webp. 508
Queries and Usersp. 513
Static Rankingp. 517
Dynamic Rankingp. 535
Evaluating Web Searchp. 538
Web Crawlersp. 541
Summaryp. 541
Further Readingp. 553
Exercisesp. 556
Bibliographyp. 558
XML Retrievalp. 564
The Essence of XMLp. 565
Paths, Trees, and FLWORsp. 571
Indexing and Query Processingp. 576
Ranked Retrievalp. 579
Evaluationp. 583
Further Readingp. 585
Exercisesp. 587
Bibliographyp. 587
Appendix
Computer Performancep. 592
Sequential Versus Random Access on Diskp. 592
Sequential Versus Random Access in RAMp. 593
Pipelined Execution and Branch Predictionp. 594
Indexp. 597
Table of Contents provided by Publisher. All Rights Reserved.

Supplemental Materials

What is included with this book?

The New copy of this book will include any supplemental materials advertised. Please check the title of the book to determine if it should include any access cards, study guides, lab manuals, CDs, etc.

The Used, Rental and eBook copies of this book are not guaranteed to include any supplemental materials. Typically, only the book itself is included. This is true even if the title states it includes any access cards, study guides, lab manuals, CDs, etc.

Rewards Program