rent-now

Rent More, Save More! Use code: ECRENTAL

5% off 1 book, 7% off 2 books, 10% off 3+ books

9781441977373

Inductive Databases and Constraint-based Data Mining

by Dzeroski, Saso; Goethals, Bart; Panov, Pance
  • ISBN13:

    9781441977373

  • ISBN10:

    1441977376

  • eBook ISBN(s):

    9781441977380

  • Format: Hardcover
  • Copyright: 2010-11-02
  • Publisher: Springer-Verlag New York Inc
  • Purchase Benefits
  • Free Shipping Icon Free Shipping On Orders Over $35!
    Your order must be $35 or more to qualify for free economy shipping. Bulk sales, PO's, Marketplace items, eBooks and apparel do not qualify for this offer.
  • eCampus.com Logo Get Rewarded for Ordering Your Textbooks! Enroll Now
List Price: $169.99 Save up to $134.35
  • Digital
    $77.22*
    Add to Cart

    DURATION
    PRICE
    *To support the delivery of the digital material to you, a digital delivery fee of $3.99 will be charged on each digital item.

Summary

This book presents inductive databases and constraint-based data mining, emerging research topics lying at the intersection of data mining and database research. The book provides an overview of the state-of-the art in this novel research area. Of special interest are the recent methods for constraint-based mining of global models for prediction and clustering, the unification of pattern mining approaches through constraint programming, the clarification of the relationship between mining local patterns and global models, and the proposed integrative frameworks and approaches for inductive databases. On the application side, applications to practically relevant problems from bioinformatics are presented to attract additional attention from a wider audience.The primary audience consists of scientists and graduate students in computer science and bio-informatics. Potential readers are likely to attend conferences on databases, data mining/ machine learning, and bio-informatics.

Table of Contents

Introduction
Inductive Databases and Constraint-based Data Mining: Introduction and Overviewp. 3
Inductive Databasesp. 3
Constraint-based Data Miningp. 7
Types of Constraintsp. 9
Functions Used in Constraintsp. 12
KDD Scenariosp. 14
A Brief Review of Literature Resourcesp. 15
The IQ (Inductive Queries for Mining Patterns and Models) Projectp. 17
What's in this Bookp. 22
Representing Entities in the OntoDM Data Mining Ontologyp. 27
Introductionp. 27
Design Principles for the OntoDM ontologyp. 29
OntoDM Structure and Implementationp. 33
Identification of Data Mining Entitiesp. 38
Representing Data Mining Enitities in OntoDM
Related Workp. 52
Conclusionp. 54
A Practical Comparative Study Of Data Mining Query Languagesp. 59
Introductionp. 60
Data Mining Tasksp. 61
Comparison of Data Mining Query Languagesp. 62
Summary of the Resultsp. 74
Conclusionsp. 76
A Theory of Inductive Query Answeringp. 79
Introductionp. 80
Boolean Inductive Queriesp. 81
Generalized Version Spacesp. 88
Query Decompositionp. 90
Normal Formsp. 98
Conclusionsp. 100
Constraint-based Mining: Selected Techniques
Generalizing Itemset Mining in a Constraint Programming Settingp. 107
Introductionp. 107
General Conceptsp. 109
Specialized Approachesp. 111
A Generalized Algorithmp. 114
A Dedicated Solverp. 116
Using Constraint Programming Systemsp. 120
Conclusionsp. 124
From Local Patterns to Classification Modelsp. 127
Introductionp. 127
Preliminariesp. 131
Correlated Patternsp. 132
Finding Pattern Setsp. 137
Direct Predictions from Patternsp. 142
Integrated Pattern Miningp. 146
Conclusionsp. 152
Constrained Predictive Clusteringp. 155
Introductionp. 155
Predictive Clustering Treesp. 156
Constrained Predictive Clustering Trees and Constraint Typesp. 161
A Search Space of (Predictive) Clustering Treesp. 165
Algorithms for Enforcing Constraintsp. 167
Conclusionp. 173
Finding Segmentations of Sequencesp. 177
Introductionp. 177
Efficient Algorithms for Segmentationp. 182
Dimensionality Reductionp. 183
Recurrent Modelsp. 185
Unimodal Segmentationp. 188
Rearranging the Input Data Pointsp. 189
Aggregate Segmentationp. 190
Evaluating the Quality of a Segmentation: Randomizationp. 191
Model Selection by BIC and Cross-validationp. 193
Bursty Sequencesp. 193
Conclusionp. 194
Mining Constrained Cross-Graph Cliques in Dynamic Networksp. 199
Introductionp. 199
Problem Settingp. 201
DATA-PEELERp. 205
Extracting ¿-Contiguous Closed 3-Setsp. 208
Constraining the Enumeration to Extract 3-Cliquesp. 212
Experimental Resultsp. 217
Related Workp. 224
Conclusionp. 226
Probabilistic Inductive Querying Using ProbLogp. 229
Introductionp. 229
ProbLog: Probabilistic Prologp. 233
Probabilistic Inferencep. 234
Implementationp. 238
Probabilistic Explanation Based Learningp. 243
Local Pattern Miningp. 245
Theory Compressionp. 249
Parameter Estimationp. 252
Applicationp. 255
Related Work in Statistical Relational Learningp. 258
Conclusionsp. 259
Inductive Databases: Integration Approaches
Inductive Querying with Virtual Mining Viewsp. 265
Introductionp. 266
The Mining Views Frameworkp. 267
An Illustrative Scenariop. 277
Conclusions and Future Workp. 285
SINDBAD and SiQL: Overview, Applications and Future Developmentsp. 289
Introductionp. 289
SiQLp. 291
Example Applicationsp. 296
A Web Service Interface for SINDBADp. 303
Future Developmentsp. 305
Conclusionp. 307
Patterns on Queriesp. 311
Introductionp. 311
Preliminariesp. 313
Frequent Item Set Miningp. 319
Transforming KRIMPp. 323
Comparing the two Approachesp. 331
Conclusions and Prospects for Further Researchp. 333
Experiment Databasesp. 335
Introductionp. 336
Motivationp. 337
Related Workp. 341
A Pilot Experiment Databasep. 343
Learning from the Pastp. 350
Conclusionsp. 358
Applications
Predicting Gene Function using Predictive Clustering Treesp. 365
Introductionp. 366
Related Workp. 367
Predictive Clustering Tree Approaches for HMCp. 369
Evaluation Measurep. 374
Datasetsp. 375
Comparison of Clus-HMC/SC/HSCp. 378
Comparison of (Ensembles of) CLUS-HMC to State-of-the-art Methodsp. 380
Conclusionsp. 384
Analyzing Gene Expression Data with Predictive Clustering Treesp. 389
Introductionp. 389
Datasetsp. 391
Predicting Multiple Clinical Parametersp. 392
Evaluating Gene Importance with Ensembles of PCTsp. 394
Constrained Clustering of Gene Expression Datap. 397
Clustering gene expression time series datap. 400
Conclusionsp. 403
Using a Solver Over the String Pattern Domain to Analyze Gene Promoter Sequencesp. 407
Introductionp. 407
A Promoter Sequence Analysis Scenariop. 409
The Marguerite Solverp. 412
Tuning the Extraction Parametersp. 413
An Objective Interestingness Measurep. 415
Execution of the Scenariop. 418
Conclusionp. 422
Inductive Queries for a Drug Designing Robot Scientistp. 425
Introductionp. 425
The Robot Scientist Evep. 427
Representations of Molecular Datap. 430
Selecting Compounds for a Drug Screening Libraryp. 444
Active learningp. 446
Conclusionsp. 448
Appendixp. 452
Author indexp. 455
Table of Contents provided by Ingram. All Rights Reserved.

Supplemental Materials

What is included with this book?

The New copy of this book will include any supplemental materials advertised. Please check the title of the book to determine if it should include any access cards, study guides, lab manuals, CDs, etc.

The Used, Rental and eBook copies of this book are not guaranteed to include any supplemental materials. Typically, only the book itself is included. This is true even if the title states it includes any access cards, study guides, lab manuals, CDs, etc.

Rewards Program