SLIDE 13 Ingest User-Driven Processing Cataloging, Extraction, Indexing (partial ETL) Query Formulator Interactive UI Online learner Periodic Content Processing Query User up & fee Ranked query results New source discovery / upload
Search / task selection
Offline learning
Task Prioritizer
Offline learner Query Formulator Data View Online learner
Task Services
HABITAT Modular System Architecture
Ingest Core Library: Extractors, Measures, Algorithms User-Driven Processing Storage & Query Layer Cataloging, Extraction, Indexing (partial ETL) Data content User profiles Query Formulator Interactive UI Online learner Periodic Content Processing Query User updates & feedback Ranked query results New source discovery / upload
Search / task selection
Offline learning Workload & Provenance Training Data Feature Weights
Task Prioritizer
Offline learner Query Formulator Data View Online learner Evaluation Management Alternate Configs User Selection Design Analytics Survey Feedback Timing & Usage Event Bus
Support Services Task Services Evaluation Services
Sampling / Profiling Sampling / Profiling Entity resolution Entity resolution Feature extraction Feature extraction Schema alignment Schema alignment Indexing Clustering Info extraction Cleaning Info extraction Info extraction
Alternative query interfaces Alternative data presentation & feedback UIs