keyword based search over environmental datasets
play

Keyword-Based Search over Environmental Datasets Jos R.R. Viqueira - PowerPoint PPT Presentation

1st International KEYSTONE Conference IKC 2015 Coimbra Portugal, 8-9 September, 2015 Keyword-Based Search over Environmental Datasets Jos R.R. Viqueira Alberto Bugarn Joaqun Trianes Jaime Martnez-Urtaza Outline MOTIVATIONAL


  1. 1st International KEYSTONE Conference IKC 2015 Coimbra Portugal, 8-9 September, 2015 Keyword-Based Search over Environmental Datasets José R.R. Viqueira Alberto Bugarín Joaquín Triñanes Jaime Martínez-Urtaza

  2. Outline  MOTIVATIONAL EXAMPLES  ENVIRONMENTAL DATASETS  KEYWORD-BASED SEARCH  PROPOSED METHODOLOGY Keyword-Based Search over Environmental Datasets

  3. MOTIVATIONAL EXAMPLES  Public health and water microbiology (Cholera risk) Motivational Examples  High water temperatures and rainfall close to sea level during monsoon season Environmental Datasets KeyWord- Based Search Proposed Methodology Keyword-Based Search over Environmental Datasets

  4. MOTIVATIONAL EXAMPLES  Public health and water microbiology (Cholera risk) Motivational Examples  High water temperatures and rainfall close to sea level during monsoon season  Environmental properties Many datasets Environmental  Sea Surface Temperature (SST) [NOAA] Datasets even for the  Rainfall (TRMM) [NASA]  Geographic Coverages (fields, grids, etc.) same property  Very large scientific data multidimensional arrays KeyWord-  Dims: Lat., Lon., Time, Depth, etc. Based Search Proposed Methodology Keyword-Based Search over Environmental Datasets

  5. MOTIVATIONAL EXAMPLES  Public health and water microbiology (Cholera risk) Motivational Examples  High water temperatures and rainfall close to sea level during monsoon season Meaning depends  Fuzzy linguistic value Environmental on the context of  Relative to the mean of the same month during Datasets application, geo the last 7 years for the specific location  location, time World Ocean Atlas (WOA). Statistical mean of temperature period, … KeyWord- Based Search Proposed Methodology NOAA. cwblendednightSST. 22/07/2015 WOA. Stat. mean temperature. July Keyword-Based Search over Environmental Datasets

  6. MOTIVATIONAL EXAMPLES  Public health and water microbiology (Cholera risk) Motivational Examples  High water temperatures and rainfall close to sea level during monsoon season  Fuzzy geographic restriction Environmental  Close to the coast line Datasets  Geographic Entity (E/R data)  Elevation of about 0 meters above sea level  Geographic Coverage KeyWord- Based Search Coast line. NOAA GSSHG Proposed Methodology Elevation. NOAA ETOPO1 Keyword-Based Search over Environmental Datasets

  7. MOTIVATIONAL EXAMPLES  Public health and water microbiology (Cholera risk) Motivational Examples  High water temperatures and rainfall close to sea level during monsoon season  Fuzzy temporal restriction Environmental  Monsoon season (Malaysia) Datasets  Northeast Monsoon from November to March (more rainfall)  Southwest Monsoon from May to September (less rainfall) KeyWord- Based Search Proposed Methodology NOAA Blended Sea Winds. January NOAA Blended Sea Winds. July Keyword-Based Search over Environmental Datasets

  8. MOTIVATIONAL EXAMPLES  Severe weather reporting Motivational Examples  Hurricane of high category close to highly populated areas  Strong winds and heavy swell on marine traffic routes  Hailstorm on the highway Environmental Datasets  Tourism  Beach protected from wind, with warm water and air temperature, KeyWord- and few waves Based Search Proposed Methodology Keyword-Based Search over Environmental Datasets

  9. ENVIRONMENTAL DATASETS  Do no always fit E/R model Motivational Examples  Geographic Entities  Entities with geometric properties Environmental  Relational data Datasets  Rivers, Meteostations, Municipalities, etc. KeyWord- Based Search  Geographic Coverages  Mappings with geographic Proposed domain Methodology  Multidimensional array data  Temperature, rainfall, elevation, salinity, etc. Keyword-Based Search over Environmental Datasets

  10. ENVIRONMENTAL DATASETS  Many large or very large datasets Motivational Examples  Stations, buoys, satellites, radars, etc.  Blended Sea Winds  Over 300 gigabytes Environmental  Prediction, reanalysis, etc. Datasets  Climate Forecast System Version 2 (CFSv2)  Over 500 terabytes KeyWord- Based Search Proposed Methodology Keyword-Based Search over Environmental Datasets

  11. ENVIRONMENTAL DATASETS  Many large or very large datasets Motivational Examples  Stations, buoys, satellites, radars, etc.  Blended Sea Winds  Over 300 gigabytes Environmental  Prediction, reanalysis, etc. Datasets  Climate Forecast System Version 2 (CFSv2)  Over 500 terabytes KeyWord- Based Search  Highly heterogeneous  Formats and encodings Proposed Methodology  Shape Files, GML, GeoJSON, etc.  GeoTiff, ASC, NetCDF, HDF, GRIB, etc.  Semantics  Temperature, Sea Temperature, Sea Surface Temperature, air temperature, etc. Keyword-Based Search over Environmental Datasets

  12. ENVIRONMENTAL DATASETS  Open Data Motivational Examples  Spatial Data Infrastructures (SDIs)  Global Spatial Data Infrastructure (GSDI)  INSPIRE Environmental  NSDI Datasets  Unidata  OpenDAP KeyWord- Based Search Proposed Methodology Keyword-Based Search over Environmental Datasets

  13. ENVIRONMENTAL DATASETS  Open Data Motivational Examples  Spatial Data Infrastructures (SDIs)  Global Spatial Data Infrastructure (GSDI)  INSPIRE Environmental  NSDI Datasets  Unidata  OpenDAP KeyWord- Based Search  Data models, Vocabularies, Ontologies  Observations and Measurements (O&M) Proposed  Sensor Model Language (SensorML) Methodology  Semantic Web for Earth and Environmental Terminology (SWEET)  CF Conventions and Metadata  World Meteorological Organization Manual on Codes  SeaDataNet Sensor Vocabularies  ETC. Keyword-Based Search over Environmental Datasets

  14. KEYWORD-BASED SEARCH  Non structured data sources Motivational Examples  Information Retrieval (IR)  Text data  Multimedia Information Retrieval (MIR) Environmental  Audio, images, video. Datasets KeyWord- Based Search Proposed Methodology Keyword-Based Search over Environmental Datasets

  15. KEYWORD-BASED SEARCH  Non structured data sources Motivational Examples  Information Retrieval (IR)  Text data  Multimedia Information Retrieval (MIR) Environmental  Audio, images, video. Datasets KeyWord-  Based Search Structured data sources  Relational DBMSs  Linked Data Proposed  E/R based data! Methodology Keyword-Based Search over Environmental Datasets

  16. PROPOSED METHODOLOGY Motivational Language/grammar Examples Fuzzy Spatio- temporal Keyword-based extension in Fuzzy Spatio-temporal Search each dataset extension of each Environmental keyword Datasets Search Engi ne in each dataset Spatio-temporal KeyWord- Keyword-Based Machine Learning Based Search Index Data to Text Systems Fuzzy-based approaches Crawling, Annotation, Proposed Vocabulary Indexing, ETL? Computing with Methodology Words Metadata Entity-Based Coverage-Based Repositories Datasets Datasets Keyword-Based Search over Environmental Datasets

  17. 1st International KEYSTONE Conference IKC 2015 Coimbra Portugal, 8-9 September, 2015 Keyword-Based Search over Environmental Datasets José R.R. Viqueira (jrr.viqueira@usc.es) Alberto Bugarín (alberto.bugarin@usc.es) Joaquín Triñanes (joaquin.trinanes@usc.es) Jaime Martínez-Urtaza (j.l.martinez-urtaza@bath.ac.uk)

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend