SHARQ Guide: SHARQ Guide: Finding relevant biological data Finding - - PowerPoint PPT Presentation

sharq guide sharq guide finding relevant biological data
SMART_READER_LITE
LIVE PREVIEW

SHARQ Guide: SHARQ Guide: Finding relevant biological data Finding - - PowerPoint PPT Presentation

SHARQ Guide: SHARQ Guide: Finding relevant biological data Finding relevant biological data and queries in a and queries in a Peer Data Management System Peer Data Management System Sarah Cohen-Boulakia , Olivier Biton, Shirley Cohen,


slide-1
SLIDE 1

07/ 20/ 2006 DILS 2006 - SHARQ Guide 1

SHARQ Guide: SHARQ Guide: Finding relevant biological data Finding relevant biological data and queries in a and queries in a Peer Data Management System Peer Data Management System

Sarah Cohen-Boulakia, Olivier Biton, Shirley Cohen, Zachary Ives, Val Tannen, Susan Davidson Database Group, University of Pennsylvania

slide-2
SLIDE 2

07/ 20/ 2006 DILS 2006 - SHARQ Guide 2

Biological peer data sharing Biological peer data sharing

  • Collaboration: Peer

Collaboration: Peer network

Mappings between data sources

  • I nterm ittent

I nterm ittent participation is possible

  • Peers may disagree

disagree

I would like to share part of my data I want to be free to leave the network at any time I wish to integrate RefSeq and UniGene data in my local database but when these sources disagree I always trust RefSeq!

Sequencing data (Genes, BACs, Contigs) Microarray data 3D-structures Human disease information Data related to Malaria Proteomic domains

slide-3
SLIDE 3

07/ 20/ 2006 DILS 2006 - SHARQ Guide 3

Which proteins contain an erythrocyte domain? Give me the name of these proteins, any annotations, and, if available, their sequence.

Biological queries Biological queries

  • Explorative

Explorative

Composed of biological entities, keywords Unspecified schema

Posed over a netw ork

netw ork of resources

Intricate and highly complementary

SwissProt and PFAM are my preferred resources!

slide-4
SLIDE 4

07/ 20/ 2006 DILS 2006 - SHARQ Guide 4

Solutions for Peer networks Solutions for Peer networks

Querying with Piazza

Piazza [ Halevy et al, 04]

Queries asked to a given peer and rew ritten

rew ritten over the schema of other peers

Certain answers are provided

Querying and Updating with Orchestra

Orchestra

[ Ives et al, 06]

Builds upon concepts from Piazza

Piazza

Allows data exchange

exchange / updates propagation among peers

Uses policies to quickly and automatically m anage

m anage disagreem ent disagreem ent (conflicting data)

slide-5
SLIDE 5

07/ 20/ 2006 DILS 2006 - SHARQ Guide 5

Remaining Problems… Remaining Problems…

I want to join the peer network: What should I do? How do I specify links between my data and the data

  • f other peers? How can my data be found by users?

What kind of information can I found in this network? How to express my queries? Had anybody ever asked a similar query?

Need for a Guide! Need for a Guide!

slide-6
SLIDE 6

07/ 20/ 2006 DILS 2006 - SHARQ Guide 6

SHARQ SHARQ -

  • Overview

Overview

  • S

Sharing H Heterogeneous and A Autonomous R Resources and Q Queries

  • Collaborative

Collaborative project

Database group at the University of Pennsylvania Penn Center for Bioinformatics Children's Hospital of Philadelphia

Goal

Develop generic

generic tools and technologies

creating / maintaining confederations of peers

confederations of peers

SHARQ is composed of two main modules

  • Orchestra

Orchestra: Core engine

  • SHARQ Guide

SHARQ Guide: Help in querying and administrating the biological peer network

slide-7
SLIDE 7

07/ 20/ 2006 DILS 2006 - SHARQ Guide 7

Visit Poster # 14!

More about SHARQ Guide? More about SHARQ Guide?