Data Integration using the Distributed Annotation System (DAS) - - PowerPoint PPT Presentation

data integration using the distributed annotation system
SMART_READER_LITE
LIVE PREVIEW

Data Integration using the Distributed Annotation System (DAS) - - PowerPoint PPT Presentation

Data Integration using the Distributed Annotation System (DAS) Andreas Prli , Ewan Birney, Tony Cox, Thomas A. Down, Rob Finn, Stefan Grf, David Jackson, Andreas Khri, Eugene Kulesha, Roger Pettett, James Smith, Jim Stalker, Tim J. P


slide-1
SLIDE 1

Data Integration using the Distributed Annotation System (DAS)

Andreas Prlić, Ewan Birney, Tony Cox, Thomas A. Down, Rob Finn, Stefan Gräf, David Jackson, Andreas Kähäri, Eugene Kulesha, Roger Pettett, James Smith, Jim Stalker, Tim J. P . Hubbard

slide-2
SLIDE 2
  • what is DAS
  • what do we do with it
  • DAS registration server
  • latest developments
slide-3
SLIDE 3

Text Integration of personal data into bioinf. resources

slide-4
SLIDE 4
  • Integration of annotations from external

sources into local applications

slide-5
SLIDE 5
  • online access to most recent data versions
  • no need for local installations
slide-6
SLIDE 6

DAS, how it works

Dowell, Jokerst, Allen, Eddy, Stein BMC Bioninformatics 2001

XML response http:// request

DAS Server DAS Server DAS Server Client

get sequence get features

slide-7
SLIDE 7

a few principles...

  • Clients are “intelligent” (few)
  • Servers are simple and easy to set up (many)
  • (most of) data is precalculated
  • libraries for server and client
  • multiple programming languages
slide-8
SLIDE 8

http://www.ensembl.org

slide-9
SLIDE 9
  • > 20 vertebrates / model organism
  • 5 mill. page impressions / week
  • 100 mirrors/internal installations worldwide
  • open source
  • used for other species as well
  • MySQL
  • 5-10 G / species + 100 G multi species data
slide-10
SLIDE 10

Add your own uses Registry

slide-11
SLIDE 11

Text Linking protein structure to e! Peptide view

slide-12
SLIDE 12

http://www.efamily.org.uk/software/dasclients/spice SPICE browser

slide-13
SLIDE 13

Click See exon structure mapped

  • nto 3D
slide-14
SLIDE 14

Show SNPs

slide-15
SLIDE 15

Zoom RASMOL commands

interact with Menu & RASMOL

slide-16
SLIDE 16
slide-17
SLIDE 17
slide-18
SLIDE 18

Structure Sequence Features Alignment DAS commands

slide-19
SLIDE 19

SPICE DAS registry Java Web Start auto install latest version send arguments

slide-20
SLIDE 20

SPICE DAS registry Meta information about DAS servers

slide-21
SLIDE 21

The DAS registration server

http://das.sanger.ac.uk/registry/

slide-22
SLIDE 22

DAS registration server

  • allows to “publish” DAS servers & share

with community

  • communicates with clients
  • regularly checks servers, sends notification
slide-23
SLIDE 23

What is the glue?

  • “Coordinate Systems”
  • Authority
  • Type of data
  • Version
  • Organism (optional)
slide-24
SLIDE 24
  • Ensembl - most of the views can display DAS

sources from multiple CS

  • SPICE - PDB, UniProt, Ensp
  • Dasty - UniProt

Clients and Coordinate Systems

slide-25
SLIDE 25

the DAS - SOA

DAS registration server a DAS source e.g. Ensembl, SPICE DAS

slide-26
SLIDE 26

111 DAS sources 26 institutions 12 countries + others

slide-27
SLIDE 27

DAS - issues

  • inconsistent implementations
  • no consistent use annotation types
  • error handling
  • searches not possible - in DAS/1
  • open sharing of data - low security
slide-28
SLIDE 28

http://sisyphus.mrc-cpe.cam.ac.uk

slide-29
SLIDE 29
slide-30
SLIDE 30
slide-31
SLIDE 31
  • Alignment DAS:
  • rotation matrices, shift vectors
  • range information (optional)
slide-32
SLIDE 32

http://www.jalview.org A. Waterhouse, J. Procter, G. Barton

slide-33
SLIDE 33
  • T. Down, T. Hubbard
  • Web Team, E. Kulesha, R. Pettett, T. Cox
  • eFamily Project
  • S. Gräf, A. Kahari, BioSapiens
  • A. Murzin, A. Andreeva
  • R. Finn, H.Hotz, A.Ahmed
  • Jmol, Biojava, MSD, everybody who sets up

DAS servers

Acknowledgments