BiBiServ: Practices and Problems WebServices and Workflows on the - - PowerPoint PPT Presentation

bibiserv practices and problems
SMART_READER_LITE
LIVE PREVIEW

BiBiServ: Practices and Problems WebServices and Workflows on the - - PowerPoint PPT Presentation

BiBiServ: Practices and Problems WebServices and Workflows on the Bielefeld University Bioinformatics Server Sven Hartmeier NETTAB 2007 - 13.6.2007 - Pisa, Italy Faculty of Technology - Bielefeld University BiBiServ-Basics... Bi elefeld


slide-1
SLIDE 1

BiBiServ: Practices and Problems

WebServices and Workflows on the Bielefeld University Bioinformatics Server

Sven Hartmeier

Faculty of Technology - Bielefeld University

NETTAB 2007 - 13.6.2007 - Pisa, Italy

slide-2
SLIDE 2

BiBiServ-Basics...

  • Bielefeld University Bioinformatics Server
  • founded in 1996
  • hosts various BioInformatics Tools

developed within several research groups at Bielefeld University

slide-3
SLIDE 3

BiBiServ-Tools

  • currently around 30 tools in the areas of:
  • RNA Structure Research
  • Alignment Computation
  • Genome Comparison
  • Primer Design
  • Evolutionary Relationships
  • Miscellaneous Other Applications
slide-4
SLIDE 4

Used Technologies

  • 1996: HTML/CGI
  • 2003: WebServices using XML-RPC
  • 2004: WebServices using SOAP / WSDL

HTML Interface transformed to „just another WebService Client“

slide-5
SLIDE 5

Architecture

WebServer „BiBiServ“ WebService Server „BiBiWSServ“ SGE Computation Grid Database Server

slide-6
SLIDE 6

Usage (HTML Interface)

WebServer „BiBiServ“ WebService Server „BiBiWSServ“ SGE Computation Grid Database Server

Request

slide-7
SLIDE 7

Usage (HTML Interface)

WebServer „BiBiServ“ WebService Server „BiBiWSServ“ SGE Computation Grid Database Server

request

slide-8
SLIDE 8

Usage (HTML Interface)

WebServer „BiBiServ“ WebService Server „BiBiWSServ“ SGE Computation Grid Database Server

response: Job-ID

slide-9
SLIDE 9

Usage (HTML Interface)

WebServer „BiBiServ“ WebService Server „BiBiWSServ“ SGE Computation Grid Database Server

polling

slide-10
SLIDE 10

Usage (HTML Interface)

WebServer „BiBiServ“ WebService Server „BiBiWSServ“ SGE Computation Grid Database Server

polling

slide-11
SLIDE 11

Usage (HTML Interface)

WebServer „BiBiServ“ WebService Server „BiBiWSServ“ SGE Computation Grid Database Server

Result

slide-12
SLIDE 12

Usage (HTML Interface)

WebServer „BiBiServ“ WebService Server „BiBiWSServ“ SGE Computation Grid Database Server

Result

slide-13
SLIDE 13

WebService Server „BiBiWSServ“ SGE Computation Grid Database Server

Usage: other clients

Perl Client ??? Client C# Client Java Client

slide-14
SLIDE 14

HTML Interface Usage

5.000 16.250 27.500 38.750 50.000 04 05 06 07 08 09 10 11 12 01 02 03 04 05 06 07 08 09 10 11 12 01 02 03 04

2005 2006 2007

slide-15
SLIDE 15

WebService Interface Usage

1.125 2.250 3.375 4.500 04 05 06 07 08 09 10 11 12 01 02 03 04 05 06 07 08 09 10 11 12 01 02 03 04

2005 2006 2007

slide-16
SLIDE 16

HOBIT

  • Helmholtz Open BioInformatics Technology
  • cooperation of 11 german universities and Helmholtz

Institutes as core of a network linking bioinformatics centres together

  • Goal: „To connect bioinformatics applications and

resources in a uniform way to provide an efficient communication tier for resource access.“

  • http://hobit.sourceforge.net/
slide-17
SLIDE 17

Interoperation Obstacles

  • necessary prerequisite:

common communication protocols & common data formats

  • most common bioinformatics data formats:
  • tool specific ASCII or binary files

(but often used for other purposes)

  • no explicit structure
  • no or only implicit semantics

(e.g. lower vs. upper case letters)

  • not easily extendable
slide-18
SLIDE 18

Hobit Schemas

  • extendable XML Schema definitions for

bioinformatics data

  • Basic building blocks for bioinformatical data

types

  • Several higher-level types:
  • SequenceML
  • AlignmentML
  • RNAStructML
  • http://bioschemas.sourceforge.net/
slide-19
SLIDE 19

BioDOM

  • Java Library for creation of data in HOBIT

XML formats + conversion of existing legacy data types

  • easy adaptation of existing tools via:
  • BioDOM Library calls
  • usage of BioDOM WebService
  • http://biodom.sourceforge.net/
slide-20
SLIDE 20

Workflow / Formats Example

BioDOM

EMBLxml TinySeq FASTA

>id agtc uact

SequenceML

mfold

BioDOM BioDOM

RNA- Shapes RNA- Fold

RNAStructML

RNA- Forester

RNAStructAlignmentML RNAMovies

4SALE

PNG/JPG/SVG

slide-21
SLIDE 21

BiBiServices as components

  • hand-coded workflows
  • WSDL-Description -> Taverna, Wildfire
  • jABC / Bio-jETI
  • Virtual BiBiServ
slide-22
SLIDE 22

The (near) Future

  • Better description of services

(WSDL is mostly OK, but not enough)

  • Semantics Layer
  • Integration with other Frameworks

(e.g. BioMoby-S)

slide-23
SLIDE 23

Why we are here...

We are currently looking for generally accepted best practices and open standards

for:

  • adding a semantic layer
  • better service descriptions
  • automated tool discovery methods
  • component interface requirements for

different workflow design tools

slide-24
SLIDE 24

Sven Hartmeier

Faculty of Technology - Bielefeld University shartmei@techfak.uni-bielefeld.de

http://bibiserv.techfak.uni-bielefeld.de/

Thanks for your attention :-)!