WikiPathways Tutorial Mining biological pathways and more Thomas - - PowerPoint PPT Presentation

wikipathways tutorial
SMART_READER_LITE
LIVE PREVIEW

WikiPathways Tutorial Mining biological pathways and more Thomas - - PowerPoint PPT Presentation

WikiPathways Tutorial Mining biological pathways and more Thomas Kelder www.wikipathways.org Wiki for biological pathways Free and open pathway resource Share, curate and discuss! Topics How to Find and filter pathway


slide-1
SLIDE 1

WikiPathways Tutorial

Mining biological pathways and more…

Thomas Kelder

slide-2
SLIDE 2

www.wikipathways.org

  • Wiki for biological pathways
  • Free and open pathway resource
  • Share, curate and discuss!
slide-3
SLIDE 3

Topics

  • How to…

– Find and filter pathway information – Download pathway information – Use it in your research – Integrate it with other resources

slide-4
SLIDE 4

Topics

  • Pathway information and curation
  • WikiPathways design & tools
  • Web service
  • Real life examples
slide-5
SLIDE 5

Pathway information and curation

slide-6
SLIDE 6

Biological pathways

  • Organize our knowledge about biology
  • Using graphics:

– Intuitive representation of complex systems – Facilitates communication and discussion – Visualization of experimental data

slide-7
SLIDE 7

Organize knowledge

Insulin inactivates liver phosphorylase, the principal enzyme that causes liver glycogen to split into glucose. This prevents breakdown of the glycogen that has been stored in the liver.

slide-8
SLIDE 8
slide-9
SLIDE 9

How pathway information is used

Knowledgebase Data visualization Enrichment analysis Network analysis

slide-10
SLIDE 10

Challenges of curating pathways

  • Not just an image

– Relations – Annotations – Literature references

slide-11
SLIDE 11

Challenges of wikifying pathways

  • Most wiki engines are designed for textual

information

  • Our pathways are stored as XML and rendered

using Java libraries

slide-12
SLIDE 12

WikiPathways design & pathway tools

slide-13
SLIDE 13
slide-14
SLIDE 14
slide-15
SLIDE 15
slide-16
SLIDE 16
slide-17
SLIDE 17

WikiPathways data model

  • Pathway identifiers

– Unique & stable – WP1, WP43, WP1373

  • Curation tags provide quality annotations
  • All other information is stored in GPML
slide-18
SLIDE 18

Working with the GPML format

slide-19
SLIDE 19

Working with the GPML format

  • XML format, platform independent
  • PathVisio Java library
  • rg.pathvisio.model

Pathway A GPML pathway

  • readFrom/writeToXml

Read/write GPML

  • writeToSvg

Convert to image

  • getDataObjects

Get the pathway elements

  • add/remove

Add/remove elements PathwayElement An element on the pathway

  • getObjectType

DataNode, Line, Shape

  • get/setTextLabel

Get or set GPML properties

  • rg.pathvisio.view

Render the pathway in your own application!

slide-20
SLIDE 20
  • DataNodes are annotated with an Xref
  • Which DataSource to use is up to the user

Pathway 2 Pathway 1

Working with annotations

DataSource Identifier Ensembl

ENSG00000139618

Entrez Gene 675 PubChem 8422 Ensembl

ENSG00000139618

Entrez Gene 675

?

slide-21
SLIDE 21

Working with annotations

  • WikiPathways provides functions where the

identifiers have been mapped for you

  • Use BridgeDB library for solving your own

mapping problems http://www.bridgedb.org

slide-22
SLIDE 22

WikiPathways PHP layer MediaWiki extensions

Putting it all together…

MediaWiki DB

Gene + metabolite mappings

MediaWiki Page rendering Validation Pathway rendering Metadata cache Web service API Pathway model Java components Editor applet PathVisio Lucene indexer BridgeDb

slide-23
SLIDE 23

WikiPathways Web service

Website Web service

slide-24
SLIDE 24

Web service

slide-25
SLIDE 25

SOAP

  • Simple Object Access Protocol
  • How your code ‘talks’ to WikiPathways
  • XML-based
  • Platform independent
  • Language independent
slide-26
SLIDE 26

Response Request

slide-27
SLIDE 27

WSDL

  • Web Services Description Language
  • Defines:
  • Function signatures
  • Data structures
  • Automatic client method/class generation
slide-28
SLIDE 28
slide-29
SLIDE 29

REST

  • Representational State Transfer
  • URL based, HTTP requests
  • Not bound to XML
  • Platform independent
  • Language independent
slide-30
SLIDE 30

http://www.wikipathways.org/wpi/webservice/webservice.php/listOrganisms

slide-31
SLIDE 31

Finding pathways

  • Get all pathways / organisms

listPathways listOrganisms

  • Pathways with text “apoptosis”

findPathwaysByText

  • Pathways containing Glucose (CHEBI:17634)

findPathwaysByXref

  • Pathways citing Lakin et al., Oncogene 1999

findPathwaysByLiterature

slide-32
SLIDE 32

Downloading pathways

  • Get the pathway title, species, last revision

getPathwayInfo

  • Get the GPML

getPathway

  • Other formats (SVG, PDF, PNG)

getPathwayAs

  • Mapped identifier list

getXrefList

  • Color gene boxes

getColoredPathway

slide-33
SLIDE 33

Wiki information

  • Get the revision history

getPathwayHistory

  • Recently changed pathways

getRecentChanges

  • Curation tags (e.g. quality annotations)

getCurationTags

slide-34
SLIDE 34

Alternatives to the web service

  • Download all pathways as GPML or image
  • Text file with info on all pathways
slide-35
SLIDE 35

Basic examples

slide-36
SLIDE 36
  • Use the wikipathways-client library
  • rg.pathvisio.wikipathways.WikiPathwaysClient

Java example

slide-37
SLIDE 37

Find pathways

slide-38
SLIDE 38
  • org.pathvisio.model.Pathway
  • Automatic GPML parsing

Extract pathway information

slide-39
SLIDE 39

List genes, proteins and metabolites

slide-40
SLIDE 40

Advantages of using Java

  • We provide a high-level API

– No need to deal with SOAP/WSDL

  • Compatible with PathVisio

– Easier GPML handling

  • Compatible with BridgeDb

– Faster and customizable identifier mapping

slide-41
SLIDE 41
  • SSOAP

http://www.omegahat.org/SSOAP/

R Example

slide-42
SLIDE 42

List available organisms

slide-43
SLIDE 43

Real life examples

slide-44
SLIDE 44

Demo: Cytoscape

listOrganisms findPathwaysByText getPathway

slide-45
SLIDE 45

Automate curation tasks

  • Propose to clean up test/tutorial pathways
slide-46
SLIDE 46

Automate curation tasks

  • List all tutorial pathways

getCurationTagsByName

  • Check if they have recently been edited

getPathwayHistory

  • If not, add proposed deletion tag

saveCurationTag

slide-47
SLIDE 47

Enrichment analysis in R

  • Get all human pathways as mapped gene lists

getXrefList

  • Download a GEO dataset
  • Perform Parametric Gene Set Enrichment
  • ~75 LOC (including comments)
slide-48
SLIDE 48

Step 1a: List all human pathways

slide-49
SLIDE 49

Step 1b: List all human pathways

slide-50
SLIDE 50

Step 2: Create gene sets

slide-51
SLIDE 51

Step 3: Enrichment analysis

slide-52
SLIDE 52
slide-53
SLIDE 53

SNPLogic

  • Provide biological context for SNPs
  • SNPs -> Genes -> Pathways
  • Uses functions:

– findPathwayByXref – getColoredPathway

slide-54
SLIDE 54
slide-55
SLIDE 55
slide-56
SLIDE 56
slide-57
SLIDE 57

Wikipedia integration

slide-58
SLIDE 58

Link to Wikipedia gene page No gene page exists, link to create new page Static image Interactive image!

slide-59
SLIDE 59
slide-60
SLIDE 60

<div style="display:block; width:60px; height:0px;

  • verflow:hidden; position:relative; left:502.0px; top:181.3px;

background:transparent; border-top:3px blue solid"></div>

Each blue line is a <div> display:block

slide-61
SLIDE 61

Each blue line is a <div> display:block

{{Annotation|0|0|[[<div>]]}}

which fits into an Annotation template

slide-62
SLIDE 62

{{Annotation|0|0|[[<div>]]}} {{Annotation|0|0|[[<div>]]}} {{Annotation|0|0|[[<div>]]}} … {{#switch:{{{highlight}}} |PDHA1={{Annotation|0|0|[[<div>]]}} |PDHA2={{Annotation|0|0|[[<div>]]}} |PDHB={{Annotation|0|0|[[<div>]]}} … }}

Each blue line is a <div> display:block which fits into an Annotation template There is one for each line, plus a special switch statement to highlight one gene per article.

slide-63
SLIDE 63

Each blue line is a <div> display:block which fits into an Annotation template There is one for each line, plus a special switch statement to highlight one gene per article. All of this is added to an Annotated Image template along with the image map itself.

{{Annotated image |||<image map>|{{{{[[<div>]]}}}}}}

slide-64
SLIDE 64

Each blue line is a <div> display:block which fits into an Annotation template There is one for each line, plus a special switch statement to highlight one gene per article. All of this is added to an Annotated Image template along with the image map itself.

{{Preview crop |Image={{Annotated image |<>|{{{{[[<>]]}}}}}} |cWidth={{#switch:{{{highlight}}}|…}} |cHeight={{#switch:{{{highlight}}}|…}} |oTop={{#switch:{{{highlight}}}|…}} |oLeft=={{#switch:{{{highlight}}}|…}} |Description}}

The Annotated Image is added to a Preview Crop template with its

  • wn switch cases.
slide-65
SLIDE 65

Each blue line is a <div> display:block which fits into an Annotation template There is one for each line, plus a special switch statement to highlight one gene per article. All of this is added to an Annotated Image template along with the image map itself. The Annotated Image is added to a Preview Crop template with its

  • wn switch cases.

Finally, all of this is put into a <div> tag to control full-size width and height.

<div style=“”>{{Preview crop |{{<>|{{{{[[<>]]}}}}}}}}</div>

slide-66
SLIDE 66

Template usage

slide-67
SLIDE 67

Demo: Taverna workflows

  • Include pathway information in Taverna workflows.
  • Get basic workflows you can use as building blocks at:

http://www.myexperiment.org/packs/40

slide-68
SLIDE 68

http://www.wikipathways.org/

Help -> Web service

  • Example applications and source code
  • API Documentation
  • Links to useful libraries

Also see: http://www.pathvisio.org PathVisio library for handling GPML http://www.bridgedb.org BridgeDb library for identifier mapping