SLIDE 1
1
STRING/Cytoscape Exercises
We are going to use the stringApp for Cytoscape (http://apps.cytoscape.org/apps/stringapp) to query the DISEASES database (http://diseases.jensenlab.org) to retrieve a network of proteins that are associated with diabetes from the STRING database (http://string-db.org). We will investigate this network in Cytoscape and make some observations about the proteins and processes involved in the disease as well as compare it to another related disease (cardiovascular system disease).
- 1. Retrieve network for diabetes mellitus
Open a new session in Cytoscape from the menu File -> New -> Session and go to the menu File -> Import -> Network -> Public Databases. In the import dialog, choose ‘STRING: disease query’ as the ‘Data Source’ and insert ‘diabetes mellitus’ into the ‘Enter disease term’ field. The next dialog shows all the matches that the stringApp finds for your query and selects the first one. Make sure to select the right one and continue with the import by pressing the Import button.
- 2. Browse the node attributes table
Note that the retrieved network contains a lot of additional information associated with the nodes and edges, such as the protein sequence, tissue expression data, disease score as well as the confidence scores for the different interaction evidences. In the following, we will explore these data using Cytoscape. Browse through the node attributes table and find the disease score column. Sort it by descending values to see the highest disease scores. You can highlight the corresponding nodes by selecting the rows in the table, bringing up the context menu (right-click the selected rows) and choosing the ‘Select nodes from selected rows’ option. Give an example for a node with a disease score of 5 and one with a disease score below 4. Rename the ‘disease score’ column to ‘diabetes score’ by right-clicking the name and choosing the ‘Rename column’ option. We will need this column later in the exercise.
- 3. Inspect tissue expression data
The stringApp automatically retrieves Information about which tissues the proteins are expressed from the TISSUES database, which we will take a look at first to better understand the data. Go to http://tissues.jensenlab.org/ and enter insulin (gene name: INS) into the search box. The resulting page will show you an overall representation of where in the body insulin is located, and below you can see tables containing the specific lines of evidence that contribute to the
- verall score.