drawing data one genome four samples
play

drawing data one genome, four samples SESSION 2 MARTIN KRZYWINSKI - PowerPoint PPT Presentation

drawing data one genome, four samples SESSION 2 MARTIN KRZYWINSKI Genome Sciences Center BC Cancer Agency Vancouver, Canada EMBO GLOBAL EXCHANGE LECTURE COURSE: HIGH-THROUGHPUT NEXT GENERATION SEQUENCING APPLIED TO INFECTIOUS DISEASES


  1. drawing data— one genome, four samples SESSION 2 MARTIN KRZYWINSKI Genome Sciences Center BC Cancer Agency Vancouver, Canada EMBO GLOBAL EXCHANGE LECTURE COURSE: HIGH-THROUGHPUT NEXT GENERATION SEQUENCING APPLIED TO INFECTIOUS DISEASES Institut Pasteur de Tunis, Tunis, Tunesia Sep 15–25, 2014 GENOME VISUALIZATION WITH CIRCOS v20140922

  2. SESSION FINAL IMAGE drawing and spacing ideograms relative ideogram spacing changing ideogram scale ideogram selection ideogram order drawing ideogram regions chromosome breaks ordering ideogram regions cytogenetic bands drawing multiple genomes ideogram progression and orientation relative and absolute ticks This is the image you will create during this session. It contains chrs 1 & 2 from human and mouse genomes. Each chromosome occupies 1/4 of the figure. 2 GENOME VISUALIZATION WITH CIRCOS · Session 2 · Drawing data—one genome, four samples

  3. highlights LESSON 1 GENOME VISUALIZATION WITH CIRCOS · Session 2 · Drawing data—one genome, four samples

  4. GENE ANNOTATION INPUT FILE > cat ../../../data/lm.gene.txt LmxM.07 425833 429915 LmxM.07.0880 type=gene,strand=+,ID=LmxM. 07.0880,Name=LmxM.07.0880,description=protein+kinase%2C +putative,size=4083,web_id=LmxM.07.0880,locus_tag=LmxM. 07.0880,size=4083,Alias=322488462,401415487,LmxM07.0880,LmxM. 07.0880,LmxM07.0880.1,LmxM07.0880.1:pep LmxM.07 441328 442218 LmxM.07.0900 type=gene,strand=+,ID=LmxM. 07.0900,Name=LmxM.07.0900,description=serine%2Fthreonine+kinase%2C +putative%2Cprotein+kinase%2C+putative,size=891,web_id=LmxM. 07.0900,locus_tag=LmxM. 07.0900,size=891,Alias=322488464,401415491,LmxM07.0900,LmxM. 07.0900,LmxM07.0900.1,LmxM07.0900.1:pep GENOME VISUALIZATION WITH CIRCOS · Session 2 · Drawing data—one genome, four samples

  5. GENES AS HIGHLIGHTS # 2/1/etc/circos.conf chromosomes_units = 1000 chromosomes = -/00/ chromosomes_color = /./=white <plots> <plot> type = highlight file = conf(datadir)/lm.gene.txt fill_color = black r1 = dims(ideogram,radius_outer) r0 = dims(ideogram,radius_inner) minsize = 5u stroke_thickness = undef </plot> </plots> . GENOME VISUALIZATION WITH CIRCOS · Session 2 · Drawing data—one genome, four samples

  6. TRANSPARENCY # 2/1/etc/circos.conf chromosomes_units = 1000 chromosomes = -/00/ chromosomes_color = /./=white <plots> <plot> type = highlight file = conf(datadir)/lm.gene.txt fill_color = black_a4 r1 = dims(ideogram,radius_outer) r0 = dims(ideogram,radius_inner) minsize = 5u stroke_thickness = undef </plot> </plots> . GENOME VISUALIZATION WITH CIRCOS · Session 2 · Drawing data—one genome, four samples

  7. RULES TO SHOW PUTATIVE PROTEINS # 2/1/etc/circos.conf chromosomes_units = 1000 chromosomes = -/00/ chromosomes_color = /./=white <plots> <plot> type = highlight file = conf(datadir)/lm.gene.txt fill_color = black_a4 r1 = dims(ideogram,radius_outer) r0 = dims(ideogram,radius_inner) minsize = 5u stroke_thickness = undef <rules> <rule> condition = var(description) =~ /putative/ fill_color = red z = 5 </rule> </rules> </plot> </plots> . GENOME VISUALIZATION WITH CIRCOS · Session 2 · Drawing data—one genome, four samples

  8. heatmaps LESSON 2 GENOME VISUALIZATION WITH CIRCOS · Session 2 · Drawing data—one genome, four samples

  9. HEATMAPS, 4 SAMPLES # 2/2/etc/circos.conf <plot> type = heatmap r1 = 0.80r r0 = 0.76r file = conf(datadir)/lm.exp.ah063.txt color = reds-8-seq scale_log_base = 0.5 minsize = 25u <<include rules.heatmap.conf>> </plot> <plot> type = heatmap r1 = 0.75r r0 = 0.71r file = conf(datadir)/lm.exp.ah064.txt color = reds-8-seq scale_log_base = 0.5 minsize = 25u <<include rules.heatmap.conf>> </plot> ... . GENOME VISUALIZATION WITH CIRCOS · Session 2 · Drawing data—one genome, four samples

  10. HISTOGRAM # etc/rules.heatmap.conf <rules> <rule> use = yes condition = var(value) < 1000 show = no </rule> <rule> condition = 1 z = eval(var(value)) </rule> </rules> . GENOME VISUALIZATION WITH CIRCOS · Session 2 · Drawing data—one genome, four samples

  11. IMPORTING COMMON TRACK SETTINGS # etc/heatmap.conf The heatmap blocks had many common parameters type = heatmap <plot> file = conf(datadir)/lm.exp.ah063.txt type = heatmap color = reds-8-seq r1 = 0.80r scale_log_base = 0.5 r0 = 0.76r minsize = 25u file = <<include rules.heatmap.conf>> conf(datadir)/lm.exp.ah063.txt color = reds-8-seq scale_log_base = 0.5 minsize = 25u # 2/2/etc/circos.conf <<include rules.heatmap.conf>> </plot> <plot> <<include heatmap.conf>> <plot> r1 = 0.80r type = heatmap r0 = 0.76r r1 = 0.75r </plot> r0 = 0.71r file = <plot> conf(datadir)/lm.exp.ah064.txt <<include heatmap.conf>> color = reds-8-seq r1 = 0.75r scale_log_base = 0.5 r0 = 0.71r minsize = 25u </plot> <<include rules.heatmap.conf>> </plot> ... 11 GENOME VISUALIZATION WITH CIRCOS · Session 2 · Drawing data—one genome, four samples

  12. floating histograms LESSON 3 GENOME VISUALIZATION WITH CIRCOS · Session 2 · Drawing data—one genome, four samples

  13. IMPORTING COMMON TRACK SETTINGS LmxM.01 14846 16843 LmxM.01.0050 exp_ah063=64,exp_ah064=195,exp_ah065=154,exp_ah066=240, exp_min=64,exp_max=240,exp_avg=163.25,exp_range=176,exp_minah=ah063,exp_maxah=ah066 LmxM.01 27727 28314 LmxM.01.0110 exp_ah063=72,exp_ah064=117,exp_ah065=127,exp_ah066=212, exp_min=72,exp_max=212,exp_avg=132,exp_range=140,exp_minah=ah063,exp_maxah=ah066 The value of each data point is the gene name, e.g. LmxM.01.0050. We can change the value by using a rule <rule> condition = 1 value = eval(var(exp_ah063)) </rule> 13 GENOME VISUALIZATION WITH CIRCOS · Session 2 · Drawing data—one genome, four samples

  14. ABSOLUTE IDEOGRAM SCALE # 2/3/etc/circos.conf <plot> sample = ah063 r1 = 0.78r r0 = 0.74r <<include heatmap.conf>> </plot> ... # etc/heatmap.conf type = heatmap color = reds-8-seq scale_log_base = 0.5 minsize = 25u <<include rules.heatmap.conf>> # etc/rules.heatmap.conf <rules> <rule> condition = 1 value = eval(var(exp_conf(.,sample))) flow = continue </rule> <rule> condition = var(value) < 1000 show = no </rule> <rule> condition = 1 z = eval(var(value)) </rule> . </rules> GENOME VISUALIZATION WITH CIRCOS · Session 2 · Drawing data—one genome, four samples

  15. FLOATING HISTOGRAM # 2/3/etc/circos.conf <plot> type = histogram float = yes r0 = 0.80r r1 = 0.90r fill_color = black stroke_thickness = undef min = 0 max = 300 minsize = 20u <<include etc/axes.conf>> <rules> <rule> condition = 1 value = eval(sqrt(var(exp_max))) valuebase = eval(sqrt(var(exp_min))) #flow = continue </rule> <rule> condition = var(exp_range) < 1000 fill_color = grey z = -10 </rule> <rule> condition = var(exp_range) > 10000 fill_color = red z = 10 </rule> </rules> </plot> . </plots> GENOME VISUALIZATION WITH CIRCOS · Session 2 · Drawing data—one genome, four samples

  16. FLOATING HISTOGRAM # 2/3/etc/circos.conf <backgrounds> use = yes <background> y1 = 50 color = vvlgrey </background> <background> y0 = 150 color = vvlred </background> </backgrounds> . GENOME VISUALIZATION WITH CIRCOS · Session 2 · Drawing data—one genome, four samples

  17. FLOATING HISTOGRAM # 2/3/etc/circos.conf <rules> <rule> condition = 1 value = eval(sqrt(var(exp_max))) valuebase = eval(sqrt(var(exp_min))) flow = continue </rule> <rule> condition = var(exp_range) < 1000 fill_color = grey z = -10 </rule> <rule> condition = var(exp_range) > 10000 fill_color = red z = 10 </rule> </rules> . GENOME VISUALIZATION WITH CIRCOS · Session 2 · Drawing data—one genome, four samples

  18. scatter plot LESSON 4 GENOME VISUALIZATION WITH CIRCOS · Session 2 · Drawing data—one genome, four samples

  19. IMPORTING COMMON TRACK SETTINGS LmxM.01 14846 16843 LmxM.01.0050 exp_ah063=64,exp_ah064=195,exp_ah065=154,exp_ah066=240, exp_min=64,exp_max=240,exp_avg=163.25,exp_range=176,exp_minah=ah063,exp_maxah=ah066 The value of each data point is the gene name, e.g. LmxM.01.0050. We can use the name of the sample at which expression was minimum to draw/hide a point <rule> condition = var(exp_minah) ne “ah063” show = no </rule> As before, we can change the value by using a rule <rule> condition = 1 value = eval(var(exp_ah063)) </rule> 19 GENOME VISUALIZATION WITH CIRCOS · Session 2 · Drawing data—one genome, four samples

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend