ANGSD Analysis of Next Generation Sequencing Data Anders - - PowerPoint PPT Presentation

angsd
SMART_READER_LITE
LIVE PREVIEW

ANGSD Analysis of Next Generation Sequencing Data Anders - - PowerPoint PPT Presentation

ANGSD Analysis of Next Generation Sequencing Data Anders Albrechtsen Why ANGSD? Focus To perform population or medical genetic analysis on NGS data while taking uncertainly into account even for low depth data At the time no other software


slide-1
SLIDE 1

ANGSD

Analysis of Next Generation Sequencing Data

Anders Albrechtsen

slide-2
SLIDE 2

Why ANGSD?

Focus To perform population or medical genetic analysis on NGS data while taking uncertainly into account even for low depth data

  • At the time no other software existed
  • Most other NGS software are focused on genotype calling
  • Useful as a research development tool
  • Somewhat useful for others (not Anders/Thorfinn)
slide-3
SLIDE 3

Great reviews from the scientific community

Twitter They actually make a wrapper for ANGSD https://github.com/mojaveazure/angsd-wrapper

slide-4
SLIDE 4

Input and output

slide-5
SLIDE 5

Input formats

Sequencing data

  • Bam
  • Cram
  • mpileup

Genotype likelihoods

  • Beagle
  • glfV3
  • tglf
  • others

Genotype (posterior) probability

  • Beagle

Example BAM − > ANGSD − > BEAGLE − > ANGSD − > Association Example MSMS − > mpileup − > ANGSD − > SFS − > ∂a∂i Example Cram − > SNPtools − >GL − > ANGSD − > NGSadmix

slide-6
SLIDE 6

Analysis

slide-7
SLIDE 7

Where ANGSD does less well

  • freeBayes/GATK/Samtools are better at SNP calling and genotype

calling

  • ANGSD does not including indels in ANY analysis

Its not bad there are just better options

slide-8
SLIDE 8

Common use - NGSadmix

1

1Raghaven et. al Nature 2014

slide-9
SLIDE 9

Common use - D-stat/ABBABABA

slide-10
SLIDE 10

Common use - MDS/PCA

  • −0.4

−0.2 0.0 0.2 0.4 0.6 0.8 1.0 −0.2 0.0 0.2 0.4

Greenlanders+50 Danes

PC1 (4.14%) PC2 (0.78%)

  • Denmark

Location12 Location13 Location14 Location15 Location16 Location17 Location18 Location19 Location22 Location23 Location26 Location27 Location28 LocationMissing

slide-11
SLIDE 11

Common use - SFS

slide-12
SLIDE 12

SFS - selction scans - theta/Tajima/Fst/PBS

slide-13
SLIDE 13

SFS - test for continuation

Conclusion The ancient clovis native american is a direct ancestor to most modern native americans

2

2Rasmussen et al. Nature 2014

slide-14
SLIDE 14

Common use - Error rate estimation

slide-15
SLIDE 15

Common use - contamation

slide-16
SLIDE 16

Common use - relatedness