Inferring sites with recent or ongoing selection for NGS - - PowerPoint PPT Presentation

inferring sites with recent or ongoing selection for ngs
SMART_READER_LITE
LIVE PREVIEW

Inferring sites with recent or ongoing selection for NGS - - PowerPoint PPT Presentation

Inferring sites with recent or ongoing selection for NGS data(+admixture/population structure) http://popgen.dk/albrecht/BAG2018/web/ Anders Albrechtsen Introduction Signatures of recent/ongoing selection Variability and SFS Sequencing types


slide-1
SLIDE 1

Inferring sites with recent or ongoing selection for NGS data(+admixture/population structure) http://popgen.dk/albrecht/BAG2018/web/

Anders Albrechtsen

slide-2
SLIDE 2

Introduction Signatures of recent/ongoing selection Variability and SFS

Sequencing types

slide-3
SLIDE 3

Introduction Signatures of recent/ongoing selection Variability and SFS

What is low depth sequencing - my take on it

medium/high depth vs. ultra low depth medium/low

  • Depth lower than 10X
  • Often a financial choice
  • Ancient DNA

Ultra low sequencing

  • Depth lower than 1X
  • by product of capture data
slide-4
SLIDE 4

Introduction Signatures of recent/ongoing selection Variability and SFS

This morning

Short intro to recent selection SFS for NGS EHH 2D SFS, Fst and PBS for NGS

slide-5
SLIDE 5

Introduction Signatures of recent/ongoing selection Variability and SFS

Afternoon - locus on low depth sequencing

Admixture proportions Individual allele frequencies (PCA)

thHan(40,919) alHan(80,714) SouthHan(20,969)

PCA

slide-6
SLIDE 6

Introduction Signatures of recent/ongoing selection Variability and SFS

Recent selection

within species / using shared variation

slide-7
SLIDE 7

Introduction Signatures of recent/ongoing selection Variability and SFS

Sorry about the Human-centric talk

Good candidates for genes under recent selection

slide-8
SLIDE 8

Introduction Signatures of recent/ongoing selection Variability and SFS

Methods is applicable for most organisms

Examples of organisms with DNA

slide-9
SLIDE 9

Introduction Signatures of recent/ongoing selection Variability and SFS

Neutral selection

Alleles can be removed,polymorphic or fixed figure from Matteo Fumagalli

slide-10
SLIDE 10

Introduction Signatures of recent/ongoing selection Variability and SFS

strong negative selection

alleles can be removed or be polymorphic

slide-11
SLIDE 11

Introduction Signatures of recent/ongoing selection Variability and SFS

Strong positive selection

Alleles can be removed, polymorphic or fixed

slide-12
SLIDE 12

Introduction Signatures of recent/ongoing selection Variability and SFS

Balancing selection

Alleles can be removed, polymorphic or fixed

slide-13
SLIDE 13

Introduction Signatures of recent/ongoing selection Variability and SFS

Summary of allele frequency changes

selections effect on alleles Neutral/weak removed, polymorphic or fixed Strong negative removed or polymorphic Strong positive removed, polymorphic or fixed Balacing removed, polymorphic or fixed Strong selection Depends on the population size Conclusion Allele frequency is (almost always) not enough to determine selection

slide-14
SLIDE 14

Introduction Signatures of recent/ongoing selection Variability and SFS

Need for additional information

Option 1 use information from the genomic region Option 2 Use information from mulitple species/populations Options 3 selection experiments External information

  • Candidate genes/biological knowledge
  • Functional categories
  • Association to phenotypes
slide-15
SLIDE 15

Introduction Signatures of recent/ongoing selection Variability and SFS

Common methods used to detect selection

slide-16
SLIDE 16

Introduction Signatures of recent/ongoing selection Variability and SFS

Signature of selection

  • Neutral locus
  • Lots of variability
slide-17
SLIDE 17

Introduction Signatures of recent/ongoing selection Variability and SFS

Signature of selection

  • Mutation enters the

population

slide-18
SLIDE 18

Introduction Signatures of recent/ongoing selection Variability and SFS

Signature of selection

  • Negative selection

removed the allele

slide-19
SLIDE 19

Introduction Signatures of recent/ongoing selection Variability and SFS

Signature of selection

  • Mutation enters the

population

slide-20
SLIDE 20

Introduction Signatures of recent/ongoing selection Variability and SFS

Signature of selection

  • Mutation enters the

population

  • Mutation increases in

frequency due to positive selection

slide-21
SLIDE 21

Introduction Signatures of recent/ongoing selection Variability and SFS

Signature of selection

  • Increases LD
  • Affects the variability
slide-22
SLIDE 22

Introduction Signatures of recent/ongoing selection Variability and SFS

Signature of selection

  • Increases haplotype similarity
slide-23
SLIDE 23

Introduction Signatures of recent/ongoing selection Variability and SFS

Signature of selection

  • Increases differences with
  • ther populations in the

whole region

slide-24
SLIDE 24

Introduction Signatures of recent/ongoing selection Variability and SFS

What is the site frequency spectrum

Ind 11 T C G T C T C A A T 12 T C G T C T C C A G 21 A G G T C G C C A T 22 A C G T G G T C A T 31 A C T A G G C C T T 32 A C T A G G T C A T # Minor 2 1 2 2 3 2 2 1 1 1 Number of minor alleles (folded) η = (0.4, 0.5, 0.1)

1 2 3 Number of minor alleles Density 0.0 0.2 0.4

slide-25
SLIDE 25

Introduction Signatures of recent/ongoing selection Variability and SFS

What is the site frequency spectrum

Ind 11 T C G T C T C A A T 12 T C G T C T C C A G 21 A G G T C G C C A T 22 A C G T G G T C A T 31 A C T A G G C C T T 32 A C T A G G T C A T Outgroup A C T T C T C C A G # Derived 2 1 4 2 3 4 2 1 1 5 polarized SFS (unfolded) η = (0.3, 0.3, 0.1, 0.2, 0.1)

1 2 3 4 5 Number of minor alleles Density 0.00 0.10 0.20 0.30

slide-26
SLIDE 26

Introduction Signatures of recent/ongoing selection Variability and SFS

Frequency spectrum gives information about selection and demography

slide-27
SLIDE 27

Introduction Signatures of recent/ongoing selection Variability and SFS

Thetas are based on the frequency spectrum

Watterson θW = a−1 n−1

i=1 ηi, where a = n−1 i=1 1/i

Tajima θT = n

2

−1 n−1

i=1 i(n − i)ηi

Tajima’s D D =

θT −θW

Var(θT −θW ) under a neutral model* θT = θW

slide-28
SLIDE 28

Introduction Signatures of recent/ongoing selection Variability and SFS

Theta are based on the frequency spectrum

Watterson θW = a−1 n−1

i=1 ηi, where a = n−1 i=1 1/i

Tajima θT = n

2

−1 n−1

i=1 i(n − i)ηi

4 diploid individuals - excluding non-variable sites

0.0 0.2 0.4 0.6 η Ση i(n−i) = 0.39 0.19 0.13 0.1 0.08 0.06 0.06 0.25 0.43 0.54 0.57 0.54 0.43 0.25 1 1 1 1 1 1 1 watterson 0.39 tajimas 0.39 watterson 0.39 tajimas 0.39

slide-29
SLIDE 29

Introduction Signatures of recent/ongoing selection Variability and SFS

Theta are based on the frequency spectrum

Watterson θW = a−1 n−1

i=1 ηi, where a = n−1 i=1 1/i

Tajima π = θT = n

2

−1 n−1

i=1 i(n − i)ηi

4 diploid individuals

0.0 0.2 0.4 0.6 η η Ση i(n−i) = 0.66 0.17 0.07 0.04 0.03 0.02 0.01 0.39 0.19 0.13 0.1 0.08 0.06 0.06 0.25 0.43 0.54 0.57 0.54 0.43 0.25 1 1 1 1 1 1 1 watterson 0.39 tajimas 0.39 watterson 0.39 tajimas 0.39 watterson 0.39 tajimas 0.32 watterson 0.39 tajimas 0.32

slide-30
SLIDE 30

Introduction Signatures of recent/ongoing selection Variability and SFS

Thetas are based on the frequency spectrum

Watterson θW = a−1 n−1

i=1 ηi, where a = n−1 i=1 1/i

Tajima π = θT = n

2

−1 n−1

i=1 i(n − i)ηi

Fu & Li θFL = η1 Fay & Wu θH = n

2

−1 n−1

i=1 i2ηi

Zeng, Fu,Shi and Wu θL =

1 n−1

n−1

i=1 iηi

general ˆ θ = n

i=0 αiηi

Test statistics D =

θ1−θ2

Var(θ1−θ2) under a neutral model* θ1 = θ2

Difference weighting schemes for the SFS

slide-31
SLIDE 31

Introduction Signatures of recent/ongoing selection Variability and SFS

Why does selection affect the SFS

slide-32
SLIDE 32

Introduction Signatures of recent/ongoing selection Variability and SFS

Frequency spectrum gives information about selection and demography

slide-33
SLIDE 33

Introduction Signatures of recent/ongoing selection Variability and SFS

How to assess significance

slides stolen from Matteo Fumagalli

slide-34
SLIDE 34

Introduction Signatures of recent/ongoing selection Variability and SFS

How to assess significance

slide-35
SLIDE 35

Introduction Signatures of recent/ongoing selection Variability and SFS

How to assess significance

slide-36
SLIDE 36

Introduction Signatures of recent/ongoing selection Variability and SFS

How to assess significance

slide-37
SLIDE 37

Introduction Signatures of recent/ongoing selection Variability and SFS

How to assess significance

slide-38
SLIDE 38

Introduction Signatures of recent/ongoing selection Variability and SFS

How to assess significance

slide-39
SLIDE 39

Introduction Signatures of recent/ongoing selection Variability and SFS

How to assess significance

slide-40
SLIDE 40

Introduction Signatures of recent/ongoing selection Variability and SFS

How to assess significance

slide-41
SLIDE 41

Introduction Signatures of recent/ongoing selection Variability and SFS

How to assess significance

slide-42
SLIDE 42

Introduction Signatures of recent/ongoing selection Variability and SFS

Exercises

Let see how variability π and Tajimas D performs on famous examples of human adaptation. go to http://popgen.dk/albrecht/BAG2018/web/ Graphics When you will run analysis on the server you will need graphic (see above link)