Identification of associated transcription factors in promoters and - - PowerPoint PPT Presentation

identification of associated transcription factors in
SMART_READER_LITE
LIVE PREVIEW

Identification of associated transcription factors in promoters and - - PowerPoint PPT Presentation

Identification of associated transcription factors in promoters and their related enhancer regions Cornelia Meckbach Institute of Bioinformatics March 8, 2018 Cornelia Meckbach (Inst. of Bioinf.) Associated TFs in enhancer and promoters March


slide-1
SLIDE 1

Identification of associated transcription factors in promoters and their related enhancer regions

Cornelia Meckbach

Institute of Bioinformatics

March 8, 2018

Cornelia Meckbach (Inst. of Bioinf.) Associated TFs in enhancer and promoters March 8, 2018 1 / 20

slide-2
SLIDE 2

Motivation

Transcription factors (TFs) on paired enhancer and promoter regions are associated if they are involved in the pairing process. ⇒ Identification of associated TFs on enhancer and related promoter regions based on their transcription factor binding sites (TFBSs).

Inspired by: Wong, KC (2017). MotifHyades: expectation maximization for de novo DNA motif pair discovery on paired sequences. Bioinformatics, Cornelia Meckbach (Inst. of Bioinf.) Associated TFs in enhancer and promoters March 8, 2018 2 / 20

slide-3
SLIDE 3

Mutual information

Identification of associated TFs of promoter-enhancer pairings (PEPs) using mutual information (MI) → Two TFs are associated with each other if their binding behavior is in dependence of each other.

Cornelia Meckbach (Inst. of Bioinf.) Associated TFs in enhancer and promoters March 8, 2018 3 / 20

slide-4
SLIDE 4

Mutual information

Consider a set of experimentally validated PEPs for a cell line (e.g. by ChIA-PET)

Predict all TFBSs of the underlying sequences Calculate MI for a TFBS pair TE and TP

Cornelia Meckbach (Inst. of Bioinf.) Associated TFs in enhancer and promoters March 8, 2018 4 / 20

slide-5
SLIDE 5

Multivariate mutual information

How much information contains TFBS TE about TP by considering the interaction type (label).

Cornelia Meckbach (Inst. of Bioinf.) Associated TFs in enhancer and promoters March 8, 2018 5 / 20

slide-6
SLIDE 6

Workflow

Cornelia Meckbach (Inst. of Bioinf.) Associated TFs in enhancer and promoters March 8, 2018 6 / 20

slide-7
SLIDE 7

Information theoretic measures

Multivariate mutual information I(TE ;TP;L)

Cornelia Meckbach (Inst. of Bioinf.) Associated TFs in enhancer and promoters March 8, 2018 7 / 20

slide-8
SLIDE 8

Information theoretic measures

Multivariate mutual information I(TE ;TP;L) Mutual information

  • f

joint TE TP with L I(TE ,TP;L)

Cornelia Meckbach (Inst. of Bioinf.) Associated TFs in enhancer and promoters March 8, 2018 8 / 20

slide-9
SLIDE 9

Information theoretic measures

Multivariate mutual information I(TE ;TP;L) Mutual information

  • f

joint TE TP with L I(TE ,TP;L) Conditional mutual information I(TE ;TP|L)

Cornelia Meckbach (Inst. of Bioinf.) Associated TFs in enhancer and promoters March 8, 2018 9 / 20

slide-10
SLIDE 10

Information theoretic measures

Multivariate mutual information I(TE ;TP;L) Mutual information

  • f

joint TE TP with L I(TE ,TP;L) Conditional mutual information I(TE ;TP|L) Dual total correlation DTC(TE ;TP;L)

Cornelia Meckbach (Inst. of Bioinf.) Associated TFs in enhancer and promoters March 8, 2018 10 / 20

slide-11
SLIDE 11

Synthetic example I

Synthetic TFBS-sequence matrix: An entry fij in the matrix is the frequency of TFBS Tj in sequence i. One row corresponds to a PEP The label column indicates the pairing type (true/false pair)

Cornelia Meckbach (Inst. of Bioinf.) Associated TFs in enhancer and promoters March 8, 2018 11 / 20

slide-12
SLIDE 12

Synthetic example I

1

Perfect associated TFBS pair

Cornelia Meckbach (Inst. of Bioinf.) Associated TFs in enhancer and promoters March 8, 2018 12 / 20

slide-13
SLIDE 13

Synthetic example I

1

Perfect associated TFBS pair

2

Associated TFBS pair in true PEPs

Cornelia Meckbach (Inst. of Bioinf.) Associated TFs in enhancer and promoters March 8, 2018 13 / 20

slide-14
SLIDE 14

Synthetic example I

1

Perfect associated TFBS pair

2

Associated TFBS pair in true PEPs

3

Non-associated TFBS pair

Cornelia Meckbach (Inst. of Bioinf.) Associated TFs in enhancer and promoters March 8, 2018 14 / 20

slide-15
SLIDE 15

Synthetic example I :Result

1

Perfect associated TFBS pair

2

Associated TFBS pair in true PEPs

3

Non-associated TFBS pair

Table: Results for different measures for synthetic example I.

TFBS of enhancer TFBS

  • f

promoter I(TE;TP;L) I(TE,TP;L) I(TE;TP|L) DTC(TE,TP,L) TE1 TP1 1.0 1.0 1.0 TE2 TP2 0.43 0.43 0.43 0.86 TE3 TP3 0.33 0.66 1.0

Cornelia Meckbach (Inst. of Bioinf.) Associated TFs in enhancer and promoters March 8, 2018 15 / 20

slide-16
SLIDE 16

Synthetic example II

Use a given library of 166 PWMs to predict potential TFBSs in the sequences True PEPs: TFBSs V$IRF1 01 and V$USF 01 are randomly inserted 1 to 10 times in enhancer and promoter sequences False PEPs: Shuffled enhancer and promoter sequences

Cornelia Meckbach (Inst. of Bioinf.) Associated TFs in enhancer and promoters March 8, 2018 16 / 20

slide-17
SLIDE 17

Synthetic example II: Results

In total there are 27556 TFBS pairs → Ranks of the inserted pairs?

Table: Ranking position of the inserted pairs.

TFBS of enhancer TFBS

  • f

promoter I(TE;TP;L) I(TE,TP;L) I(TE;TP|L) DTC(TE,TP,L) V$IRF1 01 V$USF 01 1 160 5563 125 V$IRF1 01 V$IRF1 01 2 161 6848 215 V$USF 01 V$IRF1 01 3 408 4309 60 V$USF 01 V$USF 01 4 396 3524 23

Cornelia Meckbach (Inst. of Bioinf.) Associated TFs in enhancer and promoters March 8, 2018 17 / 20

slide-18
SLIDE 18
  • Biolog. application: K562 cell line

TFBS enhancer (TE ) Logoplot of enhancer motif TFBS promoter (TP ) Logoplot for promoter motif I(TE ;TP ;L) V$E2F Q6 01 V$CREB1 Q6 0.0144 V$CREB1 Q6 V$CREB1 Q6 01 0.0122 V$CREB1 Q6 V$CREB1 Q6 0.0115 V$E2F Q6 01 V$HOMEZ 01 0.0098 V$IK Q5 V$IK Q5 0.0095 V$IK Q5 V$HOMEZ 01 0.0094 V$E2F Q6 01 V$FAC1 01 0.0087 V$E2F Q6 01 V$HIF1A Q6 0.0084 V$CREB1 Q6 V$HOMEZ 01 0.0081 V$HIF1A Q6 V$E2F Q6 01 0.0075 Cornelia Meckbach (Inst. of Bioinf.) Associated TFs in enhancer and promoters March 8, 2018 18 / 20

slide-19
SLIDE 19

Summary

Workflow to detect associated TFs on enhancer and promoter regions based on their binding sites Compared four different information theoretic measures on synthetic data sets Multivariate mutual information I(TE;TP;L) performs best on both sets

Cornelia Meckbach (Inst. of Bioinf.) Associated TFs in enhancer and promoters March 8, 2018 19 / 20

slide-20
SLIDE 20

Thanks

People Edgar Wingender Mehmet G¨ ultas Martin Haubrock Sebastian Zeidler J¨ urgen D¨

  • nitz

Rayan Daou Darius Wlochowitz Halima Alachram Doris Waldmann Torsten Sch¨

  • ps

Malte Sahrhage

Cornelia Meckbach (Inst. of Bioinf.) Associated TFs in enhancer and promoters March 8, 2018 20 / 20