Dataset NLS / NES Thomas Heinzinger, Ron Fechtner, Verena Burger, - - PowerPoint PPT Presentation

dataset nls nes
SMART_READER_LITE
LIVE PREVIEW

Dataset NLS / NES Thomas Heinzinger, Ron Fechtner, Verena Burger, - - PowerPoint PPT Presentation

Dataset NLS / NES Thomas Heinzinger, Ron Fechtner, Verena Burger, Paul Hager, Lena Maria Hackl Technische Universitt Mnchen Fakultt fr Bioinformatik Rostlab Mnchen, 28.11.2019 image taken from


slide-1
SLIDE 1

Thomas Heinzinger, Ron Fechtner, Verena Burger, Paul Hager, Lena Maria Hackl Technische Universität München Fakultät für Bioinformatik Rostlab München, 28.11.2019

Dataset NLS / NES

image taken from http://jonlieffmd.com/blog/non-immune-cells-also-combat-microbes

1

slide-2
SLIDE 2

Rostlab (TUM) | Dataset NLS / NES 2

Total Number of Sequences: 787

slide-3
SLIDE 3

3 Rostlab (TUM) | Dataset NLS / NES

The protein seq. originate from various species

slide-4
SLIDE 4

4 Rostlab (TUM) | Dataset NLS / NES

Protein Length Distribution

average length: 675.8 median length: 515

slide-5
SLIDE 5

5 Rostlab (TUM) | Dataset NLS / NES

Signal Occurrence

slide-6
SLIDE 6

6 Rostlab (TUM) | Dataset NLS / NES

Signal Size

DNA-binding protein involved in cell cycle control Nucleolar protein that regulates RNAP I

slide-7
SLIDE 7

NLS exhibits no preferred location

7

slide-8
SLIDE 8

8 Rostlab (TUM) | Dataset NLS / NES

Signal motifs influence the aa distribution

slide-9
SLIDE 9

The signals exhibit no preferred location

9

slide-10
SLIDE 10

10 Rostlab (TUM) | Dataset NLS / NES

Data Source

Bernhofer et. al, NLSdb—major update for database of nuclear localization signals and nuclear export signals, Nucleic Acids Research

  • ValidNES NES entries
  • NESBase NES entries
  • Manually curated NES and NLS

SwissProt entries

  • SeqNLS NLS entries
slide-11
SLIDE 11

11 Rostlab (TUM) | Dataset NLS / NES

Data Source

Bernhofer et. al, NLSdb—major update for database of nuclear localization signals and nuclear export signals, Nucleic Acids Research

  • Manually curated NES and NLS tagged UniProt entries
  • MMseqs2 redundancy reduced
slide-12
SLIDE 12

12 Rostlab (TUM) | Dataset NLS / NES

Protein Length Distribution, split up by NES/ NLS

NES median: 376 NES mean: 507.58 NLS median: 520.5 (even number

  • f proteins)

NLS mean: 681.42 Result of two-sample Kolmogorov-Smirnov Test: statistic=0.1838, pvalue=0.0032