1
BISC/CS303: Bioinformatics Spring 2008
Instructors: Brian Tjaden and Brett Pellock Meeting: Wednesdays, 6:30 - 9:00pm FirstClass Conference: BISC/CS303-S08
BISC/CS303: Bioinformatics Spring 2008 Administrivia Instructors: - - PDF document
BISC/CS303: Bioinformatics Spring 2008 Administrivia Instructors: Brian Tjaden and Brett Pellock Meeting: Wednesdays, 6:30 - 9:00pm FirstClass Conference: BISC/CS303-S08 1 Course Materials http://cs.wellesley.edu/~cs303 Textbooks
1
Instructors: Brian Tjaden and Brett Pellock Meeting: Wednesdays, 6:30 - 9:00pm FirstClass Conference: BISC/CS303-S08
2
http://cs.wellesley.edu/~cs303
Bioinformatics and Functional Genomics by Jonathan Pevzner John Wiley & Sons, Inc., 2003 Functional Concepts of Bioinformatics by D.E. Krane and M.L. Raymer Benjamin Cummings, 2003
3
60%
30%
10%
Focus is on how bioinformatic tools work
Computational tools Biological data Bioinformatics
4
5
Gene Protein DNA DNA: “program” for cell processes Proteins: execute cell processes
phosphate backbone
6
(anti-parallel strands)
function as a template (complementary strands)
C A T A G G A C T C T G
Nucleic acids Amino acids
transcription
translation
7
STOP
translation
M
methionine
G
glycine
S
serine
transcription mRNA GAACGCUAUGCUUGGGUGCUCUAAGUAAGCUAG GCTGACTTGCGATACGAACCCACGAGATTCATTCGATCATTT DNA gene CGACTGAACGCTATGCTTGGGTGCTCTAAGTAAGCTAGTAAA
L
leucine
polypeptide chain
K
lysine
C
cysteine
8
Amino acid tRNA (adapter molecule) mRNA GAACGCUAUGCUUGGGUGCUCUAAGUAAGCUAG codons anti-codon
9
efforts
enviromental factors
Normal RBCs Sickle cell anemia
10
A T Single base substitutions Insertions and Deletions Amplifications Inversions Translocations
9 22 CML
A genome is the total DNA in a cell Species Genome size # of Genes
Epstein-Barr virus 172 Thousand 80 Escherichia coli 4.6 Million 4,400 Drosophila melanogaster 122 Million ~14,000 Homo sapiens 3.3 Billion <25,000 Psilotum nudum (fern) 250 Billion ?
11
Growth in DNA Sequencing 1,000 10,000 100,000 1,000,000 10,000,000 100,000,000 1982 1984 1986 1988 1990 1992 1994 1996 1998 2000 2002 2004
Millions of DNA BasePairs
12
CCGTCAACAA TGCACGTATGACA ATGACAGCTTTAG CAGCTTTAGACCA ACTGAGCTCAAGGTTGCA GTCAACAACTGAG GAGCTCA TTGCACG TTTAGAC ATGACAGCTTTAG CAGCTTTAGACCA ACTGAGCTCAAGGTTGCA GTCAACAACTGAG CCGTCAACAA GAGCTCA TGCACGTATGACA TTGCACG TTTAGAC CCGTCAACAACTGAGCTCAAGGTTGCACGTATGACAGCTTTAGACCA
13
Comparative genomics involves understanding the relationships between the genomes of different species.
homology to known genes
14
(What are the genes?)
interact with each other?
in other words... How does the cell’s DNA “program” work?
generating
genomes can be very useful
generated by bioinformatic tools