Bioinformatics Outline
- What is bioinformatics?
– Who are bioinformaticians?
- Hardware
- Software
Bioinformatics Outline What is bioinformatics? Who are - - PowerPoint PPT Presentation
Bioinformatics Outline What is bioinformatics? Who are bioinformaticians? Hardware Software What is bioinformatics? What is bioinformatics? Someone to analyze my data The boring stuff I do Someone to help me between experiments
Bioinformatics Outline
– Who are bioinformaticians?
What is bioinformatics?
What is bioinformatics?
Someone to analyze my data Someone to help me think about my data A p e r s
w h
r i t e s c
p l e x a l g
i t h m s A p e r s
w h
n
s w h a t a n H M M i s That bloke who fixes my computer Someone who builds websites People sitting in a dark room analyzing data The boring stuff I do between experiments perl python R linux java C++ bash ruby HTML
Who are bioinformaticians?
publish papers, train students
data
Who are bioinformaticians?
Hardware
Torrent Server Recommended
– Processors - Two Six-core processors – RAM - 48 GB RAM – HDD Capacity - Eight 2 TB Hard drives in RAID 5 with 12 TB usable – Network – Quad port gigabit NIC – GPU - NVIDIA Graphic Processor Unit – Chassis – Dell Precision T7500 tower. No rack mount available. – Monitor⁄Keyboard – not included – fjle access available via SSH or
web service
$12,500
Computers
– 51 node cluster – most nodes: 16 cpus, 8 cores each,132 GB RAM, 1TB
local storage (/usr/data), infjniband interconnects
– (6,528 cores; 6,732 GB RAM; 50 TB scratch storage)
– connected to most nodes via infjniband
Computers
– 24 processors with 6 cores each – 198 MB RAM
– lab web server – 24 processors, 6 cores each – 50M RAM – 19TB RAID 6 storage – 18TB USED
Computers
– 4 secret servers! – 48TB backups and archival storage
Software
Software
Local Software
Metagenomics Processing
B i n n i n g r e a d s Contamination removal C
t i g C l u s t e r i n g F u n c t i
a l A s s i g n m e n t s G e n e P r e d i c t i
M e r g e p a i r e d
n d r e a d s P r e p r
e s s i n g Taxonomic assignments
Metagenomics
Prinseq
– FOCUS – Real time
metagenomics
– mg-rast – Super FOCUS
– STAMP
– crAss – metabat – ContigClustering
Metagenomics Processing
AbundanceBin CompostBin concoct crAss tetra Contig clustering FragGeneScan GlimmerMG MetaGeneAnnotator MetaGeneMark MetaGun Orphelia Prodigal Gene Prediction FASTQC FastX Toolkit fjtGCP NGS QC Toolkit Non-pareil Prinseq QC-Chain Streaming Trim Preprocessing CARMA myTaxa FOCUS PhylopythiaS KRAKEN phymmbl LMAT RAIphy MEGAN TACOA Metaplan Taxy Taxonomic assignment CLAMS Sequedex DiScRIBinATE SORT-ITEMS genometa SPANNER GSMer SPHINX PPLACER TaxSOM RTMg Treephyler Functional assignment