SLIDE 14 Measuring Term specificity - Information Content (IC)
Information Content (IC) is based on the number of annotations involving a term and its descendants ICt(x) = −log( nx nroot ) nx being the number of annotations involving term x and its descendants
GO:0006139 Nucleobase, nucleoside, nucleotide and nucleic acid metabolism GO:0043283 Biopolymer metabolism GO:0044237 Cellular metabolism GO:0043170 Macromolecule metabolism GO:0044238 Primary metabolism GO:0009987 Cellular process GO:0008150 Biological process
5 15 15
GO:0008152 metabolism nroot = 56 nGO:0008512 = 46 IC(GO:0008512) = -log(46/56)
3 5 3 8 2 Marco Mina (University of Padova) Investigating bias in SS measures September 13, 2011 14 / 17