SLIDE 1
Final Project/Presentation, STATC141 Spring 2007
How to grade the project?
To receive a grade of C: The presentation should be of 10-15 minutes long, and shows your understanding of the problem. The presentation should contain at least two parts. One long part is on the introduction and background of the problem (why this problem is interesting and how important the problem is in current bioinformatics research). This part will be the key to determine a grade between C-, C and C+. The other part can be short, which briefly reviews the current available methods for this problem. To receive a grade of B: The short part mentioned above on current available methods should be expanded. There should be a more comprehensive review included in the presentation. The quality of the introduction and review will be the key to determine a grade between B-, B and B+. A short and brief discussion of the advantages and disadvantages of the current methods should also be added. The total length of the presentation should be of 15-20 minutes. To receive a grade of A: The short discussion (mentioned above) on the advantages and disadvantages of the current methods should be expanded. In more detail, you should evaluate and compare the current available methods, and also comment
- n the difficulties of the problem. It is important that you show your own understandings on the methods. The
quality of the more detailed discussion will be the key to determine a grade of A- or A. Hopefully the discussion could provide some guidance on how to solve the problem. The total length of the presentation should be of 15-20 minutes. To receive a grade A+: The presentation is very nicely done and provides the guidance on how to solve the problem. A reasonable method may be proposed to solve the problem.
Topics
Project I. Evaluating the similarity of two sequences under an evolutionary context Assume that we have three sequences X, Y, Z from human, mouse and fugu respectively. Since human is closely related to mouse and distantly related to fugu, we would expect that X is more likely to be similar to Y rather than
- Z. In other words, if we observe that the sequence similarity between X and Y is at the same level with the
sequence similarity between X and Z, the similarity between X and Z should be more significant considering the distant relationship between human and fugu. Could you provide a statistical framework for this question, and suggest a reasonable solution for it? (for this project, how to frame the question is the most challenging part. Once the problem is appropriately framed with statistical language, it may not be difficult to solve it. You can consider defining your own test statistics for sequence similarity evaluation. Note that above grading scheme can not be 100% applied here. To receive a grade
- f A, you need frame the question in some way.)