CSCI 2570 Introduction to Nanocomputing DNA Computing John E - PowerPoint PPT Presentation

CSCI 2570 Introduction to Nanocomputing DNA Computing John E Savage

DNA (Deoxyribonucleic Acid) � DNA is double-stranded helix of nucleotides, nitrogen-containing molecules. � It carries genetic information of cell, encodes information for proteins & can self-replicate. � Base elements form rungs on double helix. � They occur in pairs: A-T (adenine-thymine), C-G (cytosine-guanine). � Sugars and phosphates form sides of helix. DNA Computing CSCI 2570 @John E Savage 2

RNA (Ribonucleic Acid) � RNA synthesized from DNA. � Genetic information carried from DNA via RNA. � RNA is a constituent of cells and viruses � RNA consists of a long, single stranded chain of phosphate and ribose units of bases. � Bases are adenine, guanine, cytosine and uracil. � Determines protein synthesis and transmission of genetic information. � RNA can also replicate. DNA Computing CSCI 2570 @John E Savage 3

DNA Hybridization � We assume that only Watson-Crick complementary strings combine. � Form oligonucleotides (2 to 20 nucleotides). � General framework for computing with DNA: � Mix oligonucleotides in solution. � Heat up solution. � Cool down slowly to allow structures to form � We show that DNA is as powerful as a Turing machine! DNA Computing CSCI 2570 @John E Savage 4

DNA is a Form of Nanotechnology � Double helix diameter = 2.0 nanometers. � Helical pitch (dist. between rungs) = .34 nms. � Ten base pairs per helical turn. � ~3 x 10 9 base pairs in human genome DNA Computing CSCI 2570 @John E Savage 5

Computing with DNA � Prepare oligonucleotides (“program them”) � Prepare solution with multiple strings. � Only complementary substrings q and q combine, e.g. q = CAG and q = GTC GCTCAG � E.g. GCTCAG + GTCTAT = GTCTAT � 1D & 2D crystalline structures self-assemble DNA Computing CSCI 2570 @John E Savage 6

Hamiltonian Path (HP) Problem � Directed graph G = (V,E) � Determine if there is a path beginning at v in & ending at v out that enters each vertex once. 4 3 1 0 6 2 5 � This graph has HP from v in = 0 to v out = 6 DNA Computing CSCI 2570 @John E Savage 7

Why is Hamiltonian Path Problem Hard? � Intuitively, the number of paths that must be explored grows exponentially with the size of the graph. � Finding a Hamiltonian path using a naïve search algorithm requires exponential search time. � Formally, it has been shown that the Hamiltonian Problem is NP-hard. DNA Computing CSCI 2570 @John E Savage 8

HP Problem is NP-Hard � NP is a class of important languages. � A problem Q (a set of instances ) is in NP if for every “Yes” instance of the problem there is a witness to membership in Q whose validity can be established in polynomial time in the instance size. � The hardest problems in NP are NP -complete. � For a problem Q to be NP -complete, Q must be in NP and every problem in NP must be reducible to Q in polynomial time. (Each problem can be solved by translating it to Q .) � If any NP -complete problem is in P (or EXP ), so is every other NP -complete problem. DNA Computing CSCI 2570 @John E Savage 9

Adleman’s Algorithm Generate random paths through the graph. 1. Keep paths starting with v in & ending with v out 2. If the path has n vertices, keep only paths with n 3. vertices. Keep all paths that enter each vertex at least once. 4. If any paths remain, say “Yes”. Otherwise say “No.” 5. DNA Computing CSCI 2570 @John E Savage 10

Hybridization to Create Paths � Adleman † denotes vertex v by DNA string (or strand ) p v q v . Strands must long enough that they are unique. � Edge ( u , v ) is denoted by q’ u p’ v where p’ and q’ are the Watson-Crick complements of p and q � Mix many copies of edge and vertex strands are put into solution along with copies of p’ in and q’ out . � Adleman used 20-mers in his experiments, |pq| = 20. †"Molecular Computation of Solutions To Combinatorial Problem," Science, 266: 1021-1024, (Nov. 11) 1994. DNA Computing CSCI 2570 @John E Savage 11

Generating Random Paths Through the Graph � Edge strings q’ u p’ v combine with vertex strings p v q v to form duplexes , shown below. q’ u p’ v q’ v p’ w GTATATCCGAGCTATTCGAGCTTAAAGCTAGGCTAGGTAC CGATAAGCTCGAATTTCGAT CCGATCCATGTTAGCACCGT p v q v p w q w � Each duplex has two sticky ends that can combine with another duplex or strand � For starting and ending vertices p v q v and p w q w add p’ v and q’ w so that duplexes with sticky ends q v and p w are produced. DNA Computing CSCI 2570 @John E Savage 12

Implementing the Algorithm � Use PCR to amplify strings starting with vertex v 0 and ending with v 6 . DNA Computing CSCI 2570 @John E Savage 13

Polymerase Chain Reaction (PCR) for String Amplification α β δ 5’ 3’ Separate double β ’ δ ’ α ’ 5’ Strand of DNA 3’ γ 2 γ 1 5’ α κ δ 3’ Identify short Substrings γ ’ 2 γ ’ 1 3’ α κ ’ δ ’ 5’ γ 2 γ 1 5’ α κ δ 3’ Denature and γ ’ 1 bind complements of short strings. γ ’ 2 γ ’ 1 3’ α κ ’ δ ’ 5’ γ 2 DNA Computing CSCI 2570 @John E Savage 14

More on PCR � Polymerase is large molecule that splits double stranded DNA and replicates from 5’ to 3’ starting it at double stranded section. γ 2 γ 1 Hybridize γ ’ 1 with 5’ α κ δ 3’ one strand, γ 2 γ ’ 1 with other γ 2 γ 1 5’ κ δ 3’ γ ’ 2 γ ’ 1 Shortened strand κ ’ δ ’ 5’ clipped at γ 1 . 3’ γ ’ 1 γ ’ 2 Shorten at γ ’ 2 κ ’ and replicate. DNA Computing CSCI 2570 @John E Savage 15

Chain Reaction � Clip DNA subsequence at both ends � Use polymerase to replicate between γ 1 & γ 2 . � Replication doubles substring on every step. � Volume of targeted substring grows exponentially. DNA Computing CSCI 2570 @John E Savage 16

Implementing the Algorithm � Use gel electropheris to find strings denoting paths of seven vertices. DNA Computing CSCI 2570 @John E Savage 17

Setup for Gel Electrophoresis � Figure provided by Wikipedia DNA Computing CSCI 2570 @John E Savage 18

Gel Electrophoresis � Separates RNA, DNA and oligonucleotides. � Nucleic acids are mixed with porous gel. � Electric field moves charged molecules in gel. � Distance a molecule moves is approximately proportional to inverse of logarithm of its size. � Molecules can be seen through staining or other methods. � Electrophoresis purifies molecules. DNA Computing CSCI 2570 @John E Savage 19

Adleman’s Algorithm Generate random paths through the graph. 1. Keep paths starting with v in & ending with v out 2. If the path has n vertices, keep only paths with n 3. vertices. Keep all paths that enter each vertex at least once. 4. If any paths remain, say “Yes”. Otherwise say “No.” 5. DNA Computing CSCI 2570 @John E Savage 20

Implementing the Algorithm � Separate double helix into single stands. � Separate out strings containing v 0 by attaching one copy of p 0 that has a magnetic bead attached to it. � Of those that remain, repeat with p i for i = 1, 2, …, 6. � The result are strings of length 7 that contain each of the vertices. � Amplify the final set of strings using PCR. Use gel electrophoresis to determine if there are any solutions. DNA Computing CSCI 2570 @John E Savage 21

Comments on Adleman’s Method � Long strings {p v } needed to make unlikely that p v combines with a string other than p v . � Twenty base elements per string suffice � Adleman’s experiment required 7 days in lab. � String amplification, gel electrophoresis � Exponential volume of material needed to do tests. � Method exploits parallelism � Nature has lots of parallelism. � Unfortunately reaction times are long (secs). DNA Computing CSCI 2570 @John E Savage 22

Extending DNA Computing to Satisfiability � SAT is defined by clauses: � A set of clauses is “satisfied” if exist values for variables s.t. each clause has value “True”. � Create a double helix for each path (binary string) as in Adleman’s problem. DNA Computing CSCI 2570 @John E Savage 23

Illustration of Lipton’s Method � SAT is defined by clauses: � Lipton † generates all “binary” strings in test tube t 0 . Filter them according to clauses. � Extract strings with x = 1. � Extract strings with x = 0 and y = 1. � Combine the two sets in test tube t 1 . � Repeat with tube t 1 on second clause, i.e. on x’ = 1, y’ = 1. � If any strings survive, it’s a “Yes” instance of SAT. †“DNA Solution of Hard Computational Problems,” R.J. Lipton, Science, vol 268, p542545m 1995 DNA Computing CSCI 2570 @John E Savage 24

Lipton’s General Method for Computing Satisfiability � Create many copies of all paths in G binary below. Paths correspond to all binary strings � For first clause produce test tube containing paths satisfying all of its literals. � Repeat with the second and subsequent clauses. � If all clauses can be satisfied, it will be discovered with high probability . DNA Computing CSCI 2570 @John E Savage 25

Conclusion � DNA-based computing offers interesting possibilities � Most likely to be useful for nano fabrication � However, high error rates may preclude its use DNA Computing CSCI 2570 @John E Savage 26

CSCI 2570 Introduction to Nanocomputing DNA Computing John E - PowerPoint PPT Presentation

CSCI 2570 Introduction to Nanocomputing DNA Computing John E Savage DNA (Deoxyribonucleic Acid) DNA is double-stranded helix of nucleotides, nitrogen-containing molecules. It carries genetic information of cell, encodes information

CSCI 2570 Introduction to Nanocomputing Reconfigurable Computing John E Savage Overview

CSCI 2570 Introduction to Nanocomputing Synthetic Biology John E Savage What is Synthetic

CSCI 2570 Introduction to Nanocomputing The Emergence of Nanotechnology John E Savage Purpose

CSCI 2570 Introduction to Nanocomputing Information Theory John E Savage What is Information

CSCI 2570 Introduction to Nanocomputing Historical Context for Computing John E Savage A

CSCI 2570 Introduction to Nanocomputing Probability Theory John E Savage The Role of

CSCI 2570 Introduction to Nanocomputing DNA Tiling John E Savage Computing with DNA

CSCI 2570 Introduction to Nanocomputing Errors in Crossbars John E Savage Lecture Outline

CSCI 2570 Introduction to Nanocomputing Discrete Quantum Computation John E Savage November 27,

CS257 Introduction to Nanocomputing Overview of Crossbar-Based Computing John E Savage

Soft Error Rate Trends 4th Workshop on Dependable and Secure Nanocomputing (WDSN-10) Alan Wood

Fourth Workshop on Dependable and Secure Nanocomputing Organizers: Jean Arlat, LAAS- CNRS

CSCI 2133 Rapid Programming Techniques for Innovation UI Design CSS Grid and Flexbox

CSCI 5582 Artificial Intelligence Lecture 23 Jim Martin CSCI 5582 Fall 2006 Today 11/30

CSCI 2133 Rapid Programming Techniques for Innovation CSS, CSS3, SASS/SCSS CSCI 2133 2

CSCI 5582 Artificial Intelligence Lecture 11 Jim Martin CSCI 5582 Fall 2006 Today 10/5

Sequencing and decoding genomes C. Victor Jongeneel, PhD Ludwig Institute for Cancer Research

Structural Bioinformatics Davide Ba Staff Scientist Genome Biology Group (CNAG) Structural

Biotechnology & Art HC 177, spring 09 The discoverers of the DNA structure, James Watson, at

Evolutionary Systems Companion slides for the book Bio-Inspired Artificial Intelligence: Theories,

How drugs interact and stabilize themselves in the vicinity of DNA? A computational quest Anwesh

Engineering Genetic Circuits Chris J. Myers Lecture 1: An Engineers Guide to Biology and

There have been 289 post-conviction DNA exonerations in the United States. The frst DNA

In-vitro Molecular Computing Based on DNA Strands An unconventional computing concept inspired by

Sambuz

Useful Links

Newsletter

Mail Us

CSCI 2570 Introduction to Nanocomputing DNA Computing John E - PowerPoint PPT Presentation

CSCI 2570 Introduction to Nanocomputing DNA Computing John E Savage DNA (Deoxyribonucleic Acid) DNA is double-stranded helix of nucleotides, nitrogen-containing molecules. It carries genetic information of cell, encodes information

CSCI 2570 Introduction to Nanocomputing Reconfigurable Computing John E Savage Overview

CSCI 2570 Introduction to Nanocomputing Synthetic Biology John E Savage What is Synthetic

CSCI 2570 Introduction to Nanocomputing The Emergence of Nanotechnology John E Savage Purpose

CSCI 2570 Introduction to Nanocomputing Information Theory John E Savage What is Information

CSCI 2570 Introduction to Nanocomputing Historical Context for Computing John E Savage A

CSCI 2570 Introduction to Nanocomputing Probability Theory John E Savage The Role of

CSCI 2570 Introduction to Nanocomputing DNA Tiling John E Savage Computing with DNA

CSCI 2570 Introduction to Nanocomputing Errors in Crossbars John E Savage Lecture Outline

CSCI 2570 Introduction to Nanocomputing Discrete Quantum Computation John E Savage November 27,

CS257 Introduction to Nanocomputing Overview of Crossbar-Based Computing John E Savage

Soft Error Rate Trends 4th Workshop on Dependable and Secure Nanocomputing (WDSN-10) Alan Wood

Fourth Workshop on Dependable and Secure Nanocomputing Organizers: Jean Arlat, LAAS- CNRS

CSCI 2133 Rapid Programming Techniques for Innovation UI Design CSS Grid and Flexbox

CSCI 5582 Artificial Intelligence Lecture 23 Jim Martin CSCI 5582 Fall 2006 Today 11/30

CSCI 2133 Rapid Programming Techniques for Innovation CSS, CSS3, SASS/SCSS CSCI 2133 2

CSCI 5582 Artificial Intelligence Lecture 11 Jim Martin CSCI 5582 Fall 2006 Today 10/5

Sequencing and decoding genomes C. Victor Jongeneel, PhD Ludwig Institute for Cancer Research

Structural Bioinformatics Davide Ba Staff Scientist Genome Biology Group (CNAG) Structural

Biotechnology &amp; Art HC 177, spring 09 The discoverers of the DNA structure, James Watson, at

Evolutionary Systems Companion slides for the book Bio-Inspired Artificial Intelligence: Theories,

How drugs interact and stabilize themselves in the vicinity of DNA? A computational quest Anwesh

Engineering Genetic Circuits Chris J. Myers Lecture 1: An Engineers Guide to Biology and

There have been 289 post-conviction DNA exonerations in the United States. The frst DNA

In-vitro Molecular Computing Based on DNA Strands An unconventional computing concept inspired by

Sambuz

Useful Links

Newsletter

Mail Us

Biotechnology & Art HC 177, spring 09 The discoverers of the DNA structure, James Watson, at