Markov Checkout Markov project from SVN Wednesday, 11:30 1:30 in a - PowerPoint PPT Presentation

Markov Checkout Markov project from SVN

 Wednesday, 11:30 – 1:30 in a Kahn Room (Union) ◦ Sign up for a 15-min time slot where your whole team can be there ◦ You’ll demo on a projector; anyone can watch  Each person will ◦ talk for ~1 minute about a technical facet of the program to which they contributed ◦ be prepared to answer questions about the project  Be professional! ◦ Be prepared ◦ Dress nicely

 Due to Wednesday’s presentations, tomorrow’s class will be optional  But for those who are here, it will be a great time to work on the Markov project, especially if you are working with a partner

Details

 Input: a text file the skunk jumped over the stump the stump jumped over the skunk the skunk said the stump stunk and the stump said the skunk stunk  Output: a randomly generated list of words that is “like” the original input in a well-defined way

 Gather statistics on word patterns by building an appropriate data structure  Use the data structure to generate random text that follows the discovered patterns

Prefix Suffix ffixes  Input: a text file NONWORD the the e skunk nk jumped mped ove ver the stump the skunk (4), the e stump p jumped mped ove ver the skunk nk stump (4) the e skunk nk said the stump mp stunk k jumped, said, skunk and d th the stu tump mp said th the skunk nk stu tunk nk stunk, the jumped over (2) over the (2) stump jumped, said, stunk, the said the (2) and, stunk NONWORD and the

 Input: a text file Prefix Suffix ffixes the e skunk nk jumped mped ove ver the stump NW NW the the e stump p jumped mped ove ver the skunk nk NW the skunk the e skunk nk said the stump mp stunk k jumped, the skunk and d th the stu tump mp said th the skunk nk stu tunk nk said, the, stunk skunk jumped over jumped over the stump, over the skunk the, jumped, the stump stunk, said …

 n=2:  n=1: the skunk said the the skunk the skunk stump stunk and the jumped over the stump jumped over skunk stunk the skunk jumped over the skunk stunk  Note: it’s also the skunk stunk possible to hit the max before you hit the last nonword.

 For the prefixes? Prefix Suffix ffixes NW NW the  For the set of suffixes? NW the skunk jumped, the skunk said, the,  To relate them? stunk skunk jumped over jumped over the stump, over the skunk the, jumped, the stump stunk, said …

 FixedLengthQueue: a specialized data structure, useful for Markov problem  Check out FixedLengthQueue  Working alone? See your individual repo.  Working with a partner? See your new Markov repo.  Work to implement it in the next 25 minutes or so  When you finish, read the (long) Markov description and start coding  We will only do milestone 1 (so no text justification)

Review HW description, Work on Markov for rest of class

Arrow w shows the point at which h next t to a add da data a  Example to the left shows the queue as elements are added a b ◦ We’ll only add , no remove  What do you need to implement this? ◦ Ar Array whose length is the capacity of the FLQ a b c ◦ Index at which to add the next element to the FLQ a b c d  This index increases by 1 as you add elements, but “wraps” back to 0 when it reaches the capacity of the FLQ ◦ Current size of the FLQ a b c d e  As opposed to the capacity of the FLQ f b c d e

Prefi fix x (n = 2) Suffi fix Blessed NONWORD NONWORD NONWORD Blessed are Blessed are the the meek NONWORD are the poor peacemakers the poor for Input: poor for they Blessed are the poor for for they will will will they will be Blessed are the they will be find peacemakers for they will will be Blessed Blessed find Blessed are meek for they will be Blessed are be Blessed are are the peacemakers for Inspired by Matthew 5:3-9 peacemakers for they will find Blessed To generate a new phrase, find Blessed are start with NONWORD NONWORD and “follow the chain”, but are meek for choose at random from meek for they eligible suffixes are NONWORD NONWORD

Prefi fix x (n = 2) Suffi fix Blessed NONWORD NONWORD NONWORD Blessed are Blessed are the the meek NONWORD are the poor peacemakers the poor for Use a Fixed-Length Queue poor for they whose length is n for they will will will they will be find Use a MultiSet • Stores each word with its will be Blessed Blessed multiplicity be Blessed are are • Has: the peacemakers for • size() peacemakers for they • findKth(int k) will find Blessed • To “pick at random” from a find Blessed are MultiSet, generate a random number, k , between 0 and are meek for size() , then call meek for they findKth(k) to get the are NONWORD NONWORD random word

This mapping is what we want to generate new data from the existing data, using a Markov Chain W k-4 W k-3 W k-2 W k-1 w k w k+1 k+1 • When building the map: t the word that follows s the given prefi fix • When generati ting g from the Implement as a map: random but according g to Fixed-Length Queue the data distributi tion on whose length is n Implement the mapping as a HashMap<String, MultiSet> Implement by choosing where the String is the at random from the concatenation of the words in the mapped MultiSet Fixed-Length Queue, and the MultiSet is the set of words that follow that String in the input Do you see why these are good data structures for this problem?

Initially, the FLQ contains NONWORD at all indices and w k+1 is the first word of the input FLQ: Q: W k-4 W k-3 W k-2 W k-1 W k add w k+1 (the next toString word in the String ng input file) to W k-4 W k-3 W k-2 W k-1 w k the FLQ (key): get the MultiSet from the The loop ends HashMap<String, MultiSet>, when the input Previous using this key file is empty. Follow the MultiSet loop by putting NONWORD as w k+1 n If the MultiSet is null, construct the times. MultiSet and put it into the HashMap. In any case, add w k+1 to the MultiSet Previous MultiSet plus w k+1

Initially, the FLQ contains NONWORD at all indices FLQ: Q: W k-4 W k-3 W k-2 W k-1 W k add w k+1 (the toString generated String ng word) to W k-4 W k-3 W k-2 W k-1 w k the FLQ (key): get the MultiSet from the The loop ends HashMap<String, MultiSet>, when using this key NONWORD is MultiSet generated or you get to the Choose w k+1 randomly from maximum the MultiSet, using number of words. findKth(random number between 0 and size of the MultiSet) W k+1

 Scanner scanner = new Scanner( new BufferedReader( new FileReader( this.pathToInputFile))); while (scanner.hasNext()) { String word = scanner.next(); ... }

Markov Checkout Markov project from SVN Wednesday, 11:30 1:30 in a - PowerPoint PPT Presentation

Markov Checkout Markov project from SVN Wednesday, 11:30 1:30 in a Kahn Room (Union) Sign up for a 15-min time slot where your whole team can be there Youll demo on a projector; anyone can watch Each person will talk for

Markov Chains Markov Processes Discrete-time Markov Chains Continuous-time Markov Chains Dr

Hidden Markov Models Discrete Markov Processes 1 Hidden Markov Models Hidden Markov Models 2

Markov chains and Hidden Markov Models 9000 Markov chains and HMMs We will discuss: Markov

CSCE 471/871 Lecture 3: Markov Chains Markov Chains and and Hidden Markov Models Hidden

Stochastic Processes Markov Processes Hamid R. Rabiee 1 Overview o Markov Property o Markov

Model Repair for Markov Decision Model Repair for Markov Decision Model Repair for Markov

Imprecise Markov chains From basic theory to applications II prof. Jasper De Bock Imprecise

Markov Chains and Hidden Markov Models COMP 571 Luay Nakhleh, Rice University Markov Chains and

Markov Chains and Hidden Markov Models COMP 571 Luay Nakhleh, Rice University 2 Markov Chains

Discrete Time Markov Chains Discrete-Time Markov Chains Books - Introduction to Stochastic

Hidden Markov Models Steven J Zeil Old Dominion Univ. Fall 2010 1 Discrete Markov Processes

Overview Motivation Verifying Continuous-Time Markov Chains 1 Lecture 1+2: Discrete-Time Markov

Markov processes (Markov chains) Construct a Bayes net from these variables: parents? Markov

Markov Logic Markov Logic Probability First-Order Logic Propositional Logic Markov Logic

Markov Models Kunsch, H.R., State Space and Hidden Markov Models . ETH- Zurich, Zurich;

The Hidden Markov The Hidden Markov Model (HMM) Model (HMM) 1 Lecture Outline Lecture Outline

John Hughes, Simon Peyton Jones, Philip Wadler December 2012 A functional language

An Efficient Posterior Regularized Latent Variable Model for Interactive Sound Source

First European LLVM Developers Conference London, 12 th -13 th April 2012 Opening remarks by

FAA reviewed 46 loss of control incidents 734 voluntary reports 9000 observed

Data Structures Practice Introduction to Markov Chaining Checkout DataStructures project from SVN

Infants -Dr. Renee Baillargeon, Rose M. Scott, Zijing He - Kuldeep Yadav (10358) Lets start

INNOVATION COLONIES: Disrupting the Fortune 500 Dominic Holt Dominic Holt Slide 3 ACM

Surgical Approach to Treatment Fetal period Dx- HLHS, intact atrial septum of Pulmonary