Sequential Importance Sampling for Counting Linear Extensions - PowerPoint PPT Presentation

Sequential Importance Sampling for Counting Linear Extensions Isabel Beichl NIST, Applied & Computational Mathematics Division work done in collaboration with Francis Sullivan Center for Computing Sciences Alathea Jensen George Mason University

Counting Linear Extensions Counting Linear Extensions of a Poset How many ways to order the vertices of a poset or DAG consistent with the order.

Counting Linear Extensions of a Poset ◮ How many ways to order the vertices of a poset or DAG consistent with the order. ◮ = the number of topological sorts of the directed acyclic graph ◮ poset = partially ordered set, DAG = directed acyclic graph. ◮ For this problem, number of extensions of a poset = number of topological sorts of a DAG = number of orderings of vertices that preserves DAG structure. ◮ As we start by taking the transitive closure of the DAG, these are the same for our purposes.

Why approximation is important ◮ Scheduling - Processor, construction ◮ A measure of the number of choices remaining = A measure of degrees of freedom ◮ Brightwell Winkler, proved NP hard to do exactly ◮ Karzanov Kachiyan MCMC approximation, high degree polynomial Not practical

Relation to linear algebra ◮ Sometimes want a matrix upper triangular with row and column permutations. ◮ v → w iff a ( v , w ) � = 0 ◮ Can do iff when thought of as a DAG, can get a linear extension. This method gives a fast way to evaluate the number of ways to make upper triangular with row & column interchanges. For optimization in scheduling, want to know that there are many possibilities to choose from.

We investigated Sequential Importance Sampling (SIS) as an approximation method.

Recall Classical Topological Sort Choose any vertex with no predecessors, put it in the list. Then delete that vertex.

What is the Knuth method ◮ Used to find the size of a tree you don’t have explicitly ◮ Take a sample from the root to the leaves: At each node make a choice of which child to take and note the number of possibilities ◮ The estimate is the product of the possibilities

Knuth method applied to our problem The tree is made by all possible partial extensions.

Problem with the Knuth method Variance can be large

Sequential Importance Sampling ◮ A variance reduction technique Suppose we wish to estimate a sum: N � F ( N ) = f ( σ j ) j =1 A basic approach is to take a sample of size M << N M F ( N ) ≈ N � f ( σ j ) M j =1 where each σ j is uniformly generated Note that 1 / N is the probability of selecting σ j so     M M f ( σ j ) N  = 1 f ( sigma j ) � � F ( N ) ≈    M M p ( σ j ) j =1 j =1

Sequential Importance Sampling Importance sampling says you can select σ j NON-uniformly if you divide by p ( σ j ). M f ( σ j ) 1 f ( σ ) � � � M → p ( σ ) p ( σ ) = f ( σ ) = F ( N ) p ( σ j ) j =1 σ σ because in the long run σ will be chosen p ( σ ) ∗ M times. This limit holds for any probability distribution p ( σ ). ◮ But how to choose p ()?

Sequential Importance Sampling The ideal choice of importance function is p ( σ ) = f ( σ ) F ( N ) i.e. the weight assigned to σ is its relative contribution to the desired sum. SO   M � 2 var = 1 � f ( σ j ) �  − F 2  M p ( σ j ) j =1 f 2 ( σ ) f 2 ( σ ) p 2 ( σ ) p ( σ ) − F 2 = p ( σ ) − F 2 = F 2 − F 2 = 0 . � � → σ σ But this requires knowledge of the answer!!!

Sequential Importnace Sampling But sometimes we DO know something about the tree In our case the path through the tree is made by one extension.

An Importance Function for Counting Extensions The number of descendants of a node + 1. Does not change during the course of the computation. The row sums of the adjacency matrix + 1.

How does our SIS work?

EXACT for G = a tree! All samples are the same

Variance for SIS Let s be a random variable. X a poset. Want to know: � s 2 � � s � 2 Our s is 1 / p sampled with probability p . 1 � � � � s � p = s i ∗ p i = ∗ p i = 1 = L ( X ) p i alli i L ( X ) � 1 1 = L ( X ) � 1 � � � s 2 � p = s 2 i p i = ∗ p i = p � u p 2 p i i i i where � . � u is the uniform average. These look the same but they are not because the frequencies are different.

Sequential Importance Sampling For the special case of linear extensions 1 x p = � k r k where x is the product of all those numerators. So in our case L ( X ) ∗ � x � x � r � u � r � u � s 2 � p � x � u = = L ( X ) = � rL ( X ) � s � 2 L ( X ) 2 p

Recursion for better Variance For a disjoint union of subgraphs X = Y ∪ Z , there is a formula � s + t � L ( X ) = L ( Y ) ∗ L ( Z ) s where Y has a size s and Z has size t . Recursion using connected components also reduces variance.

Summary for Successors Imnportance Function ◮ Reduces variance over uniform importance ◮ Can prove it does trees exactly ◮ Can prove recursion reduces variance

Another importance function Prefer to choose vertices that have smaller “open slots” Taking sqrt is even better. (Why?) 1 √ spaces . remaining − √ # successors p =

Sequential Importance Sampling for Counting Linear Extensions - PowerPoint PPT Presentation

Sequential Importance Sampling for Counting Linear Extensions Isabel Beichl NIST, Applied & Computational Mathematics Division work done in collaboration with Francis Sullivan Center for Computing Sciences Alathea Jensen George Mason

{Sequential Code} {Sequential Code} {Sequential Code} {Sequential Code} {Sequential Code}

Sampling Methods Oliver Schulte - CMPT 419/726 Bishop PRML Ch. 11 Sampling Rejection Sampling

Multiple importance sampling Slides for CS6630 lecture 6 sampling the BRDF sampling the

Random Sampling Florian Schoppmann August 24, 2010 Non-Sequential Sequential Sequential with

Chapter 7. Sampling Chapter 7. Sampling methods? methods? Two types of sampling methods Two

What is the strengths and weakness of these sampling methods? Sampling Strengths /

Sampling Methods CMSC 678 UMBC Outline Recap Monte Carlo methods Sampling Techniques Uniform

Double, Multiple, and Sequential Sampling Double-sampling In a double-sampling plan, a first

Hardware Design with VHDL Sequential Stmts ECE 443 Sequential Statements This slide set covers

Sequential Files : Outline ! Overview ! Ordered vs. Unordered ! Physical sequential Files !

Sampling Overview R toy sampling Non-probability sampling Probability Methods (AKA random)

Sampling Sediment and Sampling Sediment and Sampling Sediment and Porewater Sampling Sediment

Sequential Importance Sampling for Rare Event Estimation with Computer Experiments Brian

On Local Distributed Sampling and Counting Yitong Yin Nanjing University Joint work with W

CS 7616 Pattern Recognition Linear, Linear, Linear Aaron Bobick School of Interactive

Overview 1. Probabilistic Reasoning/Graphical models 2. Importance Sampling 3. Markov Chain

Case Study: How to Avoid Failures in the Design and Installation of Permeable Reactive Barriers

Investor Presentation November 2019 January 2020 Disclaimer This document may contain

Vertex Resource Group Ltd. Investor Presentation August 2019 www.vertex.ca | TSXV:VTX Click to

11/02/2016 UKUROVA UNIVERSITY Outline DEPARTMENT OF ELECTRICAL AND ELECTRONICS ENGINEERING

Spin ice dynamics : generic vertex models Leticia F. Cugliandolo Universit Pierre et Marie

Conflict Directed Clause Learning for the Maximum Weighted Clique Problem Emmanuel Hebrard 1 and

Search for supersymmetry with displaced dileptons at the ATLAS experiment Dominik Krauss

Polygon decomposition into monotone polygons Vertex types START vertex (2 edges on the right and