confidence sets for persistent diagrams
play

Confidence Sets for Persistent Diagrams Aleksandr Popov Eindhoven - PowerPoint PPT Presentation

Confidence Sets for Persistent Diagrams Aleksandr Popov Eindhoven University of Technology 12th June 2018 1 / 31 What to expect 1. Introduction and motivation 2. Formal definition 3. Computation methods 2 / 31 Introduction and motivation


  1. Confidence Sets for Persistent Diagrams Aleksandr Popov Eindhoven University of Technology 12th June 2018 1 / 31

  2. What to expect 1. Introduction and motivation 2. Formal definition 3. Computation methods 2 / 31

  3. Introduction and motivation 3 / 31

  4. Back to the origins Main idea of TDA: determine topology of underlying space, based on a point cloud. Point cloud can be viewed as a sample of the true space. Random sample of 10 points. 4 / 31

  5. Back to the origins Main idea of TDA: determine topology of underlying space, based on a point cloud. Point cloud can be viewed as a sample of the true space. Random sample of 10 points. 4 / 31

  6. Čech complex Čech complex on the 10 points. 5 / 31

  7. Čech complex Čech complex on the 10 points. 5 / 31

  8. Čech complex Čech complex on the 10 points. 5 / 31

  9. Čech complex Čech complex on the 10 points. 5 / 31

  10. Čech complex Čech complex on the 10 points. 5 / 31

  11. Homology Betui numbers: These will change if we take difgerent radius of balls! 6 / 31 β 0 = β 1 =

  12. Homology Betui numbers: These will change if we take difgerent radius of balls! 6 / 31 β 0 = 1 β 1 = 1

  13. Homology Betui numbers: These will change if we take difgerent radius of balls! 6 / 31 β 0 = 1 β 1 = 1

  14. A difgerent Čech complex 7 / 31 A difgerent Čech complex on the 10 points: β 0 = 6 , β 1 = 0 .

  15. A difgerent Čech complex 7 / 31 A difgerent Čech complex on the 10 points: β 0 = 6 , β 1 = 0 .

  16. A difgerent Čech complex 7 / 31 A difgerent Čech complex on the 10 points: β 0 = 6 , β 1 = 0 .

  17. A difgerent Čech complex 7 / 31 A difgerent Čech complex on the 10 points: β 0 = 6 , β 1 = 0 .

  18. A difgerent Čech complex 7 / 31 A difgerent Čech complex on the 10 points: β 0 = 6 , β 1 = 0 .

  19. Persistent homology Persistence diagram from Vietoris–Rips complex for the 10 points. 8 / 31 4 3 Death 2 1 component loop 0 0 1 2 3 4 Birth

  20. So what’s wrong with this? What here is ‘noise’ and what is ‘signal’? How well do we represent the homology of the space of interest? 9 / 31

  21. What do we want? Persistence diagram from Vietoris–Rips complex for the 10 points with the confidence band. 10 / 31 4 3 Death 2 1 component loop 0 0 1 2 3 4 Birth

  22. Formal definition 11 / 31

  23. X n following Statistical model and basic definitions Sample set S n X distribution P . P is concentrated on . Persistence diagram—multiset of points in plane. Denote by . 12 / 31 Space of interest: manifold M . Manifold M .

  24. Statistical model and basic definitions distribution P . Persistence diagram—multiset of points in plane. Denote by . 12 / 31 Space of interest: manifold M . Sample set S n = { X 1 , . . . , X n } following P is concentrated on M . M with points sampled from P .

  25. Statistical model and basic definitions distribution P . Persistence diagram—multiset of points in plane. Denote by . Sampled points S n . 12 / 31 Space of interest: manifold M . Sample set S n = { X 1 , . . . , X n } following P is concentrated on M .

  26. Statistical model and basic definitions Persistence diagram—multiset of points in 12 / 31 distribution P . Space of interest: manifold M . 4 Sample set S n = { X 1 , . . . , X n } following 3 Death 2 P is concentrated on M . 1 component loop 0 0 1 2 3 4 Birth plane. Denote by P . Persistence diagram P for the example.

  27. d A x y A y d S n x Čech filtrations; distance set filtration of distance function: Čech filtration. . increasing from 0 to for x i.e. the distance to the closest point in A . Čech filtration corresponds to lower level Čech filtration—sequence of Čech x inf D : Distance function for a set A radius. complexes with gradually increasing 13 / 31

  28. d A x y A y d S n x Čech filtrations; distance set filtration of distance function: Čech filtration. . increasing from 0 to for x i.e. the distance to the closest point in A . Čech filtration corresponds to lower level Čech filtration—sequence of Čech x inf D : Distance function for a set A radius. complexes with gradually increasing 13 / 31

  29. d S n x Čech filtrations; distance x Distance from p to A . p A . increasing from 0 to for set filtration of distance function: Čech filtration—sequence of Čech Čech filtration corresponds to lower level i.e. the distance to the closest point in A . radius. complexes with gradually increasing 13 / 31 Distance function for a set A ⊂ R D : d A ( x ) = inf y ∈ A ∥ y − x ∥ 2 ,

  30. Čech filtrations; distance Čech filtration—sequence of Čech complexes with gradually increasing radius. i.e. the distance to the closest point in A . Čech filtration corresponds to lower level set filtration of distance function: 13 / 31 Distance function for a set A ⊂ R D : d A ( x ) = inf y ∈ A ∥ y − x ∥ 2 , Regions for increasing ε . { x : d S n ( x ) ≤ ε } , for ε increasing from 0 to ∞ .

  31. Čech filtrations; distance Čech filtration—sequence of Čech complexes with gradually increasing radius. i.e. the distance to the closest point in A . Čech filtration corresponds to lower level set filtration of distance function: 13 / 31 Distance function for a set A ⊂ R D : d A ( x ) = inf y ∈ A ∥ y − x ∥ 2 , Regions for increasing ε . { x : d S n ( x ) ≤ ε } , for ε increasing from 0 to ∞ .

  32. Čech filtrations; distance Čech filtration—sequence of Čech complexes with gradually increasing radius. i.e. the distance to the closest point in A . Čech filtration corresponds to lower level set filtration of distance function: 13 / 31 Distance function for a set A ⊂ R D : d A ( x ) = inf y ∈ A ∥ y − x ∥ 2 , Regions for increasing ε . { x : d S n ( x ) ≤ ε } , for ε increasing from 0 to ∞ .

  33. What are we trying to get? Can we guarantee with high probability that they are close for all the points? We need to express ‘closeness’ somehow. 14 / 31 We can construct the persistence diagram of S n , ˆ P . We use it to estimate the persistence diagram of M , P .

  34. What are we trying to get? Can we guarantee with high probability that they are close for all the points? We need to express ‘closeness’ somehow. 14 / 31 We can construct the persistence diagram of S n , ˆ P . We use it to estimate the persistence diagram of M , P .

  35. M d Botuleneck distance max b a maximum pairwise cost. Botuleneck distance minimises the a b Perfect matching M of sets A and B a b M min A B W Botuleneck distance: (bijection). 15 / 31 Cost between points a ∈ A , b ∈ B : d ∞ ( a , b ) = max ( | a x − b x | , | a y − b y | ) . 2 3 d ∞ ( a , b ) = 3 .

  36. Botuleneck distance M maximum pairwise cost. Perfect matching M of sets A and B max Botuleneck distance minimises the Botuleneck distance: (bijection). 15 / 31 (2 , 4) (2 . 8 , 4) Cost between points a ∈ A , b ∈ B : (4 . 8 , 2 . 8) d ∞ ( a , b ) = max ( | a x − b x | , | a y − b y | ) . (4 , 2) (0 , 0 . 8) W ∞ ( A , B ) = min ( a , b ) ∈ M d ∞ ( a , b ) . (0 , 0) W ∞ ( A , B ) =

  37. Botuleneck distance M maximum pairwise cost. Perfect matching M of sets A and B max Botuleneck distance minimises the Botuleneck distance: (bijection). 15 / 31 (2 , 4) (2 . 8 , 4) Cost between points a ∈ A , b ∈ B : (4 . 8 , 2 . 8) d ∞ ( a , b ) = max ( | a x − b x | , | a y − b y | ) . (4 , 2) (0 , 0 . 8) W ∞ ( A , B ) = min ( a , b ) ∈ M d ∞ ( a , b ) . (0 , 0) W ∞ ( A , B ) =

  38. Botuleneck distance M maximum pairwise cost. Perfect matching M of sets A and B max Botuleneck distance minimises the Botuleneck distance: (bijection). 15 / 31 (2 , 4) (2 . 8 , 4) Cost between points a ∈ A , b ∈ B : (4 . 8 , 2 . 8) d ∞ ( a , b ) = max ( | a x − b x | , | a y − b y | ) . (4 , 2) (0 , 0 . 8) W ∞ ( A , B ) = min ( a , b ) ∈ M d ∞ ( a , b ) . (0 , 0) W ∞ ( A , B ) = 0 . 8 .

  39. , find c n such that is further than c n from the diagonal, then with probability at least What are we trying to get: take 2 Can we guarantee with high probability that they are close for all the points? Given confidence lim sup n W c n If the point in it is also not on the diagonal in , so it is significant . 16 / 31 We can construct the persistence diagram of S n , ˆ P . We use it to estimate the persistence diagram of M , P .

  40. is further than c n from the diagonal, then with probability at least What are we trying to get: take 2 Can we guarantee with high probability that they are close for all the points? lim sup If the point in it is also not on the diagonal in , so it is significant . 16 / 31 We can construct the persistence diagram of S n , ˆ P . We use it to estimate the persistence diagram of M , P . Given confidence α ∈ (0 , 1) , find c n such that P ( W ∞ ( ˆ P , P ) > c n ) ≤ α . n →∞

  41. What are we trying to get: take 2 Can we guarantee with high probability that they are close for all the points? lim sup 16 / 31 We can construct the persistence diagram of S n , ˆ P . We use it to estimate the persistence diagram of M , P . Given confidence α ∈ (0 , 1) , find c n such that P ( W ∞ ( ˆ P , P ) > c n ) ≤ α . n →∞ If the point in ˆ P is further than c n from the diagonal, then with probability at least 1 − α it is also not on the diagonal in P , so it is significant .

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend