Clustering by contrast Cyril CHHUN Tlcom Paris Advisor: Jean-Louis - PowerPoint PPT Presentation

Clustering by contrast Cyril CHHUN Télécom Paris Advisor: Jean-Louis DESSALLES June 20, 2019

Outline 1 Introduction 2 The algorithm 3 Test results 4 Conclusion

Introduction views on Youtube” Clustering by contrast Cyril CHHUN • learn from a single example: a “Siamese cat” • detect anomalies: a talking cat • produce negations and explanations: “she is not a writer” without going through the set of “small” objects The algorithm Design a clustering algorithm able to: What are the end goals of contrast learning? Introduction Conclusion Test results 3 / 17 • understand the meaning of “small bacteria” and “small galaxy” • produce relevant descriptions: “it’s a singer who has ten million

Introduction properties. Clustering by contrast Cyril CHHUN Information Processing Systems 15 , 2002. 1 Jon Kleinberg. An impossibility theorem for clustering. Advances in Neural functions • Solution: forsake one of those properties or use non-metric function-based clustering algorithm which verifjes those three The algorithm • Scale-invariance, richness, consistency Which properties would we expect of a clustering algorithm? Impossibility theorem Conclusion Test results 4 / 17 • Kleinberg (2002) 1 : it is impossible to design a distance

Introduction The algorithm Test results Conclusion Vocabulary • Object: observed instance • Prototype: mental representation of a group as a basic object • Contrast: “difgerence” between two objects • Weight: number of times a prototype has been recalled to its prototype • Order: real-life observations are fjrst-order objects, contrasts are second-order objects, etc. Cyril CHHUN Clustering by contrast 5 / 17 • Deviation: acceptable range of an object’s properties compared

Introduction . Clustering by contrast Cyril CHHUN w . . . The algorithm . . 6 / 17 Mean Test results Weight Conclusion Deviation Design Prototypes How to represent prototypes?     µ 1 σ 1         µ m σ m

Introduction m Clustering by contrast Cyril CHHUN • Problem: many prototypes can verify the smallest distance. verifjes scale-invariance along any axis . j The algorithm 7 / 17 Finding the clusters Test results • Dimension-agnostic, scale-invariant, not density-based. Conclusion • The prametric function Design Given object b , how to fjnd the best prototype a of deviation a ′ ? � � | a j − b j | � d ( a , b ) �→ ✶ > θ j a ′ j =1 • It is not a distance, as none of the three properties are verifjed!

Introduction The algorithm Clustering by contrast Cyril CHHUN • Deviations are not used in this step so as to avoid hubs. • Using this rule, we make a tournament and pick the winner. dimensions. The other cluster is eliminated. one whose mean is closer to the object along the most avoid the hub? reasonable: how to cluster seems more Figure: The smaller Comparing the clusters Design Conclusion Test results 8 / 17 • We simply take the best prototypes two by two and choose the

Introduction • The winning cluster is updated as follows: Clustering by contrast Cyril CHHUN improve effjciency. Unused prototypes are forgotten fjrst. • We enforce a limited memory to cope with initial errors and The algorithm 9 / 17 • The object is added as a prototype no matter what, with a How to stock the new information in the memory? Updating the memory Design Conclusion Test results deviation equal to ε times itself and a weight of 1 mean = weight ∗ prototype + object weight + 1 deviation = weight ∗ deviation + | prototype − object | weight + 1 weight = weight + 1

Introduction The algorithm Test results Conclusion Design Skeleton def feed_data_online(data): for obj in data: closest_clusters = find_closest_clusters(obj) winner = cluster_battles(obj, closest_clusters) update_memory(obj, winner) Cyril CHHUN Clustering by contrast 10 / 17 • Clustering: simple loop with complexity O ( mem _ size × n ) → Online learning

Introduction The algorithm Test results Conclusion Understanding results • Softer clustering than k-means; difgerent ways to classify when seeing a new object: – Find the closest prototype to the object (by tournament for example) Cyril CHHUN Clustering by contrast 11 / 17 – Assign object b to prototype a if d ( a , b ) = 0

Introduction The algorithm Test results Conclusion Live demonstration Cyril CHHUN Clustering by contrast 12 / 17

Introduction contrast c such that Clustering by contrast Cyril CHHUN contrast. • Example: seeing a black tomato would give a “red-to-black” j The algorithm 13 / 17 • Given an object b and its closest prototype a , we extract the low-dimensional and applicable between similar objects. • The contrast features should be meaningful, i.e. How to extract relevant contrasts? What about contrasts? Conclusion Test results � � | a j − b j | c j = ( a j − b j ) · ✶ > θ j a ′

Introduction The algorithm Test results Conclusion What about contrasts? How to stock the contrasts in memory? deviation and weight. Then, how to refjne the contrasts? • We can use the same procedure ! Cyril CHHUN Clustering by contrast 14 / 17 • We can use the same principle! Contrast-prototypes with mean,

Introduction The algorithm Test results Conclusion Second demonstration Cyril CHHUN Clustering by contrast 15 / 17

Introduction The algorithm Test results Conclusion Feedback on the checklist without going through the set of “small” objects views on Youtube” Cyril CHHUN Clustering by contrast 16 / 17 ✓ understand the meaning of “small bacteria” and “small galaxy” ✗ produce relevant descriptions: “it’s a singer who has ten million ✗ produce negations and explanations: “she is not a writer” ✓ detect anomalies: a talking cat ✓ learn from a single example: a “Siamese cat”

Introduction The algorithm Test results Conclusion Conclusion • The algorithm is dimension-agnostic and verifjes scale-invariance • It learns on-the-fmy and has a reasonable complexity (linear on average) • Designed to be used on relatively high-level datasets • Contrasts still need testing: some inconsistent results can appear Cyril CHHUN Clustering by contrast 17 / 17

Clustering by contrast Cyril CHHUN Tlcom Paris Advisor: Jean-Louis - PowerPoint PPT Presentation

Clustering by contrast Cyril CHHUN Tlcom Paris Advisor: Jean-Louis DESSALLES June 20, 2019 Outline 1 Introduction 2 The algorithm 3 Test results 4 Conclusion Introduction views on Youtube Clustering by contrast Cyril CHHUN

CHEVREUL Simultaneous Contrast Successive Contrast Successive Contrast Mixed Contrast look

Graph Clustering Graph Clustering What is clustering? What is clustering? Finding patterns

Subspace Clustering Ensemble Clustering Subspace Clustering, Ensemble Clustering, Alternative

Contrast Echocardiography Echocardiography Contrast Contrast Echocardiography Jean- -Louis J.

Evolutionary Clustering Presenter: Lei Tang Evolutionary Clustering Evolutionary Clustering

Clustering A Categorization of Major Clustering Methods Partitioning Methods

BIBLICAL SURVEY I Samuel A Study in Contra s ts Its a polar bear in a snow storm! Its

STATUS REPORT ON IN-FOCUS PHASE CONTRAST Bob Glaeser A THE TULIP APERTURE IS A

Trust based Clustering for Group Trust based Clustering for Group Trust based Clustering for

Finding Clusters Types of Clustering Approaches: Linkage Based, e.g. Hierarchical Clustering

Clustering Hierarchical clustering and k-mean clustering Genome 373 Genomic Informatics

Cl Clustering t i A Categorization of Major Clustering Methods Partitioning Methods

Clustering Hierarchical clustering, k-mean clustering Genome 559: Introduction to Statistical and

CSCE 478/878 Lecture 8: Stephen Scott Clustering Introduction Outline Clustering Stephen

Clustering and Dimensionality Reduction Preview Clustering K -means clustering

Clustering kMeans, Expectation Maximization, Self-Organizing Maps Outline K-means

Introduction to statistics: Linear models Shravan Vasishth Universit at Potsdam

Contour integration methods for self adjoint operators OTKR December 1922 2019 TU Vienna

Astrophysical and Dark Matter Origin of the IceCube High-energy Neutrino Events B HUPAL D EV

Introduction to Optimization Amy Langville SAMSI Undergraduate Workshop N.C. State University

Flexible linear models John Blischak Instructor DataCamp Differential Expression Analysis with

Practical Interpretation With a quantitative factor, like power in the etch-rate example, typically

A cross-linguistic comparison of the L2 acquisition of VOT contrasts Katherine Zhang Carnegie

A learning strategy for contrast-agnostic segmentation of brain MRI scans Benjamin Billot Billot