K -Medoids for K -Means Seeding James Newling & Franc ois - PowerPoint PPT Presentation

Oct 29, 2022 •1.81k likes •1.96k views

K -Medoids for K -Means Seeding James Newling & Franc ois Fleuret Machine Learning Group, Idiap Research Institute & EPFL December 5th, 2017 COLE POLYTECHNIQUE FDRALE DE LAUSANNE The standard K -means pipeline First: Seeding.

K -Medoids for K -Means Seeding James Newling & Franc ¸ois Fleuret Machine Learning Group, Idiap Research Institute & EPFL December 5th, 2017 ÉCOLE POLYTECHNIQUE FÉDÉRALE DE LAUSANNE
The standard K -means pipeline First: Seeding. Second: Lloyd’s (a.k.a. K -means) algorithm. simulated data K = 12 2 , N = 25 K uniform K -means++ LLOYD LLOYD E = 0 . 105 E = 0 . 072 1 / 3
The standard K -means pipeline (+CLARANS) simulated data K = 12 2 , N = 25 K uniform K -means++ CLARANS CLARANS LLOYD LLOYD LLOYD LLOYD E = 0 . 032 E = 0 . 105 E = 0 . 072 E = 0 . 032 1 / 3
CLARANS of Ng and Han (1994) 1: while not converged do randomly choose 1 center and 1 non-center 2: if swapping them decreases E then 3: implement the swap 4: end if 5: 6: end while 2 / 3
CLARANS of Ng and Han (1994) 1: while not converged do randomly choose 1 center and 1 non-center 2: if swapping them decreases E then 3: implement the swap 4: end if 5: 6: end while Avoids local minima of LLOYD by, • long-range swaps • updating centers and samples simultanously . 2 / 3
CLARANS of Ng and Han (1994) 1: while not converged do randomly choose 1 center and 1 non-center 2: if swapping them decreases E then 3: implement the swap 4: end if 5: 6: end while Avoids local minima of LLOYD by, • long-range swaps • updating centers and samples simultanously . We present algorithmic improvements, where • computing new E is O ( N / K ) • implementing swap is O ( N ) . 2 / 3
Results • RNA dataset, d = 8 , N = 16 × 10 4 , K = 400 • 50 runs without CLARANS (red), 24 runs with (blue). 1 . 8 1 . 6 E 1 . 4 1 . 2 1 . 0 0 . 0 0 . 5 1 . 0 1 . 5 2 . 0 2 . 5 3 . 0 3 . 5 time [s] K -means++ LLOYD K -means++ CLARANS LLOYD • On 16 datasets, geometric mean improvement is 3 % . CLARANS with Levenshtein metric for sequence data, l 0 , l 1 , . . . , l ∞ for sparse/dense vectors, many others, on github. 3 / 3
The end james.newling@idiap.ch

Recommend

The K - Medoids Clustering Method Find representative objects, called medoids, in clusters PAM

The K - Medoids Clustering Method Find representative objects, called medoids, in clusters PAM (Partitioning Around Medoids, 1987) starts from an initial set of medoids and iteratively replaces one of the medoids by one of the

289 views • 28 slides

REVEGETATION REVEGETATION REVEGETATION REVEGETATION SEEDING SEEDING SEEDING SEEDING Or Or

REVEGETATION REVEGETATION REVEGETATION REVEGETATION SEEDING SEEDING SEEDING SEEDING Or Or The Art of Greening Up Your Work The Art of Greening Up Your Work Bill Awmack P.Ag . Why Re Why Re - -Seed? Seed? Seed? Why Re Why Re Seed?

586 views • 46 slides

Soybean Seeding Trend Analysis February 2020 The Story ry of f Soybean Seeding Rates

Atlantic Grains Council Soybean Seeding Trend Analysis February 2020 The Story ry of f Soybean Seeding Rates Introduction Chapter 1: Plant population Chapter 2: Yield Chapter 3: Previous crop Chapter 4: Seeding equipment

713 views • 27 slides

Tahoe-Truckee Cloud Seeding Project Water Year 2018 1 DRI Cloud seeding generator: on (Sierra

Tahoe-Truckee Cloud Seeding Project Water Year 2018 1 DRI Cloud seeding generator: on (Sierra Crest Tahoe Winter Snowfall - 2015-2018 Lake Tahoe Basin Snow Water Equivalent by Water Year WY2017 WY2016 median WY2018 74% median WY2015

420 views • 23 slides

Tahoe-Truckee Cloud Seeding Project Water Year 2018 1 DRI Cloud seeding generator: on (Sierra

Tahoe-Truckee Cloud Seeding Project Water Year 2018 1 DRI Cloud seeding generator: on (Sierra Crest Tahoe Winter 2017-2018 (WY2018) Snow Water Equivalent by Month 74% Average WY2018(74%) - Nov (11%), (9%) - Dec (25%),

343 views • 17 slides

Impacts of Changing Seeding Impacts of Changing Seeding Rates in Soybean Rates in Soybean Shawn

Impacts of Changing Seeding Impacts of Changing Seeding Rates in Soybean Rates in Soybean Shawn P. Conley Shawn P. Conley Soybean Extension Specialist Soybean Extension Specialist Purdue University Purdue University Should Yield

443 views • 7 slides

k -means++ seeding Have seen that the k -means algorithm can output arbitrarily poor solutions, if

k -means++ seeding Have seen that the k -means algorithm can output arbitrarily poor solutions, if started with a bad set of initial centroids k -means++ is a simple, probabilistic algorithm to compute initial centroids These centroids are

745 views • 8 slides

Research Update February 2020 In Introduction Soybeans: Seeding Rate, Fungicide and

Atlantic Grains Council Research Update February 2020 In Introduction Soybeans: Seeding Rate, Fungicide and Fertility trials Peas: Gypsum and Seeding rate trials Oats: Nitrogen and Fungicide trials Corn: Boron/Sulfur trials in

689 views • 45 slides

Idaho Power Companys 2009 Cloud Seeding Program Summary Shaun Parkinson, Ph.D, P.E.

An I DACORP Company Idaho Power Companys 2009 Cloud Seeding Program Summary Shaun Parkinson, Ph.D, P.E. Engineering Leader Idaho Powers Cloud Seeding Projects Payette Upper Snake in cooperation with E. Idaho - HCRC&D ldahoPo

265 views • 24 slides

Idaho Power Companys 2009 Cloud Seeding Program Summary Kevin Wade Meteorological Information

An IDACORP Company Idaho Power Companys 2009 Cloud Seeding Program Summary Kevin Wade Meteorological Information Systems Specialist Idaho Powers Idaho Power s Cloud Seeding Projects Payette Payette Upper Snake in cooperation with E

565 views • 18 slides

USING PAST SEEDING TREATMENTS TO INFORM FUTURE SOURCING IN THE COLORADO PLATEAU ANDREA T.

USING PAST SEEDING TREATMENTS TO INFORM FUTURE SOURCING IN THE COLORADO PLATEAU ANDREA T. KRAMER, SHANNON STILL, NORA TALKINGTON, TROY WOOD NATIONAL NATIVE SEED CONFERENCE FEBRUARY 15, 2017 MANY THINGS INFLUENCE SEEDING OUTCOMES Management

385 views • 24 slides

Improved Hunt Seeding with Specific Anomaly Scoring Brenden Bishop January 8, 2019 1/21

Introduction Finding Anomalies Example Conclusion References Improved Hunt Seeding with Specific Anomaly Scoring Brenden Bishop January 8, 2019 1/21 Brenden Bishop Improved Hunt Seeding withSpecific Anomaly Scoring Introduction Finding

813 views • 63 slides

Tahoe-Truckee Cloud Seeding Project Preliminary Results Water Year 2018 April 4, 2018 1 DRI

Tahoe-Truckee Cloud Seeding Project Preliminary Results Water Year 2018 April 4, 2018 1 DRI Cloud seeding generator: on (Sierra Crest Tahoe Winter 2017-2018 Target area Mt Rose SWE (in) Echo Pk SWE (in) 8,800 MSL 7,653 MSL Control

261 views • 13 slides

High Repetition Rate mJ-level Few-Cycle Pulse Laser Amplifier for XUV-FEL seeding . Laser

High Repetition Rate mJ-level Few-Cycle Pulse Laser Amplifier for XUV-FEL seeding . Laser amplifier development: applications at high repetition rate FELs The FLASH-II FEL seeding project Requirements for an XUV seed source and for

603 views • 29 slides

How to Optimize Gower Distance Weights for the k-Medoids Clustering Algorithm to Obtain Mobility

How to Optimize Gower Distance Weights for the k-Medoids Clustering Algorithm to Obtain Mobility Profiles of the Swiss Population Alperen Bektas and Ren Schumann HES-SO Valais / Wallis The 6th Swiss Conference on Data Science Bern, 14 th of

265 views • 15 slides

K-means++: The Advantages of Careful Seeding Sergei Vassilvitskii David Arthur (Stanford

K-means++: The Advantages of Careful Seeding Sergei Vassilvitskii David Arthur (Stanford university) Clustering R d Given points in split them into similar groups. k n Clustering R d Given points in split them into

898 views • 43 slides

Semidefinite programming converse bounds for quantum communication arXiv:1709.00200 Kun Fang

Semidefinite programming converse bounds for quantum communication arXiv:1709.00200 Kun Fang Joint work with Xin Wang, Runyao Duan Centre for Quantum Software and Information U niversity of T echnology S ydney Quantum communication A 1 B 1 A

273 views • 24 slides

Scalable Precision Tuning of Numerical Software Cindy Rubio-Gonzlez Department of Computer

Scalable Precision Tuning of Numerical Software Cindy Rubio-Gonzlez Department of Computer Science University of California, Davis Best Practices for HPC Software Developers Webinar, October 14 th , 2020 Floating-Point Precision Tuning

590 views • 34 slides

Quantum Algorithms for Systems of Linear Equations Rolando Somma Theoretical Division Los Alamos

Quantum Algorithms for Systems of Linear Equations Rolando Somma Theoretical Division Los Alamos National Laboratory Joint work with Andrew Childs Robin Kothari Yigit Subasi Davide Orsucci Maryland Microsoft Vienna Los Alamos Workshop

1.37k views • 101 slides

Learning Unitaries with gradient descent optimization Reevu Maity (Oxford) In progress with

Learning Unitaries with gradient descent optimization Reevu Maity (Oxford) In progress with Bobak Kiani (MIT), Zi-Wen Liu (Perimeter), Seth Lloyd (MIT) & Milad Marvian(MIT) It from Qubit 2019 June 13 Outline We consider classical

658 views • 14 slides

Today More on Closed World Assumption & Negation as Failure. Clark completion

1 Today More on Closed World Assumption & Negation as Failure. Clark completion Lloyd-Topor transformation Alan Smaill Logic Programming Nov 8, 2010 2 Reminder: Negation by failure Prolog does not distinguish between being

710 views • 23 slides

Deep RL + K-Means Matt Gormley Lecture 25 Apr. 17, 2019 1 Reminders Homework 8:

10-601 Introduction to Machine Learning Machine Learning Department School of Computer Science Carnegie Mellon University Deep RL + K-Means Matt Gormley Lecture 25 Apr. 17, 2019 1 Reminders Homework 8: Reinforcement Learning Out:

579 views • 39 slides

Reassessing the Role of Heterogeneity to Understand Business Cycles Jos Vctor Ros Rull

Reassessing the Role of Heterogeneity to Understand Business Cycles Jos Vctor Ros Rull With material developed jointly with Zhen Huo and by Dirk Krueger University of Pennsylvania, UCL, CEPR, and CAERP EEA-ESEM Lisbon 2017 1

375 views • 33 slides

A Brief Introduction to Matching Theory and Its Applications pek zkal Sanver Bilgi

A Brief Introduction to Matching Theory and Its Applications pek zkal Sanver Bilgi University Bilgi University Summer School, PACD2019 Universite de Caen Basse Normandie August 29, 2019 Outline Basic Concepts and Results on Matching

590 views • 27 slides

K -Medoids for K -Means Seeding James Newling & Franc ois - PowerPoint PPT Presentation

K -Medoids for K -Means Seeding James Newling & Franc ois Fleuret Machine Learning Group, Idiap Research Institute & EPFL December 5th, 2017 COLE POLYTECHNIQUE FDRALE DE LAUSANNE The standard K -means pipeline First: Seeding.

The K - Medoids Clustering Method Find representative objects, called medoids, in clusters PAM

REVEGETATION REVEGETATION REVEGETATION REVEGETATION SEEDING SEEDING SEEDING SEEDING Or Or

Soybean Seeding Trend Analysis February 2020 The Story ry of f Soybean Seeding Rates

Tahoe-Truckee Cloud Seeding Project Water Year 2018 1 DRI Cloud seeding generator: on (Sierra

Tahoe-Truckee Cloud Seeding Project Water Year 2018 1 DRI Cloud seeding generator: on (Sierra

Impacts of Changing Seeding Impacts of Changing Seeding Rates in Soybean Rates in Soybean Shawn

k -means++ seeding Have seen that the k -means algorithm can output arbitrarily poor solutions, if

Research Update February 2020 In Introduction Soybeans: Seeding Rate, Fungicide and

Idaho Power Companys 2009 Cloud Seeding Program Summary Shaun Parkinson, Ph.D, P.E.

Idaho Power Companys 2009 Cloud Seeding Program Summary Kevin Wade Meteorological Information

USING PAST SEEDING TREATMENTS TO INFORM FUTURE SOURCING IN THE COLORADO PLATEAU ANDREA T.

Improved Hunt Seeding with Specific Anomaly Scoring Brenden Bishop January 8, 2019 1/21

Tahoe-Truckee Cloud Seeding Project Preliminary Results Water Year 2018 April 4, 2018 1 DRI

High Repetition Rate mJ-level Few-Cycle Pulse Laser Amplifier for XUV-FEL seeding . Laser

How to Optimize Gower Distance Weights for the k-Medoids Clustering Algorithm to Obtain Mobility

K-means++: The Advantages of Careful Seeding Sergei Vassilvitskii David Arthur (Stanford

Semidefinite programming converse bounds for quantum communication arXiv:1709.00200 Kun Fang

Scalable Precision Tuning of Numerical Software Cindy Rubio-Gonzlez Department of Computer

Quantum Algorithms for Systems of Linear Equations Rolando Somma Theoretical Division Los Alamos

Learning Unitaries with gradient descent optimization Reevu Maity (Oxford) In progress with

Today More on Closed World Assumption &amp; Negation as Failure. Clark completion

Deep RL + K-Means Matt Gormley Lecture 25 Apr. 17, 2019 1 Reminders Homework 8:

Reassessing the Role of Heterogeneity to Understand Business Cycles Jos Vctor Ros Rull

A Brief Introduction to Matching Theory and Its Applications pek zkal Sanver Bilgi

Today More on Closed World Assumption & Negation as Failure. Clark completion