Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal
CSE, IIT Delhi
June 16, 2015
[Joint work with Anup Bhattacharya (IITD) and Amit Kumar (IITD)] Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Faster Algorithms for the Constrained k -means Problem Ragesh - - PowerPoint PPT Presentation
Faster Algorithms for the Constrained k -means Problem Ragesh Jaiswal CSE, IIT Delhi June 16, 2015 [Joint work with Anup Bhattacharya (IITD) and Amit Kumar (IITD)] Ragesh Jaiswal Faster Algorithms for the Constrained k -means Problem k -means
[Joint work with Anup Bhattacharya (IITD) and Amit Kumar (IITD)] Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
c∈C
Example: k = 4, d = 2
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
l fraction of its
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
l fraction of its
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
l fraction of its points
k
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
k
k
x∈X ||x − Γ(X)||2 + |X| · ||Γ(X) − p||2. Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
c∈C
k
k
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
k
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
k
Constraint: Each cluster should have at least r points. Figure : Partition algorithm: Minimum cost circulation.
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
k
k
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
k
k
O(k/ǫ).
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
O(k/ǫ).
O(k/ǫ)
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
k
i=1
k
i=1
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
k
i=1
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
k
i=1
i=1
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
k
i=1
i=1
Lower bound: Ω
˜ Ω
√ǫ
Upper bound: O
˜ O( k
ǫ)
.
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
˜ Ω
√ǫ
˜ O( k
ǫ)
Ω(k/√ǫ).
O(k/ǫ)
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
the cost w.r.t. the centroid (green triangle) of a few randomly chosen points (green dots).
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
ǫO(1)
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Pranjal Awasthi, Moses Charikar, Ravishankar Krishnaswamy, and Ali Kemal Sinop, The hardness of approximation of euclidean k-means, CoRR abs/1502.03316 (2015). David Arthur and Sergei Vassilvitskii, k-means++: the advantages of careful seeding, Proceedings of the eighteenth annual ACM-SIAM symposium on Discrete algorithms (Philadelphia, PA, USA), SODA ’07, Society for Industrial and Applied Mathematics, 2007, pp. 1027–1035. Sanjoy Dasgupta, The hardness of k-means clustering, Tech. Report CS2008-0916, Department of Computer Science and Engineering, University of California San Diego, 2008. Hu Ding and Jinhui Xu, A unified framework for clustering constrained data without locality property, SODA’15, pp. 1471–1490, 2015. Dan Feldman, Morteza Monemizadeh, and Christian Sohler, A PTAS for k-means clustering based on weak coresets, Proceedings of the twenty-third annual symposium on Computational geometry (New York, NY, USA), SCG ’07, ACM, 2007, pp. 11–18. Mary Inaba, Naoki Katoh, and Hiroshi Imai, Applications of weighted voronoi diagrams and randomization to variance-based k-clustering: (extended abstract), Proceedings of the tenth annual symposium on Computational geometry (New York, NY, USA), SCG ’94, ACM, 1994, pp. 332–339. Ragesh Jaiswal, Mehul Kumar, and Pulkit Yadav, Improved analysis of D2-sampling based PTAS for k-means and other clustering problems, Information Processing Letters 115 (2015), no. 2. Tapas Kanungo, David M. Mount, Nathan S. Netanyahu, Christine D. Piatko, Ruth Silverman, and Angela Y. Wu, A local search approximation algorithm for k-means clustering, Proc. 18th Annual Symposium on Computational Geometry, 2002, pp. 10–18. Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Amit Kumar, Yogish Sabharwal, and Sandeep Sen, Linear-time approximation schemes for clustering problems in any dimensions, J. ACM 57 (2010), no. 2, 5:1–5:32. Meena Mahajan, Prajakta Nimbhorkar, and Kasturi Varadarajan, The planar k-means problem is NP-hard, Theoretical Computer Science 442 (2012), no. 0, 13 – 21, Special Issue on the Workshop on Algorithms and Computation (WALCOM 2009). Andrea Vattani, The hardness of k-means clustering in the plane, Tech. report, Department of Computer Science and Engineering, University of California San Diego, 2009. Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem
Ragesh Jaiswal Faster Algorithms for the Constrained k-means Problem