SLIDE 1
OptimizationoverZonotopes andTrainingSupportVectorMachines - - PowerPoint PPT Presentation
OptimizationoverZonotopes andTrainingSupportVectorMachines - - PowerPoint PPT Presentation
ZonotopesandSVM D.Eppstein,UCIrvine,WADS2001 OptimizationoverZonotopes andTrainingSupportVectorMachines MarshallBern XeroxPaloAltoResearchCtr. DavidEppstein
SLIDE 2
SLIDE 3
ZonotopesandSVM D.Eppstein,UCIrvine,WADS2001
SupportVectorMachines(SVM)
Machinelearningtechniqueforclassifjcationproblems i.e.givenalargenumberoflabeledyes/noinstances, predictyes/novalueofadditionalinstances Liftdatavaluestomoderate-orhigh-dimensionalEuclideanspace maybeimplicit,using“kernelfunctions”toreplacedotproducts Findhyperplaneseparatingliftedyesandnoinstances dependingononlyfew“supportvectors” Predictfuturevaluesbyliftingandusingsamehyperplane
Mathematicaloptimizationproblem Usinglinearorconvexprogrammingalgorithms
SLIDE 4
ZonotopesandSVM D.Eppstein,UCIrvine,WADS2001
DirectionsofSVMResearch
ApplySVMtechniquestomachinelearningapplications CompareSVMtechniquestootherclassifjers ModifySVMtoproducebetterclassifjers DeriveeffjcientpracticalalgorithmsforSVMoptimization Dotheoreticalanalysisofhyperplaneseparationalgorithms
SLIDE 5
ZonotopesandSVM D.Eppstein,UCIrvine,WADS2001
DirectionsofSVMResearch
ApplySVMtechniquestomachinelearningapplications CompareSVMtechniquestootherclassifjers ModifySVMtoproducebetterclassifjers DeriveeffjcientpracticalalgorithmsforSVMoptimization Dotheoreticalanalysisofhyperplaneseparationalgorithms
Ourinterests
SLIDE 6
ZonotopesandSVM D.Eppstein,UCIrvine,WADS2001
DirectionsofSVMResearch
ApplySVMtechniquestomachinelearningapplications CompareSVMtechniquestootherclassifjers ModifySVMtoproducebetterclassifjers DeriveeffjcientpracticalalgorithmsforSVMoptimization Dotheoreticalanalysisofhyperplaneseparationalgorithms
Thistalk
SLIDE 7
ZonotopesandSVM D.Eppstein,UCIrvine,WADS2001
Isn’titjustlinearprogramming?
fjndv,cdefjningseparatinghyperplanev·x+c=0 satisfyingconstraintsv·Yi+c≥0,foryes-instances, v·Ni+c≤0forno-instances FromcomputationalgeometryweknowLPiseffjcientwhenn>>d
No,because...
Manyfeasiblesolutions,needtochooseone “maximummarginclassifjer”leadstoquadraticprogram,stillnotsohard Use“softmarginclassifjer”toavoiddependenceonoutliers blowsupdimensionfromdton+difexpressedasLP sowantalgorithmsthatstayinlowdimension
SLIDE 8
ZonotopesandSVM D.Eppstein,UCIrvine,WADS2001
Maximummarginclassifjer
Choosehyperplaneatmaximumdistancefrombothconvexhulls Workswell(butsodomanyotherchoices)whensetswell-separated Whensetsoverlap,distancefromhullsisnegative Maximummarginunpopularinthiscase duetosensitivedependenceonthemostextremepoints(outliers)
SLIDE 9
ZonotopesandSVM D.Eppstein,UCIrvine,WADS2001
SoftConvexHull
Idea:shrinkthetwoconvexhullssotheyarewellseparated Usualhull:sumaipi,0≤ai≤1,sumai=1 Centroid:sumaipi,0≤ai≤1/n,sumai=1 Softconvexhull:sumaipi,0≤ai≤µ,sumai=1 Chooseparameter1/n≤µ≤1toshrinkhulltowardscentroid Resultisa“centroidpolytope”[Bernetal.,ESA‘95]: weightedaverageofpointswhereweightsvaryininterval[0,µ] Formedbyintersectingzonotopesumaipi,0≤ai≤µ withhyperplanesumai=1
SLIDE 10
ZonotopesandSVM D.Eppstein,UCIrvine,WADS2001
SoftConvexHulls
µ = µ = µ = µ =
5/12 1/3 1/2 3/4
x1 x2 x3
SLIDE 11
ZonotopesandSVM D.Eppstein,UCIrvine,WADS2001
SoftMarginClassifjers
Ifµislarge,optimalseparatinghyperplanedependsonlyonfew“supportvectors” ratherthanonentiredataset Ifµissmall,softhullswillbewellseparated Chooseµautomaticallytolargestvalueforwhichhullsareseparated Geometrically:fjndlowestpointinintersectionoftwozonotopes
- r...
Chooseµempirically(e.g.bycross-validatingtofjndbestclassifjer) Performmaximummarginclassifjcationforchosenvalue Geometrically:fjndclosestpointsintwozonotopecross-sections Ourtechniquesapplytobothproblems
SLIDE 12
ZonotopesandSVM D.Eppstein,UCIrvine,WADS2001
Zonotopes:Minkowskisumsoflinesegments
Chooseonepointfromeachsegment,addthecoordinates TypicallyΘ(nd–1)facets correspondingtohyperplanearrangementind–1dimensions
SLIDE 13
ZonotopesandSVM D.Eppstein,UCIrvine,WADS2001
OptimizationoverZonotopes
Givenacollectionofzonotopesgeneratedbynlinesegments andgivenalinearobjectivefunctionf fjndthepointxintheintersectionofthezonotopesminimizingf(x) Likelinearprogrammingwithzonotopeinsteadofhalfspaceconstraints CouldbeturnedintoanexplicitLPbutnumberofconstraintsblowsup Thissolvesautomaticchoiceofµ,fjxed-µvariantissimilar
Goals:
scalablealgorithm(linearornear-linearinn) lowdependenceond—typicalCGalg.isexponential,wepreferpolynomial
SLIDE 14
ZonotopesandSVM D.Eppstein,UCIrvine,WADS2001
Optimizationoveronezonotope
Givenzonotopeandlinearfunction,whatisbestvertex? Veryeasy:optimizeindependentlyovereachlinesegment Zonotopeintersecthyperplanealmostaseasy:fractionalknapsack (solvedbyagreedyalgorithm) Buthowtoextendtomorethanonezonotope?
SLIDE 15
ZonotopesandSVM D.Eppstein,UCIrvine,WADS2001
EllipsoidMethod
Generaltechniqueforlinearorconvexoptimization Notverypractical
Convertsseparationintooptimization
Needsasinputa“separationoracle” thattestsifapointisinfeasibleregion, ifnotfjndshyperplaneseparatingitfromfeasibleregion
Dually,convertsoptimizationintoseparation
Separationonaconvexset=optimizationonitspolar,viceversa Cansolveseparationproblemusingasinputan“optimizationoracle“ thatfjndsextremevertexforalinearobjectivefunction
SLIDE 16
ZonotopesandSVM D.Eppstein,UCIrvine,WADS2001
Zonotopeoptimizationalgorithm
Useellipsoidtoconvertsingle-zonotopeoptimizationtoseparation Multi-zonotopeseparationsolvedbytestingeachzonotopeindependently Useellipsoidagaintoconvertseparationtomulti-zonotopeoptimization
Analysis:
Twolevelsofrecursivecallsinellipsoidmethods Eachlevelmultipliestimebypoly(d,precision) Requiredprecisioncanbeshowntobesmall:polylog(n)timesinitialprecision Noblowupindependenceonn
Totaltime:O(npoly(d,logn,precision))
SLIDE 17