Big Data - Lecture 1 Optimization reminders S. Gadat Toulouse, - PowerPoint PPT Presentation

Introduction Standard Convex optimisation procedures Constrained Convex optimisation Conclusion Big Data - Lecture 1 Optimization reminders S. Gadat Toulouse, Octobre 2014 S. Gadat Big Data - Lecture 1

Introduction Major issues Standard Convex optimisation procedures Examples Constrained Convex optimisation Mathematics Conclusion Schedule 1 Introduction Major issues Examples Mathematics 2 Standard Convex optimisation procedures Convex functions Example of convex functions Gradient descent method 3 Constrained Convex optimisation Definition Equality constraint Inequality constraint Lagrangian in general settings KKT Conditions 4 Conclusion S. Gadat Big Data - Lecture 1

Introduction Major issues Standard Convex optimisation procedures Examples Constrained Convex optimisation Mathematics Conclusion I Introduction - Major issues in data science Data science: Extract from data some knowledge for industrial or academic exploitation. I nvolves: Signal Processing (how to record the data and represent it?) 1 Modelization (What is the problem, what kind of mathematical 2 model and answer?) Statistics (reliability of estimation procedures?) 3 Machine Learning (what kind of efficient optimization algorithm?) 4 Implementation (software needs) 5 Visualization (how can I represent the resulting knowledge?) 6 In its whole, this sequence of questions are at the core of Artificial Intelligence and may also be referred to as Computer Science problems. In this lecture, we will address some issues raised in red items. Each time, practical examples will be provided Most of our motivation comes from the Big Data world, encountered in image processing, finance, genetics and many other fields where knowledge extraction is needed, when facing to many observations described by many variables. n : number of observations - p : number of variables per observations p >> n >> O (1) . S. Gadat Big Data - Lecture 1

Introduction Major issues Standard Convex optimisation procedures Examples Constrained Convex optimisation Mathematics Conclusion I Introduction - Spam classification - Signal Processing datasets From a set of labelled messages (spam or not), build a classification for automatic spam rejection. Select among the words meaningful elements? Automatic classification? S. Gadat Big Data - Lecture 1

Introduction Major issues Standard Convex optimisation procedures Examples Constrained Convex optimisation Mathematics Conclusion I Introduction - Micro-array analysis - Biological datasets One measures micro-array datasets built from a huge amount of profile genes expression. Number of genes p (of order thousands). Number of samples n (of order hundred). Diagnostic help: healthy or ill? Select among the genes meaningful elements? Automatic classification? S. Gadat Big Data - Lecture 1

Introduction Major issues Standard Convex optimisation procedures Examples Constrained Convex optimisation Mathematics Conclusion I Introduction - Fraud detection - Industrial datasets Set of individual electrical consumption for some people in Medellin (Colombia). Each individual provides a monthly electrical consumption. The electrical firm measures the whole consumption for important hubs (they are formed by a set of clients). Want to detect eventual fraud. Problems: Missing data: completion of the table. How? Noise in the several measurements: how does it degrages the fraud detection? Can we exhibit several monthly behaviour of the clients? S. Gadat Big Data - Lecture 1

Introduction Major issues Standard Convex optimisation procedures Examples Constrained Convex optimisation Mathematics Conclusion I Introduction - Data Completion & Recommandation systems - Advertisement and e-business datasets Recommandation problems: And more recently: What kind of database? Reliable recommandation for clients? Online strategy? S. Gadat Big Data - Lecture 1

Introduction Major issues Standard Convex optimisation procedures Examples Constrained Convex optimisation Mathematics Conclusion I Introduction - Credit scoring - Actuaries datasets Build an indicator ( Q score) from a dataset for the probability of interest in a financial product (Visa premier credit card). Define a model, a question? 1 2 Use a supervised classification algorithm to rank the best clients. 3 Use logistic regression to provide a score. S. Gadat Big Data - Lecture 1

Introduction Major issues Standard Convex optimisation procedures Examples Constrained Convex optimisation Mathematics Conclusion I Introduction - What about maths? Various mathematical fields we will talk about: Analysis: Convex optimization, Approximation theory Statistics: Penalized procedures and their reliability Probabilistic methods: concentration inequalities, stochastic processes, stochastic approximations Famous keywords: Lasso Boosting Convex relaxation Supervised classification Support Vector Machine Aggregation rules Gradient descent Stochastic Gradient descent Sequential prediction Bandit games, minimax policies Matrix completion S. Gadat Big Data - Lecture 1

Introduction Convex functions Standard Convex optimisation procedures Example of convex functions Constrained Convex optimisation Gradient descent method Conclusion Schedule 1 Introduction Major issues Examples Mathematics 2 Standard Convex optimisation procedures Convex functions Example of convex functions Gradient descent method 3 Constrained Convex optimisation Definition Equality constraint Inequality constraint Lagrangian in general settings KKT Conditions 4 Conclusion S. Gadat Big Data - Lecture 1

Introduction Convex functions Standard Convex optimisation procedures Example of convex functions Constrained Convex optimisation Gradient descent method Conclusion Convex functions We recall some background material that is necessary for a clear understanding of how some machine learning procedures work. We will cover some basic relationships between convexity, positive semidefiniteness, local and global minimizers. Definition (Convex sets, convex functions) A set D is convex if for any ( x 1 , x 2 ) ∈ D 2 and all α ∈ [0 , 1] , x = αx 1 + (1 − α ) x 2 ∈ D . A function f is convex if its domain D is convex f ( x ) = f ( αx 1 + (1 − α ) x 2 ) ≤ αf ( x 1 ) + (1 − α ) f ( x 2 ) . Definition (Positive Semi Definite matrix (PSD)) A p × p matrix H is (PSD) if for all p × 1 vectors z , we have z t Hz ≥ 0 . There exists a strong link between SDP matrix and convex functions, given by the following proposition. Proposition A smooth C 2 ( D ) function f is convex if and only if D 2 f is SDP at any point of D . The proof follows easily from a second order Taylor expansion. S. Gadat Big Data - Lecture 1

Introduction Convex functions Standard Convex optimisation procedures Example of convex functions Constrained Convex optimisation Gradient descent method Conclusion Example of convex functions Exponential function: θ ∈ R �− → exp( aθ ) on R whatever a is. Affine function: θ ∈ R d �− → a t θ + b Entropy function: θ ∈ R + �− → − θ log( θ ) � d � p -norm: θ ∈ R d �− � p � � θ i | p . → | θ � p := � i =1 Quadratic form: θ ∈ R d �− → θ t P θ + 2 q t θ + r where P is symetric and positive. S. Gadat Big Data - Lecture 1

Introduction Convex functions Standard Convex optimisation procedures Example of convex functions Constrained Convex optimisation Gradient descent method Conclusion Why convex functions are useful? From external motivations: Many problems in machine learning come from the minimization of a convex criterion and provide meaningful results for the statistical initial task. Many optimization problems admit a convex reformulation (SVM classification or regression, LASSO regression, ridge regression, permutation recovery, . . . ). From a numerical point of view: Local minimizer = global minimizer. It is a powerful point since in general, descent methods involve ∇ f ( x ) (or something related to), which is a local information on f . x is a local (global) minimizer of f if and only if 0 ∈ ∂f ( x ) . Many fast algorithms for the optimization of convex function exist, and sometimes are independent on the dimension d of the original space. S. Gadat Big Data - Lecture 1

Introduction Convex functions Standard Convex optimisation procedures Example of convex functions Constrained Convex optimisation Gradient descent method Conclusion Why convex functions are powerful? Two kind of optimization problems: On the left: non convex optimization problem, use of Travelling Salesman type method. Greedy exploration step (simulated annealing, genetic algortihms). On the right: convex optimization problem, use local descent methods with gradients or subgradients. Definition (Subgradient (nonsmooth functions?)) For any function f : R d − → R , and any x in R d , the subgradient ∂f ( x ) is the set of all vectors g such that f ( x ) − f ( y ) ≤ � g, x − y � . This set of subgradients may be empty. Fortunately, it is not the case for convex functions. Proposition f : R d − → R is convex if and only if ∂f ( x ) � = ∅ for any x of R d . S. Gadat Big Data - Lecture 1

Big Data - Lecture 1 Optimization reminders S. Gadat Toulouse, - PowerPoint PPT Presentation

Introduction Standard Convex optimisation procedures Constrained Convex optimisation Conclusion Big Data - Lecture 1 Optimization reminders S. Gadat Toulouse, Octobre 2014 S. Gadat Big Data - Lecture 1 Introduction Standard Convex

Machine Learning Anders Holst SICS Big Data Analytics Analysis Big Data Big Value Big Data

Big Data Algorithms with Medical Applications Yixin Chen Outline Challenges to big data

CS535 Big Data 1/22/2020 Sangmi Lee Pallickara CS535 Big Data | Computer Science Department

COMP9313: Big Data Management Introduction to Big Data Management What is big data? Tweeted by

9/17/2020 Division Updates and Reminders September 17, 2020 9/17/2020 1 1 Agenda Updates

Optimization in the Big Data Regime Sham M. Kakade Machine Learning for Big Data

15-780: Optimization J. Zico Kolter March 14-16, 2015 1 Outline Introduction to optimization

Stochastic Optimization Techniques for Big Data Machine Learning Tong Zhang Rutgers University

HOW BIG IS BIG DATA FOR AN INSURER LIKE AXA? CHALLENGES & OPPORTUNITIES Paris Big Data

BIG DATA CONFERENCE How to transform data into money using Big Data technologies INTRO THE

BIG DATA: Revolutionizing construction business through socmed data mining REVOLUTIONIZING

Getting the Big (Data) Picture Eva Andreasson , Cloudera Big Data? Todays Big Data Landscape

Fundamentals of Big Data BIG DATA F UN DAMEN TALS W ITH P YS PARK Upendra Devisetty Science

(recent advancements in) Optimization in the Big Data Regime Sham M. Kakade Computer

Big Data Analytics: What is Big Data? Stony Brook University CSE545, Fall 2016 the inaugural

Big Data Analytics: What is Big Data? H. Andrew Schwartz Stony Brook University CSE545, Fall

Dark matter heavyweights SUSY and Q-balls Inflation+SUSY Q-balls stable Q-balls as

on the light front Sergei Alexandrov Laboratoire Charles Coulomb Montpellier work in progress

Data Mining Support Vector Machines Introduction to Data Mining, 2 nd Edition by Tan, Steinbach,

Naive Bayes and Gaussian Bayes Classifier Ladislav Rampasek slides by Mengye Ren and others

Support vector machines CS 446 Part 1: linear support vector machines 1.0 1.0 1.0 0.8 0.8

9 MultiFreedom Constraints II IFEM Ch 9 Slide 1 Introduction to FEM Penalty Function

Variational Principles and Constraints in Continuum Dynamics Giovanni Romano, Raffaele Barretta

First order gravity on the light front Sergei Alexandrov Laboratoire Charles Coulomb

Big Data - Lecture 1 Optimization reminders S. Gadat Toulouse, - PowerPoint PPT Presentation

Introduction Standard Convex optimisation procedures Constrained Convex optimisation Conclusion Big Data - Lecture 1 Optimization reminders S. Gadat Toulouse, Octobre 2014 S. Gadat Big Data - Lecture 1 Introduction Standard Convex

Machine Learning Anders Holst SICS Big Data Analytics Analysis Big Data Big Value Big Data

Big Data Algorithms with Medical Applications Yixin Chen Outline Challenges to big data

CS535 Big Data 1/22/2020 Sangmi Lee Pallickara CS535 Big Data | Computer Science Department

COMP9313: Big Data Management Introduction to Big Data Management What is big data? Tweeted by

9/17/2020 Division Updates and Reminders September 17, 2020 9/17/2020 1 1 Agenda Updates

Optimization in the Big Data Regime Sham M. Kakade Machine Learning for Big Data

15-780: Optimization J. Zico Kolter March 14-16, 2015 1 Outline Introduction to optimization

Stochastic Optimization Techniques for Big Data Machine Learning Tong Zhang Rutgers University

HOW BIG IS BIG DATA FOR AN INSURER LIKE AXA? CHALLENGES &amp; OPPORTUNITIES Paris Big Data

BIG DATA CONFERENCE How to transform data into money using Big Data technologies INTRO THE

BIG DATA: Revolutionizing construction business through socmed data mining REVOLUTIONIZING

Getting the Big (Data) Picture Eva Andreasson , Cloudera Big Data? Todays Big Data Landscape

Fundamentals of Big Data BIG DATA F UN DAMEN TALS W ITH P YS PARK Upendra Devisetty Science

(recent advancements in) Optimization in the Big Data Regime Sham M. Kakade Computer

Big Data Analytics: What is Big Data? Stony Brook University CSE545, Fall 2016 the inaugural

Big Data Analytics: What is Big Data? H. Andrew Schwartz Stony Brook University CSE545, Fall

Dark matter heavyweights SUSY and Q-balls Inflation+SUSY Q-balls stable Q-balls as

on the light front Sergei Alexandrov Laboratoire Charles Coulomb Montpellier work in progress

Data Mining Support Vector Machines Introduction to Data Mining, 2 nd Edition by Tan, Steinbach,

Naive Bayes and Gaussian Bayes Classifier Ladislav Rampasek slides by Mengye Ren and others

Support vector machines CS 446 Part 1: linear support vector machines 1.0 1.0 1.0 0.8 0.8

9 MultiFreedom Constraints II IFEM Ch 9 Slide 1 Introduction to FEM Penalty Function

Variational Principles and Constraints in Continuum Dynamics Giovanni Romano, Raffaele Barretta

First order gravity on the light front Sergei Alexandrov Laboratoire Charles Coulomb

HOW BIG IS BIG DATA FOR AN INSURER LIKE AXA? CHALLENGES & OPPORTUNITIES Paris Big Data