Optimization Problems LING 572 Advanced Statistical Methods for NLP - PowerPoint PPT Presentation

Optimization Problems LING 572 Advanced Statistical Methods for NLP January 28, 2020 1

Announcements ● HW2 grades posted: 85.7 avg ● Historically the hardest / most time-consuming assignment of 572 ● [NB: we accept +/- 5% differences from our results on test data] ● Format moving forward (no points this time; will update the specs): ● Each line needs to have the specified format ● Blocks for the classes need to be in the same order as example file (e.g. talk.politics.guns, then talk.politics.mideast, talk.politics.misc) ● Include the same commented lines as the example files 2

Performance ● A fair number of people struggled to get reasonable performance ● Depth + recursion = explosion ● General lesson: think about what repeated operations you will be doing a lot, and choose data structures that do those efficiently ● e.g. dicts/sets are hash tables, so very efficient lookup / insertion (O(1) avg) ● Useful for the built-in datatypes: https://wiki.python.org/moin/TimeComplexity ● Common issue: pandas data-frames can be slow ● To the quick live demo! 3

Linguistics Twitter Field Day link 4

Optimization 5

What is an optimization problem? ● The problem of finding the best solution from all feasible solutions. ● Given a function f : X → ℝ x 0 ∈ X f , find that optimizes . ● f is called ▪ an objective function, ▪ a loss function or cost function (minimization), or ▪ a utility function or fitness function (maximization), etc. ● X is an n -dimensional vector space: ▪ discrete (possible values are countable): combinatorial optimization problem ▪ continuous: e.g., constrained problems 6

Components of each optimization problem ● Decision variables X: describe our choices that are under our control. ● We normally use n to represent the number of decision variables, and x_i to represent the i-th decision variable. ● Objective function f: the function we wish to optimize ● Constraints: describe the limitations that restrict our choice for decision variables. 7

Standard form of a continuous optimization problem 8

Common types of optimization problem ● Linear programming (LP) problems: ▪ Definition: Both objective function and constraints are linear ▪ The problems can be solved in polynomial time. ▪ https://en.wikipedia.org/wiki/Linear_programming ● Integer linear programming (ILP) problems: ▪ Definition: LP problem in which some or all of the variables are restricted to be integers ▪ Often, solving ILP problem is NP-hard. ▪ https://en.wikipedia.org/wiki/Integer_programming 9

Common types of optimization problem (cont’d) ▪ Quadratic programming (QP): ▪ Definition: The objective function is quadratic, and the constraints are linear ▪ Solving QP problems is simple under certain conditions ▪ https://en.wikipedia.org/wiki/Quadratic_programming ● Convex optimization: ● Definition: f(x) is a convex function, and X is a convex set. ● Property: if a local minimum exists, then it is a global minimum. ● https://en.wikipedia.org/wiki/Convex_optimization 10

Convex set A set C is said to be convex if, for all x and y in C and all t in the interval (0, 1) , the point (1 − t ) x + ty also belongs to C 11

Convex function ▪ Let X be a convex set in a real vector space and f : X → ℝ a function. ▪ f is convex just in case: ▪ ∀ x 1 , x 2 ∈ X , ∀ t ∈ [0,1], f ( tx 1 + (1 − t ) x 2 ) ≤ tf ( x 1 ) + (1 − t ) f ( x 2 ) ▪ (strictly convex: strict inequality, with t ranging in (0, 1), excluding endpoints.) 12

Terms ● A solution is the assignment of values to all the decision variables ● A solution is called feasible if it satisfies all the constraints. ● The set of all the feasible solutions forms a feasible region. ● A feasible solution is called optimal if f(x) attains the optimal value at the solution. 13

Terms ● If a problem has no feasible solution, the problem itself is called infeasible. ● If the value of the objective function can be infinitely large, the problem is called unbounded. 14

Linear programming 15

Linear Programming ● The linear programming method was first developed by Leonid Kantorovich in late 1930s. ● Main applications: diet problem, supply problem ● A primary method for solving LP is the simplex method. ● LP problems can be solved in polynomial time. 16

An example 17

Feasible region 2 x + 4 y ≤ 220 3 x + 2 y ≤ 150 x ≥ 0 y ≥ 0 source 18

Property of LP ● The feasible region is convex ● If the feasible region is non-empty and bounded, then ● optimal solutions exist, and ● there is an optimal solution that is a corner point ➔ We only need to check the corner points ● The most well-known method is called the simplex method. 19

Simplex method Simplex method: ▪ Start with a feasible solution, move to another one to increase f(x) 20

Integer linear programming 21

Integer programming ● IP is an active research area and there are still many unsolved problems. ● Many applications: scheduling, “diet” problems, NLP, … ● IP is more difficult to solve than LP. ● Methods: ● Branch and Bound ● Use LP relaxation 22

Example: Investment Decisions Four investment options. Over 3 months, we want to invest up to 14, 12, and 15k. link 23

Example: Maximum Spanning Tree An approach to dependency parsing (see, e.g. 571 slides) ∑ max s ( G ) = s ( w 1 , w 2 , l ) ( w 1 , w 2 , l ) ∈ G Constraint: G is a tree (no cycles) More constraints possible: heads cannot have more than one outgoing label of each type 24

LP vs. ILP 25

Summary ● Optimization problems have many real-life applications. ● Common types: LP, IP, ILP, QP, Convex optimization problem ● LP is easy to solve; the most well-known method is the simplex method. ● IP is hard to resolve. ● QP and Convex optimization are used the most in our field. 26

Optimization Problems LING 572 Advanced Statistical Methods for NLP - PowerPoint PPT Presentation

Optimization Problems LING 572 Advanced Statistical Methods for NLP January 28, 2020 1 Announcements HW2 grades posted: 85.7 avg Historically the hardest / most time-consuming assignment of 572 [NB: we accept +/- 5% differences from

15-780: Optimization J. Zico Kolter March 14-16, 2015 1 Outline Introduction to optimization

Convex Optimization 4. Convex Optimization Problems Prof. Ying Cui Department of Electrical

Various Topics Outline 1. Dynamic (time-varying) Optimization Problems 2. Stochastic

Solving Percent Problems Word Problems Find a Pattern Estimation Problems Fraction Problems

Optimization Problems and Local Search Russell and Norvig 4.1 Optimization Problems n

CS675: Convex and Combinatorial Optimization Fall 2019 Convex Optimization Problems Instructor:

CS675: Convex and Combinatorial Optimization Spring 2018 Convex Optimization Problems

Outline Optimization Unconstrained Optimization Problems Machine Learning and Pattern

Convex optimization problems (II) Lijun Zhang zlj@nju.edu.cn http://cs.nju.edu.cn/zlj Outline

P2P Combinatorial Optimization Amir H. Payberah (amir@sics.se) P2P Combinatorial Optimization, 13

AM 205: lecture 20 Today: PDE optimization, constrained optimization example New topic:

5. Network flow problems Example: Sailco Minimum-cost flow problems Transportation

Toward Conveniently Handling Bi-Level Optimization Problems David M. Gay AMPL Optimization, Inc.

Optimization-based routing and congestion control Routing, congestion control as optimization

CS675: Convex and Combinatorial Optimization Fall 2019 Duality of Convex Optimization Problems

Statistical Inverse Problems and abstract inverse problems examples Instrumental Variables

Compiling Comp Ling Practical weighted dynamic programming and the Dyna language -Michael Jordan

Ling Chuan Lee@F13 Labs Lee Yee Chan@F13 Labs Ling

Design Challenges for Entity Linking Xiao Ling , Sameer Singh, Daniel S. Weld Entity Linking

Lecture 2: C# Fundamentals Lisa (Ling) Liu Overview Simple example Comment

Chinese Characters Ling 203 Languages of the World Overview Most Chinese characters bear no

Trainable Approaches for Surface NLG* Adwait Ratnaparkhi WhizBang! Labs -- Research *Funded by

Security of SHA-3 and Related Constructions Jian Guo FSE 2019 @ Paris, France. 27th March 2019

Kumar Sambhav Pandey, Hitesh Shrimali, Dinesh Kumar B School of Computing and Electrical