Conditional Moment Relaxations and Sums-of-AM/GM-Exponentials Riley - - PowerPoint PPT Presentation

▶

Apr 23, 2023 353 likes •894 views

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices Conditional Moment Relaxations and Sums-of-AM/GM-Exponentials Riley Murray California Institute of Technology MIT

SLIDE 1

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

Conditional Moment Relaxations and Sums-of-AM/GM-Exponentials

Riley Murray California Institute of Technology MIT Virtual Seminar on Optimization and Related Areas 17 April 2020

Riley Murray 1

SLIDE 2

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

The functions of interest

polynomials Parameters ai in Nn, ci in R. Using xai = n

j=1 x aij j

, x →

cixai. Care about degree: maxi ai1. signomials Parameters ai in Rn, ci in R. In “exponential form”, x →

ci exp(ai · x). Care about number of terms: m. For historical and modeling reasons, signomials are often written in geometric form y →

ciyai where y ∈ Rn

++ has the correspondence yi = exp(xi). We use the exponential form!

Riley Murray 2

SLIDE 3

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

Geometric Programming

The signomial f(x) = m

i=1 ci exp(ai · x)

is called a posynomial when all ci ≥ 0. Geometric programs (GPs): inf

x∈Rn

f(x) : gi(x) ≤ 1 ∀ i ∈ [k]
where f and {gi}k

i=1 are posynomials.

Study of GPs initiated by Zener, Duffin, and Peterson (1967). Exponential-form GPs are convex & poly-time solvable via IPMs [1].

Optimization-based engineering design: electrical [2, 3, 4], structural [5, 6], environmental [7], and aeronautical [8, 9].

Epidemilogical process control [10, 11, 12], power control and storage [13, 14], self-driving cars [15], gas network

peration [16].

Additional applications in healthcare [17], biology [18], economics [19, 20, 21], and statistics [22, 23]

Riley Murray 3

SLIDE 4

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

Signomial programming

A signomial program (SP) is an optimization problem stated with signomials, e.g. inf

x∈Rn

f(x) : gi(x) ≤ 0 for all i in [k]
.

Major applications in aircraft design [24, 25, 26, 27, 28] and structrual engineering [29, 30, 31, 32]. Additional applications in EE [33], communications [34], and ML [35].

Riley Murray 4

SLIDE 5

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

Motivation. Mathematical Preliminaries. Sums-of-AM/GM-Exponentials. Sparsity preservation. A hierarchy. Extreme rays. Conclusion.

Riley Murray 5

SLIDE 6

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

The AM/GM-inequality

If u, λ ∈ Rm are positive and 1⊺λ = 1, then uλ ≤ λ⊺u.

Proof. If v = log u, then uλ = exp(λ⊺v)≤ m

i=1 λi exp vi= λ⊺u.

A recent history of using the AM/GM inequality to certify function nonnegativity:

1978 and 1989: Reznick [36, 37]. 2009: P´ ebay, Rojas and Thompson [38]. 2012 and 2013: Ghasemi and Marshall [39], Ghasemi, Lasserre, and Marshall [40]. 2012: Paneta, Koeppl, and Craciun [41], and August, Craciun, and Koeppl [42]. 2016: Iliman and de Wolff [43]. When used for computation, exponents {ai}m

i=1 were presumed to be highly structured.

E.g. conv{ai}m

i=1 has m − 1 extreme points, 1 point in its relative interior.

Riley Murray 6

SLIDE 7

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

Definitions from convex analysis

A set convex set K is called a cone if x ∈ K ⇒ λx ∈ K for all λ ≥ 0; the dual cone to K is K† = {y : y⊺x ≥ 0 for all x in K}. – and we have (K†)† = cl K A convex set X induces a support function σX(λ) = sup{λ⊺x : x in X}. The relative entropy function continuously extends D(u, v) =

ui log(ui/vi) to Rm

+ × Rm + .

Important: if you evaluate D(·, ·) outside Rm

+ × Rm + , you get +∞.

Riley Murray 7

SLIDE 8

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

A trick with convex duality

Start with a primal problem Val(c) = inf

x {c⊺x : Ax = b, x ≥ 0}.

Obtain a dual problem Val(c) = sup

µ {−b⊺µ : A⊺µ + c ≥ 0}.

We will encounter constraints like Val(c) + L ≥ 0. Write such a constraint as: there exists a µ where A⊺µ + c ≥ 0 and b⊺µ ≤ L.

Riley Murray 8

SLIDE 9

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

Nonnegativity and optimization

We’ll work with sets X ⊂ Rn. Speaking abstractly, for any f : Rn → R f ⋆

X = inf{f(x) : x in X}

= sup{γ : f − γ is nonnegative over X}. Make this more concrete. For signomials: CNNS(A, X) . =

ci exp(ai · x) ≥ 0 ∀ x ∈ X

A . =      a2 . . . am      ∈ Rm×n. So for f(x) = m

i=1 ci exp(ai · x),

f ⋆

X = sup {γ : c − γe1 ∈ CNNS(A, X)}

– where e1 is the 1st standard basis vector in Rm.

Riley Murray 9

SLIDE 10

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

Duality and moment relaxations

Abbreviate exp(Ax) ∈ Rm elementwise, and express CNNS(A, X) = {c : c⊺exp(Ax) ≥ 0 ∀ x ∈ X}. The definition of “dual cone” requires CNNS(A, X)† = {v : c⊺v ≥ 0 ∀ c ∈ CNNS(A, X)}. So we end up getting CNNS(A, X)† = co {exp(Ax) : x ∈ X} – a “moment cone.” conv {exp(Ax) : x ∈ X} =      Ex [exp(Ax)] : x ∼ F, supp F ⊂ X

conditional probability

     Get moment relaxations from conic duality sup

γ {γ : c − γe1 ∈ CNNS(A, X)}

f⋆

= inf

c⊺v :

v ∈ CNNS(A, X)† satisfies v · e1 = 1

Riley Murray 10

SLIDE 11

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

Motivation. Mathematical Preliminaries. Sums-of-AM/GM-Exponentials. Sparsity preservation. A hierarchy. Extreme rays. Conclusion.

Riley Murray 11

SLIDE 12

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

X-AGE functions

Definition. An X-AGE function is an X-nonnegative signomial, which has at most one

negative coefficient. Generalizes X = Rn from [44]; see [45]. Consider f(x) = m

i=1 ci exp(a⊺ i x). If c ∈ Rm has c\k .

= (ci)i∈[m]\k ≥ 0, then f(x) ≥ 0 ⇔

ci exp([ai − ak] · x) ≥ 0 ⇔

ci exp([ai − ak] · x)

convex!

+ck ≥ 0.

Theorem (M., Chandrasekaran, & Wierman (2019))

If X is a convex set, then the conditions c\k ≥ 0 and c ∈ CNNS(A, X) are equivalent to the existence of some ν ∈ Rm satisfying 1⊺ν = 0 and σX (−A⊺ν) + D(ν\k, ec\k) ≤ ck.

Riley Murray 12

SLIDE 13

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

X-SAGE certificates & lower bounds

A signomial is X-SAGE if it can be written as a sum of appropriate X-AGE functions. The cone of coefficients CSAGE(A, X) . =

c : {(ν(k), c(k))}m

k=1 satisfy c = m k=1 c(k), 1⊺ν(k) = 0,

and σX

−Aν(k)

+ D

ν(k)

\k , c(k) \k

≤ c(k)

∀ k ∈ [m]

is contained within CNNS(A, X).

Consider f(x) = m

i=1 ci exp(ai · x) with a1 = 0:

f ⋆

X = sup{γ : c − γe1 in CNNS(A, X)}

≥ sup{ γ : c − γe1 in CSAGE(A, X)} =: f SAGE

. MOSEK + sageopt = off-the-shelf software for computing f SAGE

.

Riley Murray 13

SLIDE 14

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

Conditional moment relaxations via SAGE

Consider f(x) = m

i=1 ci exp(ai · x) with a1 = 0. Applying conic duality ...

sup{ γ : c − γe1 in CSAGE(A, X)} = f SAGE

= inf

c⊺v : v in CSAGE(A, X)†

satisfies v · e1 = 1

Conic duality reverses inclusions

CNNS(A, X)† ⊂ CSAGE(A, X)†. The dual X-SAGE cone is CSAGE(A, X)† = cl{v : some z1, . . . , zm in Rn satisfy vk log(v/vk) ≥ [A − 1ak]zk and zk/vk ∈ X for all k in [m]}. The dual helps with solution recovery. Useful even when f SAGE

< f ⋆

Riley Murray 14

SLIDE 15

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

An example in R3

Minimize f(x) = exp(x1 − x2)/2 − exp x1 − 5 exp(−x2)

X =

x : (70, 1, 0.5) ≤ exp x ≤ (150, 30, 21)

exp(x2 − x3) 100 + exp x2 100 + exp(x1 + x3) 2000 ≤ 1

Compute f SAGE

= −147.85713 ≤ f ⋆

X, and recover feasible

˜ x = (5.01063529, 3.40119660, −0.48450710) satisfying f(˜ x) = −147.66666.

Riley Murray 15

SLIDE 16

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

Motivation. Mathematical Preliminaries. Sums-of-AM/GM-Exponentials. Sparsity preservation. A hierarchy. Extreme rays. Conclusion.

Riley Murray 16

SLIDE 17

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

Sparsity and SAGE signomials

Theorem (M., Chandrasekaran, & Wierman)

Fix a vector c ∈ Rm with nonempty N = {i : ci < 0}. If c ∈ CSAGE(A, X), then there exist X-AGE vectors {c(i)}i∈N where c(i)

= ci and c =

i∈N

c(i). This is true even if X is not convex. Proven formally for X = Rn in [46]. Let K ⊃ Rm

+ induce Ci .

= {c ∈ K : c\i ≥ 0}.

1 Show that for j ∈ N, can eliminate c(j) ∈ Cj from an existing decomposition. 2 Show that conic combinations of {c(i) ∈ Ci}i∈N can reduce to c(i) i

= ci.

Riley Murray 17

SLIDE 18

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

Sparsity preservation: a univariate example

f(x) = e−3x + e−2x + 4ex + e2x − 4e−x − 1 − e3x over x ≤ 0

0.5 1.0

1 2 3 4

f1(x) = 0.88 · e−3x + 0.82 · e−2x + 2.69 · ex + 0.12 · e2x − 4 · e−x f2(x) = 0.10 · e−3x + 0.15 · e−2x + 0.90 · ex + 0.12 · e2x − 1 f3(x) = 0.02 · e−3x + 0.03 · e−2x + 0.41 · ex + 0.76 · e2x − e3x

Riley Murray 18

SLIDE 19

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

Applying Sums-of-Squares (SOS) to an AGE function

Consider the following function on R2 f(x, y) = 1 − 2e(2x+2y) + 1 2

e8x + e8y

. Use sageopt, round solution, certify f is R2-AGE with ν⋆ = (1, −2, 1/2, 1/2). We can express f as a sum-of-squares, but this requires new terms f(x, y) =

1 − 2e(2x+2y)2

+ 1 2

e4x − e4y2

=

1 − 2e(2x+2y)+e(4x+4y)

+ 1 2

e8x + e8y−2e(4x+4y)

. SOS is the predominant way to certify polynomial nonnegativity. SAGE can certify polynomial nonnegativity [46] with X Rn [45]. Remark: In the special case X = Rn with integer exponents, the sparsity result can also be deduced from Jie Wang’s work on Sums-of-Nonnegative-Circuit polynomials [47, 48].

Riley Murray 19

SLIDE 20

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

Motivation. Mathematical Preliminaries. Sums-of-AM/GM-Exponentials. Sparsity preservation. A hierarchy. Extreme rays. Conclusion.

Riley Murray 20

SLIDE 21

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

A hierarchy of stronger convex relaxations.

The earlier example with X ⊂ R3 – f(x) = exp(x1 − x2)/2 − exp x1 − 5 exp(−x2). We found bounds f SAGE

= −147.85713 ≤ f ⋆

and f ⋆

X ≤ f(˜

x) = −147.66666. The modulation trick lets us construct a sequence of bounds f (ℓ)

. = sup{γ : m

i=1 exp(ai · x)

ℓ (f(x) − γ) is X-SAGE}. Using MOSEK + sageopt with this particular example, ℓ SAGE bound solve time (s)

147.85713

0.01 1

147.67225

0.02 2

147.66680

0.08 3

147.66666

0.26

Riley Murray 21

SLIDE 22

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

Convergence results

When introducing Rn-SAGE, Chandrasekaran and Shah proved two results for hierarchies. No known convergence conditions for f (ℓ)

X , prior to March 2020.

Theorem (Wang, Jaini, Yu, Poupart [49])

Let A ∈ Qm×n be a rank n matrix with a1 = 0, and consider f(x) = m

i=1 ci exp(ai · x).

If X is a compact convex set, then lim

ℓ→∞ f (ℓ) X = f ⋆ X.

Assumes nothing about the representation of X. Compare to the canonical (non X-SAGE) approach, which uses a Lagrangian relaxation: inf

x {f(x) : g(x) ≥ 0} ≥ sup γ,λ

{γ : f − γ −

i λi · gi ∈ Λ, λi ∈ Λ′}

Λ, Λ′ are sets of nonnegative functions.

Riley Murray 22

SLIDE 23

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

Motivation. Mathematical Preliminaries. Sums-of-AM/GM-Exponentials. Sparsity preservation. A hierarchy. Extreme rays. Conclusion.

Riley Murray 23

SLIDE 24

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

Affine matroid-theoretic circuits

A circuit is a minimal affinely dependent {ai}i∈I ⊂ Rn. A circuit is simplicial if conv{ai}i∈I has |I| − 1 extreme points. As a matter of notation, let Lk = {λ : λ⊺1 = 0, λk = −1, λ\k ≥ 0}. Simplicial circuits obtained from A ∈ Rm×n are 1-to-1 with certain λ ∈ Rm λ ∈ Lk for some k ∈ [m], A⊺λ = 0 and {ai : λi > 0} is affinely independent.

Riley Murray 24

SLIDE 25

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

Circuits and Rn-SAGE

M., Chandrasekaran, and Wierman [46] determined extreme rays of CAGE(A, i) = {c ∈ CNNS(A, Rn) : c\i ≥ 0}. The ordinary SAGE cone is a Minkowski sum CSAGE(A) =

CAGE(A, i). Katth¨ an, Naumann, and Theobald [50] completely determined ext CSAGE(A). Forsg˚ ard and de Wolff [51] studied ∂CSAGE(A) in detail;defined Rez(A) = co{λ ∈ Rm : λ is a simplicial circuit w.r.t. A}. Combine [50, 51] to clearly link ext CSAGE(A) and ext Rez(A).

Riley Murray 25

SLIDE 26

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

Circuits and X-SAGE

The following is ongoing, joint work with Helen Naumann and Thorsten Theobald.

Definition. A simplicial X-circuit induced by A ∈ Rm×n is a vector λ⋆ ∈ Rm where
1. λ⋆ ∈ Lk for some k ∈ [m],
2. σX (−A⊺λ⋆) < +∞, and
3. if λ → σX (−A⊺λ) is linear on [λ1, λ2] ⊂ Lk, then λ⋆ ∈ relint[λ1, λ2].

A clean generalization from X = Rn. Provides the basis for a “Reznick cone” with conditional SAGE certificates. Particularly informative when X is a polyhedron. E.g., if X is a polyhedron, then CSAGE(A, X) is power-cone representable.

Riley Murray 26

SLIDE 27

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

Motivation. Mathematical Preliminaries. Sums-of-AM/GM-Exponentials. Sparsity preservation. A hierarchy. Extreme rays. Conclusion.

Riley Murray 27

SLIDE 28

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

Some open problems

1. When do we have CSAGE(A, X) = CNNS(A, X)?

For X = Rn see [46, 51], and also [46, 47] for polynomials.

2. If f > 0 on compact X, is there some g > 0 so f · g is X-SAGE?

Resolved in the affirmative for A ∈ Qm×n [49]! Follow-up questions ... If “h =standard multiplier,” how to bound least ℓ where hℓ · f is X-SAGE? Irrational A? Perhaps leverage Hausdorff continuity.

3. Complexity of testing “c ∈ CNNS(α, X)” with two ci < 0?

Many possible algorithmic projects (ask me for details). More open problems to follow once “X-circuit” paper is put on arXiv.

Riley Murray 28

SLIDE 29

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

Acknowledgements

Special thanks to my collaborators, on all things SAGE-related: Venkat Chandrasekaran, Adam Wierman, Helen Naumann, Thorsten Theobald, Fang Xiao, and Berk Orztuk. Thanks also to the many people who make working in this area exciting: Sadik Iliman, Timo de Wolff, Marieke Dressler, Henning Seidler, Janin Heuer, Jie Wang, Jen Forsg˚ ard, and more!

Riley Murray 29

SLIDE 30

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

References I

[1] Yurii Nesterov and Arkadii Nemirovskii. Interior-Point Polynomial Algorithms in Convex Programming. Society for Industrial and Applied Mathematics, January 1994. [2] Stephen P. Boyd, Seung-Jean Kim, Dinesh D. Patil, and Mark A. Horowitz. Digital circuit optimization via geometric programming. Operations Research, 53(6):899–932, December 2005. [3] R. A. Jabr. Application of geometric programming to transformer design. IEEE Transactions on Magnetics, 41(11):4261–4269, 2005. [4] D. Patil, S. Yun, S. . Kim, A. Cheung, M. Horowitz, and S. Boyd. A new method for design of robust digital circuits. In Sixth international symposium on quality electronic design (isqed’05), pages 676–681, 2005. [5] Ting-Yu Chen. Structural optimization using single-term posynomial geometric programming. Computers & Structures, 45(5-6):911–918, December 1992.

Riley Murray 30

SLIDE 31

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

References II

[6] Prabhat Hajela. Geometric programming strategies in large-scale structural synthesis. AIAA Journal, 24(7):1173–1178, July 1986. [7] Yves Smeers and Daniel Tyteca. A geometric programming model for the optimal design of wastewater treatment plants. Operations Research, 32(2):314–342, April 1984. [8] Warren Hoburg and Pieter Abbeel. Geometric programming for aircraft design optimization. AIAA Journal, 52(11):2414–2426, November 2014. [9] Michael Burton and Warren Hoburg. Solar and gas powered long-endurance unmanned aircraft sizing via geometric programming. Journal of Aircraft, 55(1):212–225, January 2018. [10] C. Nowzari, V. M. Preciado, and G. J. Pappas. Optimal resource allocation for control of networked epidemic models. IEEE Transactions on Control of Network Systems, 4(2):159–169, 2017.

Riley Murray 31

SLIDE 32

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

References III

[11] V. M. Preciado, M. Zargham, C. Enyioha, A. Jadbabaie, and G. J. Pappas. Optimal resource allocation for network protection against spreading processes. IEEE Transactions on Control of Network Systems, 1(1):99–108, 2014. [12] C. Nowzari, V. M. Preciado, and G. J. Pappas. Analysis and control of epidemics: A survey of spreading processes on complex networks. IEEE Control Systems Magazine, 36(1):26–46, 2016. [13] M. Chiang, C. W. Tan, D. P. Palomar, D. O’neill, and D. Julian. Power control by geometric programming. IEEE Transactions on Wireless Communications, 6(7):2640–2651, 2007. [14] J. Rajasekharan and V. Koivunen. Optimal energy consumption model for smart grid households with energy storage. IEEE Journal of Selected Topics in Signal Processing, 8(6):1154–1166, 2014.

Riley Murray 32

SLIDE 33

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

References IV

[15] Abhijit Davare, Qi Zhu, Marco Di Natale, Claudio Pinello, Sri Kanajan, and Alberto Sangiovanni-Vincentelli. Period optimization for hard real-time distributed automotive systems. In Proceedings of the 44th Annual Design Automation Conference, DAC ’07, page 278–283, New York, NY, USA, 2007. Association for Computing Machinery. [16] S. Misra, M. W. Fisher, S. Backhaus, R. Bent, M. Chertkov, and F. Pan. Optimal compression in natural gas networks: A geometric programming approach. IEEE Transactions on Control of Network Systems, 2(1):47–56, 2015. [17] Y. He, W. Zhu, and L. Guan. Optimal resource allocation for pervasive health monitoring systems with body sensor networks. IEEE Transactions on Mobile Computing, 10(11):1558–1575, 2011. [18] Alberto Marin-Sanguino, Eberhard O Voit, Carlos Gonzalez-Alcon, and Nestor V Torres. Optimization of biotechnological systems through geometric programming. Theoretical Biology and Medical Modelling, 4(1), September 2007.

Riley Murray 33

SLIDE 34

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

References V

[19] J.R. Rajasekera and M. Yamada. Estimating the firm value distribution function by entropy optimization and geometric programming. Annals of Operations Research, 105(1/4):61–75, 2001. [20] Farshid Samadi, Abolfazl Mirzazadeh, and Mir Mohsen Pedram. Fuzzy pricing, marketing and service planning in a fuzzy inventory model: A geometric programming approach. Applied Mathematical Modelling, 37(10-11):6683–6694, June 2013. [21] Seyed Ahmad Yazdian, Kamran Shahanaghi, and Ahmad Makui. Joint optimisation of price, warranty and recovery planning in remanufacturing of used products under linear and non-linear demand, return and cost functions. International Journal of Systems Science, 47(5):1155–1175, May 2014. [22] M. MAZUMDAR and T. R. JEFFERSON. Maximum likelihood estimates for multinomial probabilities via geometric programming. Biometrika, 70(1):257–261, 04 1983.

Riley Murray 34

SLIDE 35

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

References VI

[23] Paul D. Feigin and Ury Passy. The geometric programming dual to the extinction probability problem in simple branching processes. The Annals of Probability, 9(3):498–503, 1981. [24] Martin A. York, Warren W. Hoburg, and Mark Drela. Turbofan engine sizing and tradeoff analysis via signomial programming. Journal of Aircraft, 55(3):988–1003, May 2018. [25] Martin A. York, Berk ¨ Ozt¨ urk, Edward Burnell, and Warren W. Hoburg. Efficient aircraft multidisciplinary design optimization and sensitivity analysis via signomial programming. AIAA Journal, 56(11):4546–4561, November 2018. [26] Berk Ozturk and Ali Saab. Optimal aircraft design deicions under uncertainty via robust signomial programming. In AIAA Aviation 2019 Forum. American Institute of Aeronautics and Astronautics, June 2019.

Riley Murray 35

SLIDE 36

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

References VII

[27] Philippe G. Kirschen and Warren W. Hoburg. The power of log transformation: A comparison of geometric and signomial programming with general nonlinear programming techniques for aircraft design optimization. In 2018 AIAA/ASCE/AHS/ASC Structures, Structural Dynamics, and Materials

Conference. American Institute of Aeronautics and Astronautics, January 2018.

[28] Philippe G. Kirschen, Martin A. York, Berk Ozturk, and Warren W. Hoburg. Application of signomial programming to aircraft design. Journal of Aircraft, 55(3):965–987, May 2018. [29] M. Avriel and J. D. Barrett. Optimal design of pitched laminated wood beams. In Advances in Geometric Programming, pages 407–419. Springer US, 1980. [30] Hojjat Adeli and Osama Kamal. Efficient optimization of space trusses. Computers & Structures, 24(3):501 – 511, 1986.

Riley Murray 36

SLIDE 37

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

References VIII

[31] Balaur S. Dhillon and Chen-Hsing Kuo. Optimum design of composite hybrid plate girders. Journal of Structural Engineering, 117(7):2088–2098, 1991. [32] Yun Kang Sui and Xi Cheng Wang. Second-order method of generalized geometric programming for spatial frame

ptimization.

Computer Methods in Applied Mechanics and Engineering, 141(1-2):117–123, February 1997. [33] R.A. Jabr. Inductor design using signomial programming. COMPEL - The international journal for computation and mathematics in electrical and electronic engineering, 26(2):461–475, 2007. [34] Mung Chiang. Nonconvex Optimization for Communication Networks, pages 137–196. Springer US, Boston, MA, 2009.

Riley Murray 37

SLIDE 38

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

References IX

[35] Han Zhao, Pascal Poupart, and Geoffrey J Gordon. A unified approach for learning the parameters of sum-product networks. In D. D. Lee, M. Sugiyama, U. V. Luxburg, I. Guyon, and R. Garnett, editors, Advances in Neural Information Processing Systems 29, pages 433–441. Curran Associates, Inc., 2016. [36] Bruce Reznick. Extremal PSD forms with few terms. Duke Mathematical Journal, 45(2):363–374, jun 1978. [37] Bruce Reznick. Forms derived from the arithmetic-geometric inequality. Mathematische Annalen, 283(3):431–464, 1989. [38] Philippe P. P´ ebay, J. Maurice Rojas, and David C. Thompson. Optimization and NP r-completeness of certain fewnomials. In Proceedings of the 2009 conference on Symbolic numeric computation. ACM Press, 2009.

Riley Murray 38

SLIDE 39

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

References X

[39] Mehdi Ghasemi and Murray Marshall. Lower bounds for polynomials using geometric programming. SIAM Journal on Optimization, 22(2):460–473, jan 2012. [40] M. Ghasemi, J. B. Lasserre, and M. Marshall. Lower bounds on the global minimum of a polynomial. Computational Optimization and Applications, 57(2):387–402, sep 2013. [41] Casian Pantea, Heinz Koeppl, and Gheorghe Craciun. Global injectivity and multiple equilibria in uni- and bi-molecular reaction networks. Discrete and Continuous Dynamical Systems - Series B, 17(6):2153–2170, May 2012. [42] Elias August, Gheorghe Craciun, and Heinz Koeppl. Finding invariant sets for biological systems using monomial domination. In 2012 IEEE 51st IEEE Conference on Decision and Control (CDC). IEEE, December 2012. [43] Sadik Iliman and Timo de Wolff. Amoebas, nonnegative polynomials and sums of squares supported on circuits. Research in the Mathematical Sciences, 3(1):9, Mar 2016.

Riley Murray 39

SLIDE 40

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

References XI

[44] Venkat Chandrasekaran and Parikshit Shah. Relative entropy relaxations for signomial optimization. SIAM Journal on Optimization, 26(2):1147 – 1173, 2016. [45] Riley Murray, Venkat Chandrasekaran, and Adam Wierman. Signomial and polynomial optimization via relative entropy and partial dualization, 2019. [46] Riley Murray, Venkat Chandrasekaran, and Adam Wierman. Newton polytopes and relative entropy optimization, 2018. [47] Jie Wang. Nonnegative polynomials and circuit polynomials, 2018. [48] Jie Wang. On supports of sums of nonnegative circuit polynomials, 2018. [49] Allen Houze Wang, Priyank Jaini, Yaoliang Yu, and Pascal Poupart. Complete hierarchy of relaxation for constrained signomial positivity, 2020.

Riley Murray 40

SLIDE 41

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

References XII

[50] L. Katth¨ an, H. Naumann, and T. Theobald. A unified framework of SAGE and SONC polynomials and its duality theory, 2019. Preprint, arXiv:1903.08966. [51] Jens Forsg˚ ard and Timo de Wolff. The algebraic boundary of the sonc cone, 2019.

Riley Murray 41

SLIDE 42

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

Appendices

Exactness analysis. Software. Optimization with nonconvex constraints. Log-log convex (“geometrically convex”) functions.

Riley Murray 42

SLIDE 43

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

Rn-SAGE Exactness

10 2 4 8 8 7 6 6 6 8 5 4 10 3 4 2 1 2 10 2 8 4 10 6 6 8 8 6 4 10 4 2 2 2 20 4 6 8 15 20 10 12 15 10 14 10 16 5 5 2 10 3 8 10 4 6 8 5 6 4 6 4 2 2

f g f ·g f +g

Riley Murray 43

SLIDE 44

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

Simplicial sign patterns

Theorem (7)

If conv(A) is simplicial, and ci ≤ 0 for all nonextremal ai, then c ∈ CNNS(A, Rn) if and only if c ∈ CSAGE(A, Rn).

0.2 0.4 0.6 0.8 1 1.2 1.4 1.6

0.1

Riley Murray 44

SLIDE 45

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

Simplicial sign patterns

Theorem (7)

If conv(A) is simplicial, and ci ≤ 0 for all nonextremal ai, then c ∈ CNNS(A, Rn) if and only if c ∈ CSAGE(A, Rn).

1 2 1 1 2 2

f(x) = (ex1 − ex2 − ex3)2 is clearly nonnegative, but

Riley Murray 45

SLIDE 46

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

Partitioning a Newton polytope

We say that A can be partitioned into ℓ faces if we can permute its rows so that A = [A(1); . . . ; A(ℓ)] where {conv A(i)}ℓ

i=1 are mutually disjoint faces of conv(A).

Riley Murray 46

SLIDE 47

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

Partitioning a Newton polytope

Theorem (8)

If {A(i)}ℓ

i=1 are matrices partitioning A = [A(1); . . . ; A(ℓ)], then

CNNS(A, Rn) = ⊕ℓ

i=1CNNS(A(i), Rn)

–and the same is true of CSAGE(A, Rn). Sanity checks : All matrices A admit a trivial partition with ℓ = 1. If all ai are extremal, then CNNS(A, Rn) = Rm

+ .

Riley Murray 47

SLIDE 48

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

An Rn-SAGE exactness theorem

Theorem (9)

Suppose A can be partitioned into faces where

1 each simplicial face has ≤ 2 nonextremal exponents, and 2 all other faces contain at most one nonextremal exponent.

Then CSAGE(A, Rn) = CNNS(A, Rn). Violate the first hypothesis? Consider f(x) = (ex1 − ex2 − ex3)2 not SAGE, per C&S’16. Violate the second hypothesis? Consider A⊺ = [e1, e2, 2e1, 2e2, 2(e1 + e2), 0], for which (−4, −2, 3, 2, 1, 1.8) ∈ CNNS(A, R2) \ CSAGE(A, R2).

Riley Murray 48

SLIDE 49

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

Optimization with nonconvex constraints

Q: What should we do when some constraints are nonconvex? A: Combine X-SAGE certificates with Lagrangian relaxations. Concretely, suppose we want to minimize f over Ω . = X ∩ {x : g(x) ≤ 0} where X is convex, but g1, . . . , gk are nonconvex signomials. Then, if λ1, . . . , λk are nonnegative dual variables, we have inf

x∈Ω f(x) ≥ sup

γ : f +

λigi − γ is X-SAGE

Riley Murray 49

SLIDE 50

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

The SimPleAC aircraft design problem

From Warren Hoburg’s PhD thesis. Problem statistics: 140 variables. 89 inequality constraints (1 nonconvex). 67 equality constraints (15 nonconvex). Performance of the most basic SAGE relaxation: bound “cost ≥ 2957” (roughly match a known solution). MOSEK solves in two seconds, on a six year old laptop. solution recovery fails (numerical issues).

Riley Murray 50

SLIDE 51

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

The sageopt python package

import sageopt as so y = so.standard_sig_monomials(3) f = 0.5y[0]/y[1] - y[0] - 5/y[1] ineqs = [100 - y[1]/y[2] - y[1] - 0.05y[0]*y[2], y[0] - 70, y[1] - 1, y[2] - 0.5, 150 - y[0], 30 - y[1], 21 - y[2]] X = so.infer_domain(f, gts, []) prob = so.sig_relaxation(f, X, form=’dual’) prob.solve() solutions = so.sig_solrec(prob)

Riley Murray 51

SLIDE 52

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

The sageopt python package

... # define f, X as before from sageopt import coniclifts as cl modulator = so.Signomial(f.alpha, np.ones(f.m)) ** 3 gamma = cl.Variable() h = modulator * (f - gamma) con = cl.PrimalSageCone(h.c, h.alpha, X, ’con_name’) prob = cl.Problem(cl.MAX, gamma, [con]) prob.solve() age_vecs = [v.value for v in con.age_vectors.values()] age_sigs = [so.Signomial(h.alpha, v) for v in age_vecs] h_numeric = so.Signomial(h.alpha, h.c.value)

Riley Murray 52

SLIDE 53

Motivation Preliminaries Sums-of-AM/GM-Exponentials Sparsity preservation A Hierarchy Extreme rays Conclusion Appendices

Log-log convexity: examples

With domains D = Rn

++:

g(x) = max{x1, . . . , xn} g(x) = xa1

1 · · · xan n

g(x) = ∞

e−t2dt −1 With more restricted domains: x → (−x log x)−1 D = (0, 1) X → (I − X)−1 D = {X ∈ Rn×n

: ρ(X) < 1} x → (log x)−1 D = (1, ∞) Some tractable constraints for X-SAGE polynomials: xp ≤ a x2

j = a

a ≤ P{N(0, σ) ≥ |x|} where a > 0.

Riley Murray 53