-
From Trees to Polynomials and Back Again: New Capacity Bounds with Applications to TSP
Authors:
Leonid Gurvits,
Nathan Klein,
Jonathan Leake
Abstract:
We give simply exponential lower bounds on the probabilities of a given strongly Rayleigh distribution, depending only on its expectation. This resolves a weak version of a problem left open by Karlin-Klein-Oveis Gharan in their recent breakthrough work on metric TSP, and this resolution leads to a minor improvement of their approximation factor for metric TSP. Our results also allow for a more st…
▽ More
We give simply exponential lower bounds on the probabilities of a given strongly Rayleigh distribution, depending only on its expectation. This resolves a weak version of a problem left open by Karlin-Klein-Oveis Gharan in their recent breakthrough work on metric TSP, and this resolution leads to a minor improvement of their approximation factor for metric TSP. Our results also allow for a more streamlined analysis of the algorithm.
To achieve these new bounds, we build upon the work of Gurvits-Leake on the use of the productization technique for bounding the capacity of a real stable polynomial. This technique allows one to reduce certain inequalities for real stable polynomials to products of affine linear forms, which have an underlying matrix structure. In this paper, we push this technique further by characterizing the worst-case polynomials via bipartitioned forests. This rigid combinatorial structure yields a clean induction argument, which implies our stronger bounds.
In general, we believe the results of this paper will lead to further improvement and simplification of the analysis of various combinatorial and probabilistic bounds and algorithms.
△ Less
Submitted 9 May, 2024; v1 submitted 15 November, 2023;
originally announced November 2023.
-
Deterministic Approximation Algorithms for Volumes of Spectrahedra
Authors:
Mahmut Levent Doğan,
Jonathan Leake,
Mohan Ravichandran
Abstract:
We give a method for computing asymptotic formulas and approximations for the volumes of spectrahedra, based on the maximum-entropy principle from statistical physics. The method gives an approximate volume formula based on a single convex optimization problem of minimizing $-\log \det P$ over the spectrahedron. Spectrahedra can be described as affine slices of the convex cone of positive semi-def…
▽ More
We give a method for computing asymptotic formulas and approximations for the volumes of spectrahedra, based on the maximum-entropy principle from statistical physics. The method gives an approximate volume formula based on a single convex optimization problem of minimizing $-\log \det P$ over the spectrahedron. Spectrahedra can be described as affine slices of the convex cone of positive semi-definite (PSD) matrices, and the method yields efficient deterministic approximation algorithms and asymptotic formulas whenever the number of affine constraints is sufficiently dominated by the dimension of the PSD cone.
Our approach is inspired by the work of Barvinok and Hartigan who used an analogous framework for approximately computing volumes of polytopes. Spectrahedra, however, possess a remarkable feature not shared by polytopes, a new fact that we also prove: central sections of the set of density matrices (the quantum version of the simplex) all have asymptotically the same volume. This allows for very general approximation algorithms, which apply to large classes of naturally occurring spectrahedra.
We give two main applications of this method. First, we apply this method to what we call the "multi-way Birkhoff spectrahedron" and obtain an explicit asymptotic formula for its volume. This spectrahedron is the set of quantum states with maximal entanglement (i.e., the quantum states having univariant quantum marginals equal to the identity matrix) and is the quantum analog of the multi-way Birkhoff polytope. Second, we apply this method to explicitly compute the asymptotic volume of central sections of the set of density matrices.
△ Less
Submitted 22 November, 2022;
originally announced November 2022.
-
Optimization and Sampling Under Continuous Symmetry: Examples and Lie Theory
Authors:
Jonathan Leake,
Nisheeth K. Vishnoi
Abstract:
In the last few years, the notion of symmetry has provided a powerful and essential lens to view several optimization or sampling problems that arise in areas such as theoretical computer science, statistics, machine learning, quantum inference, and privacy. Here, we present two examples of nonconvex problems in optimization and sampling where continuous symmetries play -- implicitly or explicitly…
▽ More
In the last few years, the notion of symmetry has provided a powerful and essential lens to view several optimization or sampling problems that arise in areas such as theoretical computer science, statistics, machine learning, quantum inference, and privacy. Here, we present two examples of nonconvex problems in optimization and sampling where continuous symmetries play -- implicitly or explicitly -- a key role in the development of efficient algorithms. These examples rely on deep and hidden connections between nonconvex symmetric manifolds and convex polytopes, and are heavily generalizable. To formulate and understand these generalizations, we then present an introduction to Lie theory -- an indispensable mathematical toolkit for capturing and working with continuous symmetries. We first present the basics of Lie groups, Lie algebras, and the adjoint actions associated with them, and we also mention the classification theorem for Lie algebras. Subsequently, we present Kostant's convexity theorem and show how it allows us to reduce linear optimization problems over orbits of Lie groups to linear optimization problems over polytopes. Finally, we present the Harish-Chandra and the Harish-Chandra--Itzykson--Zuber (HCIZ) formulas, which convert partition functions (integrals) over Lie groups into sums over the corresponding (discrete) Weyl groups, enabling efficient sampling algorithms.
△ Less
Submitted 2 September, 2021;
originally announced September 2021.
-
Sampling Matrices from Harish-Chandra-Itzykson-Zuber Densities with Applications to Quantum Inference and Differential Privacy
Authors:
Jonathan Leake,
Colin S. McSwiggen,
Nisheeth K. Vishnoi
Abstract:
Given two $n \times n$ Hermitian matrices $Y$ and $Λ$, the Harish-Chandra-Itzykson-Zuber (HCIZ) distribution on the unitary group $\text{U}(n)$ is $e^{\text{tr}(UΛU^*Y)}dμ(U)$, where $μ$ is the Haar measure on $\text{U}(n)$. The density $e^{\text{tr}(UΛU^*Y)}$ is known as the HCIZ density. Random unitary matrices distributed according to the HCIZ density are important in various settings in physic…
▽ More
Given two $n \times n$ Hermitian matrices $Y$ and $Λ$, the Harish-Chandra-Itzykson-Zuber (HCIZ) distribution on the unitary group $\text{U}(n)$ is $e^{\text{tr}(UΛU^*Y)}dμ(U)$, where $μ$ is the Haar measure on $\text{U}(n)$. The density $e^{\text{tr}(UΛU^*Y)}$ is known as the HCIZ density. Random unitary matrices distributed according to the HCIZ density are important in various settings in physics and random matrix theory. However, the basic question of efficient sampling from the HCIZ distribution has remained open. We present two efficient algorithms to sample matrices from distributions that are close to the HCIZ distribution. The first algorithm outputs samples that are $ξ$-close in total variation distance and requires polynomially many arithmetic operations in $\log 1/ξ$ and the number of bits needed to encode $Y$ and $Λ$. The second algorithm comes with a stronger guarantee that the samples are $ξ$-close in infinity divergence, but the number of arithmetic operations depends polynomially on $1/ξ$, the number of bits needed to encode $Y$ and $Λ$, and the differences of the largest and the smallest eigenvalues of $Y$ and $Λ$.
HCIZ densities can also be viewed as exponential densities on $\text{U}(n)$-orbits, and these densities have been studied in statistics, machine learning, and theoretical computer science. Thus our results have the following applications: 1) an efficient algorithm to sample from complex versions of matrix Langevin distributions studied in statistics, 2) an efficient algorithm to sample from continuous max-entropy distributions on unitary orbits, which implies an efficient algorithm to sample a pure quantum state from the entropy-maximizing ensemble representing a given density matrix, and 3) an efficient algorithm for differentially private rank-$k$ approximation, with improved utility bounds for $k>1$.
△ Less
Submitted 6 April, 2021; v1 submitted 10 November, 2020;
originally announced November 2020.
-
On the Computability of Continuous Maximum Entropy Distributions: Adjoint Orbits of Lie Groups
Authors:
Jonathan Leake,
Nisheeth K. Vishnoi
Abstract:
Given a point $A$ in the convex hull of a given adjoint orbit $\mathcal{O}(F)$ of a compact Lie group $G$, we give a polynomial time algorithm to compute the probability density supported on $\mathcal{O}(F)$ whose expectation is $A$ and that minimizes the Kullback-Leibler divergence to the $G$-invariant measure on $\mathcal{O}(F)$. This significantly extends the recent work of the authors (STOC 20…
▽ More
Given a point $A$ in the convex hull of a given adjoint orbit $\mathcal{O}(F)$ of a compact Lie group $G$, we give a polynomial time algorithm to compute the probability density supported on $\mathcal{O}(F)$ whose expectation is $A$ and that minimizes the Kullback-Leibler divergence to the $G$-invariant measure on $\mathcal{O}(F)$. This significantly extends the recent work of the authors (STOC 2020) who presented such a result for the manifold of rank $k$-projections which is a specific adjoint orbit of the unitary group $\mathrm{U}(n)$. Our result relies on the ellipsoid method-based framework proposed in prior work; however, to apply it to the general setting of compact Lie groups, we need tools from Lie theory. For instance, properties of the adjoint representation are used to find the defining equalities of the minimal affine space containing the convex hull of $\mathcal{O}(F)$, and to establish a bound on the optimal dual solution. Also, the Harish-Chandra integral formula is used to obtain an evaluation oracle for the dual objective function. While the Harish-Chandra integral formula allows us to write certain integrals over the adjoint orbit of a Lie group as a sum of a small number of determinants, it is only defined for elements of a chosen Cartan subalgebra of the Lie algebra $\mathfrak{g}$ of $G.$ We show how it can be applied to our setting with the help of Kostant's convexity theorem. Further, the convex hull of an adjoint orbit is a type of orbitope, and the orbitopes studied in this paper are known to be spectrahedral. Thus our main result can be viewed as extending the maximum entropy framework to a class of spectrahedra.
△ Less
Submitted 3 November, 2020;
originally announced November 2020.
-
On the computability of continuous maximum entropy distributions with applications
Authors:
Jonathan Leake,
Nisheeth K. Vishnoi
Abstract:
We initiate a study of the following problem: Given a continuous domain $Ω$ along with its convex hull $\mathcal{K}$, a point $A \in \mathcal{K}$ and a prior measure $μ$ on $Ω$, find the probability density over $Ω$ whose marginal is $A$ and that minimizes the KL-divergence to $μ$. This framework gives rise to several extremal distributions that arise in mathematics, quantum mechanics, statistics,…
▽ More
We initiate a study of the following problem: Given a continuous domain $Ω$ along with its convex hull $\mathcal{K}$, a point $A \in \mathcal{K}$ and a prior measure $μ$ on $Ω$, find the probability density over $Ω$ whose marginal is $A$ and that minimizes the KL-divergence to $μ$. This framework gives rise to several extremal distributions that arise in mathematics, quantum mechanics, statistics, and theoretical computer science. Our technical contributions include a polynomial bound on the norm of the optimizer of the dual problem that holds in a very general setting and relies on a "balance" property of the measure $μ$ on $Ω$, and exact algorithms for evaluating the dual and its gradient for several interesting settings of $Ω$ and $μ$. Together, along with the ellipsoid method, these results imply polynomial-time algorithms to compute such KL-divergence minimizing distributions in several cases. Applications of our results include: 1) an optimization characterization of the Goemans-Williamson measure that is used to round a positive semidefinite matrix to a vector, 2) the computability of the entropic barrier for polytopes studied by Bubeck and Eldan, and 3) a polynomial-time algorithm to compute the barycentric quantum entropy of a density matrix that was proposed as an alternative to von Neumann entropy in the 1970s: this corresponds to the case when $Ω$ is the set of rank one projections matrices and $μ$ corresponds to the Haar measure on the unit sphere. Our techniques generalize to the setting of Hermitian rank $k$ projections using the Harish-Chandra-Itzykson-Zuber formula, and are applicable even beyond, to adjoint orbits of compact Lie groups.
△ Less
Submitted 15 April, 2020;
originally announced April 2020.
-
Relational Grid Monitoring Architecture (R-GMA)
Authors:
Rob Byrom,
Brian Coghlan,
Andrew W Cooke,
Roney Cordenonsi,
Linda Cornwall,
Abdeslem Djaoui,
Laurence Field,
Steve Fisher,
Steve Hicks,
Stuart Kenny,
Jason Leake,
James Magowan,
Werner Nutt,
David O'Callaghan,
Norbert Podhorszki,
John Ryan,
Manish Soni,
Paul Taylor,
Antony J Wilson
Abstract:
We describe R-GMA (Relational Grid Monitoring Architecture) which has been developed within the European DataGrid Project as a Grid Information and Monitoring System. Is is based on the GMA from GGF, which is a simple Consumer-Producer model. The special strength of this implementation comes from the power of the relational model. We offer a global view of the information as if each Virtual Orga…
▽ More
We describe R-GMA (Relational Grid Monitoring Architecture) which has been developed within the European DataGrid Project as a Grid Information and Monitoring System. Is is based on the GMA from GGF, which is a simple Consumer-Producer model. The special strength of this implementation comes from the power of the relational model. We offer a global view of the information as if each Virtual Organisation had one large relational database. We provide a number of different Producer types with different characteristics; for example some support streaming of information. We also provide combined Consumer/Producers, which are able to combine information and republish it. At the heart of the system is the mediator, which for any query is able to find and connect to the best Producers for the job. We have developed components to allow a measure of inter-working between MDS and R-GMA. We have used it both for information about the grid (primarily to find out about what services are available at any one time) and for application monitoring. R-GMA has been deployed in various testbeds; we describe some preliminary results and experiences of this deployment.
△ Less
Submitted 15 August, 2003;
originally announced August 2003.