Skip to main content

Showing 1–10 of 10 results for author: Asteris, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:1605.08961  [pdf, other

    stat.ML cs.DS cs.IT math.OC stat.ME

    A simple and provable algorithm for sparse diagonal CCA

    Authors: Megasthenis Asteris, Anastasios Kyrillidis, Oluwasanmi Koyejo, Russell Poldrack

    Abstract: Given two sets of variables, derived from a common set of samples, sparse Canonical Correlation Analysis (CCA) seeks linear combinations of a small number of variables in each set, such that the induced canonical variables are maximally correlated. Sparse CCA is NP-hard. We propose a novel combinatorial algorithm for sparse diagonal CCA, i.e., sparse CCA under the additional assumption that vari… ▽ More

    Submitted 28 May, 2016; originally announced May 2016.

    Comments: To appear at ICML 2016, 14 pages, 4 figures

  2. arXiv:1603.06861  [pdf, other

    stat.ML cs.IT cs.LG math.OC

    Trading-off variance and complexity in stochastic gradient descent

    Authors: Vatsal Shah, Megasthenis Asteris, Anastasios Kyrillidis, Sujay Sanghavi

    Abstract: Stochastic gradient descent is the method of choice for large-scale machine learning problems, by virtue of its light complexity per iteration. However, it lags behind its non-stochastic counterparts with respect to the convergence rate, due to high variance introduced by the stochastic updates. The popular Stochastic Variance-Reduced Gradient (SVRG) method mitigates this shortcoming, introducing… ▽ More

    Submitted 22 March, 2016; originally announced March 2016.

    Comments: 14 pages, 13 figures, first edition on 9th of October 2015

  3. arXiv:1603.02782  [pdf, other

    cs.DS stat.ML

    Bipartite Correlation Clustering -- Maximizing Agreements

    Authors: Megasthenis Asteris, Anastasios Kyrillidis, Dimitris Papailiopoulos, Alexandros G. Dimakis

    Abstract: In Bipartite Correlation Clustering (BCC) we are given a complete bipartite graph $G$ with `+' and `-' edges, and we seek a vertex clustering that maximizes the number of agreements: the number of all `+' edges within clusters plus all `-' edges cut across clusters. BCC is known to be NP-hard. We present a novel approximation algorithm for $k$-BCC, a variant of BCC with an upper bound $k$ on the… ▽ More

    Submitted 9 March, 2016; originally announced March 2016.

    Comments: To appear in AISTATS 2016

  4. arXiv:1508.00625  [pdf, ps, other

    stat.ML cs.DS cs.LG math.OC

    Sparse PCA via Bipartite Matchings

    Authors: Megasthenis Asteris, Dimitris Papailiopoulos, Anastasios Kyrillidis, Alexandros G. Dimakis

    Abstract: We consider the following multi-component sparse PCA problem: given a set of data points, we seek to extract a small number of sparse components with disjoint supports that jointly capture the maximum possible variance. These components can be computed one by one, repeatedly solving the single-component problem and deflating the input data matrix, but as we show this greedy procedure is suboptimal… ▽ More

    Submitted 3 August, 2015; originally announced August 2015.

  5. arXiv:1506.02344  [pdf, other

    stat.ML cs.IT cs.LG math.OC

    Stay on path: PCA along graph paths

    Authors: Megasthenis Asteris, Anastasios Kyrillidis, Alexandros G. Dimakis, Han-Gyol Yi and, Bharath Chandrasekaran

    Abstract: We introduce a variant of (sparse) PCA in which the set of feasible support sets is determined by a graph. In particular, we consider the following setting: given a directed acyclic graph $G$ on $p$ vertices corresponding to variables, the non-zero entries of the extracted principal component must coincide with vertices lying along a path in $G$. From a statistical perspective, information on th… ▽ More

    Submitted 18 June, 2015; v1 submitted 7 June, 2015; originally announced June 2015.

    Comments: 12 pages, 5 figures, In Proceedings of International Conference on Machine Learning (ICML) 2015

  6. arXiv:1504.05294  [pdf, ps, other

    cs.IT

    On Approximating the Sum-Rate for Multiple-Unicasts

    Authors: Karthikeyan Shanmugam, Megasthenis Asteris, Alexandros G. Dimakis

    Abstract: We study upper bounds on the sum-rate of multiple-unicasts. We approximate the Generalized Network Sharing Bound (GNS cut) of the multiple-unicasts network coding problem with $k$ independent sources. Our approximation algorithm runs in polynomial time and yields an upper bound on the joint source entropy rate, which is within an $O(\log^2 k)$ factor from the GNS cut. It further yields a vector-li… ▽ More

    Submitted 15 November, 2015; v1 submitted 20 April, 2015; originally announced April 2015.

    Comments: 10 pages; Shorter version appeared at ISIT (International Symposium on Information Theory) 2015; some typos corrected

  7. arXiv:1401.0734  [pdf, other

    cs.IT

    Repairable Fountain Codes

    Authors: Megasthenis Asteris, Alexandros G. Dimakis

    Abstract: We introduce a new family of Fountain codes that are systematic and also have sparse parities. Given an input of $k$ symbols, our codes produce an unbounded number of output symbols, generating each parity independently by linearly combining a logarithmic number of randomly selected input symbols. The construction guarantees that for any $ε>0$ accessing a random subset of $(1+ε)k$ encoded symbols,… ▽ More

    Submitted 3 January, 2014; originally announced January 2014.

    Comments: To appear in IEEE Journal on Selected Areas in Communications, Issue on Communication Methodologies for Next-Generation Storage Systems 2013, 11 pages, 2 figures

  8. arXiv:1312.5891  [pdf, other

    cs.IT stat.ML

    The Sparse Principal Component of a Constant-rank Matrix

    Authors: Megasthenis Asteris, Dimitris S. Papailiopoulos, George N. Karystinos

    Abstract: The computation of the sparse principal component of a matrix is equivalent to the identification of its principal submatrix with the largest maximum eigenvalue. Finding this optimal submatrix is what renders the problem ${\mathcal{NP}}$-hard. In this work, we prove that, if the matrix is positive semidefinite and its rank is constant, then its sparse principal component is polynomially computable… ▽ More

    Submitted 20 December, 2013; originally announced December 2013.

  9. arXiv:1301.3791  [pdf, other

    cs.IT cs.DC cs.NI

    XORing Elephants: Novel Erasure Codes for Big Data

    Authors: Maheswaran Sathiamoorthy, Megasthenis Asteris, Dimitris Papailiopoulos, Alexandros G. Dimakis, Ramkumar Vadali, Scott Chen, Dhruba Borthakur

    Abstract: Distributed storage systems for large clusters typically use replication to provide reliability. Recently, erasure codes have been used to reduce the large storage overhead of three-replicated systems. Reed-Solomon codes are the standard design choice and their high repair cost is often considered an unavoidable price to pay for high storage efficiency and high reliability. This paper shows how… ▽ More

    Submitted 16 January, 2013; originally announced January 2013.

    Comments: Technical report, paper to appear in Proceedings of VLDB, 2013

  10. arXiv:1106.1651  [pdf, other

    cs.IT cs.LG eess.SY math.OC

    Sparse Principal Component of a Rank-deficient Matrix

    Authors: Megasthenis Asteris, Dimitris S. Papailiopoulos, George N. Karystinos

    Abstract: We consider the problem of identifying the sparse principal component of a rank-deficient matrix. We introduce auxiliary spherical variables and prove that there exists a set of candidate index-sets (that is, sets of indices to the nonzero elements of the vector argument) whose size is polynomially bounded, in terms of rank, and contains the optimal index-set, i.e. the index-set of the nonzero ele… ▽ More

    Submitted 8 June, 2011; originally announced June 2011.

    Comments: 5 pages, 1 figure, to be presented at ISIT