Zum Hauptinhalt springen

Showing 1–5 of 5 results for author: Garakani, A B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2002.05753  [pdf, ps, other

    cs.IR cs.LG

    Multi-objective Ranking via Constrained Optimization

    Authors: Michinari Momma, Alireza Bagheri Garakani, Nanxun Ma, Yi Sun

    Abstract: In this paper, we introduce an Augmented Lagrangian based method to incorporate the multiple objectives (MO) in a search ranking algorithm. Optimizing MOs is an essential and realistic requirement for building ranking models in production. The proposed method formulates MO in constrained optimization and solves the problem in the popular Boosting framework -- a novel contribution of our work. Furt… ▽ More

    Submitted 13 February, 2020; originally announced February 2020.

  2. arXiv:1701.03577  [pdf, ps, other

    stat.ML cs.AI cs.CL cs.LG

    Kernel Approximation Methods for Speech Recognition

    Authors: Avner May, Alireza Bagheri Garakani, Zhiyun Lu, Dong Guo, Kuan Liu, Aurélien Bellet, Linxi Fan, Michael Collins, Daniel Hsu, Brian Kingsbury, Michael Picheny, Fei Sha

    Abstract: We study large-scale kernel methods for acoustic modeling in speech recognition and compare their performance to deep neural networks (DNNs). We perform experiments on four speech recognition datasets, including the TIMIT and Broadcast News benchmark tasks, and compare these two types of models on frame-level performance metrics (accuracy, cross-entropy), as well as on recognition metrics (word/ch… ▽ More

    Submitted 13 January, 2017; originally announced January 2017.

  3. arXiv:1603.05800  [pdf, ps, other

    cs.LG stat.ML

    A Comparison between Deep Neural Nets and Kernel Acoustic Models for Speech Recognition

    Authors: Zhiyun Lu, Dong Guo, Alireza Bagheri Garakani, Kuan Liu, Avner May, Aurelien Bellet, Linxi Fan, Michael Collins, Brian Kingsbury, Michael Picheny, Fei Sha

    Abstract: We study large-scale kernel methods for acoustic modeling and compare to DNNs on performance metrics related to both acoustic modeling and recognition. Measuring perplexity and frame-level classification accuracy, kernel-based acoustic models are as effective as their DNN counterparts. However, on token-error-rates DNN models can be significantly better. We have discovered that this might be attri… ▽ More

    Submitted 18 March, 2016; originally announced March 2016.

    Comments: arXiv admin note: text overlap with arXiv:1411.4000

  4. arXiv:1411.4000  [pdf, other

    cs.LG cs.AI stat.ML

    How to Scale Up Kernel Methods to Be As Good As Deep Neural Nets

    Authors: Zhiyun Lu, Avner May, Kuan Liu, Alireza Bagheri Garakani, Dong Guo, Aurélien Bellet, Linxi Fan, Michael Collins, Brian Kingsbury, Michael Picheny, Fei Sha

    Abstract: The computational complexity of kernel methods has often been a major barrier for applying them to large-scale learning problems. We argue that this barrier can be effectively overcome. In particular, we develop methods to scale up kernel models to successfully tackle large-scale learning problems that are so far only approachable by deep learning architectures. Based on the seminal work by Rahimi… ▽ More

    Submitted 17 June, 2015; v1 submitted 14 November, 2014; originally announced November 2014.

  5. arXiv:1404.2644  [pdf, other

    cs.DC cs.AI cs.LG stat.ML

    A Distributed Frank-Wolfe Algorithm for Communication-Efficient Sparse Learning

    Authors: Aurélien Bellet, Yingyu Liang, Alireza Bagheri Garakani, Maria-Florina Balcan, Fei Sha

    Abstract: Learning sparse combinations is a frequent theme in machine learning. In this paper, we study its associated optimization problem in the distributed setting where the elements to be combined are not centrally located but spread over a network. We address the key challenges of balancing communication costs and optimization errors. To this end, we propose a distributed Frank-Wolfe (dFW) algorithm. W… ▽ More

    Submitted 12 January, 2015; v1 submitted 9 April, 2014; originally announced April 2014.

    Comments: Extended version of the SIAM Data Mining 2015 paper