Skip to main content

Showing 1–11 of 11 results for author: Fard, M M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.15402  [pdf, other

    cs.CL

    Learning Section Weights for Multi-Label Document Classification

    Authors: Maziar Moradi Fard, Paula Sorrolla Bayod, Kiomars Motarjem, Mohammad Alian Nejadi, Saber Akhondi, Camilo Thorne

    Abstract: Multi-label document classification is a traditional task in NLP. Compared to single-label classification, each document can be assigned multiple classes. This problem is crucially important in various domains, such as tagging scientific articles. Documents are often structured into several sections such as abstract and title. Current approaches treat different sections equally for multi-label cla… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

    Comments: 7 pages, 4 figures, 5 tables

  2. arXiv:2202.01940  [pdf, other

    stat.ML cs.LG

    Distribution Embedding Networks for Generalization from a Diverse Set of Classification Tasks

    Authors: Lang Liu, Mahdi Milani Fard, Sen Zhao

    Abstract: We propose Distribution Embedding Networks (DEN) for classification with small data. In the same spirit of meta-learning, DEN learns from a diverse set of training tasks with the goal to generalize to unseen target tasks. Unlike existing approaches which require the inputs of training and target tasks to have the same dimension with possibly similar distributions, DEN allows training and target ta… ▽ More

    Submitted 31 December, 2022; v1 submitted 3 February, 2022; originally announced February 2022.

    Comments: This paper is accepted at TMLR https://openreview.net/forum?id=F2rG2CXsgO

  3. arXiv:2102.09492  [pdf, other

    cs.LG stat.ML

    Optimizing Black-box Metrics with Iterative Example Weighting

    Authors: Gaurush Hiranandani, Jatin Mathur, Harikrishna Narasimhan, Mahdi Milani Fard, Oluwasanmi Koyejo

    Abstract: We consider learning to optimize a classification metric defined by a black-box function of the confusion matrix. Such black-box learning settings are ubiquitous, for example, when the learner only has query access to the metric of interest, or in noisy-label and domain adaptation applications where the learner must evaluate the metric via performance evaluation using a small validation sample. Ou… ▽ More

    Submitted 23 June, 2021; v1 submitted 18 February, 2021; originally announced February 2021.

    Comments: The paper to appear at ICML 2021. This version includes the camera-ready edits. 42 pages, 2 figures, and 7 tables

  4. arXiv:2002.08605  [pdf, other

    cs.LG cs.AI stat.ML

    Optimizing Black-box Metrics with Adaptive Surrogates

    Authors: Qijia Jiang, Olaoluwa Adigun, Harikrishna Narasimhan, Mahdi Milani Fard, Maya Gupta

    Abstract: We address the problem of training models with black-box and hard-to-optimize metrics by expressing the metric as a monotonic function of a small number of easy-to-optimize surrogates. We pose the training problem as an optimization over a relaxed surrogate space, which we solve by estimating local gradients for the metric and performing inexact convex projections. We analyze gradient estimates ba… ▽ More

    Submitted 20 February, 2020; originally announced February 2020.

  5. arXiv:1806.11212  [pdf, other

    cs.LG stat.ML

    Proxy Fairness

    Authors: Maya Gupta, Andrew Cotter, Mahdi Milani Fard, Serena Wang

    Abstract: We consider the problem of improving fairness when one lacks access to a dataset labeled with protected groups, making it difficult to take advantage of strategies that can improve fairness but require protected group labels, either at training or runtime. To address this, we investigate improving fairness metrics for proxy groups, and test whether doing so results in improved fairness for the tru… ▽ More

    Submitted 28 June, 2018; originally announced June 2018.

  6. arXiv:1806.10069  [pdf, other

    cs.LG stat.ML

    Deep $k$-Means: Jointly clustering with $k$-Means and learning representations

    Authors: Maziar Moradi Fard, Thibaut Thonet, Eric Gaussier

    Abstract: We study in this paper the problem of jointly clustering and learning representations. As several previous studies have shown, learning representations that are both faithful to the data to be clustered and adapted to the clustering algorithm can lead to better clustering performance, all the more so that the two tasks are performed jointly. We propose here such an approach for $k$-Means clusterin… ▽ More

    Submitted 12 December, 2018; v1 submitted 26 June, 2018; originally announced June 2018.

    Comments: Under consideration at Pattern Recognition Letters

  7. arXiv:1805.10582  [pdf, other

    stat.ML cs.AI cs.LG

    Metric-Optimized Example Weights

    Authors: Sen Zhao, Mahdi Milani Fard, Harikrishna Narasimhan, Maya Gupta

    Abstract: Real-world machine learning applications often have complex test metrics, and may have training and test data that are not identically distributed. Motivated by known connections between complex test metrics and cost-weighted learning, we propose addressing these issues by using a weighted loss function with a standard loss, where the weights on the training examples are learned to optimize the te… ▽ More

    Submitted 15 June, 2019; v1 submitted 27 May, 2018; originally announced May 2018.

    Comments: Proceedings of the 36th International Conference on Machine Learning (ICML'19)

  8. arXiv:1401.3871  [pdf

    cs.AI cs.LG

    Non-Deterministic Policies in Markovian Decision Processes

    Authors: Mahdi Milani Fard, Joelle Pineau

    Abstract: Markovian processes have long been used to model stochastic environments. Reinforcement learning has emerged as a framework to solve sequential planning and decision-making problems in such environments. In recent years, attempts were made to apply methods from reinforcement learning to construct decision support systems for action selection in Markovian environments. Although conventional methods… ▽ More

    Submitted 16 January, 2014; originally announced January 2014.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 40, pages 1-24, 2011

  9. arXiv:1312.0286  [pdf, other

    cs.LG stat.ML

    Efficient Learning and Planning with Compressed Predictive States

    Authors: William L. Hamilton, Mahdi Milani Fard, Joelle Pineau

    Abstract: Predictive state representations (PSRs) offer an expressive framework for modelling partially observable systems. By compactly representing systems as functions of observable quantities, the PSR learning approach avoids using local-minima prone expectation-maximization and instead employs a globally optimal moment-based algorithm. Moreover, since PSRs do not require a predetermined latent state st… ▽ More

    Submitted 20 July, 2014; v1 submitted 1 December, 2013; originally announced December 2013.

    Comments: 45 pages, 10 figures, submitted to the Journal of Machine Learning Research

  10. arXiv:1207.5554  [pdf, other

    cs.LG stat.ML

    Bellman Error Based Feature Generation using Random Projections on Sparse Spaces

    Authors: Mahdi Milani Fard, Yuri Grinberg, Amir-massoud Farahmand, Joelle Pineau, Doina Precup

    Abstract: We address the problem of automatic generation of features for value function approximation. Bellman Error Basis Functions (BEBFs) have been shown to improve the error of policy evaluation with function approximation, with a convergence rate similar to that of value iteration. We propose a simple, fast and robust algorithm based on random projections to generate BEBFs for sparse feature spaces. We… ▽ More

    Submitted 21 September, 2012; v1 submitted 23 July, 2012; originally announced July 2012.

  11. arXiv:1202.3717  [pdf

    cs.LG stat.ML

    PAC-Bayesian Policy Evaluation for Reinforcement Learning

    Authors: Mahdi MIlani Fard, Joelle Pineau, Csaba Szepesvari

    Abstract: Bayesian priors offer a compact yet general means of incorporating domain knowledge into many learning tasks. The correctness of the Bayesian analysis and inference, however, largely depends on accuracy and correctness of these priors. PAC-Bayesian methods overcome this problem by providing bounds that hold regardless of the correctness of the prior distribution. This paper introduces the first PA… ▽ More

    Submitted 14 February, 2012; originally announced February 2012.

    Report number: UAI-P-2011-PG-195-202