Zum Hauptinhalt springen

Showing 1–4 of 4 results for author: Khandagale, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.02997  [pdf, other

    cs.LG cs.AI stat.ML

    When Do Neural Nets Outperform Boosted Trees on Tabular Data?

    Authors: Duncan McElfresh, Sujay Khandagale, Jonathan Valverde, Vishak Prasad C, Benjamin Feuer, Chinmay Hegde, Ganesh Ramakrishnan, Micah Goldblum, Colin White

    Abstract: Tabular data is one of the most commonly used types of data in machine learning. Despite recent advances in neural nets (NNs) for tabular data, there is still an active discussion on whether or not NNs generally outperform gradient-boosted decision trees (GBDTs) on tabular data, with several recent works arguing either that GBDTs consistently outperform NNs on tabular data, or vice versa. In this… ▽ More

    Submitted 15 July, 2024; v1 submitted 4 May, 2023; originally announced May 2023.

    Comments: NeurIPS Datasets and Benchmarks Track 2023

  2. arXiv:2206.11886  [pdf, other

    cs.IR cs.AI cs.LG

    On the Generalizability and Predictability of Recommender Systems

    Authors: Duncan McElfresh, Sujay Khandagale, Jonathan Valverde, John P. Dickerson, Colin White

    Abstract: While other areas of machine learning have seen more and more automation, designing a high-performing recommender system still requires a high level of human effort. Furthermore, recent work has shown that modern recommender system algorithms do not always improve over well-tuned baselines. A natural follow-up question is, "how do we choose the right algorithm for a new dataset and performance met… ▽ More

    Submitted 6 October, 2022; v1 submitted 23 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022

  3. arXiv:2106.12543  [pdf, other

    cs.LG cs.AI stat.ML

    Synthetic Benchmarks for Scientific Research in Explainable Machine Learning

    Authors: Yang Liu, Sujay Khandagale, Colin White, Willie Neiswanger

    Abstract: As machine learning models grow more complex and their applications become more high-stakes, tools for explaining model predictions have become increasingly important. This has spurred a flurry of research in model explainability and has given rise to feature attribution methods such as LIME and SHAP. Despite their widespread use, evaluating and comparing different feature attribution methods rema… ▽ More

    Submitted 4 November, 2021; v1 submitted 23 June, 2021; originally announced June 2021.

    Comments: NeurIPS Datasets and Benchmarks Track 2021

  4. arXiv:1904.08249  [pdf, other

    cs.LG stat.ML

    Bonsai -- Diverse and Shallow Trees for Extreme Multi-label Classification

    Authors: Sujay Khandagale, Han Xiao, Rohit Babbar

    Abstract: Extreme multi-label classification (XMC) refers to supervised multi-label learning involving hundreds of thousand or even millions of labels. In this paper, we develop a suite of algorithms, called Bonsai, which generalizes the notion of label representation in XMC, and partitions the labels in the representation space to learn shallow trees. We show three concrete realizations of this label repre… ▽ More

    Submitted 10 August, 2019; v1 submitted 17 April, 2019; originally announced April 2019.