Zum Hauptinhalt springen

Showing 1–8 of 8 results for author: Friedman, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.13141  [pdf, other

    stat.ML cs.LG

    Function Trees: Transparent Machine Learning

    Authors: Jerome H. Friedman

    Abstract: The output of a machine learning algorithm can usually be represented by one or more multivariate functions of its input variables. Knowing the global properties of such functions can help in understanding the system that produced the data as well as interpreting and explaining corresponding model predictions. A method is presented for representing a general multivariate function as a tree of simp… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  2. arXiv:2207.05112  [pdf, other

    cs.LG

    An Interpretable Joint Nonnegative Matrix Factorization-Based Point Cloud Distance Measure

    Authors: Hannah Friedman, Amani R. Maina-Kilaas, Julianna Schalkwyk, Hina Ahmed, Jamie Haddock

    Abstract: In this paper, we propose a new method for determining shared features of and measuring the distance between data sets or point clouds. Our approach uses the joint factorization of two data matrices $X_1,X_2$ into non-negative matrices $X_1 = AS_1, X_2 = AS_2$ to derive a similarity measure that determines how well the shared basis $A$ approximates $X_1, X_2$. We also propose a point cloud distanc… ▽ More

    Submitted 27 November, 2022; v1 submitted 11 July, 2022; originally announced July 2022.

  3. arXiv:2107.07160  [pdf, other

    cs.LG stat.ML

    Lockout: Sparse Regularization of Neural Networks

    Authors: Gilmer Valdes, Wilmer Arbelo, Yannet Interian, Jerome H. Friedman

    Abstract: Many regression and classification procedures fit a parameterized function $f(x;w)$ of predictor variables $x$ to data $\{x_{i},y_{i}\}_1^N$ based on some loss criterion $L(y,f)$. Often, regularization is applied to improve accuracy by placing a constraint $P(w)\leq t$ on the values of the parameters $w$. Although efficient methods exist for finding solutions to these constrained optimization prob… ▽ More

    Submitted 15 July, 2021; originally announced July 2021.

  4. arXiv:2001.10102  [pdf, ps, other

    stat.ML cs.LG

    Predicting Regression Probability Distributions with Imperfect Data Through Optimal Transformations

    Authors: Jerome H. Friedman

    Abstract: The goal of regression analysis is to predict the value of a numeric outcome variable y given a vector of joint values of other (predictor) variables x. Usually a particular x-vector does not specify a repeatable value for y, but rather a probability distribution of possible y--values, p(y|x). This distribution has a location, scale and shape, all of which can depend on x, and are needed to infer… ▽ More

    Submitted 27 January, 2020; originally announced January 2020.

    Comments: 33 pages, 19 figures

  5. Contrast Trees and Distribution Boosting

    Authors: Jerome H. Friedman

    Abstract: Often machine learning methods are applied and results reported in cases where there is little to no information concerning accuracy of the output. Simply because a computer program returns a result does not insure its validity. If decisions are to be made based on such results it is important to have some notion of their veracity. Contrast trees represent a new approach for assessing the accuracy… ▽ More

    Submitted 8 December, 2019; originally announced December 2019.

    Comments: 18 pages, 20 figures

  6. Expert-Augmented Machine Learning

    Authors: E. D. Gennatas, J. H. Friedman, L. H. Ungar, R. Pirracchio, E. Eaton, L. Reichman, Y. Interian, C. B. Simone, A. Auerbach, E. Delgado, M. J. Van der Laan, T. D. Solberg, G. Valdes

    Abstract: Machine Learning is proving invaluable across disciplines. However, its success is often limited by the quality and quantity of available data, while its adoption by the level of trust that models afford users. Human vs. machine performance is commonly compared empirically to decide whether a certain task should be performed by a computer or an expert. In reality, the optimal learning strategy may… ▽ More

    Submitted 5 January, 2021; v1 submitted 22 March, 2019; originally announced March 2019.

  7. arXiv:0805.1386  [pdf, ps, other

    cs.LO

    A language for mathematical knowledge management

    Authors: Steven Kieffer, Jeremy Avigad, Harvey Friedman

    Abstract: We argue that the language of Zermelo Fraenkel set theory with definitions and partial functions provides the most promising bedrock semantics for communicating and sharing mathematical knowledge. We then describe a syntactic sugaring of that language that provides a way of writing remarkably readable assertions without straying far from the set-theoretic semantics. We illustrate with some example… ▽ More

    Submitted 3 January, 2011; v1 submitted 9 May, 2008; originally announced May 2008.

    ACM Class: F.4.1; I.2.4

    Journal ref: Studies in Logic, Grammar and Rhetoric, 18:51-66, 2009

  8. Combining decision procedures for the reals

    Authors: Jeremy Avigad, Harvey Friedman

    Abstract: <p>We address the general problem of determining the validity of boolean combinations of equalities and inequalities between real-valued expressions. In particular, we consider methods of establishing such assertions using only restricted forms of distributivity. At the same time, we explore ways in which "local" decision or heuristic procedures for fragments of the theory of the reals can be am… ▽ More

    Submitted 18 October, 2006; v1 submitted 31 January, 2006; originally announced January 2006.

    Comments: Will appear in Logical Methods in Computer Science

    ACM Class: F.4.1; I.2.3

    Journal ref: Logical Methods in Computer Science, Volume 2, Issue 4 (October 18, 2006) lmcs:2240