Zum Hauptinhalt springen

Showing 1–28 of 28 results for author: Nadler, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.12831  [pdf, other

    cs.CL cs.AI

    Truth is Universal: Robust Detection of Lies in LLMs

    Authors: Lennart Bürger, Fred A. Hamprecht, Boaz Nadler

    Abstract: Large Language Models (LLMs) have revolutionised natural language processing, exhibiting impressive human-like capabilities. In particular, LLMs are capable of "lying", knowingly outputting false statements. Hence, it is of interest and importance to develop methods to detect when LLMs lie. Indeed, several authors trained classifiers to detect LLM lies based on their internal model activations. Ho… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 10 pages, 30 figures

  2. arXiv:2304.13940  [pdf, other

    stat.ML cs.LG

    A Majorization-Minimization Gauss-Newton Method for 1-Bit Matrix Completion

    Authors: Xiaoqian Liu, Xu Han, Eric C. Chi, Boaz Nadler

    Abstract: In 1-bit matrix completion, the aim is to estimate an underlying low-rank matrix from a partial set of binary observations. We propose a novel method for 1-bit matrix completion called MMGN. Our method is based on the majorization-minimization (MM) principle, which converts the original optimization problem into a sequence of standard low-rank matrix completion problems. We solve each of these sub… ▽ More

    Submitted 22 April, 2024; v1 submitted 26 April, 2023; originally announced April 2023.

    Comments: 28 pages, 7 figures

  3. arXiv:2301.12559  [pdf, other

    stat.ML cs.LG math.OC stat.ME

    Imbalanced Mixed Linear Regression

    Authors: Pini Zilber, Boaz Nadler

    Abstract: We consider the problem of mixed linear regression (MLR), where each observed sample belongs to one of $K$ unknown linear models. In practical applications, the proportions of the $K$ components are often imbalanced. Unfortunately, most MLR methods do not perform well in such settings. Motivated by this practical challenge, in this work we propose Mix-IRLS, a novel, simple and fast algorithm for M… ▽ More

    Submitted 29 January, 2023; originally announced January 2023.

  4. arXiv:2301.04022  [pdf, other

    cs.LG math.ST

    Distributed Sparse Linear Regression under Communication Constraints

    Authors: Rodney Fonseca, Boaz Nadler

    Abstract: In multiple domains, statistical tasks are performed in distributed settings, with data split among several end machines that are connected to a fusion center. In various applications, the end machines have limited bandwidth and power, and thus a tight communication budget. In this work we focus on distributed learning of a sparse linear regression model, under severe communication constraints. We… ▽ More

    Submitted 9 January, 2023; originally announced January 2023.

    Comments: 33 pages, 4 figures

    MSC Class: 62J07; 62J05; 68W15

  5. arXiv:2209.07230  [pdf, ps, other

    stat.ML cs.LG

    Recovery Guarantees for Distributed-OMP

    Authors: Chen Amiraz, Robert Krauthgamer, Boaz Nadler

    Abstract: We study distributed schemes for high-dimensional sparse linear regression, based on orthogonal matching pursuit (OMP). Such schemes are particularly suited for settings where a central fusion center is connected to end machines, that have both computation and communication limitations. We prove that under suitable assumptions, distributed-OMP schemes recover the support of the regression vector w… ▽ More

    Submitted 31 October, 2023; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: 47 pages, 4 figures

  6. arXiv:2201.13052  [pdf, other

    cs.LG math.OC

    Inductive Matrix Completion: No Bad Local Minima and a Fast Algorithm

    Authors: Pini Zilber, Boaz Nadler

    Abstract: The inductive matrix completion (IMC) problem is to recover a low rank matrix from few observed entries while incorporating prior knowledge about its row and column subspaces. In this work, we make three contributions to the IMC problem: (i) we prove that under suitable conditions, the IMC optimization landscape has no bad local minima; (ii) we derive a simple scheme with theoretical guarantees to… ▽ More

    Submitted 31 January, 2022; originally announced January 2022.

  7. arXiv:2106.12933  [pdf, other

    math.OC cs.LG math.NA

    GNMR: A provable one-line algorithm for low rank matrix recovery

    Authors: Pini Zilber, Boaz Nadler

    Abstract: Low rank matrix recovery problems, including matrix completion and matrix sensing, appear in a broad range of applications. In this work we present GNMR -- an extremely simple iterative algorithm for low rank matrix recovery, based on a Gauss-Newton linearization. On the theoretical front, we derive recovery guarantees for GNMR in both the matrix sensing and matrix completion settings. Some of the… ▽ More

    Submitted 27 April, 2022; v1 submitted 24 June, 2021; originally announced June 2021.

    MSC Class: 15A83 (Primary) 49M15; 65F55 (Secondary)

  8. arXiv:2102.13276  [pdf, other

    stat.ML cs.LG q-bio.PE

    Spectral Top-Down Recovery of Latent Tree Models

    Authors: Yariv Aizenbud, Ariel Jaffe, Meng Wang, Amber Hu, Noah Amsel, Boaz Nadler, Joseph T. Chang, Yuval Kluger

    Abstract: Modeling the distribution of high dimensional data by a latent tree graphical model is a prevalent approach in multiple scientific domains. A common task is to infer the underlying tree structure, given only observations of its terminal nodes. Many algorithms for tree recovery are computationally intensive, which limits their applicability to trees of moderate size. For large trees, a common appro… ▽ More

    Submitted 7 December, 2021; v1 submitted 25 February, 2021; originally announced February 2021.

  9. arXiv:2102.03060  [pdf, ps, other

    stat.ML cs.IT cs.LG

    Distributed Sparse Normal Means Estimation with Sublinear Communication

    Authors: Chen Amiraz, Robert Krauthgamer, Boaz Nadler

    Abstract: We consider the problem of sparse normal means estimation in a distributed setting with communication constraints. We assume there are $M$ machines, each holding $d$-dimensional observations of a $K$-sparse vector $μ$ corrupted by additive Gaussian noise. The $M$ machines are connected in a star topology to a fusion center, whose goal is to estimate the vector $μ$ with a low communication budget.… ▽ More

    Submitted 14 February, 2022; v1 submitted 5 February, 2021; originally announced February 2021.

    Comments: 36 pages, 2 figures

  10. arXiv:2101.00575  [pdf, other

    cs.LG math.ST

    Improved Convergence Guarantees for Learning Gaussian Mixture Models by EM and Gradient EM

    Authors: Nimrod Segol, Boaz Nadler

    Abstract: We consider the problem of estimating the parameters a Gaussian Mixture Model with K components of known weights, all with an identity covariance matrix. We make two contributions. First, at the population level, we present a sharper analysis of the local convergence of EM and gradient EM, compared to previous works. Assuming a separation of $Ω(\sqrt{\log K})$, we prove convergence of both methods… ▽ More

    Submitted 23 September, 2021; v1 submitted 3 January, 2021; originally announced January 2021.

  11. arXiv:2005.09021  [pdf, other

    cs.LG math.OC stat.ML

    The Trimmed Lasso: Sparse Recovery Guarantees and Practical Optimization by the Generalized Soft-Min Penalty

    Authors: Tal Amir, Ronen Basri, Boaz Nadler

    Abstract: We present a new approach to solve the sparse approximation or best subset selection problem, namely find a $k$-sparse vector ${\bf x}\in\mathbb{R}^d$ that minimizes the $\ell_2$ residual $\lVert A{\bf x}-{\bf y} \rVert_2$. We consider a regularized approach, whereby this residual is penalized by the non-convex $\textit{trimmed lasso}$, defined as the $\ell_1$-norm of ${\bf x}$ excluding its $k$ l… ▽ More

    Submitted 17 June, 2021; v1 submitted 18 May, 2020; originally announced May 2020.

    Comments: 49 pages; 7 figures; To appear in SIAM Journal on Mathematics of Data Science (SIMODS)

    MSC Class: 62J05; 90C26

  12. arXiv:2002.12547  [pdf, ps, other

    stat.ML cs.LG

    Spectral neighbor joining for reconstruction of latent tree models

    Authors: Ariel Jaffe, Noah Amsel, Yariv Aizenbud, Boaz Nadler, Joseph T. Chang, Yuval Kluger

    Abstract: A common assumption in multiple scientific applications is that the distribution of observed data can be modeled by a latent tree graphical model. An important example is phylogenetics, where the tree models the evolutionary lineages of a set of observed organisms. Given a set of independent realizations of the random variables at the leaves of the tree, a key challenge is to infer the underlying… ▽ More

    Submitted 22 September, 2020; v1 submitted 28 February, 2020; originally announced February 2020.

  13. arXiv:2002.01849  [pdf, other

    math.OC cs.LG

    Rank $2r$ iterative least squares: efficient recovery of ill-conditioned low rank matrices from few entries

    Authors: Jonathan Bauch, Boaz Nadler, Pini Zilber

    Abstract: We present a new, simple and computationally efficient iterative method for low rank matrix completion. Our method is inspired by the class of factorization-type iterative algorithms, but substantially differs from them in the way the problem is cast. Precisely, given a target rank $r$, instead of optimizing on the manifold of rank $r$ matrices, we allow our interim estimated matrix to have a spec… ▽ More

    Submitted 28 October, 2020; v1 submitted 5 February, 2020; originally announced February 2020.

    Journal ref: SIAM Journal on Mathematics of Data Science, 3(1), 439-465 (2021)

  14. arXiv:1806.01993  [pdf, other

    stat.ML cs.LG

    Beyond Trees: Classification with Sparse Pairwise Dependencies

    Authors: Yaniv Tenzer, Amit Moscovich, Mary Frances Dorn, Boaz Nadler, Clifford Spiegelman

    Abstract: Several classification methods assume that the underlying distributions follow tree-structured graphical models. Indeed, trees capture statistical dependencies between pairs of variables, which may be crucial to attain low classification errors. The resulting classifier is linear in the log-transformed univariate and bivariate densities that correspond to the tree edges. In practice, however, obse… ▽ More

    Submitted 16 April, 2020; v1 submitted 5 June, 2018; originally announced June 2018.

    Comments: 32 pages, 12 figures, 3 tables. Major revision with new feature-selection step and more extensive simulations

    MSC Class: 62H30

    Journal ref: Journal of Machine Learning Research 21:189 (2020) 1-33

  15. arXiv:1801.01587  [pdf, other

    stat.ML cs.LG

    SpectralNet: Spectral Clustering using Deep Neural Networks

    Authors: Uri Shaham, Kelly Stanton, Henry Li, Boaz Nadler, Ronen Basri, Yuval Kluger

    Abstract: Spectral clustering is a leading and popular technique in unsupervised data analysis. Two of its major limitations are scalability and generalization of the spectral embedding (i.e., out-of-sample-extension). In this paper we introduce a deep learning approach to spectral clustering that overcomes the above shortcomings. Our network, which we call SpectralNet, learns a map that embeds input data p… ▽ More

    Submitted 4 April, 2018; v1 submitted 4 January, 2018; originally announced January 2018.

    Comments: Added citations. Accepted to ICLR 2018

  16. On Detection of Faint Edges in Noisy Images

    Authors: Nati Ofir, Meirav Galun, Sharon Alpert, Achi Brandt, Boaz Nadler, Ronen Basri

    Abstract: A fundamental question for edge detection in noisy images is how faint can an edge be and still be detected. In this paper we offer a formalism to study this question and subsequently introduce computationally efficient multiscale edge detection algorithms designed to detect faint edges in noisy images. In our formalism we view edge detection as a search in a discrete, though potentially large, se… ▽ More

    Submitted 22 June, 2017; originally announced June 2017.

  17. arXiv:1703.02965  [pdf, ps, other

    stat.ML cs.LG

    Unsupervised Ensemble Regression

    Authors: Omer Dror, Boaz Nadler, Erhan Bilal, Yuval Kluger

    Abstract: Consider a regression problem where there is no labeled data and the only observations are the predictions $f_i(x_j)$ of $m$ experts $f_{i}$ over many samples $x_j$. With no knowledge on the accuracy of the experts, is it still possible to accurately estimate the unknown responses $y_{j}$? Can one still detect the least or most accurate experts? In this work we propose a framework to study these q… ▽ More

    Submitted 8 March, 2017; originally announced March 2017.

  18. arXiv:1611.02221  [pdf, other

    stat.ML cs.LG

    Minimax-optimal semi-supervised regression on unknown manifolds

    Authors: Amit Moscovich, Ariel Jaffe, Boaz Nadler

    Abstract: We consider semi-supervised regression when the predictor variables are drawn from an unknown manifold. A simple two step approach to this problem is to: (i) estimate the manifold geodesic distance between any pair of points using both the labeled and unlabeled instances; and (ii) apply a k nearest neighbor regressor based on these distance estimates. We prove that given sufficiently many unlabele… ▽ More

    Submitted 6 March, 2017; v1 submitted 7 November, 2016; originally announced November 2016.

    MSC Class: 62G08

    Journal ref: Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, PMLR 54 (2017) 933-942

  19. arXiv:1602.02285  [pdf, other

    stat.ML cs.LG

    A Deep Learning Approach to Unsupervised Ensemble Learning

    Authors: Uri Shaham, Xiuyuan Cheng, Omer Dror, Ariel Jaffe, Boaz Nadler, Joseph Chang, Yuval Kluger

    Abstract: We show how deep learning methods can be applied in the context of crowdsourcing and unsupervised ensemble learning. First, we prove that the popular model of Dawid and Skene, which assumes that all classifiers are conditionally independent, is {\em equivalent} to a Restricted Boltzmann Machine (RBM) with a single hidden node. Hence, under this model, the posterior probabilities of the true labels… ▽ More

    Submitted 6 February, 2016; originally announced February 2016.

    Report number: PMLR 48:30-39

  20. arXiv:1510.05830  [pdf, ps, other

    cs.LG stat.ML

    Unsupervised Ensemble Learning with Dependent Classifiers

    Authors: Ariel Jaffe, Ethan Fetaya, Boaz Nadler, Tingting Jiang, Yuval Kluger

    Abstract: In unsupervised ensemble learning, one obtains predictions from multiple sources or classifiers, yet without knowing the reliability and expertise of each source, and with no labeled data to assess it. The task is to combine these possibly conflicting predictions into an accurate meta-learner. Most works to date assumed perfect diversity between the different sources, a property known as condition… ▽ More

    Submitted 23 February, 2016; v1 submitted 20 October, 2015; originally announced October 2015.

  21. Fast Detection of Curved Edges at Low SNR

    Authors: Nati Ofir, Meirav Galun, Boaz Nadler, Ronen Basri

    Abstract: Detecting edges is a fundamental problem in computer vision with many applications, some involving very noisy images. While most edge detection methods are fast, they perform well only on relatively clean images. Indeed, edges in such images can be reliably detected using only local filters. Detecting faint edges under high levels of noise cannot be done locally at the individual pixel level, and… ▽ More

    Submitted 25 May, 2015; originally announced May 2015.

    Comments: 9 pages, 11 figures

  22. arXiv:1505.03001  [pdf, ps, other

    stat.CO cs.LG stat.ML

    Detecting the large entries of a sparse covariance matrix in sub-quadratic time

    Authors: Ofer Shwartz, Boaz Nadler

    Abstract: The covariance matrix of a $p$-dimensional random variable is a fundamental quantity in data analysis. Given $n$ i.i.d. observations, it is typically estimated by the sample covariance matrix, at a computational cost of $O(np^{2})$ operations. When $n,p$ are large, this computation may be prohibitively slow. Moreover, in several contemporary applications, the population matrix is approximately spa… ▽ More

    Submitted 20 December, 2015; v1 submitted 12 May, 2015; originally announced May 2015.

  23. arXiv:1502.02158  [pdf, other

    cs.LG

    Learning Parametric-Output HMMs with Two Aliased States

    Authors: Roi Weiss, Boaz Nadler

    Abstract: In various applications involving hidden Markov models (HMMs), some of the hidden states are aliased, having identical output distributions. The minimality, identifiability and learnability of such aliased HMMs have been long standing problems, with only partial solutions provided thus far. In this paper we focus on parametric-output HMMs, whose output distributions come from a parametric family,… ▽ More

    Submitted 7 February, 2015; originally announced February 2015.

  24. arXiv:1411.4226  [pdf, ps, other

    math.ST cs.IT

    Roy's largest root under rank-one alternatives:The complex valued case and applications

    Authors: Prathapasinghe Dharmawansa, Boaz Nadler, Ofer Shwartz

    Abstract: The largest eigenvalue of a Wishart matrix, known as Roy's largest root (RLR), plays an important role in a variety of applications. Most works to date derived approximations to its distribution under various asymptotic regimes, such as degrees of freedom, dimension, or both tending to infinity. However, several applications involve finite and relative small parameters, for which the above approxi… ▽ More

    Submitted 16 November, 2014; originally announced November 2014.

    MSC Class: 15B52; 62H10; 62H15; 94A13; 94A14

  25. arXiv:1407.7644  [pdf, ps, other

    stat.ML cs.LG

    Estimating the Accuracies of Multiple Classifiers Without Labeled Data

    Authors: Ariel Jaffe, Boaz Nadler, Yuval Kluger

    Abstract: In various situations one is given only the predictions of multiple classifiers over a large unlabeled test data. This scenario raises the following questions: Without any labeled data and without any a-priori knowledge about the reliability of these different classifiers, is it possible to consistently and computationally efficiently estimate their accuracies? Furthermore, also in a completely un… ▽ More

    Submitted 30 October, 2014; v1 submitted 29 July, 2014; originally announced July 2014.

  26. Ranking and combining multiple predictors without labeled data

    Authors: Fabio Parisi, Francesco Strino, Boaz Nadler, Yuval Kluger

    Abstract: In a broad range of classification and decision making problems, one is given the advice or predictions of several classifiers, of unknown reliability, over multiple questions or queries. This scenario is different from the standard supervised setting, where each classifier accuracy can be assessed using available labeled data, and raises two questions: given only the predictions of several classi… ▽ More

    Submitted 24 November, 2013; v1 submitted 13 March, 2013; originally announced March 2013.

    Comments: Supplementary Information is included at the end of the manuscript. This is a revision of our original submission of the manuscript entitled "The student's dilemma: ranking and improving prediction at test time without access to training data", which is now entitled "Ranking and combining multiple predictors without labeled data"

    Journal ref: Proc. Natl. Acad. Sci. U.S.A. 111 (2014) 1253-1258

  27. arXiv:1302.6009  [pdf, ps, other

    cs.LG math.ST stat.ML

    On learning parametric-output HMMs

    Authors: Aryeh Kontorovich, Boaz Nadler, Roi Weiss

    Abstract: We present a novel approach for learning an HMM whose outputs are distributed according to a parametric family. This is done by {\em decoupling} the learning task into two steps: first estimating the output parameters, and then estimating the hidden states transition probabilities. The first step is accomplished by fitting a mixture model to the output stationary distribution. Given the parameters… ▽ More

    Submitted 25 February, 2013; originally announced February 2013.

  28. arXiv:1210.4909  [pdf

    cs.LG stat.ML

    Active Learning with Distributional Estimates

    Authors: Jens Roeder, Boaz Nadler, Kevin Kunzmann, Fred A. Hamprecht

    Abstract: Active Learning (AL) is increasingly important in a broad range of applications. Two main AL principles to obtain accurate classification with few labeled data are refinement of the current decision boundary and exploration of poorly sampled regions. In this paper we derive a novel AL scheme that balances these two principles in a natural way. In contrast to many AL strategies, which are based on… ▽ More

    Submitted 16 October, 2012; originally announced October 2012.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-715-725