Zum Hauptinhalt springen

Showing 1–17 of 17 results for author: Shamir, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.19107  [pdf, ps, other

    cs.LG cs.AI

    Offline Regularised Reinforcement Learning for Large Language Models Alignment

    Authors: Pierre Harvey Richemond, Yunhao Tang, Daniel Guo, Daniele Calandriello, Mohammad Gheshlaghi Azar, Rafael Rafailov, Bernardo Avila Pires, Eugene Tarassov, Lucas Spangher, Will Ellsworth, Aliaksei Severyn, Jonathan Mallinson, Lior Shani, Gil Shamir, Rishabh Joshi, Tianqi Liu, Remi Munos, Bilal Piot

    Abstract: The dominant framework for alignment of large language models (LLM), whether through reinforcement learning from human feedback or direct preference optimisation, is to learn from preference data. This involves building datasets where each element is a quadruplet composed of a prompt, two independent responses (completions of the prompt) and a human preference between the two independent responses… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  2. arXiv:2306.08650  [pdf, other

    cs.IR cs.LG

    Learning to Rank when Grades Matter

    Authors: Le Yan, Zhen Qin, Gil Shamir, Dong Lin, Xuanhui Wang, Mike Bendersky

    Abstract: Graded labels are ubiquitous in real-world learning-to-rank applications, especially in human rated relevance data. Traditional learning-to-rank techniques aim to optimize the ranked order of documents. They typically, however, ignore predicting actual grades. This prevents them from being adopted in applications where grades matter, such as filtering out ``poor'' documents. Achieving both good ra… ▽ More

    Submitted 20 June, 2023; v1 submitted 14 June, 2023; originally announced June 2023.

  3. arXiv:2209.05310  [pdf, other

    cs.IR cs.LG

    On the Factory Floor: ML Engineering for Industrial-Scale Ads Recommendation Models

    Authors: Rohan Anil, Sandra Gadanho, Da Huang, Nijith Jacob, Zhuoshu Li, Dong Lin, Todd Phillips, Cristina Pop, Kevin Regan, Gil I. Shamir, Rakesh Shivanna, Qiqi Yan

    Abstract: For industrial-scale advertising systems, prediction of ad click-through rate (CTR) is a central problem. Ad clicks constitute a significant class of user engagements and are often used as the primary signal for the usefulness of ads to users. Additionally, in cost-per-click advertising systems where advertisers are charged per click, click rate expectations feed directly into value estimation. Ac… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

    Comments: ORSUM - ACM RecSys, September 23, 2022, Seattle, WA

  4. arXiv:2202.06499  [pdf, other

    cs.LG cs.IR

    Real World Large Scale Recommendation Systems Reproducibility and Smooth Activations

    Authors: Gil I. Shamir, Dong Lin

    Abstract: Real world recommendation systems influence a constantly growing set of domains. With deep networks, that now drive such systems, recommendations have been more relevant to the user's interests and tasks. However, they may not always be reproducible even if produced by the same system for the same user, recommendation sequence, request, or query. This problem received almost no attention in academ… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

  5. arXiv:2202.04598  [pdf, ps, other

    math.OC cs.LG stat.ML

    Reproducibility in Optimization: Theoretical Framework and Limits

    Authors: Kwangjun Ahn, Prateek Jain, Ziwei Ji, Satyen Kale, Praneeth Netrapalli, Gil I. Shamir

    Abstract: We initiate a formal study of reproducibility in optimization. We define a quantitative measure of reproducibility of optimization procedures in the face of noisy or error-prone operations such as inexact or stochastic gradient computations or inexact initialization. We then analyze several convex optimization settings of interest such as smooth, non-smooth, and strongly-convex objective functions… ▽ More

    Submitted 4 December, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

    Comments: 45 Pages; Accepted to NeurIPS 2022

  6. arXiv:2110.06435  [pdf, other

    cs.LG

    Dropout Prediction Uncertainty Estimation Using Neuron Activation Strength

    Authors: Haichao Yu, Zhe Chen, Dong Lin, Gil Shamir, Jie Han

    Abstract: Dropout has been commonly used to quantify prediction uncertainty, i.e, the variations of model predictions on a given input example. However, using dropout in practice can be expensive as it requires running dropout inferences many times. In this paper, we study how to estimate dropout prediction uncertainty in a resource-efficient manner. We demonstrate that we can use neuron activation streng… ▽ More

    Submitted 16 June, 2022; v1 submitted 12 October, 2021; originally announced October 2021.

    Comments: 8 pages

  7. arXiv:2102.10696  [pdf, other

    cs.LG

    Synthesizing Irreproducibility in Deep Networks

    Authors: Robert R. Snapp, Gil I. Shamir

    Abstract: The success and superior performance of deep networks is spreading their popularity and use to an increasing number of applications. Very recent works, however, demonstrate that modern day deep networks suffer from irreproducibility (also referred to as nondeterminism or underspecification). Two or more models that are identical in architecture, structure, training hyper-parameters, and parameters… ▽ More

    Submitted 21 February, 2021; originally announced February 2021.

  8. arXiv:2101.12113  [pdf, other

    cs.LG stat.ML

    Low Complexity Approximate Bayesian Logistic Regression for Sparse Online Learning

    Authors: Gil I. Shamir, Wojciech Szpankowski

    Abstract: Theoretical results show that Bayesian methods can achieve lower bounds on regret for online logistic regression. In practice, however, such techniques may not be feasible especially for very large feature sets. Various approximations that, for huge sparse feature sets, diminish the theoretical advantages, must be used. Often, they apply stochastic gradient methods with hyper-parameters that must… ▽ More

    Submitted 28 January, 2021; originally announced January 2021.

  9. arXiv:2010.09931  [pdf, other

    cs.LG cs.NE stat.ML

    Smooth activations and reproducibility in deep networks

    Authors: Gil I. Shamir, Dong Lin, Lorenzo Coviello

    Abstract: Deep networks are gradually penetrating almost every domain in our lives due to their amazing success. However, with substantive performance accuracy improvements comes the price of \emph{irreproducibility}. Two identical models, trained on the exact same training dataset may exhibit large differences in predictions on individual examples even when average accuracy is similar, especially when trai… ▽ More

    Submitted 30 November, 2020; v1 submitted 19 October, 2020; originally announced October 2020.

  10. arXiv:2010.09923  [pdf, other

    cs.LG cs.NE stat.ML

    Anti-Distillation: Improving reproducibility of deep networks

    Authors: Gil I. Shamir, Lorenzo Coviello

    Abstract: Deep networks have been revolutionary in improving performance of machine learning and artificial intelligence systems. Their high prediction accuracy, however, comes at a price of \emph{model irreproducibility\/} in very high levels that do not occur with classical linear models. Two models, even if they are supposedly identical, with identical architecture and identical trained parameter sets, a… ▽ More

    Submitted 19 October, 2020; originally announced October 2020.

  11. arXiv:2005.10320  [pdf, ps, other

    cs.IT

    Sequential Universal Modeling for Non-Binary Sequences with Constrained Distributions

    Authors: Michael Drmota, Gil Shamir, Wojciech Szpankowski

    Abstract: Sequential probability assignment and universal compression go hand in hand. We propose sequential probability assignment for non-binary (and large alphabet) sequences with empirical distributions whose parameters are known to be bounded within a limited interval. Sequential probability assignment algorithms are essential in many applications that require fast and accurate estimation of the maximi… ▽ More

    Submitted 6 February, 2021; v1 submitted 20 May, 2020; originally announced May 2020.

  12. arXiv:2002.02950  [pdf, ps, other

    cs.LG stat.ML

    Logistic Regression Regret: What's the Catch?

    Authors: Gil I. Shamir

    Abstract: We address the problem of the achievable regret rates with online logistic regression. We derive lower bounds with logarithmic regret under $L_1$, $L_2$, and $L_\infty$ constraints on the parameter values. The bounds are dominated by $d/2 \log T$, where $T$ is the horizon and $d$ is the dimensionality of the parameter space. We show their achievability for $d=o(T^{1/3})$ in all these cases with Ba… ▽ More

    Submitted 19 February, 2020; v1 submitted 7 February, 2020; originally announced February 2020.

  13. arXiv:0711.2102  [pdf, ps, other

    cs.IT

    Patterns of i.i.d. Sequences and Their Entropy - Part II: Bounds for Some Distributions

    Authors: Gil I. Shamir

    Abstract: A pattern of a sequence is a sequence of integer indices with each index describing the order of first occurrence of the respective symbol in the original sequence. In a recent paper, tight general bounds on the block entropy of patterns of sequences generated by independent and identically distributed (i.i.d.) sources were derived. In this paper, precise approximations are provided for the patt… ▽ More

    Submitted 13 November, 2007; originally announced November 2007.

  14. arXiv:0704.0838  [pdf, ps, other

    cs.IT

    Universal Source Coding for Monotonic and Fast Decaying Monotonic Distributions

    Authors: Gil I. Shamir

    Abstract: We study universal compression of sequences generated by monotonic distributions. We show that for a monotonic distribution over an alphabet of size $k$, each probability parameter costs essentially $0.5 \log (n/k^3)$ bits, where $n$ is the coded sequence length, as long as $k = o(n^{1/3})$. Otherwise, for $k = O(n)$, the total average sequence redundancy is $O(n^{1/3+ε})$ bits overall. We then… ▽ More

    Submitted 5 April, 2007; originally announced April 2007.

    Comments: Submitted to IEEE Transactions on Information Theory

  15. arXiv:cs/0605046  [pdf, ps, other

    cs.IT

    Patterns of i.i.d. Sequences and Their Entropy - Part I: General Bounds

    Authors: Gil I. Shamir

    Abstract: Tight bounds on the block entropy of patterns of sequences generated by independent and identically distributed (i.i.d.) sources are derived. A pattern of a sequence is a sequence of integer indices with each index representing the order of first occurrence of the respective symbol in the original sequence. Since a pattern is the result of data processing on the original sequence, its entropy ca… ▽ More

    Submitted 13 November, 2007; v1 submitted 10 May, 2006; originally announced May 2006.

    Comments: Submitted to IEEE Transactions on Information Theory

  16. Universal Lossless Compression with Unknown Alphabets - The Average Case

    Authors: Gil I. Shamir

    Abstract: Universal compression of patterns of sequences generated by independently identically distributed (i.i.d.) sources with unknown, possibly large, alphabets is investigated. A pattern is a sequence of indices that contains all consecutive indices in increasing order of first occurrence. If the alphabet of a source that generated a sequence is unknown, the inevitable cost of coding the unknown alph… ▽ More

    Submitted 16 March, 2006; originally announced March 2006.

    Comments: Revised for IEEE Transactions on Information Theory

    ACM Class: G.3

  17. arXiv:cs/0504049  [pdf, ps, other

    cs.IT

    Bounds on the Entropy of Patterns of I.I.D. Sequences

    Authors: Gil I. Shamir

    Abstract: Bounds on the entropy of patterns of sequences generated by independently identically distributed (i.i.d.) sources are derived. A pattern is a sequence of indices that contains all consecutive integer indices in increasing order of first occurrence. If the alphabet of a source that generated a sequence is unknown, the inevitable cost of coding the unknown alphabet symbols can be exploited to cre… ▽ More

    Submitted 12 April, 2005; originally announced April 2005.

    Comments: submitted to ITW2005