Zum Hauptinhalt springen

Showing 51–61 of 61 results for author: Mozer, M

.
  1. arXiv:1809.03702  [pdf, other

    cs.LG stat.ML

    Sparse Attentive Backtracking: Temporal CreditAssignment Through Reminding

    Authors: Nan Rosemary Ke, Anirudh Goyal, Olexa Bilaniuk, Jonathan Binas, Michael C. Mozer, Chris Pal, Yoshua Bengio

    Abstract: Learning long-term dependencies in extended temporal sequences requires credit assignment to events far back in the past. The most common method for training recurrent neural networks, back-propagation through time (BPTT), requires credit information to be propagated backwards through every single step of the forward computation, potentially over thousands or millions of time steps. This becomes c… ▽ More

    Submitted 11 September, 2018; originally announced September 2018.

    Comments: To appear as a Spotlight presentation at NIPS 2018

  2. arXiv:1805.08402  [pdf, other

    cs.LG stat.ML

    Adapted Deep Embeddings: A Synthesis of Methods for $k$-Shot Inductive Transfer Learning

    Authors: Tyler R. Scott, Karl Ridgeway, Michael C. Mozer

    Abstract: The focus in machine learning has branched beyond training classifiers on a single task to investigating how previously acquired knowledge in a source domain can be leveraged to facilitate learning in a related target domain, known as inductive transfer learning. Three active lines of research have independently explored transfer learning using neural networks. In weight transfer, a model trained… ▽ More

    Submitted 27 October, 2018; v1 submitted 22 May, 2018; originally announced May 2018.

  3. arXiv:1805.08394  [pdf, other

    cs.NE

    State-Denoised Recurrent Neural Networks

    Authors: Michael C. Mozer, Denis Kazakov, Robert V. Lindsey

    Abstract: Recurrent neural networks (RNNs) are difficult to train on sequence processing tasks, not only because input noise may be amplified through feedback, but also because any inaccuracy in the weights has similar consequences as input noise. We describe a method for denoising the hidden state during training to achieve more robust representations thereby improving generalization performance. Attractor… ▽ More

    Submitted 28 May, 2018; v1 submitted 22 May, 2018; originally announced May 2018.

  4. arXiv:1803.07977  [pdf, other

    hep-ph

    Les Houches 2017: Physics at TeV Colliders Standard Model Working Group Report

    Authors: J. Bendavid, F. Caola, V. Ciulli, R. Harlander, G. Heinrich, J. Huston, S. Kallweit, S. Prestel, E. Re, K. Tackmann, J. Thaler, K. Theofilatos, J. R. Andersen, J. Bellm, N. Berger, D. Bhatia, B. Biedermann, S. Bräuer, D. Britzger, A. G. Buckley, R. Camacho, G. Chachamis, S. Chatterjee, X. Chen, M. Chiesa , et al. (80 additional authors not shown)

    Abstract: This Report summarizes the proceedings of the 2017 Les Houches workshop on Physics at TeV Colliders. Session 1 dealt with (I) new developments relevant for high precision Standard Model calculations, (II) theoretical uncertainties and dataset dependence of parton distribution functions, (III) new developments in jet substructure techniques, (IV) issues in the theoretical description of the product… ▽ More

    Submitted 21 March, 2018; originally announced March 2018.

    Comments: Proceedings of the Standard Model Working Group of the 2017 Les Houches Workshop, Physics at TeV Colliders, Les Houches 5-23 June 2017. 314 pages

    Report number: UWTHPH-2018-5

  5. arXiv:1802.05312  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Deep Disentangled Embeddings with the F-Statistic Loss

    Authors: Karl Ridgeway, Michael C. Mozer

    Abstract: Deep-embedding methods aim to discover representations of a domain that make explicit the domain's class structure and thereby support few-shot learning. Disentangling methods aim to make explicit compositional or factorial structure. We combine these two active but independent lines of research and propose a new paradigm suitable for both goals. We propose and evaluate a novel loss function based… ▽ More

    Submitted 19 May, 2018; v1 submitted 14 February, 2018; originally announced February 2018.

  6. Vector boson scattering: Recent experimental and theory developments

    Authors: C. F. Anders, A. Ballestrero, J. Balz, R. Bellan, B. Biedermann, C. Bittrich, S. Braß, I. Brivio, L. S. Bruni, J. Butterworth, M. Cacciari, A. Cardini, C. Charlot, V. Ciulli, R. Covarelli, J. Cuevas, A. Denner, L. Di Ciaccio, S. Dittmaier, S. Duric, S. Farrington, P. Ferrari, P. Ferreira Silva, L. Finco, D. Giljanović , et al. (89 additional authors not shown)

    Abstract: This document summarises the talks and discussions happened during the VBSCan Split17 workshop, the first general meeting of the VBSCan COST Action network. This collaboration is aiming at a consistent and coordinated study of vector-boson scattering from the phenomenological and experimental point of view, for the best exploitation of the data that will be delivered by existing and future particl… ▽ More

    Submitted 13 December, 2018; v1 submitted 12 January, 2018; originally announced January 2018.

    Comments: 41 pages including references, 11 figures, summary of the talks and discussions happened during the first VBSCan workshop: https://indico.cern.ch/event/629638/. Note that in v2 the original title "VBSCan Split 2017 Workshop Summary" has been modified according to the published version

    Report number: VBSCan-PUB-01-17

    Journal ref: Rev.Phys. 3 (2018) 44-63

  7. arXiv:1710.04110  [pdf, other

    cs.NE cs.LG

    Discrete Event, Continuous Time RNNs

    Authors: Michael C. Mozer, Denis Kazakov, Robert V. Lindsey

    Abstract: We investigate recurrent neural network architectures for event-sequence processing. Event sequences, characterized by discrete observations stamped with continuous-valued times of occurrence, are challenging due to the potentially wide dynamic range of relevant time scales as well as interactions between time scales. We describe four forms of inductive bias that should benefit architectures for e… ▽ More

    Submitted 11 October, 2017; originally announced October 2017.

    Comments: 21 pages

    ACM Class: I.2.6

  8. arXiv:1612.08117  [pdf, other

    cs.HC cs.NE

    Improving Human-Machine Cooperative Visual Search With Soft Highlighting

    Authors: Ronald T. Kneusel, Michael C. Mozer

    Abstract: Advances in machine learning have produced systems that attain human-level performance on certain visual tasks, e.g., object identification. Nonetheless, other tasks requiring visual expertise are unlikely to be entrusted to machines for some time, e.g., satellite and medical imagery analysis. We describe a human-machine cooperative approach to visual search, the aim of which is to outperform eith… ▽ More

    Submitted 23 December, 2016; originally announced December 2016.

  9. arXiv:1604.02416  [pdf, other

    cs.AI cs.NE

    How deep is knowledge tracing?

    Authors: Mohammad Khajah, Robert V. Lindsey, Michael C. Mozer

    Abstract: In theoretical cognitive science, there is a tension between highly structured models whose parameters have a direct psychological interpretation and highly complex, general-purpose models whose parameters and representations are difficult to interpret. The former typically provide more insight into cognition but the latter often perform better. This tension has recently surfaced in the realm of e… ▽ More

    Submitted 21 June, 2016; v1 submitted 14 March, 2016; originally announced April 2016.

    Comments: 8 pages, 2 figures

  10. arXiv:1511.06409  [pdf, other

    cs.LG cs.CV

    Learning to Generate Images with Perceptual Similarity Metrics

    Authors: Jake Snell, Karl Ridgeway, Renjie Liao, Brett D. Roads, Michael C. Mozer, Richard S. Zemel

    Abstract: Deep networks are increasingly being applied to problems involving image synthesis, e.g., generating images from textual descriptions and reconstructing an input image from a compact representation. Supervised training of image-synthesis networks typically uses a pixel-wise loss (PL) to indicate the mismatch between a generated image and its corresponding target image. We propose instead to use a… ▽ More

    Submitted 23 January, 2017; v1 submitted 19 November, 2015; originally announced November 2015.

  11. arXiv:1411.4413  [pdf, other

    hep-ex hep-ph

    Observation of the rare $B^0_s\toμ^+μ^-$ decay from the combined analysis of CMS and LHCb data

    Authors: The CMS, LHCb Collaborations, :, V. Khachatryan, A. M. Sirunyan, A. Tumasyan, W. Adam, T. Bergauer, M. Dragicevic, J. Erö, M. Friedl, R. Frühwirth, V. M. Ghete, C. Hartl, N. Hörmann, J. Hrubec, M. Jeitler, W. Kiesenhofer, V. Knünz, M. Krammer, I. Krätschmer, D. Liko, I. Mikulec, D. Rabady, B. Rahbaran , et al. (2807 additional authors not shown)

    Abstract: A joint measurement is presented of the branching fractions $B^0_s\toμ^+μ^-$ and $B^0\toμ^+μ^-$ in proton-proton collisions at the LHC by the CMS and LHCb experiments. The data samples were collected in 2011 at a centre-of-mass energy of 7 TeV, and in 2012 at 8 TeV. The combined analysis produces the first observation of the $B^0_s\toμ^+μ^-$ decay, with a statistical significance exceeding six sta… ▽ More

    Submitted 17 August, 2015; v1 submitted 17 November, 2014; originally announced November 2014.

    Comments: Correspondence should be addressed to [email protected]

    Report number: CERN-PH-EP-2014-220, CMS-BPH-13-007, LHCb-PAPER-2014-049

    Journal ref: Nature 522, 68-72 (04 June 2015)