Zum Hauptinhalt springen

Showing 1–14 of 14 results for author: Rubenstein, P K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1110 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 8 August, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  2. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  3. arXiv:2306.12925  [pdf, other

    cs.CL cs.AI cs.SD eess.AS stat.ML

    AudioPaLM: A Large Language Model That Can Speak and Listen

    Authors: Paul K. Rubenstein, Chulayuth Asawaroengchai, Duc Dung Nguyen, Ankur Bapna, Zalán Borsos, Félix de Chaumont Quitry, Peter Chen, Dalia El Badawy, Wei Han, Eugene Kharitonov, Hannah Muckenhirn, Dirk Padfield, James Qin, Danny Rozenberg, Tara Sainath, Johan Schalkwyk, Matt Sharifi, Michelle Tadmor Ramanovich, Marco Tagliasacchi, Alexandru Tudor, Mihajlo Velimirović, Damien Vincent, Jiahui Yu, Yongqiang Wang, Vicky Zayats , et al. (5 additional authors not shown)

    Abstract: We introduce AudioPaLM, a large language model for speech understanding and generation. AudioPaLM fuses text-based and speech-based language models, PaLM-2 [Anil et al., 2023] and AudioLM [Borsos et al., 2022], into a unified multimodal architecture that can process and generate text and speech with applications including speech recognition and speech-to-speech translation. AudioPaLM inherits the… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

    Comments: Technical report

  4. arXiv:2302.03491  [pdf, ps, other

    cs.CL cs.LG

    Learning Translation Quality Evaluation on Low Resource Languages from Large Language Models

    Authors: Amirkeivan Mohtashami, Mauro Verzetti, Paul K. Rubenstein

    Abstract: Learned metrics such as BLEURT have in recent years become widely employed to evaluate the quality of machine translation systems. Training such metrics requires data which can be expensive and difficult to acquire, particularly for lower-resource languages. We show how knowledge can be distilled from Large Language Models (LLMs) to improve upon such learned metrics without requiring human annotat… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

  5. arXiv:2203.06127  [pdf, other

    cs.CV

    Spatial Consistency Loss for Training Multi-Label Classifiers from Single-Label Annotations

    Authors: Thomas Verelst, Paul K. Rubenstein, Marcin Eichner, Tinne Tuytelaars, Maxim Berman

    Abstract: As natural images usually contain multiple objects, multi-label image classification is more applicable "in the wild" than single-label classification. However, exhaustively annotating images with every object of interest is costly and time-consuming. We aim to train multi-label classifiers from single-label annotations only. We show that adding a consistency loss, ensuring that the predictions of… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Comments: 24 pages, 9 figures

  6. arXiv:1910.03962  [pdf, other

    stat.ML cs.LG

    Optimal experimental design via Bayesian optimization: active causal structure learning for Gaussian process networks

    Authors: Julius von Kügelgen, Paul K Rubenstein, Bernhard Schölkopf, Adrian Weller

    Abstract: We study the problem of causal discovery through targeted interventions. Starting from few observational measurements, we follow a Bayesian active learning approach to perform those experiments which, in expectation with respect to the current model, are maximally informative about the underlying causal structure. Unlike previous work, we consider the setting of continuous random variables with no… ▽ More

    Submitted 9 October, 2019; originally announced October 2019.

    Comments: Working paper. Accepted as a poster at the NeurIPS 2019 workshop, "Do the right thing": machine learning and causal inference for improved decision making. (6 pages + references + appendix)

  7. arXiv:1907.13625  [pdf, other

    cs.LG stat.ML

    On Mutual Information Maximization for Representation Learning

    Authors: Michael Tschannen, Josip Djolonga, Paul K. Rubenstein, Sylvain Gelly, Mario Lucic

    Abstract: Many recent methods for unsupervised or self-supervised representation learning train feature extractors by maximizing an estimate of the mutual information (MI) between different views of the data. This comes with several immediate problems: For example, MI is notoriously hard to estimate, and using it as an objective for representation learning may lead to highly entangled representations due to… ▽ More

    Submitted 23 January, 2020; v1 submitted 31 July, 2019; originally announced July 2019.

    Comments: ICLR 2020. Michael Tschannen and Josip Djolonga contributed equally

  8. arXiv:1905.11112  [pdf, other

    stat.ML cs.IT cs.LG

    Practical and Consistent Estimation of f-Divergences

    Authors: Paul K. Rubenstein, Olivier Bousquet, Josip Djolonga, Carlos Riquelme, Ilya Tolstikhin

    Abstract: The estimation of an f-divergence between two probability distributions based on samples is a fundamental problem in statistics and machine learning. Most works study this problem under very weak assumptions, in which case it is provably hard. We consider the case of stronger structural assumptions that are commonly satisfied in modern machine learning, including representation learning and genera… ▽ More

    Submitted 24 October, 2019; v1 submitted 27 May, 2019; originally announced May 2019.

    Comments: Accepted to the 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada

    Journal ref: 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada

  9. arXiv:1905.06642  [pdf, other

    stat.ML cs.LG

    The Incomplete Rosetta Stone Problem: Identifiability Results for Multi-View Nonlinear ICA

    Authors: Luigi Gresele, Paul K. Rubenstein, Arash Mehrjou, Francesco Locatello, Bernhard Schölkopf

    Abstract: We consider the problem of recovering a common latent source with independent components from multiple views. This applies to settings in which a variable is measured with multiple experimental modalities, and where the goal is to synthesize the disparate measurements into a single unified representation. We consider the case that the observed views are a nonlinear mixing of component-wise corrupt… ▽ More

    Submitted 1 August, 2019; v1 submitted 16 May, 2019; originally announced May 2019.

    Journal ref: Proceedings of the 35th Conference on Uncertainty in Artificial Intelligence, 2019

  10. arXiv:1812.07909  [pdf, other

    stat.ML cs.AI cs.LG

    An Empirical Study of Generative Models with Encoders

    Authors: Paul K. Rubenstein, Yunpeng Li, Dominik Roblek

    Abstract: Generative adversarial networks (GANs) are capable of producing high quality image samples. However, unlike variational autoencoders (VAEs), GANs lack encoders that provide the inverse mapping for the generators, i.e., encode images back to the latent space. In this work, we consider adversarially learned generative models that also have encoders. We evaluate models based on their ability to produ… ▽ More

    Submitted 19 December, 2018; originally announced December 2018.

  11. arXiv:1802.03761  [pdf, other

    stat.ML cs.LG

    On the Latent Space of Wasserstein Auto-Encoders

    Authors: Paul K. Rubenstein, Bernhard Schoelkopf, Ilya Tolstikhin

    Abstract: We study the role of latent space dimensionality in Wasserstein auto-encoders (WAEs). Through experimentation on synthetic and real datasets, we argue that random encoders should be preferred over deterministic encoders. We highlight the potential of WAEs for representation learning with promising results on a benchmark disentanglement task.

    Submitted 11 February, 2018; originally announced February 2018.

  12. arXiv:1707.00819  [pdf, other

    stat.ML cs.AI cs.LG stat.ME

    Causal Consistency of Structural Equation Models

    Authors: Paul K. Rubenstein, Sebastian Weichwald, Stephan Bongers, Joris M. Mooij, Dominik Janzing, Moritz Grosse-Wentrup, Bernhard Schölkopf

    Abstract: Complex systems can be modelled at various levels of detail. Ideally, causal models of the same system should be consistent with one another in the sense that they agree in their predictions of the effects of interventions. We formalise this notion of consistency in the case of Structural Equation Models (SEMs) by introducing exact transformations between SEMs. This provides a general language to… ▽ More

    Submitted 4 July, 2017; originally announced July 2017.

    Comments: equal contribution between Rubenstein and Weichwald; accepted manuscript

    Journal ref: Proceedings of the Annual Conference on Uncertainty in Artificial Intelligence, UAI 2017 ( http://auai.org/uai2017/proceedings/papers/11.pdf )

  13. arXiv:1706.10234  [pdf, other

    stat.ML cs.AI cs.LG

    Probabilistic Active Learning of Functions in Structural Causal Models

    Authors: Paul K. Rubenstein, Ilya Tolstikhin, Philipp Hennig, Bernhard Schoelkopf

    Abstract: We consider the problem of learning the functions computing children from parents in a Structural Causal Model once the underlying causal graph has been identified. This is in some sense the second step after causal discovery. Taking a probabilistic approach to estimating these functions, we derive a natural myopic active learning scheme that identifies the intervention which is optimally informat… ▽ More

    Submitted 30 June, 2017; originally announced June 2017.

    Comments: 9 pages main text + 4 pages supplement

  14. arXiv:1608.08028  [pdf, other

    cs.AI

    From Deterministic ODEs to Dynamic Structural Causal Models

    Authors: Paul K. Rubenstein, Stephan Bongers, Bernhard Schoelkopf, Joris M. Mooij

    Abstract: Structural Causal Models are widely used in causal modelling, but how they relate to other modelling tools is poorly understood. In this paper we provide a novel perspective on the relationship between Ordinary Differential Equations and Structural Causal Models. We show how, under certain conditions, the asymptotic behaviour of an Ordinary Differential Equation under non-constant interventions ca… ▽ More

    Submitted 9 July, 2018; v1 submitted 29 August, 2016; originally announced August 2016.

    Comments: Accepted for publication in Conference on Uncertainy in Artificial Intelligence

    Journal ref: Proceedings of the 35th Annual Conference on Uncertainty in Artificial Intelligence (2018), 114-123