Skip to main content

Showing 1–30 of 30 results for author: Ramos, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  2. arXiv:2306.13649  [pdf, other

    cs.LG cs.AI cs.CL

    On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes

    Authors: Rishabh Agarwal, Nino Vieillard, Yongchao Zhou, Piotr Stanczyk, Sabela Ramos, Matthieu Geist, Olivier Bachem

    Abstract: Knowledge distillation (KD) is widely used for compressing a teacher model to reduce its inference cost and memory footprint, by training a smaller student model. However, current KD methods for auto-regressive sequence models suffer from distribution mismatch between output sequences seen during training and those generated by the student during inference. To address this issue, we introduce Gene… ▽ More

    Submitted 16 January, 2024; v1 submitted 23 June, 2023; originally announced June 2023.

    Comments: Accepted at ICLR 2024. First two authors contributed equally

  3. arXiv:2306.00186  [pdf, other

    cs.CL

    Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback

    Authors: Paul Roit, Johan Ferret, Lior Shani, Roee Aharoni, Geoffrey Cideron, Robert Dadashi, Matthieu Geist, Sertan Girgin, Léonard Hussenot, Orgad Keller, Nikola Momchev, Sabela Ramos, Piotr Stanczyk, Nino Vieillard, Olivier Bachem, Gal Elidan, Avinatan Hassidim, Olivier Pietquin, Idan Szpektor

    Abstract: Despite the seeming success of contemporary grounded text generation systems, they often tend to generate factually inconsistent text with respect to their input. This phenomenon is emphasized in tasks like summarization, in which the generated summaries should be corroborated by their source article. In this work, we leverage recent progress on textual entailment models to directly address this p… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

    Comments: ACL 2023

  4. arXiv:2305.03718  [pdf

    cs.CY cs.CR

    The MEV Saga: Can Regulation Illuminate the Dark Forest?

    Authors: Simona Ramos, Joshua Ellul

    Abstract: In this article, we develop an interdisciplinary analysis of MEV which desires to merge the gap that exists between technical and legal research supporting policymakers in their regulatory decisions concerning blockchains, DeFi and associated risks. Consequently, this article is intended for both technical and legal audiences, and while we abstain from a detailed legal analysis, we aim to open a p… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

    Comments: 12 pages

    Journal ref: 2nd International Workshop on Decentralized Governance Design at the 35th International Conference on Advanced Information Systems Engineering 2023

  5. arXiv:2304.04749  [pdf

    cs.SE cs.CY

    Watch the Gap: Making code more intelligible to users without sacrificing decentralization?

    Authors: Simona Ramos, Morshed Mannan

    Abstract: The potential for blockchain technology to eliminate the middleman and replace the top down hierarchical model of governance with a system of distributed cooperation has opened up many new opportunities, as well as dilemmas. Surpassing the level of acceptance by early tech adopters, the market of smart contracts is now moving towards wider acceptance from regular (non tech) users. For this to happ… ▽ More

    Submitted 10 March, 2023; originally announced April 2023.

    Journal ref: IEEE 24th Conference on Business Informatics; Workshop towards Decentralized Governance Design, June 2022

  6. arXiv:2209.07650  [pdf, other

    cs.IT cs.LG math.NA

    Statistical Properties of the Entropy from Ordinal Patterns

    Authors: Eduarda T. C. Chagas, Alejandro. C. Frery, Juliana Gambini, Magdalena M. Lucini, Heitor S. Ramos, Andrea A. Rey

    Abstract: The ultimate purpose of the statistical analysis of ordinal patterns is to characterize the distribution of the features they induce. In particular, knowing the joint distribution of the pair Entropy-Statistical Complexity for a large class of time series models would allow statistical tests that are unavailable to date. Working in this direction, we characterize the asymptotic distribution of the… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Journal ref: Chaos: An Interdisciplinary Journal of Nonlinear Science (2022)

  7. arXiv:2208.12763  [pdf, other

    cs.CV cs.GR

    Leveraging Synthetic Data to Learn Video Stabilization Under Adverse Conditions

    Authors: Abdulrahman Kerim, Washington L. S. Ramos, Leandro Soriano Marcolino, Erickson R. Nascimento, Richard Jiang

    Abstract: Video stabilization plays a central role to improve videos quality. However, despite the substantial progress made by these methods, they were, mainly, tested under standard weather and lighting conditions, and may perform poorly under adverse conditions. In this paper, we propose a synthetic-aware adverse weather robust algorithm for video stabilization that does not require real data and can be… ▽ More

    Submitted 26 August, 2022; originally announced August 2022.

    ACM Class: I.4.0; I.4.1; I.6.0

  8. arXiv:2111.02767  [pdf, other

    cs.LG

    RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning

    Authors: Sabela Ramos, Sertan Girgin, Léonard Hussenot, Damien Vincent, Hanna Yakubovich, Daniel Toyama, Anita Gergely, Piotr Stanczyk, Raphael Marinier, Jeremiah Harmsen, Olivier Pietquin, Nikola Momchev

    Abstract: We introduce RLDS (Reinforcement Learning Datasets), an ecosystem for recording, replaying, manipulating, annotating and sharing data in the context of Sequential Decision Making (SDM) including Reinforcement Learning (RL), Learning from Demonstrations, Offline RL or Imitation Learning. RLDS enables not only reproducibility of existing research and easy generation of new datasets, but also acceler… ▽ More

    Submitted 4 November, 2021; originally announced November 2021.

    Comments: https://github.com/google-research/rlds

  9. arXiv:2105.12034  [pdf, other

    cs.LG

    Hyperparameter Selection for Imitation Learning

    Authors: Leonard Hussenot, Marcin Andrychowicz, Damien Vincent, Robert Dadashi, Anton Raichuk, Lukasz Stafiniak, Sertan Girgin, Raphael Marinier, Nikola Momchev, Sabela Ramos, Manu Orsini, Olivier Bachem, Matthieu Geist, Olivier Pietquin

    Abstract: We address the issue of tuning hyperparameters (HPs) for imitation learning algorithms in the context of continuous-control, when the underlying reward function of the demonstrating expert cannot be observed at any time. The vast literature in imitation learning mostly considers this reward function to be available for HP selection, but this is not a realistic setting. Indeed, would this reward fu… ▽ More

    Submitted 25 May, 2021; originally announced May 2021.

    Comments: ICML 2021

  10. arXiv:2102.04736  [pdf, other

    cs.LG cs.AI cs.DC

    Reverb: A Framework For Experience Replay

    Authors: Albin Cassirer, Gabriel Barth-Maron, Eugene Brevdo, Sabela Ramos, Toby Boyd, Thibault Sottiaux, Manuel Kroiss

    Abstract: A central component of training in Reinforcement Learning (RL) is Experience: the data used for training. The mechanisms used to generate and consume this data have an important effect on the performance of RL algorithms. In this paper, we introduce Reverb: an efficient, extensible, and easy to use system designed specifically for experience replay in RL. Reverb is designed to work efficiently i… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

    Comments: 11 pages

  11. arXiv:2101.11227  [pdf, other

    stat.ME cs.MS

    Bayesian Paired-Comparison with the bpcs Package

    Authors: David Issa Mattos, Érika Martins Silva Ramos

    Abstract: This article introduces the bpcs R package (Bayesian Paired Comparison in Stan) and the statistical models implemented in the package. This package aims to facilitate the use of Bayesian models for paired comparison data in behavioral research. Bayesian analysis of paired comparison data allows parameter estimation even in conditions where the maximum likelihood does not exist, allows easy extensi… ▽ More

    Submitted 20 September, 2021; v1 submitted 27 January, 2021; originally announced January 2021.

    Comments: Accepted for publication in the Journal of Behavior Research Methods (https://www.springer.com/journal/13428)

  12. arXiv:2011.08325  [pdf, other

    cs.CV cs.LG

    A New Similarity Space Tailored for Supervised Deep Metric Learning

    Authors: Pedro H. Barros, Fabiane Queiroz, Flavio Figueredo, Jefersson A. dos Santos, Heitor S. Ramos

    Abstract: We propose a novel deep metric learning method. Differently from many works on this area, we defined a novel latent space obtained through an autoencoder. The new space, namely S-space, is divided into different regions that describe the positions where pairs of objects are similar/dissimilar. We locate makers to identify these regions. We estimate the similarities between objects through a kernel… ▽ More

    Submitted 18 November, 2020; v1 submitted 16 November, 2020; originally announced November 2020.

    Comments: 47 pages, 11 figures

  13. A Sparse Sampling-based framework for Semantic Fast-Forward of First-Person Videos

    Authors: Michel Melo Silva, Washington Luis Souza Ramos, Mario Fernando Montenegro Campos, Erickson Rangel Nascimento

    Abstract: Technological advances in sensors have paved the way for digital cameras to become increasingly ubiquitous, which, in turn, led to the popularity of the self-recording culture. As a result, the amount of visual data on the Internet is moving in the opposite direction of the available time and patience of the users. Thus, most of the uploaded videos are doomed to be forgotten and unwatched stashed… ▽ More

    Submitted 21 September, 2020; originally announced September 2020.

    Comments: Accepted at the IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 2020. arXiv admin note: text overlap with arXiv:1802.08722

  14. arXiv:2007.08687  [pdf, other

    cs.LG eess.SP stat.ML

    Leveraging the Self-Transition Probability of Ordinal Pattern Transition Graph for Transportation Mode Classification

    Authors: I. Cardoso-Pereira, J. B. Borges, P. H. Barros, A. F. Loureiro, O. A. Rosso, H. S. Ramos

    Abstract: The analysis of GPS trajectories is a well-studied problem in Urban Computing and has been used to track people. Analyzing people mobility and identifying the transportation mode used by them is essential for cities that want to reduce traffic jams and travel time between their points, thus helping to improve the quality of life of citizens. The trajectory data of a moving object is represented by… ▽ More

    Submitted 16 July, 2020; originally announced July 2020.

  15. arXiv:2006.00979  [pdf, other

    cs.LG cs.AI

    Acme: A Research Framework for Distributed Reinforcement Learning

    Authors: Matthew W. Hoffman, Bobak Shahriari, John Aslanides, Gabriel Barth-Maron, Nikola Momchev, Danila Sinopalnikov, Piotr Stańczyk, Sabela Ramos, Anton Raichuk, Damien Vincent, Léonard Hussenot, Robert Dadashi, Gabriel Dulac-Arnold, Manu Orsini, Alexis Jacq, Johan Ferret, Nino Vieillard, Seyed Kamyar Seyed Ghasemipour, Sertan Girgin, Olivier Pietquin, Feryal Behbahani, Tamara Norman, Abbas Abdolmaleki, Albin Cassirer, Fan Yang , et al. (14 additional authors not shown)

    Abstract: Deep reinforcement learning (RL) has led to many recent and groundbreaking advances. However, these advances have often come at the cost of both increased scale in the underlying architectures being trained as well as increased complexity of the RL algorithms used to train them. These increases have in turn made it more difficult for researchers to rapidly prototype new ideas or reproduce publishe… ▽ More

    Submitted 20 September, 2022; v1 submitted 1 June, 2020; originally announced June 2020.

    Comments: This work presents a second version of the paper which coincides with an increase in modularity, additional emphasis on offline, imitation and learning from demonstrations algorithms, as well as various new agents implemented as part of Acme

  16. arXiv:1912.12655  [pdf, other

    cs.CV

    Personalizing Fast-Forward Videos Based on Visual and Textual Features from Social Network

    Authors: Washington L. S. Ramos, Michel M. Silva, Edson R. Araujo, Alan C. Neves, Erickson R. Nascimento

    Abstract: The growth of Social Networks has fueled the habit of people logging their day-to-day activities, and long First-Person Videos (FPVs) are one of the main tools in this new habit. Semantic-aware fast-forward methods are able to decrease the watch time and select meaningful moments, which is key to increase the chances of these videos being watched. However, these methods can not handle semantics in… ▽ More

    Submitted 29 December, 2019; originally announced December 2019.

  17. 3DBGrowth: volumetric vertebrae segmentation and reconstruction in magnetic resonance imaging

    Authors: Jonathan S. Ramos, Mirela T. Cazzolato, Bruno S. Faiçal, Marcello H. Nogueira-Barbosa, Caetano Traina Jr., Agma J. M. Traina

    Abstract: Segmentation of medical images is critical for making several processes of analysis and classification more reliable. With the growing number of people presenting back pain and related problems, the semi-automatic segmentation and 3D reconstruction of vertebral bodies became even more important to support decision making. A 3D reconstruction allows a fast and objective analysis of each vertebrae c… ▽ More

    Submitted 8 July, 2019; v1 submitted 24 June, 2019; originally announced June 2019.

    Comments: This is a pre-print of an article published in Computer-Based Medical Systems. The final authenticated version is available online at: https://doi.org/10.1109/CBMS.2019.00091

    Journal ref: Computer-Based Medical Systems, 2019

  18. BGrowth: an efficient approach for the segmentation of vertebral compression fractures in magnetic resonance imaging

    Authors: Jonathan S. Ramos, Carolina Y. V. Watanabe, Marcello H. Nogueira-Barbosa, Agma J. M. Traina

    Abstract: Segmentation of medical images is a critical issue: several process of analysis and classification rely on this segmentation. With the growing number of people presenting back pain and problems related to it, the automatic or semi-automatic segmentation of fractured vertebral bodies became a challenging task. In general, those fractures present several regions with non-homogeneous intensities and… ▽ More

    Submitted 24 June, 2019; v1 submitted 20 June, 2019; originally announced June 2019.

    Comments: This is a pre-print of an article published in Symposium on Applied Computing. The final authenticated version is available online at https://doi.org/10.1145/3297280.3299728

    Journal ref: The 34th ACM/SIGAPP Symposium on Applied Computing (SAC2019)

  19. arXiv:1808.00320  [pdf, other

    cs.CY

    Robots Racialized in the Likeness of Marginalized Social Identities are Subject to Greater Dehumanization than those racialized as White

    Authors: Megan Strait, Ana Sánchez Ramos, Virginia Contreras, Noemi Garcia

    Abstract: The emergence and spread of humanlike robots into increasingly public domains has revealed a concerning phenomenon: people's unabashed dehumanization of robots, particularly those gendered as female. Here we examined this phenomenon further towards understanding whether other socially marginalized cues (racialization in the likeness of Asian and Black identities), like female-gendering, are associ… ▽ More

    Submitted 1 August, 2018; originally announced August 2018.

    Comments: Accepted to IEEE RO-MAN 2018

  20. A Weighted Sparse Sampling and Smoothing Frame Transition Approach for Semantic Fast-Forward First-Person Videos

    Authors: Michel Melo Silva, Washington Luis Souza Ramos, Joao Klock Ferreira, Felipe Cadar Chamone, Mario Fernando Montenegro Campos, Erickson Rangel Nascimento

    Abstract: Thanks to the advances in the technology of low-cost digital cameras and the popularity of the self-recording culture, the amount of visual data on the Internet is going to the opposite side of the available time and patience of the users. Thus, most of the uploaded videos are doomed to be forgotten and unwatched in a computer folder or website. In this work, we address the problem of creating smo… ▽ More

    Submitted 4 April, 2019; v1 submitted 23 February, 2018; originally announced February 2018.

    Comments: Accepted for publication in the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2018. Link to the project wesite: https://www.verlab.dcc.ufmg.br/semantic-hyperlapse/

  21. Making a long story short: A Multi-Importance fast-forwarding egocentric videos with the emphasis on relevant objects

    Authors: Michel Melo Silva, Washington Luis Souza Ramos, Felipe Cadar Chamone, João Pedro Klock Ferreira, Mario Fernando Montenegro Campos, Erickson Rangel Nascimento

    Abstract: The emergence of low-cost high-quality personal wearable cameras combined with the increasing storage capacity of video-sharing websites have evoked a growing interest in first-person videos, since most videos are composed of long-running unedited streams which are usually tedious and unpleasant to watch. State-of-the-art semantic fast-forward methods currently face the challenge of providing an a… ▽ More

    Submitted 7 March, 2018; v1 submitted 9 November, 2017; originally announced November 2017.

    Comments: Accepted to publication in the Journal of Visual Communication and Image Representation (JVCI) 2018. Project website: https://www.verlab.dcc.ufmg.br/semantic-hyperlapse

  22. Fast-Forward Video Based on Semantic Extraction

    Authors: Washington Luis Souza Ramos, Michel Melo Silva, Mario Fernando Montenegro Campos, Erickson Rangel Nascimento

    Abstract: Thanks to the low operational cost and large storage capacity of smartphones and wearable devices, people are recording many hours of daily activities, sport actions and home videos. These videos, also known as egocentric videos, are generally long-running streams with unedited content, which make them boring and visually unpalatable, bringing up the challenge to make egocentric videos more appeal… ▽ More

    Submitted 16 August, 2017; v1 submitted 14 August, 2017; originally announced August 2017.

    Comments: Accepted for publication and presented in 2016 IEEE International Conference on Image Processing (ICIP)

  23. Towards Semantic Fast-Forward and Stabilized Egocentric Videos

    Authors: Michel Melo Silva, Washington Luis Souza Ramos, Joao Pedro Klock Ferreira, Mario Fernando Montenegro Campos, Erickson Rangel Nascimento

    Abstract: The emergence of low-cost personal mobiles devices and wearable cameras and the increasing storage capacity of video-sharing websites have pushed forward a growing interest towards first-person videos. Since most of the recorded videos compose long-running streams with unedited content, they are tedious and unpleasant to watch. The fast-forward state-of-the-art methods are facing challenges of bal… ▽ More

    Submitted 16 August, 2017; v1 submitted 14 August, 2017; originally announced August 2017.

    Comments: Accepted for publication and presented in the First International Workshop on Egocentric Perception, Interaction and Computing at European Conference on Computer Vision (EPIC@ECCV) 2016

  24. arXiv:1612.06573  [pdf, other

    cs.CV cs.RO

    Detecting Unexpected Obstacles for Self-Driving Cars: Fusing Deep Learning and Geometric Modeling

    Authors: Sebastian Ramos, Stefan Gehrig, Peter Pinggera, Uwe Franke, Carsten Rother

    Abstract: The detection of small road hazards, such as lost cargo, is a vital capability for self-driving cars. We tackle this challenging and rarely addressed problem with a vision system that leverages appearance, contextual as well as geometric cues. To utilize the appearance and contextual cues, we propose a new deep learning-based obstacle detection framework. Here a variant of a fully convolutional ne… ▽ More

    Submitted 20 December, 2016; originally announced December 2016.

    Comments: Submitted to the IEEE International Conference on Robotics and Automation (ICRA) 2017

  25. arXiv:1609.04653  [pdf, other

    cs.CV cs.RO

    Lost and Found: Detecting Small Road Hazards for Self-Driving Vehicles

    Authors: Peter Pinggera, Sebastian Ramos, Stefan Gehrig, Uwe Franke, Carsten Rother, Rudolf Mester

    Abstract: Detecting small obstacles on the road ahead is a critical part of the driving task which has to be mastered by fully autonomous cars. In this paper, we present a method based on stereo vision to reliably detect such obstacles from a moving vehicle. The proposed algorithm performs statistical hypothesis tests in disparity space directly on stereo image data, assessing freespace and obstacle hypothe… ▽ More

    Submitted 15 September, 2016; originally announced September 2016.

    Comments: To be presented at IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2016

  26. arXiv:1604.01685  [pdf, other

    cs.CV

    The Cityscapes Dataset for Semantic Urban Scene Understanding

    Authors: Marius Cordts, Mohamed Omran, Sebastian Ramos, Timo Rehfeld, Markus Enzweiler, Rodrigo Benenson, Uwe Franke, Stefan Roth, Bernt Schiele

    Abstract: Visual understanding of complex urban street scenes is an enabling factor for a wide range of applications. Object detection has benefited enormously from large-scale datasets, especially in the context of deep learning. For semantic urban scene understanding, however, no current dataset adequately captures the complexity of real-world urban scenes. To address this, we introduce Cityscapes, a be… ▽ More

    Submitted 7 April, 2016; v1 submitted 6 April, 2016; originally announced April 2016.

    Comments: Includes supplemental material

  27. arXiv:1408.5400  [pdf, other

    cs.CV cs.LG

    Hierarchical Adaptive Structural SVM for Domain Adaptation

    Authors: Jiaolong Xu, Sebastian Ramos, David Vazquez, Antonio M. Lopez

    Abstract: A key topic in classification is the accuracy loss produced when the data distribution in the training (source) domain differs from that in the testing (target) domain. This is being recognized as a very relevant problem for many computer vision tasks such as image classification, object detection, and object category recognition. In this paper, we present a novel domain adaptation method that lev… ▽ More

    Submitted 22 August, 2014; originally announced August 2014.

  28. arXiv:1407.3686  [pdf, ps, other

    cs.CV

    Spatiotemporal Stacked Sequential Learning for Pedestrian Detection

    Authors: Alejandro González, Sebastian Ramos, David Vázquez, Antonio M. López, Jaume Amores

    Abstract: Pedestrian classifiers decide which image windows contain a pedestrian. In practice, such classifiers provide a relatively high response at neighbor windows overlapping a pedestrian, while the responses around potential false positives are expected to be lower. An analogous reasoning applies for image sequences. If there is a pedestrian located within a frame, the same pedestrian is expected to ap… ▽ More

    Submitted 14 July, 2014; originally announced July 2014.

    Comments: 8 pages, 5 figure, 1 table

  29. arXiv:1306.1894  [pdf, other

    cs.CV

    Speckle Reduction with Adaptive Stack Filters

    Authors: María Elena Buemi, Alejandro C. Frery, Heitor S. Ramos

    Abstract: Stack filters are a special case of non-linear filters. They have a good performance for filtering images with different types of noise while preserving edges and details. A stack filter decomposes an input image into stacks of binary images according to a set of thresholds. Each binary image is then filtered by a Boolean function, which characterizes the filter. Adaptive stack filters can be comp… ▽ More

    Submitted 8 June, 2013; originally announced June 2013.

    Comments: Accepted for publication on Pattern Recognition Letters. arXiv admin note: substantial text overlap with arXiv:1207.4308

  30. Assessment of SAR Image Filtering using Adaptive Stack Filters

    Authors: Maria E. Buemi, Marta Mejail, Julio Jacobo, Alejandro C. Frery, Heitor S. Ramos

    Abstract: Stack filters are a special case of non-linear filters. They have a good performance for filtering images with different types of noise while preserving edges and details. A stack filter decomposes an input image into several binary images according to a set of thresholds. Each binary image is then filtered by a Boolean function, which characterizes the filter. Adaptive stack filters can be design… ▽ More

    Submitted 18 July, 2012; originally announced July 2012.

    Journal ref: Proceedings 16th Iberoamerican Congress on Pattern Recognition (CIARP 2011), Lecture Notes in Computer Science vol. 7042, p. 89--96