Zum Hauptinhalt springen

Showing 1–13 of 13 results for author: Rose, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.05467  [pdf, other

    cs.DC cs.AI

    The infrastructure powering IBM's Gen AI model development

    Authors: Talia Gershon, Seetharami Seelam, Brian Belgodere, Milton Bonilla, Lan Hoang, Danny Barnett, I-Hsin Chung, Apoorve Mohan, Ming-Hung Chen, Lixiang Luo, Robert Walkup, Constantinos Evangelinos, Shweta Salaria, Marc Dombrowa, Yoonho Park, Apo Kayi, Liran Schour, Alim Alim, Ali Sydney, Pavlos Maniotis, Laurent Schares, Bernard Metzler, Bengi Karacali-Akyamac, Sophia Wen, Tatsuhiro Chiba , et al. (121 additional authors not shown)

    Abstract: AI Infrastructure plays a key role in the speed and cost-competitiveness of developing and deploying advanced AI models. The current demand for powerful AI infrastructure for model training is driven by the emergence of generative AI and foundational models, where on occasion thousands of GPUs must cooperate on a single training job for the model to be trained in a reasonable time. Delivering effi… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: Corresponding Authors: Talia Gershon, Seetharami Seelam,Brian Belgodere, Milton Bonilla

  2. arXiv:2403.02078  [pdf, other

    cs.CL

    Automated Generation of Multiple-Choice Cloze Questions for Assessing English Vocabulary Using GPT-turbo 3.5

    Authors: Qiao Wang, Ralph Rose, Naho Orita, Ayaka Sugawara

    Abstract: A common way of assessing language learners' mastery of vocabulary is via multiple-choice cloze (i.e., fill-in-the-blank) questions. But the creation of test items can be laborious for individual teachers or in large-scale language programs. In this paper, we evaluate a new method for automatically generating these types of questions using large language models (LLM). The VocaTT (vocabulary teachi… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Journal ref: Mika Hämäläinen, Emily Öhman, Flammie Pirinen, Khalid Alnajjar, So Miyagawa, Yuri Bizzoni, Niko Partanen, and Jack Rueter. 2023. Proc. of the Joint 3rd International Conference on NLP4DH and 8th IWCLUL. ACL, Tokyo, Japan, edition

  3. Experimental analysis of variability in WS$_2$-based devices for hardware security

    Authors: M. Vatalaro, H. Neill, F. Gity, P. Magnone, V. Maccaronio, C. Márquez, J. C. Galdon, F. Gamiz, F. Crupi, P. Hurley, R. De Rose

    Abstract: This work investigates the variability of tungsten disulfide (WS$_2$)-based devices by experimental characterization in view of possible application in the field of hardware security. To this aim, a preliminary analysis was performed by measurements across voltages and temperatures on a set of seven Si/SiO$_2$/WS$_2$ back-gated devices, also considering the effect of different stabilization condit… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

    Journal ref: Solid-State Electronics 2023

  4. arXiv:2306.16398  [pdf, other

    cs.SD eess.AS

    Cascaded encoders for fine-tuning ASR models on overlapped speech

    Authors: Richard Rose, Oscar Chang, Olivier Siohan

    Abstract: Multi-talker speech recognition (MT-ASR) has been shown to improve ASR performance on speech containing overlapping utterances from more than one speaker. Multi-talker models have typically been trained from scratch using simulated or actual overlapping speech datasets. On the other hand, the trend in ASR has been to train foundation models using massive datasets collected from a wide variety of t… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  5. arXiv:2211.07357  [pdf, other

    cs.LG cs.AI eess.SY

    Controlling Commercial Cooling Systems Using Reinforcement Learning

    Authors: Jerry Luo, Cosmin Paduraru, Octavian Voicu, Yuri Chervonyi, Scott Munns, Jerry Li, Crystal Qian, Praneet Dutta, Jared Quincy Davis, Ningjia Wu, Xingwei Yang, Chu-Ming Chang, Ted Li, Rob Rose, Mingyan Fan, Hootan Nakhost, Tinglin Liu, Brian Kirkman, Frank Altamura, Lee Cline, Patrick Tonker, Joel Gouker, Dave Uden, Warren Buddy Bryan, Jason Law , et al. (11 additional authors not shown)

    Abstract: This paper is a technical overview of DeepMind and Google's recent work on reinforcement learning for controlling commercial cooling systems. Building on expertise that began with cooling Google's data centers more efficiently, we recently conducted live experiments on two real-world facilities in partnership with Trane Technologies, a building management system provider. These live experiments ha… ▽ More

    Submitted 14 December, 2022; v1 submitted 11 November, 2022; originally announced November 2022.

    Comments: 27 pages, 11 figures

  6. arXiv:2205.09388  [pdf

    cs.ET physics.app-ph

    Smart Material Implication Using Spin-Transfer Torque Magnetic Tunnel Junctions for Logic-in-Memory Computing

    Authors: Raffaele De Rose, Tommaso Zanotti, Francesco Maria Puglisi, Felice Crupi, Paolo Pavan, Marco Lanuzza

    Abstract: Smart material implication (SIMPLY) logic has been recently proposed for the design of energy-efficient Logic-in-Memory (LIM) architectures based on non-volatile resistive memory devices. The SIMPLY logic is enabled by adding a comparator to the conventional IMPLY scheme. This allows performing a preliminary READ operation and hence the SET operation only in the case it is actually required. This… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Journal ref: Solid-State Electronics 2022

  7. Adjusting Thermal Stability in Double-Barrier MTJ for Energy Improvement in Cryogenic STT-MRAMs

    Authors: Esteban Garzón, Raffaele De Rose, Felice Crupi, Lionel Trojman, Adam Teman, Marco Lanuzza

    Abstract: This paper investigates the impact of thermal stability relaxation in double-barrier magnetic tunnel junctions (DMTJs) for energy-efficient spin-transfer torque magnetic random access memories (STT-MRAMs) operating at the liquid nitrogen boiling point (77K). Our study is carried out through a macrospin-based Verilog-A compact model of DMTJ, along with a 65nm commercial process design kit (PDK) cal… ▽ More

    Submitted 20 April, 2022; originally announced April 2022.

    Journal ref: Solid-State Electronics, 2022

  8. arXiv:2204.00652  [pdf, other

    cs.SD cs.CL eess.AS

    End-to-end multi-talker audio-visual ASR using an active speaker attention module

    Authors: Richard Rose, Olivier Siohan

    Abstract: This paper presents a new approach for end-to-end audio-visual multi-talker speech recognition. The approach, referred to here as the visual context attention model (VCAM), is important because it uses the available video information to assign decoded text to one of multiple visible faces. This essentially resolves the label ambiguity issue associated with most multi-talker modeling approaches whi… ▽ More

    Submitted 1 April, 2022; originally announced April 2022.

    Comments: 5 pages, 3 figures, 3 tables, 28 citations

  9. arXiv:2001.05863  [pdf, other

    cs.HC cs.RO

    Establishing Human-Robot Trust through Music-Driven Robotic Emotion Prosody and Gesture

    Authors: Richard Savery, Ryan Rose, Gil Weinberg

    Abstract: As human-robot collaboration opportunities continue to expand, trust becomes ever more important for full engagement and utilization of robots. Affective trust, built on emotional relationship and interpersonal bonds is particularly critical as it is more resilient to mistakes and increases the willingness to collaborate. In this paper we present a novel model built on music-driven emotional proso… ▽ More

    Submitted 11 January, 2020; originally announced January 2020.

    Journal ref: The 28th IEEE International Conference on Robot & Human Interactive Communication 2019

  10. arXiv:1808.09016  [pdf, other

    cs.CV cs.AI

    Review Helpfulness Assessment based on Convolutional Neural Network

    Authors: Xianshan Qu, Xiaopeng Li, John R. Rose

    Abstract: In this paper we describe the implementation of a convolutional neural network (CNN) used to assess online review helpfulness. To our knowledge, this is the first use of this architecture to address this problem. We explore the impact of two related factors impacting CNN performance: different word embedding initializations and different input review lengths. We also propose an approach to combini… ▽ More

    Submitted 27 August, 2018; originally announced August 2018.

  11. Multi-level computational methods for interdisciplinary research in the HathiTrust Digital Library

    Authors: Jaimie Murdock, Colin Allen, Katy Börner, Robert Light, Simon McAlister, Andrew Ravenscroft, Robert Rose, Doori Rose, Jun Otsuka, David Bourget, John Lawrence, Chris Reed

    Abstract: We show how faceted search using a combination of traditional classification systems and mixed-membership topic models can go beyond keyword search to inform resource discovery, hypothesis formulation, and argument extraction for interdisciplinary research. Our test domain is the history and philosophy of scientific work on animal mind and cognition. The methods can be generalized to other researc… ▽ More

    Submitted 7 June, 2017; v1 submitted 3 February, 2017; originally announced February 2017.

    Comments: revised, 29 pages, 3 figures

  12. arXiv:1608.00027  [pdf, ps, other

    stat.ML cs.LG

    gLOP: the global and Local Penalty for Capturing Predictive Heterogeneity

    Authors: Rhiannon V. Rose, Daniel J. Lizotte

    Abstract: When faced with a supervised learning problem, we hope to have rich enough data to build a model that predicts future instances well. However, in practice, problems can exhibit predictive heterogeneity: most instances might be relatively easy to predict, while others might be predictive outliers for which a model trained on the entire dataset does not perform well. Identifying these can help focus… ▽ More

    Submitted 29 July, 2016; originally announced August 2016.

    Comments: Presented at 2016 Machine Learning and Healthcare Conference (MLHC 2016), Los Angeles, CA

  13. arXiv:1606.05925  [pdf, other

    stat.ML cs.CL cs.LG

    Graph based manifold regularized deep neural networks for automatic speech recognition

    Authors: Vikrant Singh Tomar, Richard C. Rose

    Abstract: Deep neural networks (DNNs) have been successfully applied to a wide variety of acoustic modeling tasks in recent years. These include the applications of DNNs either in a discriminative feature extraction or in a hybrid acoustic modeling scenario. Despite the rapid progress in this area, a number of challenges remain in training DNNs. This paper presents an effective way of training DNNs using a… ▽ More

    Submitted 19 June, 2016; originally announced June 2016.

    Comments: 12 pages including citations, 2 figures