Zum Hauptinhalt springen

Showing 1–16 of 16 results for author: Ye, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.03427  [pdf, other

    cs.RO

    GMMCalib: Extrinsic Calibration of LiDAR Sensors using GMM-based Joint Registration

    Authors: Ilir Tahiraj, Felix Fent, Philipp Hafemann, Egon Ye, Markus Lienkamp

    Abstract: State-of-the-art LiDAR calibration frameworks mainly use non-probabilistic registration methods such as Iterative Closest Point (ICP) and its variants. These methods suffer from biased results due to their pair-wise registration procedure as well as their sensitivity to initialization and parameterization. This often leads to misalignments in the calibration process. Probabilistic registration met… ▽ More

    Submitted 8 April, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

  2. arXiv:2401.00406  [pdf

    cs.CV

    Low-cost Geometry-based Eye Gaze Detection using Facial Landmarks Generated through Deep Learning

    Authors: Esther Enhui Ye, John Enzhou Ye, Joseph Ye, Jacob Ye, Runzhou Ye

    Abstract: Introduction: In the realm of human-computer interaction and behavioral research, accurate real-time gaze estimation is critical. Traditional methods often rely on expensive equipment or large datasets, which are impractical in many scenarios. This paper introduces a novel, geometry-based approach to address these challenges, utilizing consumer-grade hardware for broader applicability. Methods: We… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

  3. arXiv:2312.05187  [pdf, other

    cs.CL cs.SD eess.AS

    Seamless: Multilingual Expressive and Streaming Speech Translation

    Authors: Seamless Communication, Loïc Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Mark Duppenthaler, Paul-Ambroise Duquenne, Brian Ellis, Hady Elsahar, Justin Haaheim, John Hoffman, Min-Jae Hwang, Hirofumi Inaguma, Christopher Klaiber, Ilia Kulikov, Pengwei Li, Daniel Licht, Jean Maillard, Ruslan Mavlyutov, Alice Rakotoarison, Kaushik Ram Sadagopan, Abinesh Ramakrishnan, Tuan Tran, Guillaume Wenzek , et al. (40 additional authors not shown)

    Abstract: Large-scale automatic speech translation systems today lack key features that help machine-mediated communication feel seamless when compared to human-to-human dialogue. In this work, we introduce a family of models that enable end-to-end expressive and multilingual translations in a streaming fashion. First, we contribute an improved version of the massively multilingual and multimodal SeamlessM4… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  4. arXiv:2308.11927  [pdf, other

    q-bio.QM cs.CV eess.IV

    Recovering a Molecule's 3D Dynamics from Liquid-phase Electron Microscopy Movies

    Authors: Enze Ye, Yuhang Wang, Hong Zhang, Yiqin Gao, Huan Wang, He Sun

    Abstract: The dynamics of biomolecules are crucial for our understanding of their functioning in living systems. However, current 3D imaging techniques, such as cryogenic electron microscopy (cryo-EM), require freezing the sample, which limits the observation of their conformational changes in real time. The innovative liquid-phase electron microscopy (liquid-phase EM) technique allows molecules to be place… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

  5. arXiv:2308.11596  [pdf, other

    cs.CL

    SeamlessM4T: Massively Multilingual & Multimodal Machine Translation

    Authors: Seamless Communication, Loïc Barrault, Yu-An Chung, Mariano Cora Meglioli, David Dale, Ning Dong, Paul-Ambroise Duquenne, Hady Elsahar, Hongyu Gong, Kevin Heffernan, John Hoffman, Christopher Klaiber, Pengwei Li, Daniel Licht, Jean Maillard, Alice Rakotoarison, Kaushik Ram Sadagopan, Guillaume Wenzek, Ethan Ye, Bapi Akula, Peng-Jen Chen, Naji El Hachem, Brian Ellis, Gabriel Mejia Gonzalez, Justin Haaheim , et al. (43 additional authors not shown)

    Abstract: What does it take to create the Babel Fish, a tool that can help individuals translate speech between any two languages? While recent breakthroughs in text-based models have pushed machine translation coverage beyond 200 languages, unified speech-to-speech translation models have yet to achieve similar strides. More specifically, conventional speech-to-speech translation systems rely on cascaded s… ▽ More

    Submitted 24 October, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

    ACM Class: I.2.7

  6. arXiv:2302.02123   

    cs.CL cs.AI cs.NE

    Greedy Ordering of Layer Weight Matrices in Transformers Improves Translation

    Authors: Elicia Ye

    Abstract: Prior work has attempted to understand the internal structures and functionalities of Transformer-based encoder-decoder architectures on the level of multi-head attention and feed-forward sublayers. Interpretations have focused on the encoder and decoder, along with the combinatorial possibilities of the self-attention, cross-attention, and feed-forward sublayers. However, without examining the lo… ▽ More

    Submitted 16 March, 2023; v1 submitted 4 February, 2023; originally announced February 2023.

    Comments: The paper contains an error in the implementation of the algorithm

  7. arXiv:2212.11890  [pdf, other

    quant-ph cs.AI cs.ET cs.LG cs.NE

    Decoding surface codes with deep reinforcement learning and probabilistic policy reuse

    Authors: Elisha Siddiqui Matekole, Esther Ye, Ramya Iyer, Samuel Yen-Chi Chen

    Abstract: Quantum computing (QC) promises significant advantages on certain hard computational tasks over classical computers. However, current quantum hardware, also known as noisy intermediate-scale quantum computers (NISQ), are still unable to carry out computations faithfully mainly because of the lack of quantum error correction (QEC) capability. A significant amount of theoretical studies have provide… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

  8. ALL-MASK: A Reconfigurable Logic Locking Method for Multicore Architecture with Sequential-Instruction-Oriented Key

    Authors: Jianfeng Wang, Zhonghao Chen, Jiahao Zhang, Yixin Xu, Tongguang Yu, Enze Ye, Ziheng Zheng, Huazhong Yang, Sumitha George, Yongpan Liu, Vijaykrishnan Narayanan, Xueqing Li

    Abstract: Intellectual property (IP) piracy has become a non-negligible problem as the integrated circuit (IC) production supply chain is becoming increasingly globalized and separated that enables attacks by potentially untrusted attackers. Logic locking is a widely adopted method to lock the circuit module with a key and prevent hackers from cracking it. The key is the critical aspect of logic locking, bu… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: 15 pages, 17 figures

    ACM Class: B.2.3; B.6.2; B.7.3

    Journal ref: ACM Transactions on Design Automation of Electronic Systems 2024

  9. arXiv:2202.11066  [pdf, other

    cs.CY

    Outing Power Outages: Real-time and Predictive Socio-demographic Analytics for New York City

    Authors: Samuel Eckstrom, Graham Murphy, Eileen Ye, Samrat Acharya, Robert Mieth, Yury Dvorkin

    Abstract: Electrical outages continue to occur despite technological innovations and improvements to electric power distribution infrastructure. In this paper, we describe a tool that was designed to acquire and collect data on electric power outages in New York City since July 2020. The electrical outages are then displayed on a front-end application, which is publicly available. We use the collected outag… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

    Comments: Accepted for the 2022 IEEE PES General Meeting

  10. NewsPod: Automatic and Interactive News Podcasts

    Authors: Philippe Laban, Elicia Ye, Srujay Korlakunta, John Canny, Marti A. Hearst

    Abstract: News podcasts are a popular medium to stay informed and dive deep into news topics. Today, most podcasts are handcrafted by professionals. In this work, we advance the state-of-the-art in automatically generated podcasts, making use of recent advances in natural language processing and text-to-speech technology. We present NewsPod, an automatically generated, interactive news podcast. The podcast… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

    Comments: Accepted at IUI 2022, 16 pages, 10 figures

  11. arXiv:2202.00478  [pdf

    cs.CL

    NeuraHealth: An Automated Screening Pipeline to Detect Undiagnosed Cognitive Impairment in Electronic Health Records with Deep Learning and Natural Language Processing

    Authors: Tanish Tyagi, Colin G. Magdamo, Ayush Noori, Zhaozhi Li, Xiao Liu, Mayuresh Deodhar, Zhuoqiao Hong, Wendong Ge, Elissa M. Ye, Yi-han Sheu, Haitham Alabsi, Laura Brenner, Gregory K. Robbins, Sahar Zafar, Nicole Benson, Lidia Moura, John Hsu, Alberto Serrano-Pozo, Dimitry Prokopenko, Rudolph E. Tanzi, Bradley T. Hyman, Deborah Blacker, Shibani S. Mukerji, M. Brandon Westover, Sudeshna Das

    Abstract: Dementia related cognitive impairment (CI) is a neurodegenerative disorder, affecting over 55 million people worldwide and growing rapidly at the rate of one new case every 3 seconds. 75% cases go undiagnosed globally with up to 90% in low-and-middle-income countries, leading to an estimated annual worldwide cost of USD 1.3 trillion, forecasted to reach 2.8 trillion by 2030. With no cure, a recurr… ▽ More

    Submitted 20 June, 2022; v1 submitted 12 January, 2022; originally announced February 2022.

  12. arXiv:2112.05779  [pdf, other

    quant-ph cs.AI cs.ET cs.LG cs.NE

    Quantum Architecture Search via Continual Reinforcement Learning

    Authors: Esther Ye, Samuel Yen-Chi Chen

    Abstract: Quantum computing has promised significant improvement in solving difficult computational tasks over classical computers. Designing quantum circuits for practical use, however, is not a trivial objective and requires expert-level knowledge. To aid this endeavor, this paper proposes a machine learning-based method to construct quantum circuit architectures. Previous works have demonstrated that cla… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

  13. arXiv:2111.09115  [pdf, other

    cs.CL cs.LG

    Using Deep Learning to Identify Patients with Cognitive Impairment in Electronic Health Records

    Authors: Tanish Tyagi, Colin G. Magdamo, Ayush Noori, Zhaozhi Li, Xiao Liu, Mayuresh Deodhar, Zhuoqiao Hong, Wendong Ge, Elissa M. Ye, Yi-han Sheu, Haitham Alabsi, Laura Brenner, Gregory K. Robbins, Sahar Zafar, Nicole Benson, Lidia Moura, John Hsu, Alberto Serrano-Pozo, Dimitry Prokopenko, Rudolph E. Tanzi, Bradley T. Hyman, Deborah Blacker, Shibani S. Mukerji, M. Brandon Westover, Sudeshna Das

    Abstract: Dementia is a neurodegenerative disorder that causes cognitive decline and affects more than 50 million people worldwide. Dementia is under-diagnosed by healthcare professionals - only one in four people who suffer from dementia are diagnosed. Even when a diagnosis is made, it may not be entered as a structured International Classification of Diseases (ICD) diagnosis code in a patient's charts. In… ▽ More

    Submitted 12 November, 2021; originally announced November 2021.

    Comments: Machine Learning for Health (ML4H) - Extended Abstract

  14. arXiv:2108.12531  [pdf, other

    eess.AS cs.CL cs.LG

    Speech Representations and Phoneme Classification for Preserving the Endangered Language of Ladin

    Authors: Zane Durante, Leena Mathur, Eric Ye, Sichong Zhao, Tejas Ramdas, Khalil Iskarous

    Abstract: A vast majority of the world's 7,000 spoken languages are predicted to become extinct within this century, including the endangered language of Ladin from the Italian Alps. Linguists who work to preserve a language's phonetic and phonological structure can spend hours transcribing each minute of speech from native speakers. To address this problem in the context of Ladin, our paper presents the fi… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

    Comments: Accepted to ICSA MLSLP 2021 (held with Interspeech 2021)

  15. arXiv:2102.13473  [pdf

    eess.SP cs.LG

    Sleep Apnea and Respiratory Anomaly Detection from a Wearable Band and Oxygen Saturation

    Authors: Wolfgang Ganglberger, Abigail A. Bucklin, Ryan A. Tesh, Madalena Da Silva Cardoso, Haoqi Sun, Michael J. Leone, Luis Paixao, Ezhil Panneerselvam, Elissa M. Ye, B. Taylor Thompson, Oluwaseun Akeju, David Kuller, Robert J. Thomas, M. Brandon Westover

    Abstract: Objective: Sleep related respiratory abnormalities are typically detected using polysomnography. There is a need in general medicine and critical care for a more convenient method to automatically detect sleep apnea from a simple, easy-to-wear device. The objective is to automatically detect abnormal respiration and estimate the Apnea-Hypopnea-Index (AHI) with a wearable respiratory device, compar… ▽ More

    Submitted 23 February, 2021; originally announced February 2021.

    Comments: Co-First Authors: Wolfgang Ganglberger, Abigail A. Bucklin Co-Senior Authors: Robert J. Thomas, M. Brandon Westover

  16. arXiv:2011.06489  [pdf, other

    cs.CL

    Natural Language Processing to Detect Cognitive Concerns in Electronic Health Records Using Deep Learning

    Authors: Zhuoqiao Hong, Colin G. Magdamo, Yi-han Sheu, Prathamesh Mohite, Ayush Noori, Elissa M. Ye, Wendong Ge, Haoqi Sun, Laura Brenner, Gregory Robbins, Shibani Mukerji, Sahar Zafar, Nicole Benson, Lidia Moura, John Hsu, Bradley T. Hyman, Michael B. Westover, Deborah Blacker, Sudeshna Das

    Abstract: Dementia is under-recognized in the community, under-diagnosed by healthcare professionals, and under-coded in claims data. Information on cognitive dysfunction, however, is often found in unstructured clinician notes within medical records but manual review by experts is time consuming and often prone to errors. Automated mining of these notes presents a potential opportunity to label patients wi… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2020 - Extended Abstract

    MSC Class: I.2.7