Skip to main content

Showing 1–15 of 15 results for author: Jang, K

Searching in archive eess. Search in all archives.
.
  1. arXiv:2407.03563  [pdf, other

    eess.AS cs.CL cs.LG eess.IV

    Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition

    Authors: Sungnyun Kim, Kangwook Jang, Sangmin Bae, Hoirin Kim, Se-Young Yun

    Abstract: Audio-visual speech recognition (AVSR) aims to transcribe human speech using both audio and video modalities. In practical environments with noise-corrupted audio, the role of video information becomes crucial. However, prior works have primarily focused on enhancing audio features in AVSR, overlooking the importance of video features. In this study, we strengthen the video features by learning th… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  2. arXiv:2406.16716  [pdf, other

    eess.AS cs.CR cs.SD

    One-Class Learning with Adaptive Centroid Shift for Audio Deepfake Detection

    Authors: Hyun Myung Kim, Kangwook Jang, Hoirin Kim

    Abstract: As speech synthesis systems continue to make remarkable advances in recent years, the importance of robust deepfake detection systems that perform well in unseen systems has grown. In this paper, we propose a novel adaptive centroid shift (ACS) method that updates the centroid representation by continually shifting as the weighted average of bonafide representations. Our approach uses only bonafid… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  3. arXiv:2405.03684  [pdf, other

    eess.IV

    All-in-One Deep Learning Framework for MR Image Reconstruction

    Authors: Geunu Jeong, Hyeonsoo Kim, Joonyoung Yang, Kyungeun Jang, Jeewook Kim

    Abstract: We introduce a novel, all-in-one deep learning framework for MR image reconstruction, enabling a single model to enhance image quality across multiple aspects of k-space sampling and to be effective across a wide range of clinical and technical scenarios. This DICOM-based algorithm serves as the core of SwiftMR (AIRS Medical, Seoul, Korea), which is FDA-cleared, CE-certified, and commercially avai… ▽ More

    Submitted 26 May, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

    Comments: 22 pages, 9 figures; number of collected MR raw data corrected

  4. arXiv:2402.17050  [pdf, other

    eess.SY cs.RO

    Reinforcement Learning Based Oscillation Dampening: Scaling up Single-Agent RL algorithms to a 100 AV highway field operational test

    Authors: Kathy Jang, Nathan Lichtlé, Eugene Vinitsky, Adit Shah, Matthew Bunting, Matthew Nice, Benedetto Piccoli, Benjamin Seibold, Daniel B. Work, Maria Laura Delle Monache, Jonathan Sprinkle, Jonathan W. Lee, Alexandre M. Bayen

    Abstract: In this article, we explore the technical details of the reinforcement learning (RL) algorithms that were deployed in the largest field test of automated vehicles designed to smooth traffic flow in history as of 2023, uncovering the challenges and breakthroughs that come with developing RL controllers for automated vehicles. We delve into the fundamental concepts behind RL algorithms and their app… ▽ More

    Submitted 14 May, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  5. arXiv:2402.17043  [pdf, other

    eess.SY

    Traffic Control via Connected and Automated Vehicles: An Open-Road Field Experiment with 100 CAVs

    Authors: Jonathan W. Lee, Han Wang, Kathy Jang, Amaury Hayat, Matthew Bunting, Arwa Alanqary, William Barbour, Zhe Fu, Xiaoqian Gong, George Gunter, Sharon Hornstein, Abdul Rahman Kreidieh, Nathan Lichtlé, Matthew W. Nice, William A. Richardson, Adit Shah, Eugene Vinitsky, Fangyu Wu, Shengquan Xiang, Sulaiman Almatrudi, Fahd Althukair, Rahul Bhadani, Joy Carpio, Raphael Chekroun, Eric Cheng , et al. (39 additional authors not shown)

    Abstract: The CIRCLES project aims to reduce instabilities in traffic flow, which are naturally occurring phenomena due to human driving behavior. These "phantom jams" or "stop-and-go waves,"are a significant source of wasted energy. Toward this goal, the CIRCLES project designed a control system referred to as the MegaController by the CIRCLES team, that could be deployed in real traffic. Our field experim… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  6. arXiv:2401.09666  [pdf, other

    eess.SY cs.AI cs.MA

    Traffic Smoothing Controllers for Autonomous Vehicles Using Deep Reinforcement Learning and Real-World Trajectory Data

    Authors: Nathan Lichtlé, Kathy Jang, Adit Shah, Eugene Vinitsky, Jonathan W. Lee, Alexandre M. Bayen

    Abstract: Designing traffic-smoothing cruise controllers that can be deployed onto autonomous vehicles is a key step towards improving traffic flow, reducing congestion, and enhancing fuel efficiency in mixed autonomy traffic. We bypass the common issue of having to carefully fine-tune a large traffic microsimulator by leveraging real-world trajectory data from the I-24 highway in Tennessee, replayed in a o… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: Accepted to be published as part of the 26th IEEE International Conference on Intelligent Transportation Systems (ITSC) 2023, Bilbao, Spain, September 24-28, 2023

  7. arXiv:2312.09040  [pdf, other

    cs.SD cs.CL eess.AS

    STaR: Distilling Speech Temporal Relation for Lightweight Speech Self-Supervised Learning Models

    Authors: Kangwook Jang, Sungnyun Kim, Hoirin Kim

    Abstract: Albeit great performance of Transformer-based speech selfsupervised learning (SSL) models, their large parameter size and computational cost make them unfavorable to utilize. In this study, we propose to compress the speech SSL models by distilling speech temporal relation (STaR). Unlike previous works that directly match the representation for each speech frame, STaR distillation transfers tempor… ▽ More

    Submitted 25 April, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: ICASSP 2024 Best Student Paper Awarded. Code URL: https://github.com/sungnyun/ARMHuBERT

  8. Recycle-and-Distill: Universal Compression Strategy for Transformer-based Speech SSL Models with Attention Map Reusing and Masking Distillation

    Authors: Kangwook Jang, Sungnyun Kim, Se-Young Yun, Hoirin Kim

    Abstract: Transformer-based speech self-supervised learning (SSL) models, such as HuBERT, show surprising performance in various speech processing tasks. However, huge number of parameters in speech SSL models necessitate the compression to a more compact model for wider usage in academia or small companies. In this study, we suggest to reuse attention maps across the Transformer layers, so as to remove key… ▽ More

    Submitted 26 October, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Proceedings of Interspeech 2023. Code URL: https://github.com/sungnyun/ARMHuBERT

  9. arXiv:2210.12772  [pdf, other

    physics.med-ph eess.IV eess.SP eess.SY

    Electroanatomic Mapping to determine Scar Regions in patients with Atrial Fibrillation

    Authors: Jiyue He, Kuk Jin Jang, Katie Walsh, Jackson Liang, Sanjay Dixit, Rahul Mangharam

    Abstract: Left atrial voltage maps are routinely acquired during electroanatomic mapping in patients undergoing catheter ablation for atrial fibrillation. For patients, who have prior catheter ablation when they are in sinus rhythm, the voltage map can be used to identify low voltage areas using a threshold of 0.2 - 0.45 mV. However, such a voltage threshold for maps acquired during atrial fibrillation has… ▽ More

    Submitted 8 November, 2022; v1 submitted 23 October, 2022; originally announced October 2022.

    Journal ref: 2019 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

  10. arXiv:2207.00555  [pdf, other

    eess.AS cs.CL cs.LG

    FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning

    Authors: Yeonghyeon Lee, Kangwook Jang, Jahyun Goo, Youngmoon Jung, Hoirin Kim

    Abstract: Large-scale speech self-supervised learning (SSL) has emerged to the main field of speech processing, however, the problem of computational cost arising from its vast size makes a high entry barrier to academia. In addition, existing distillation techniques of speech SSL models compress the model by reducing layers, which induces performance degradation in linguistic pattern recognition tasks such… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

    Comments: Accepted to Interspeech 2022

  11. arXiv:2101.10404  [pdf, other

    eess.SY cs.LG cs.RO

    Learning-'N-Flying: A Learning-based, Decentralized Mission Aware UAS Collision Avoidance Scheme

    Authors: Alëna Rodionova, Yash Vardhan Pant, Connor Kurtz, Kuk Jang, Houssam Abbas, Rahul Mangharam

    Abstract: Urban Air Mobility, the scenario where hundreds of manned and Unmanned Aircraft System (UAS) carry out a wide variety of missions (e.g. moving humans and goods within the city), is gaining acceptance as a transportation solution of the future. One of the key requirements for this to happen is safely managing the air traffic in these urban airspaces. Due to the expected density of the airspace, thi… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.

    Comments: to be published in ACM Transactions on Cyber-Physical Systems. arXiv admin note: text overlap with arXiv:2006.13267

  12. arXiv:2006.13267  [pdf, other

    eess.SY cs.LG cs.RO

    Learning-to-Fly: Learning-based Collision Avoidance for Scalable Urban Air Mobility

    Authors: Alëna Rodionova, Yash Vardhan Pant, Kuk Jang, Houssam Abbas, Rahul Mangharam

    Abstract: With increasing urban population, there is global interest in Urban Air Mobility (UAM), where hundreds of autonomous Unmanned Aircraft Systems (UAS) execute missions in the airspace above cities. Unlike traditional human-in-the-loop air traffic management, UAM requires decentralized autonomous approaches that scale for an order of magnitude higher aircraft densities and are applicable to urban set… ▽ More

    Submitted 23 June, 2020; originally announced June 2020.

    Comments: To be published in IEEE International Conference on Intelligent Transportation Systems (ITSC), 2020

  13. Zero-Shot Autonomous Vehicle Policy Transfer: From Simulation to Real-World via Adversarial Learning

    Authors: Behdad Chalaki, Logan E. Beaver, Ben Remer, Kathy Jang, Eugene Vinitsky, Alexandre M. Bayen, Andreas A. Malikopoulos

    Abstract: In this article, we demonstrate a zero-shot transfer of an autonomous driving policy from simulation to University of Delaware's scaled smart city with adversarial multi-agent reinforcement learning, in which an adversary attempts to decrease the net reward by perturbing both the inputs and outputs of the autonomous vehicles during training. We train the autonomous vehicles to coordinate with each… ▽ More

    Submitted 22 June, 2020; v1 submitted 12 March, 2019; originally announced March 2019.

    Comments: 6 pages, 4 figures

    Journal ref: IEEE International Conference on Control & Automation, (2020), 35-40

  14. arXiv:1812.06120  [pdf, other

    eess.SY cs.AI cs.RO

    Simulation to Scaled City: Zero-Shot Policy Transfer for Traffic Control via Autonomous Vehicles

    Authors: Kathy Jang, Eugene Vinitsky, Behdad Chalaki, Ben Remer, Logan Beaver, Andreas Malikopoulos, Alexandre Bayen

    Abstract: Using deep reinforcement learning, we train control policies for autonomous vehicles leading a platoon of vehicles onto a roundabout. Using Flow, a library for deep reinforcement learning in micro-simulators, we train two policies, one policy with noise injected into the state and action space and one without any injected noise. In simulation, the autonomous vehicle learns an emergent metering beh… ▽ More

    Submitted 22 February, 2019; v1 submitted 14 December, 2018; originally announced December 2018.

    Comments: To be published at the International Conference on Cyber Physical Systems (ICCPS) 2019. 10 pages, 9 figures

    ACM Class: I.2.1; I.2.4; I.2.6; I.2.10; I.6.5

  15. arXiv:1512.08083  [pdf, other

    eess.SY

    Model Checking Implantable Cardioverter Defibrillators

    Authors: Houssam Abbas, Kuk Jin Jang, Zhihao Jiang, Rahul Mangharam

    Abstract: Ventricular Fibrillation is a disorganized electrical excitation of the heart that results in inadequate blood flow to the body. It usually ends in death within seconds. The most common way to treat the symptoms of fibrillation is to implant a medical device, known as an Implantable Cardioverter Defibrillator (ICD), in the patient's body. Model-based verification can supply rigorous proofs of safe… ▽ More

    Submitted 26 December, 2015; originally announced December 2015.

    Comments: Hybrid Systems: Computation and Control 2016