Zum Hauptinhalt springen

Showing 1–30 of 30 results for author: Jang, Y

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.14372  [pdf, ps, other

    eess.SY

    Ring-LWE based encrypted controller with unlimited number of recursive multiplications and effect of error growth

    Authors: Yeongjun Jang, Joowon Lee, Seonhong Min, Hyesun Kwak, Junsoo Kim, Yongsoo Song

    Abstract: In this paper, we propose a method to encrypt linear dynamic controllers that enables an unlimited number of recursive homomorphic multiplications on a Ring Learning With Errors (Ring-LWE) based cryptosystem without bootstrapping. Unlike LWE based schemes, where a scalar error is injected during encryption for security, Ring-LWE based schemes are based on polynomial rings and inject error as a pol… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 12 pages, 3 figures

  2. arXiv:2405.11614  [pdf, other

    cs.CV eess.IV

    Nickel and Diming Your GAN: A Dual-Method Approach to Enhancing GAN Efficiency via Knowledge Distillation

    Authors: Sangyeop Yeo, Yoojin Jang, Jaejun Yoo

    Abstract: In this paper, we address the challenge of compressing generative adversarial networks (GANs) for deployment in resource-constrained environments by proposing two novel methodologies: Distribution Matching for Efficient compression (DiME) and Network Interactive Compression via Knowledge Exchange and Learning (NICKEL). DiME employs foundation models as embedding kernels for efficient distribution… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  3. arXiv:2405.10272  [pdf, other

    cs.CV cs.AI cs.SD eess.AS eess.IV

    Faces that Speak: Jointly Synthesising Talking Face and Speech from Text

    Authors: Youngjoon Jang, Ji-Hoon Kim, Junseok Ahn, Doyeop Kwak, Hong-Sun Yang, Yoon-Cheol Ju, Il-Hwan Kim, Byeong-Yeol Kim, Joon Son Chung

    Abstract: The goal of this work is to simultaneously generate natural talking faces and speech outputs from text. We achieve this by integrating Talking Face Generation (TFG) and Text-to-Speech (TTS) systems into a unified framework. We address the main challenges of each task: (1) generating a range of head poses representative of real-world scenarios, and (2) ensuring voice consistency despite variations… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: CVPR 2024

  4. arXiv:2405.02066  [pdf, other

    cs.CV eess.IV

    WateRF: Robust Watermarks in Radiance Fields for Protection of Copyrights

    Authors: Youngdong Jang, Dong In Lee, MinHyuk Jang, Jong Wook Kim, Feng Yang, Sangpil Kim

    Abstract: The advances in the Neural Radiance Fields (NeRF) research offer extensive applications in diverse domains, but protecting their copyrights has not yet been researched in depth. Recently, NeRF watermarking has been considered one of the pivotal solutions for safely deploying NeRF-based 3D representations. However, existing methods are designed to apply only to implicit or explicit NeRF representat… ▽ More

    Submitted 11 July, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

  5. arXiv:2404.02574  [pdf, ps, other

    eess.SY

    Learning with errors based dynamic encryption that discloses residue signal for anomaly detection

    Authors: Yeongjun Jang, Joowon Lee, Junsoo Kim, Hyungbo Shim

    Abstract: Anomaly detection is a protocol that detects integrity attacks on control systems by comparing the residue signal with a threshold. Implementing anomaly detection on encrypted control systems has been a challenge because it is hard to detect an anomaly from the encrypted residue signal without the secret key. In this paper, we propose a dynamic encryption scheme for a linear system that automatica… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 7 pages, 1 figure

  6. arXiv:2402.05965  [pdf, other

    cs.LG eess.SP

    Hybrid Neural Representations for Spherical Data

    Authors: Hyomin Kim, Yunhui Jang, Jaeho Lee, Sungsoo Ahn

    Abstract: In this paper, we study hybrid neural representations for spherical data, a domain of increasing relevance in scientific research. In particular, our work focuses on weather and climate data as well as comic microwave background (CMB) data. Although previous studies have delved into coordinate-based neural representations for spherical signals, they often fail to capture the intricate details of h… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 13 pages, 8 figures

  7. arXiv:2401.10032  [pdf, other

    eess.AS cs.AI eess.SP

    FreGrad: Lightweight and Fast Frequency-aware Diffusion Vocoder

    Authors: Tan Dat Nguyen, Ji-Hoon Kim, Youngjoon Jang, Jaehun Kim, Joon Son Chung

    Abstract: The goal of this paper is to generate realistic audio with a lightweight and fast diffusion-based vocoder, named FreGrad. Our framework consists of the following three key components: (1) We employ discrete wavelet transform that decomposes a complicated waveform into sub-band wavelets, which helps FreGrad to operate on a simple and concise feature space, (2) We design a frequency-aware dilated co… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: Accepted to ICASSP 2024

  8. arXiv:2311.08439  [pdf, other

    eess.IV cs.CV cs.LG

    A Unified Approach for Comprehensive Analysis of Various Spectral and Tissue Doppler Echocardiography

    Authors: Jaeik Jeon, Jiyeon Kim, Yeonggul Jang, Yeonyee E. Yoon, Dawun Jeong, Youngtaek Hong, Seung-Ah Lee, Hyuk-Jae Chang

    Abstract: Doppler echocardiography offers critical insights into cardiac function and phases by quantifying blood flow velocities and evaluating myocardial motion. However, previous methods for automating Doppler analysis, ranging from initial signal processing techniques to advanced deep learning approaches, have been constrained by their reliance on electrocardiogram (ECG) data and their inability to proc… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  9. arXiv:2310.19581  [pdf, other

    eess.AS cs.CV cs.SD

    Seeing Through the Conversation: Audio-Visual Speech Separation based on Diffusion Model

    Authors: Suyeon Lee, Chaeyoung Jung, Youngjoon Jang, Jaehun Kim, Joon Son Chung

    Abstract: The objective of this work is to extract target speaker's voice from a mixture of voices using visual cues. Existing works on audio-visual speech separation have demonstrated their performance with promising intelligibility, but maintaining naturalness remains a challenge. To address this issue, we propose AVDiffuSS, an audio-visual speech separation model based on a diffusion mechanism known for… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: Project page with demo: https://mm.kaist.ac.kr/projects/avdiffuss/

  10. arXiv:2310.08897  [pdf, other

    eess.IV cs.CV cs.LG

    Self supervised convolutional kernel based handcrafted feature harmonization: Enhanced left ventricle hypertension disease phenotyping on echocardiography

    Authors: Jina Lee, Youngtaek Hong, Dawun Jeong, Yeonggul Jang, Jaeik Jeon, Sihyeon Jeong, Taekgeun Jung, Yeonyee E. Yoon, Inki Moon, Seung-Ah Lee, Hyuk-Jae Chang

    Abstract: Radiomics, a medical imaging technique, extracts quantitative handcrafted features from images to predict diseases. Harmonization in those features ensures consistent feature extraction across various imaging devices and protocols. Methods for harmonization include standardized imaging protocols, statistical adjustments, and evaluating feature robustness. Myocardial diseases such as Left Ventricul… ▽ More

    Submitted 22 November, 2023; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: 11 pages, 7 figures

  11. arXiv:2309.12306  [pdf, other

    cs.CV cs.SD eess.AS

    TalkNCE: Improving Active Speaker Detection with Talk-Aware Contrastive Learning

    Authors: Chaeyoung Jung, Suyeon Lee, Kihyun Nam, Kyeongha Rho, You Jin Kim, Youngjoon Jang, Joon Son Chung

    Abstract: The goal of this work is Active Speaker Detection (ASD), a task to determine whether a person is speaking or not in a series of video frames. Previous works have dealt with the task by exploring network architectures while learning effective representations has been less explored. In this work, we propose TalkNCE, a novel talk-aware contrastive loss. The loss is only applied to part of the full se… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  12. arXiv:2308.16483  [pdf, other

    eess.SP cs.HC cs.LG

    Improving Out-of-Distribution Detection in Echocardiographic View Classication through Enhancing Semantic Features

    Authors: Jaeik Jeon, Seongmin Ha, Yeonggul Jang, Yeonyee E. Yoon, Jiyeon Kim, Hyunseok Jeong, Dawun Jeong, Youngtaek Hong, Seung-Ah Lee Hyuk-Jae Chang

    Abstract: In echocardiographic view classification, accurately detecting out-of-distribution (OOD) data is essential but challenging, especially given the subtle differences between in-distribution and OOD data. While conventional OOD detection methods, such as Mahalanobis distance (MD) are effective in far-OOD scenarios with clear distinctions between distributions, they struggle to discern the less obviou… ▽ More

    Submitted 23 November, 2023; v1 submitted 31 August, 2023; originally announced August 2023.

  13. arXiv:2305.10975  [pdf, other

    eess.IV cs.AI cs.CV

    Benchmarking Deep Learning Frameworks for Automated Diagnosis of Ocular Toxoplasmosis: A Comprehensive Approach to Classification and Segmentation

    Authors: Syed Samiul Alam, Samiul Based Shuvo, Shams Nafisa Ali, Fardeen Ahmed, Arbil Chakma, Yeong Min Jang

    Abstract: Ocular Toxoplasmosis (OT), is a common eye infection caused by T. gondii that can cause vision problems. Diagnosis is typically done through a clinical examination and imaging, but these methods can be complicated and costly, requiring trained personnel. To address this issue, we have created a benchmark study that evaluates the effectiveness of existing pre-trained networks using transfer learnin… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

  14. arXiv:2304.09507  [pdf, other

    eess.IV cs.CV

    Self-supervised Image Denoising with Downsampled Invariance Loss and Conditional Blind-Spot Network

    Authors: Yeong Il Jang, Keuntek Lee, Gu Yong Park, Seyun Kim, Nam Ik Cho

    Abstract: There have been many image denoisers using deep neural networks, which outperform conventional model-based methods by large margins. Recently, self-supervised methods have attracted attention because constructing a large real noise dataset for supervised training is an enormous burden. The most representative self-supervised denoisers are based on blind-spot networks, which exclude the receptive f… ▽ More

    Submitted 28 July, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: Accepted to ICCV 2023

  15. arXiv:2304.04027  [pdf, other

    eess.IV cs.CV cs.LG

    NeBLa: Neural Beer-Lambert for 3D Reconstruction of Oral Structures from Panoramic Radiographs

    Authors: Sihwa Park, Seongjun Kim, Doeyoung Kwon, Yohan Jang, In-Seok Song, Seung Jun Baek

    Abstract: Panoramic radiography (Panoramic X-ray, PX) is a widely used imaging modality for dental examination. However, PX only provides a flattened 2D image, lacking in a 3D view of the oral structure. In this paper, we propose NeBLa (Neural Beer-Lambert) to estimate 3D oral structures from real-world PX. NeBLa tackles full 3D reconstruction for varying subjects (patients) where each reconstruction is bas… ▽ More

    Submitted 6 February, 2024; v1 submitted 8 April, 2023; originally announced April 2023.

    Comments: 18 pages, 16 figures, Accepted to AAAI 2024

  16. arXiv:2302.03022  [pdf, other

    cs.CV cs.RO eess.IV

    SurgT challenge: Benchmark of Soft-Tissue Trackers for Robotic Surgery

    Authors: Joao Cartucho, Alistair Weld, Samyakh Tukra, Haozheng Xu, Hiroki Matsuzaki, Taiyo Ishikawa, Minjun Kwon, Yong Eun Jang, Kwang-Ju Kim, Gwang Lee, Bizhe Bai, Lueder Kahrs, Lars Boecking, Simeon Allmendinger, Leopold Muller, Yitong Zhang, Yueming Jin, Sophia Bano, Francisco Vasconcelos, Wolfgang Reiter, Jonas Hajek, Bruno Silva, Estevao Lima, Joao L. Vilaca, Sandro Queiros , et al. (1 additional authors not shown)

    Abstract: This paper introduces the ``SurgT: Surgical Tracking" challenge which was organised in conjunction with MICCAI 2022. There were two purposes for the creation of this challenge: (1) the establishment of the first standardised benchmark for the research community to assess soft-tissue trackers; and (2) to encourage the development of unsupervised deep learning methods, given the lack of annotated da… ▽ More

    Submitted 30 August, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

  17. arXiv:2301.13460  [pdf, other

    eess.SP

    Energy-Efficient Vehicular Edge Computing with One-by-one Access Scheme

    Authors: Youngsu Jang, Seongah Jeong, Joonhyuk Kang

    Abstract: With the advent of ever-growing vehicular applications, vehicular edge computing (VEC) has been a promising solution to augment the computing capacity of future smart vehicles. The ultimate challenge to fulfill the quality of service (QoS) is increasingly prominent with constrained computing and communication resources of vehicles. In this paper, we propose an energy-efficient task offloading stra… ▽ More

    Submitted 31 January, 2023; originally announced January 2023.

    Comments: 5 pages, 5 figures

  18. arXiv:2212.13333  [pdf

    quant-ph cs.NI eess.SY

    Quantum Communication Systems: Vision, Protocols, Applications, and Challenges

    Authors: Syed Rakib Hasan, Mostafa Zaman Chowdhury, Md. Saiam, Yeong Min Jang

    Abstract: The growth of modern technological sectors have risen to such a spectacular level that the blessings of technology have spread to every corner of the world, even to remote corners. At present, technological development finds its basis in the theoretical foundation of classical physics in every field of scientific research, such as wireless communication, visible light communication, machine learni… ▽ More

    Submitted 26 December, 2022; originally announced December 2022.

    Comments: 23 pages, 11 Figures

  19. arXiv:2211.06225  [pdf, other

    cs.IT eess.SP

    Over-the-Air Consensus for Distributed Vehicle Platooning Control (Extended version)

    Authors: Jihoon Lee, Yonghoon Jang, Hansol Kim, Seong-Lyun Kim, Seung-Woo Ko

    Abstract: A distributed control of vehicle platooning is referred to as distributed consensus (DC) since many autonomous vehicles (AVs) reach a consensus to move as one body with the same velocity and inter-distance. For DC control to be stable, other AVs' real-time position information should be inputted to each AV's controller via vehicle-to-vehicle (V2V) communications. On the other hand, too many V2V li… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

    Comments: This work has been submitted to the IEEE for possible publication

  20. arXiv:2211.00439  [pdf, other

    eess.AS cs.SD

    Metric Learning for User-defined Keyword Spotting

    Authors: Jaemin Jung, Youkyum Kim, Jihwan Park, Youshin Lim, Byeong-Yeol Kim, Youngjoon Jang, Joon Son Chung

    Abstract: The goal of this work is to detect new spoken terms defined by users. While most previous works address Keyword Spotting (KWS) as a closed-set classification problem, this limits their transferability to unseen terms. The ability to define custom keywords has advantages in terms of user experience. In this paper, we propose a metric learning-based training strategy for user-defined keyword spott… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

  21. arXiv:2207.01868  [pdf, other

    eess.IV cs.CV cs.LG

    Bayesian approaches for Quantifying Clinicians' Variability in Medical Image Quantification

    Authors: Jaeik Jeon, Yeonggul Jang, Youngtaek Hong, Hackjoon Shim, Sekeun Kim

    Abstract: Medical imaging, including MRI, CT, and Ultrasound, plays a vital role in clinical decisions. Accurate segmentation is essential to measure the structure of interest from the image. However, manual segmentation is highly operator-dependent, which leads to high inter and intra-variability of quantitative measurements. In this paper, we explore the feasibility that Bayesian predictive distribution p… ▽ More

    Submitted 6 July, 2022; v1 submitted 5 July, 2022; originally announced July 2022.

    Comments: Interpretable Machine Learning in Healthcare

  22. arXiv:2112.06417  [pdf, other

    eess.IV cs.CV

    LC-FDNet: Learned Lossless Image Compression with Frequency Decomposition Network

    Authors: Hochang Rhee, Yeong Il Jang, Seyun Kim, Nam Ik Cho

    Abstract: Recent learning-based lossless image compression methods encode an image in the unit of subimages and achieve comparable performances to conventional non-learning algorithms. However, these methods do not consider the performance drop in the high-frequency region, giving equal consideration to the low and high-frequency areas. In this paper, we propose a new lossless image compression method that… ▽ More

    Submitted 12 December, 2021; originally announced December 2021.

  23. arXiv:2111.07552  [pdf, other

    eess.SY cs.RO

    Dynamic Placement of Rapidly Deployable Mobile Sensor Robots Using Machine Learning and Expected Value of Information

    Authors: Alice Agogino, Hae Young Jang, Vivek Rao, Ritik Batra, Felicity Liao, Rohan Sood, Irving Fang, R. Lily Hu, Emerson Shoichet-Bartus, John Matranga

    Abstract: Although the Industrial Internet of Things has increased the number of sensors permanently installed in industrial plants, there will be gaps in coverage due to broken sensors or sparse density in very large plants, such as in the petrochemical industry. Modern emergency response operations are beginning to use Small Unmanned Aerial Systems (sUAS) that have the ability to drop sensor robots to pre… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

    Comments: 14 pages, 11 figures, IMECE2021

  24. arXiv:2007.09102  [pdf, other

    eess.SY cs.AI

    Breaking Moravec's Paradox: Visual-Based Distribution in Smart Fashion Retail

    Authors: Shin Woong Sung, Hyunsuk Baek, Hyeonjun Sim, Eun Hie Kim, Hyunwoo Hwangbo, Young Jae Jang

    Abstract: In this paper, we report an industry-academia collaborative study on the distribution method of fashion products using an artificial intelligence (AI) technique combined with an optimization method. To meet the current fashion trend of short product lifetimes and an increasing variety of styles, the company produces limited volumes of a large variety of styles. However, due to the limited volume o… ▽ More

    Submitted 9 July, 2020; originally announced July 2020.

    Comments: 10 pages, 19 figures, The fifth international workshop on fashion and KDD, KDD 2020

  25. Energy-Efficient UAV Relaying Robust Resource Allocation in Uncertain Adversarial Networks

    Authors: S. Ahmed, Mostafa Z. Chowdhury, S. R. Sabuj, M. I. Alam, Y. M. Jang

    Abstract: The mobile relaying technique is a critical enhancing technology in wireless communications due to a higher chance of supporting the remote user from the base station (BS) with better quality of service. This paper investigates energy-efficient (EE) mobile relaying networks, mounted on an unmanned aerial vehicle (UAV), while the unknown adversaries try to intercept the legitimate link. We aim to o… ▽ More

    Submitted 23 July, 2021; v1 submitted 28 June, 2020; originally announced June 2020.

    Comments: 12 pages, 9 figures

  26. Opportunities of Optical Spectrum for Future Wireless Communications

    Authors: Mostafa Zaman Chowdhury, Moh Khalid Hasan, Md Shahjalal, Eun Bi Shin, Yeong Min Jang

    Abstract: The requirements in terms of service quality such as data rate, latency, power consumption, number of connectivity of future fifth-generation (5G) communication is very high. Moreover, in Internet of Things (IoT) requires massive connectivity. Optical wireless communication (OWC) technologies such as visible light communication, light fidelity, optical camera communication, and free space optical… ▽ More

    Submitted 30 May, 2020; originally announced June 2020.

    Comments: 2019 International Conference on Artificial Intelligence in Information and Communication (ICAIIC)

  27. Optical wireless hybrid networks for 5G and beyond communications

    Authors: Mostafa Zaman Chowdhury, Moh Khalid Hasan, Md Shahjalal, Md Tanvir Hossan, Yeong Min Jang

    Abstract: The next 5 th generation (5G) and above ultra-high speed, ultra-low latency, and extremely high reliable communication systems will consist of heterogeneous networks. These heterogeneous networks will consist not only radio frequency (RF) based systems but also optical wireless based systems. Hybrid architectures among different networks is an excellent approach for achieving the required level of… ▽ More

    Submitted 30 May, 2020; originally announced June 2020.

    Comments: 2018 International Conference on Information and Communication Technology Convergence (ICTC)

  28. arXiv:1910.06652  [pdf, other

    eess.SP

    Energy-Efficient Task Offloading for Vehicular Edge Computing: Joint Optimization of Offloading and Bit Allocation

    Authors: Youngsu Jang, Jinyeop Na, Seongah Jeong, Joonhyuk Kang

    Abstract: With the rapid development of vehicular networks, various applications that require high computation resources have emerged. To efficiently execute these applications, vehicular edge computing (VEC) can be employed. VEC offloads the computation tasks to the VEC node, i.e., the road side unit (RSU), which improves vehicular service and reduces energy consumption of the vehicle. However, communicati… ▽ More

    Submitted 15 October, 2019; originally announced October 2019.

    Comments: 5 pages, 4 figures

  29. arXiv:1909.11315  [pdf

    cs.NI eess.SP

    6G Wireless Communication Systems: Applications, Requirements, Technologies, Challenges, and Research Directions

    Authors: Mostafa Zaman Chowdhury, Md. Shahjalal, Shakil Ahmed, Yeong Min Jang

    Abstract: Fifth-generation (5G) communication, which has many more features than fourth-generation communication, will be officially launched very soon. A new paradigm of wireless communication, the sixth-generation (6G) system, with the full support of artificial intelligence is expected to be deployed between 2027 and 2030. In beyond 5G, there are some fundamental issues, which need to be addressed are hi… ▽ More

    Submitted 25 September, 2019; originally announced September 2019.

  30. arXiv:1810.04127  [pdf

    eess.SP cs.NI

    A Novel Indoor Mobile Localization System Based on Optical Camera Communication

    Authors: Md. Tanvir Hossan, Mostafa Zaman Chowdhury, Amirul Islam, Yeong Min Jang

    Abstract: Localizing smartphones in indoor environments offers excellent opportunities for e-commerce. In this paper, we propose a localization technique for smartphones in indoor environments. This technique can calculate the coordinates of a smartphone using existing illumination infrastructure with light-emitting diodes (LEDs). The system can locate smartphones without further modification of the existin… ▽ More

    Submitted 5 October, 2018; originally announced October 2018.

    Journal ref: Wireless Communications and Mobile Computing, vol. 2018, Jan. 2018