Skip to main content

Showing 1–50 of 64 results for author: Kong, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.10179  [pdf, other

    cs.CV

    CLIP-Guided Networks for Transferable Targeted Attacks

    Authors: Hao Fang, Jiawei Kong, Bin Chen, Tao Dai, Hao Wu, Shu-Tao Xia

    Abstract: Transferable targeted adversarial attacks aim to mislead models into outputting adversary-specified predictions in black-box scenarios. Recent studies have introduced \textit{single-target} generative attacks that train a generator for each target class to generate highly transferable perturbations, resulting in substantial computational overhead when handling multiple classes. \textit{Multi-targe… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  2. arXiv:2406.05491  [pdf, other

    cs.CV cs.CR

    One Perturbation is Enough: On Generating Universal Adversarial Perturbations against Vision-Language Pre-training Models

    Authors: Hao Fang, Jiawei Kong, Wenbo Yu, Bin Chen, Jiawei Li, Shutao Xia, Ke Xu

    Abstract: Vision-Language Pre-training (VLP) models trained on large-scale image-text pairs have demonstrated unprecedented capability in many practical applications. However, previous studies have revealed that VLP models are vulnerable to adversarial samples crafted by a malicious adversary. While existing attacks have achieved great success in improving attack effect and transferability, they all focus o… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  3. arXiv:2404.06709  [pdf, other

    cs.CL

    CQIL: Inference Latency Optimization with Concurrent Computation of Quasi-Independent Layers

    Authors: Longwei Zou, Qingyang Wang, Han Zhao, Jiangang Kong, Yi Yang, Yangdong Deng

    Abstract: The fast-growing large scale language models are delivering unprecedented performance on almost all natural language processing tasks. However, the effectiveness of large language models are reliant on an exponentially increasing number of parameters. The overwhelming computation complexity incurs a high inference latency that negatively affects user experience. Existing methods to improve inferen… ▽ More

    Submitted 4 July, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: ACL 2024

  4. arXiv:2403.18134  [pdf, other

    eess.IV cs.CV

    Integrative Graph-Transformer Framework for Histopathology Whole Slide Image Representation and Classification

    Authors: Zhan Shi, Jingwei Zhang, Jun Kong, Fusheng Wang

    Abstract: In digital pathology, the multiple instance learning (MIL) strategy is widely used in the weakly supervised histopathology whole slide image (WSI) classification task where giga-pixel WSIs are only labeled at the slide level. However, existing attention-based MIL approaches often overlook contextual information and intrinsic spatial relationships between neighboring tissue tiles, while graph-based… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  5. arXiv:2403.01971  [pdf, other

    cs.SE

    ContrastRepair: Enhancing Conversation-Based Automated Program Repair via Contrastive Test Case Pairs

    Authors: Jiaolong Kong, Mingfei Cheng, Xiaofei Xie, Shangqing Liu, Xiaoning Du, Qi Guo

    Abstract: Automated Program Repair (APR) aims to automatically generate patches for rectifying software bugs. Recent strides in Large Language Models (LLM), such as ChatGPT, have yielded encouraging outcomes in APR, especially within the conversation-driven APR framework. Nevertheless, the efficacy of conversation-driven APR is contingent on the quality of the feedback information. In this paper, we propose… ▽ More

    Submitted 7 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  6. arXiv:2403.00669  [pdf, other

    cs.LG

    Advancing Additive Manufacturing through Deep Learning: A Comprehensive Review of Current Progress and Future Challenges

    Authors: Amirul Islam Saimon, Emmanuel Yangue, Xiaowei Yue, Zhenyu James Kong, Chenang Liu

    Abstract: Additive manufacturing (AM) has already proved itself to be the potential alternative to widely-used subtractive manufacturing due to its extraordinary capacity of manufacturing highly customized products with minimum material wastage. Nevertheless, it is still not being considered as the primary choice for the industry due to some of its major inherent challenges, including complex and dynamic pr… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  7. arXiv:2402.10087  [pdf, ps, other

    cs.NI cs.LG eess.SP

    Decentralized Covert Routing in Heterogeneous Networks Using Reinforcement Learning

    Authors: Justin Kong, Terrence J. Moore, Fikadu T. Dagefu

    Abstract: This letter investigates covert routing communications in a heterogeneous network where a source transmits confidential data to a destination with the aid of relaying nodes where each transmitter judiciously chooses one modality among multiple communication modalities. We develop a novel reinforcement learning-based covert routing algorithm that finds a route from the source to the destination whe… ▽ More

    Submitted 31 January, 2024; originally announced February 2024.

  8. arXiv:2402.04013  [pdf, other

    cs.CV

    Privacy Leakage on DNNs: A Survey of Model Inversion Attacks and Defenses

    Authors: Hao Fang, Yixiang Qiu, Hongyao Yu, Wenbo Yu, Jiawei Kong, Baoli Chong, Bin Chen, Xuan Wang, Shu-Tao Xia

    Abstract: Model Inversion (MI) attacks aim to disclose private information about the training data by abusing access to the pre-trained models. These attacks enable adversaries to reconstruct high-fidelity data that closely aligns with the private training data, which has raised significant privacy concerns. Despite the rapid advances in the field, we lack a comprehensive overview of existing MI attacks and… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  9. arXiv:2312.04106  [pdf, other

    cs.CV

    Identity-Obscured Neural Radiance Fields: Privacy-Preserving 3D Facial Reconstruction

    Authors: Jiayi Kong, Baixin Xu, Xurui Song, Chen Qian, Jun Luo, Ying He

    Abstract: Neural radiance fields (NeRF) typically require a complete set of images taken from multiple camera perspectives to accurately reconstruct geometric details. However, this approach raise significant privacy concerns in the context of facial reconstruction. The critical need for privacy protection often leads invidividuals to be reluctant in sharing their facial images, due to fears of potential mi… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  10. arXiv:2311.11745  [pdf, other

    cs.SD cs.CL eess.AS

    ELF: Encoding Speaker-Specific Latent Speech Feature for Speech Synthesis

    Authors: Jungil Kong, Junmo Lee, Jeongmin Kim, Beomjeong Kim, Jihoon Park, Dohee Kong, Changheon Lee, Sangjin Kim

    Abstract: In this work, we propose a novel method for modeling numerous speakers, which enables expressing the overall characteristics of speakers in detail like a trained multi-speaker model without additional training on the target speaker's dataset. Although various works with similar purposes have been actively studied, their performance has not yet reached that of trained multi-speaker models due to th… ▽ More

    Submitted 31 May, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

    Comments: ICML 2024

  11. arXiv:2310.17448  [pdf, other

    cs.CL eess.AS

    Dialect Adaptation and Data Augmentation for Low-Resource ASR: TalTech Systems for the MADASR 2023 Challenge

    Authors: Tanel Alumäe, Jiaming Kong, Daniil Robnikov

    Abstract: This paper describes Tallinn University of Technology (TalTech) systems developed for the ASRU MADASR 2023 Challenge. The challenge focuses on automatic speech recognition of dialect-rich Indian languages with limited training audio and text data. TalTech participated in two tracks of the challenge: Track 1 that allowed using only the provided training data and Track 3 which allowed using addition… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Journal ref: 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)

  12. arXiv:2310.04453  [pdf, other

    cs.CL cs.LG cs.SI

    COVID-19 South African Vaccine Hesitancy Models Show Boost in Performance Upon Fine-Tuning on M-pox Tweets

    Authors: Nicholas Perikli, Srimoy Bhattacharya, Blessing Ogbuokiri, Zahra Movahedi Nia, Benjamin Lieberman, Nidhi Tripathi, Salah-Eddine Dahbi, Finn Stevenson, Nicola Bragazzi, Jude Kong, Bruce Mellado

    Abstract: Very large numbers of M-pox cases have, since the start of May 2022, been reported in non-endemic countries leading many to fear that the M-pox Outbreak would rapidly transition into another pandemic, while the COVID-19 pandemic ravages on. Given the similarities of M-pox with COVID-19, we chose to test the performance of COVID-19 models trained on South African twitter data on a hand-labelled M-p… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  13. arXiv:2309.07449  [pdf

    physics.soc-ph cs.MA math.DS nlin.AO

    Rate-Induced Transitions in Networked Complex Adaptive Systems: Exploring Dynamics and Management Implications Across Ecological, Social, and Socioecological Systems

    Authors: Vítor V. Vasconcelos, Flávia M. D. Marquitti, Theresa Ong, Lisa C. McManus, Marcus Aguiar, Amanda B. Campos, Partha S. Dutta, Kristen Jovanelly, Victoria Junquera, Jude Kong, Elisabeth H. Krueger, Simon A. Levin, Wenying Liao, Mingzhen Lu, Dhruv Mittal, Mercedes Pascual, Flávio L. Pinheiro, Juan Rocha, Fernando P. Santos, Peter Sloot, Chenyang, Su, Benton Taylor, Eden Tekwa, Sjoerd Terpstra , et al. (5 additional authors not shown)

    Abstract: Complex adaptive systems (CASs), from ecosystems to economies, are open systems and inherently dependent on external conditions. While a system can transition from one state to another based on the magnitude of change in external conditions, the rate of change -- irrespective of magnitude -- may also lead to system state changes due to a phenomenon known as a rate-induced transition (RIT). This st… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: 25 pages, 4 figures, 1 box, supplementary information

    MSC Class: 37G; 37N; 91B; 91C; 91D; 91E; 92D; 92D25; 92D40; 92F; 93A; 93A14; 93A16 ACM Class: I.6.3; I.6.m; J.3; J.4; J.m; K.4.2

  14. arXiv:2309.05253  [pdf, other

    quant-ph cs.CR cs.LG math-ph

    A quantum tug of war between randomness and symmetries on homogeneous spaces

    Authors: Rahul Arvind, Kishor Bharti, Jun Yong Khoo, Dax Enshan Koh, Jian Feng Kong

    Abstract: We explore the interplay between symmetry and randomness in quantum information. Adopting a geometric approach, we consider states as $H$-equivalent if related by a symmetry transformation characterized by the group $H$. We then introduce the Haar measure on the homogeneous space $\mathbb{U}/H$, characterizing true randomness for $H$-equivalent systems. While this mathematical machinery is well-st… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

    Comments: 9 + 1 pages, 3 figures

  15. arXiv:2309.05088  [pdf

    cs.CY q-bio.OT

    Towards Trustworthy Artificial Intelligence for Equitable Global Health

    Authors: Hong Qin, Jude Kong, Wandi Ding, Ramneek Ahluwalia, Christo El Morr, Zeynep Engin, Jake Okechukwu Effoduh, Rebecca Hwa, Serena Jingchuan Guo, Laleh Seyyed-Kalantari, Sylvia Kiwuwa Muyingo, Candace Makeda Moore, Ravi Parikh, Reva Schwartz, Dongxiao Zhu, Xiaoqian Wang, Yiye Zhang

    Abstract: Artificial intelligence (AI) can potentially transform global health, but algorithmic bias can exacerbate social inequities and disparity. Trustworthy AI entails the intentional design to ensure equity and mitigate potential biases. To advance trustworthy AI in global health, we convened a workshop on Fairness in Machine Intelligence for Global Health (FairMI4GH). The event brought together a glob… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

    Comments: 7 pages

  16. arXiv:2307.16430  [pdf, other

    cs.SD cs.LG eess.AS

    VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design

    Authors: Jungil Kong, Jihoon Park, Beomjeong Kim, Jeongmin Kim, Dohee Kong, Sangjin Kim

    Abstract: Single-stage text-to-speech models have been actively studied recently, and their results have outperformed two-stage pipeline systems. Although the previous single-stage model has made great progress, there is room for improvement in terms of its intermittent unnaturalness, computational efficiency, and strong dependence on phoneme conversion. In this work, we introduce VITS2, a single-stage text… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: Interspeech 2023

  17. arXiv:2307.15072  [pdf, other

    cs.CY cs.CL cs.LG cs.SI

    Detecting the Presence of COVID-19 Vaccination Hesitancy from South African Twitter Data Using Machine Learning

    Authors: Nicholas Perikli, Srimoy Bhattacharya, Blessing Ogbuokiri, Zahra Movahedi Nia, Benjamin Lieberman, Nidhi Tripathi, Salah-Eddine Dahbi, Finn Stevenson, Nicola Bragazzi, Jude Kong, Bruce Mellado

    Abstract: Very few social media studies have been done on South African user-generated content during the COVID-19 pandemic and even fewer using hand-labelling over automated methods. Vaccination is a major tool in the fight against the pandemic, but vaccine hesitancy jeopardizes any public health effort. In this study, sentiment analysis on South African tweets related to vaccine hesitancy was performed, w… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

  18. arXiv:2306.06862  [pdf, other

    cs.RO

    Saltation Matrices: The Essential Tool for Linearizing Hybrid Dynamical Systems

    Authors: Nathan J. Kong, J. Joe Payne, James Zhu, Aaron M. Johnson

    Abstract: Hybrid dynamical systems, i.e. systems that have both continuous and discrete states, are ubiquitous in engineering, but are difficult to work with due to their discontinuous transitions. For example, a robot leg is able to exert very little control effort while it is in the air compared to when it is on the ground. When the leg hits the ground, the penetrating velocity instantaneously collapses t… ▽ More

    Submitted 20 June, 2024; v1 submitted 12 June, 2023; originally announced June 2023.

  19. arXiv:2305.16143  [pdf, other

    cs.LG

    Condensed Prototype Replay for Class Incremental Learning

    Authors: Jiangtao Kong, Zhenyu Zong, Tianyi Zhou, Huajie Shao

    Abstract: Incremental learning (IL) suffers from catastrophic forgetting of old tasks when learning new tasks. This can be addressed by replaying previous tasks' data stored in a memory, which however is usually prone to size limits and privacy leakage. Recent studies store only class centroids as prototypes and augment them with Gaussian noises to create synthetic data for replay. However, they cannot effe… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  20. arXiv:2305.13048  [pdf, other

    cs.CL cs.AI

    RWKV: Reinventing RNNs for the Transformer Era

    Authors: Bo Peng, Eric Alcaide, Quentin Anthony, Alon Albalak, Samuel Arcadinho, Stella Biderman, Huanqi Cao, Xin Cheng, Michael Chung, Matteo Grella, Kranthi Kiran GV, Xuzheng He, Haowen Hou, Jiaju Lin, Przemyslaw Kazienko, Jan Kocon, Jiaming Kong, Bartlomiej Koptyra, Hayden Lau, Krishna Sri Ipsit Mantri, Ferdinand Mom, Atsushi Saito, Guangyu Song, Xiangru Tang, Bolun Wang , et al. (9 additional authors not shown)

    Abstract: Transformers have revolutionized almost all natural language processing (NLP) tasks but suffer from memory and computational complexity that scales quadratically with sequence length. In contrast, recurrent neural networks (RNNs) exhibit linear scaling in memory and computational requirements but struggle to match the same performance as Transformers due to limitations in parallelization and scala… ▽ More

    Submitted 10 December, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

  21. arXiv:2212.07194  [pdf

    cs.LG cs.IR

    Traffic Flow Prediction via Variational Bayesian Inference-based Encoder-Decoder Framework

    Authors: Jianlei Kong, Xiaomeng Fan, Xue-Bo Jin, Min Zuo

    Abstract: Accurate traffic flow prediction, a hotspot for intelligent transportation research, is the prerequisite for mastering traffic and making travel plans. The speed of traffic flow can be affected by roads condition, weather, holidays, etc. Furthermore, the sensors to catch the information about traffic flow will be interfered with by environmental factors such as illumination, collection time, occlu… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

  22. arXiv:2210.08723  [pdf, other

    cs.CR

    Private Data Valuation and Fair Payment in Data Marketplaces

    Authors: Zhihua Tian, Jian Liu, Jingyu Li, Xinle Cao, Ruoxi Jia, Jun Kong, Mengdi Liu, Kui Ren

    Abstract: Data valuation is an essential task in a data marketplace. It aims at fairly compensating data owners for their contribution. There is increasing recognition in the machine learning community that the Shapley value -- a foundational profit-sharing scheme in cooperative game theory -- has major potential to value data, because it uniquely satisfies basic properties for fair credit allocation and ha… ▽ More

    Submitted 17 February, 2023; v1 submitted 16 October, 2022; originally announced October 2022.

    Comments: 14 pages

  23. arXiv:2209.06421  [pdf, other

    cs.GR

    A Transfer Function Design Using A Knowledge Database based on Deep Image and Primitive Intensity Profile Features Retrieval

    Authors: Younhyun Jung, Jim Kong, Jinman Kim

    Abstract: Transfer function (TF) plays a key role for the generation of direct volume rendering (DVR), by enabling accurate identification of structures of interest (SOIs) interactively as well as ensuring appropriate visibility of them. Attempts at mitigating the repetitive manual process of TF design have led to approaches that make use of a knowledge database consisting of pre-designed TFs by domain expe… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

    Comments: submitted to Computer Graphics Forum for review

  24. Hybrid iLQR Model Predictive Control for Contact Implicit Stabilization on Legged Robots

    Authors: Nathan J. Kong, Chuanzheng Li, Aaron M. Johnson

    Abstract: Model Predictive Control (MPC) is a popular strategy for controlling robots but is difficult for systems with contact due to the complex nature of hybrid dynamics. To implement MPC for systems with contact, dynamic models are often simplified or contact sequences fixed in time in order to plan trajectories efficiently. In this work, we extend Hybrid iterative Linear Quadratic Regulator to work in… ▽ More

    Submitted 6 November, 2023; v1 submitted 10 July, 2022; originally announced July 2022.

    Comments: in IEEE Transactions on Robotics, 2023. arXiv admin note: substantial text overlap with arXiv:2103.14584

  25. arXiv:2206.00798  [pdf, other

    cs.CV

    Multi-scale frequency separation network for image deblurring

    Authors: Yanni Zhang, Qiang Li, Miao Qi, Di Liu, Jun Kong, Jianzhong Wang

    Abstract: Image deblurring aims to restore the detailed texture information or structures from blurry images, which has become an indispensable step in many computer vision tasks. Although various methods have been proposed to deal with the image deblurring problem, most of them treated the blurry image as a whole and neglected the characteristics of different image frequencies. In this paper, we present a… ▽ More

    Submitted 8 December, 2022; v1 submitted 1 June, 2022; originally announced June 2022.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  26. arXiv:2205.14811  [pdf, other

    math.OC cs.LG math.NA

    Last-iterate convergence analysis of stochastic momentum methods for neural networks

    Authors: Dongpo Xu, Jinlan Liu, Yinghua Lu, Jun Kong, Danilo Mandic

    Abstract: The stochastic momentum method is a commonly used acceleration technique for solving large-scale stochastic optimization problems in artificial neural networks. Current convergence results of stochastic momentum methods under non-convex stochastic settings mostly discuss convergence in terms of the random output and minimum output. To this end, we address the convergence of the last iterate output… ▽ More

    Submitted 29 May, 2022; originally announced May 2022.

    Comments: 21pages, 4figures

    MSC Class: 90C26 ACM Class: G.1.6

    Journal ref: Neurocomputing 527 (2023) 27-35

  27. arXiv:2205.06801  [pdf

    cs.CL cs.AI cs.SI

    Twitter-Based Gender Recognition Using Transformers

    Authors: Zahra Movahedi Nia, Ali Ahmadi, Bruce Mellado, Jianhong Wu, James Orbinski, Ali Agary, Jude Dzevela Kong

    Abstract: Social media contains useful information about people and the society that could help advance research in many different areas (e.g. by applying opinion mining, emotion/sentiment analysis, and statistical analysis) such as business and finance, health, socio-economic inequality and gender vulnerability. User demographics provide rich information that could help study the subject further. However,… ▽ More

    Submitted 24 April, 2022; originally announced May 2022.

  28. arXiv:2202.12729  [pdf, other

    cs.RO

    The Uncertainty Aware Salted Kalman Filter: State Estimation for Hybrid Systems with Uncertain Guards

    Authors: J. Joe Payne, Nathan J. Kong, Aaron M. Johnson

    Abstract: In this paper we present a method for updating robotic state belief through contact with uncertain surfaces and apply this update to a Kalman filter for more accurate state estimation. Examining how guard surface uncertainty affects the time spent in each mode, we derive a guard saltation matrix - which maps perturbations prior to hybrid events to perturbations after - accounting for additional va… ▽ More

    Submitted 29 July, 2022; v1 submitted 25 February, 2022; originally announced February 2022.

    Comments: To appear in IROS 2022

  29. arXiv:2112.01442  [pdf, other

    cs.SI cs.AI

    Learning Large-scale Network Embedding from Representative Subgraph

    Authors: Junsheng Kong, Weizhao Li, Ben Liao, Jiezhong Qiu, Chang-Yu, Hsieh, Yi Cai, Jinhui Zhu, Shengyu Zhang

    Abstract: We study the problem of large-scale network embedding, which aims to learn low-dimensional latent representations for network mining applications. Recent research in the field of network embedding has led to significant progress such as DeepWalk, LINE, NetMF, NetSMF. However, the huge size of many real-world networks makes it computationally expensive to learn network embedding from the entire net… ▽ More

    Submitted 2 December, 2021; originally announced December 2021.

    Comments: 10 pages, 5 figures

  30. arXiv:2110.01123  [pdf, ps, other

    cs.RO

    Hybrid Event Shaping to Stabilize Periodic Hybrid Orbits

    Authors: James Zhu, Nathan J. Kong, George Council, Aaron M. Johnson

    Abstract: Many controllers for legged robotic systems leverage open- or closed-loop control at discrete hybrid events to enhance stability. These controllers appear in several well studied phenomena such as the Raibert stepping controller, paddle juggling and swing leg retraction. This work introduces hybrid event shaping (HES): a generalized method for analyzing and producing stable hybrid event controller… ▽ More

    Submitted 3 July, 2022; v1 submitted 3 October, 2021; originally announced October 2021.

    Comments: Presented at IEEE ICRA 2022

  31. Fast Extraction of Word Embedding from Q-contexts

    Authors: Junsheng Kong, Weizhao Li, Zeyi Liu, Ben Liao, Jiezhong Qiu, Chang-Yu Hsieh, Yi Cai, Shengyu Zhang

    Abstract: The notion of word embedding plays a fundamental role in natural language processing (NLP). However, pre-training word embedding for very large-scale vocabulary is computationally challenging for most existing methods. In this work, we show that with merely a small fraction of contexts (Q-contexts)which are typical in the whole corpus (and their mutual information with words), one can construct hi… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

    Comments: Accepted by CIKM 2021

  32. arXiv:2109.01182  [pdf, other

    physics.soc-ph cs.MA cs.SI

    COVID-19 Vaccine Hesitancy and Information Diffusion: An Agent-based Modeling Approach

    Authors: Pooria Taghizadeh Naderi, Ali Asgary, Jude Kong, Jianhong Wu, Fattaneh Taghiyareh

    Abstract: Despite the unprecedented success in the rapid development of several effective vaccines against the Cov-SARS-2, global vaccination rollout efforts suffer from vaccine distribution inequality and vaccine acceptance, leading to insufficient public immunity provided by the vaccine products. While a major current focus in vaccine acceptance research is how to model and inform vaccine acceptance based… ▽ More

    Submitted 2 September, 2021; originally announced September 2021.

  33. arXiv:2106.06103  [pdf, other

    cs.SD eess.AS

    Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

    Authors: Jaehyeon Kim, Jungil Kong, Juhee Son

    Abstract: Several recent end-to-end text-to-speech (TTS) models enabling single-stage training and parallel sampling have been proposed, but their sample quality does not match that of two-stage TTS systems. In this work, we present a parallel end-to-end TTS method that generates more natural sounding audio than current two-stage models. Our method adopts variational inference augmented with normalizing flo… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

    Comments: ICML 2021

  34. arXiv:2103.14584  [pdf, other

    cs.RO

    iLQR for Piecewise-Smooth Hybrid Dynamical Systems

    Authors: Nathan J. Kong, George Council, Aaron M. Johnson

    Abstract: Trajectory optimization is a popular strategy for planning trajectories for robotic systems. However, many robotic tasks require changing contact conditions, which is difficult due to the hybrid nature of the dynamics. The optimal sequence and timing of these modes are typically not known ahead of time. In this work, we extend the Iterative Linear Quadratic Regulator (iLQR) method to a class of pi… ▽ More

    Submitted 6 September, 2021; v1 submitted 26 March, 2021; originally announced March 2021.

    Comments: To Appear in IEEE CDC 2021

  35. arXiv:2101.05403  [pdf

    cs.CV cs.LG

    Image deblurring based on lightweight multi-information fusion network

    Authors: Yanni Zhang, Yiming Liu, Qiang Li, Miao Qi, Dahong Xu, Jun Kong, Jianzhong Wang

    Abstract: Recently, deep learning based image deblurring has been well developed. However, exploiting the detailed image features in a deep learning framework always requires a mass of parameters, which inevitably makes the network suffer from high computational burden. To solve this problem, we propose a lightweight multiinformation fusion network (LMFN) for image deblurring. The proposed LMFN is designed… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

  36. arXiv:2011.06228  [pdf, other

    cs.CV

    DSAM: A Distance Shrinking with Angular Marginalizing Loss for High Performance Vehicle Re-identificatio

    Authors: Jiangtao Kong, Yu Cheng, Benjia Zhou, Kai Li, Junliang Xing

    Abstract: Vehicle Re-identification (ReID) is an important yet challenging problem in computer vision. Compared to other visual objects like faces and persons, vehicles simultaneously exhibit much larger intraclass viewpoint variations and interclass visual similarities, making most exiting loss functions designed for face recognition and person ReID unsuitable for vehicle ReID. To obtain a high-performance… ▽ More

    Submitted 8 September, 2021; v1 submitted 12 November, 2020; originally announced November 2020.

  37. arXiv:2010.05646  [pdf, other

    cs.SD cs.LG eess.AS

    HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

    Authors: Jungil Kong, Jaehyeon Kim, Jaekyoung Bae

    Abstract: Several recent work on speech synthesis have employed generative adversarial networks (GANs) to produce raw waveforms. Although such methods improve the sampling efficiency and memory usage, their sample quality has not yet reached that of autoregressive and flow-based generative models. In this work, we propose HiFi-GAN, which achieves both efficient and high-fidelity speech synthesis. As speech… ▽ More

    Submitted 23 October, 2020; v1 submitted 12 October, 2020; originally announced October 2020.

    Comments: NeurIPS 2020. Code available at https://github.com/jik876/hifi-gan

  38. arXiv:2010.04935  [pdf, other

    cs.CL cs.AI

    HPCC-YNU at SemEval-2020 Task 9: A Bilingual Vector Gating Mechanism for Sentiment Analysis of Code-Mixed Text

    Authors: Jun Kong, Jin Wang, Xuejie Zhang

    Abstract: It is fairly common to use code-mixing on a social media platform to express opinions and emotions in multilingual societies. The purpose of this task is to detect the sentiment of code-mixed social media text. Code-mixed text poses a great challenge for the traditional NLP system, which currently uses monolingual resources to deal with the problem of multilingual mixing. This task has been solved… ▽ More

    Submitted 10 October, 2020; originally announced October 2020.

    Comments: 6 pages, 3 figures

  39. arXiv:2007.12233  [pdf, other

    cs.RO

    The Salted Kalman Filter: Kalman Filtering on Hybrid Dynamical Systems

    Authors: Nathan J. Kong, J. Joe Payne, George Council, Aaron M. Johnson

    Abstract: Many state estimation and control algorithms require knowledge of how probability distributions propagate through dynamical systems. However, despite hybrid dynamical systems becoming increasingly important in many fields, there has been little work on utilizing the knowledge of how probability distributions map through hybrid transitions. Here, we make use of a propagation law that employs the sa… ▽ More

    Submitted 8 February, 2021; v1 submitted 23 July, 2020; originally announced July 2020.

    Comments: Submitted to Automatica

  40. arXiv:2005.11129  [pdf, other

    eess.AS cs.SD

    Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search

    Authors: Jaehyeon Kim, Sungwon Kim, Jungil Kong, Sungroh Yoon

    Abstract: Recently, text-to-speech (TTS) models such as FastSpeech and ParaNet have been proposed to generate mel-spectrograms from text in parallel. Despite the advantage, the parallel TTS models cannot be trained without guidance from autoregressive TTS models as their external aligners. In this work, we propose Glow-TTS, a flow-based generative model for parallel TTS that does not require any external al… ▽ More

    Submitted 22 October, 2020; v1 submitted 22 May, 2020; originally announced May 2020.

    Comments: Accepted by NeurIPS2020

  41. arXiv:1911.07088  [pdf, other

    eess.IV cs.CV

    Liver Steatosis Segmentation with Deep Learning Methods

    Authors: Xiaoyuan Guo, Fusheng Wang, George Teodorou, Alton B. Farris, Jun Kong

    Abstract: Liver steatosis is known as the abnormal accumulation of lipids within cells. An accurate quantification of steatosis area within the liver histopathological microscopy images plays an important role in liver disease diagnosis and trans-plantation assessment. Such a quantification analysis often requires a precise steatosis segmentation that is challenging due to abundant presence of highly overla… ▽ More

    Submitted 16 November, 2019; originally announced November 2019.

    Comments: 4 pages

    Journal ref: 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019) Venice, Italy, April 8-11, 2019

  42. arXiv:1910.14548  [pdf, other

    cs.DC

    Run-time Parameter Sensitivity Analysis Optimizations

    Authors: Eduardo Scartezini, Willian Barreiros Jr., Tahsin Kurc, Jun Kong, Alba C. M. A. Melo, Joel Saltz, George Teodoro

    Abstract: Efficient execution of parameter sensitivity analysis (SA) is critical to allow for its routinely use. The pathology image processing application investigated in this work processes high-resolution whole-slide cancer tissue images from large datasets to characterize and classify the disease. However, the application is parameterized and changes in parameter values may significantly affect its resu… ▽ More

    Submitted 31 October, 2019; originally announced October 2019.

    Comments: 8 pages, 8 figures

  43. StateLens: A Reverse Engineering Solution for Making Existing Dynamic Touchscreens Accessible

    Authors: Anhong Guo, Junhan Kong, Michael Rivera, Frank F. Xu, Jeffrey P. Bigham

    Abstract: Blind people frequently encounter inaccessible dynamic touchscreens in their everyday lives that are difficult, frustrating, and often impossible to use independently. Touchscreens are often the only way to control everything from coffee machines and payment terminals, to subway ticket machines and in-flight entertainment systems. Interacting with dynamic touchscreens is difficult non-visually bec… ▽ More

    Submitted 19 August, 2019; originally announced August 2019.

    Comments: ACM UIST 2019

  44. arXiv:1906.03385  [pdf, other

    cs.IT

    Applications of Gaussian Binomials to Coding Theory for Deletion Error Correction

    Authors: Manabu Hagiwara, Justin Kong

    Abstract: We present new applications on $q$-binomials, also known as Gaussian binomial coefficients. Our main theorems determine cardinalities of certain error-correcting codes based on Varshamov-Tenengolts codes and prove a curious phenomenon relating to deletion sphere for specific cases.

    Submitted 12 June, 2019; v1 submitted 7 June, 2019; originally announced June 2019.

    Comments: 17 pages, 2 figures

    MSC Class: 05A10; 05A19; 94B25

  45. arXiv:1810.02911  [pdf

    cs.DC

    Tuning for Tissue Image Segmentation Workflows for Accuracy and Performance

    Authors: Luis F. R. Taveira, Tahsin Kurc, Alba C. M. A. Melo, Jun Kong, Erich Bremer, Joel H. Saltz, George Teodoro

    Abstract: We propose a software platform that integrates methods and tools for multi-objective parameter auto- tuning in tissue image segmentation workflows. The goal of our work is to provide an approach for improving the accuracy of nucleus/cell segmentation pipelines by tuning their input parameters. The shape, size and texture features of nuclei in tissue are important biomarkers for disease prognosis,… ▽ More

    Submitted 5 October, 2018; originally announced October 2018.

    Comments: 29 pages, 5 figures

  46. All One Needs to Know about Fog Computing and Related Edge Computing Paradigms: A Complete Survey

    Authors: Ashkan Yousefpour, Caleb Fung, Tam Nguyen, Krishna Kadiyala, Fatemeh Jalali, Amirreza Niakanlahiji, Jian Kong, Jason P. Jue

    Abstract: With the Internet of Things (IoT) becoming part of our daily life and our environment, we expect rapid growth in the number of connected devices. IoT is expected to connect billions of devices and humans to bring promising advantages for us. With this growth, fog computing, along with its related edge computing paradigms, such as multi-access edge computing (MEC) and cloudlet, are seen as promisin… ▽ More

    Submitted 13 February, 2019; v1 submitted 15 August, 2018; originally announced August 2018.

    Comments: 48 pages, 7 tables, 11 figures, 450 references. The data (categories and features/objectives of the papers) of this survey are now available publicly. Accepted by Elsevier Journal of Systems Architecture

  47. arXiv:1808.04795  [pdf, other

    cs.CV

    Clumped Nuclei Segmentation with Adjacent Point Match and Local Shape based Intensity Analysis for Overlapped Nuclei in Fluorescence In-Situ Hybridization Images

    Authors: Xiaoyuan Guo, Hanyi Yu, Blair Rossetti, George Teodoro, Daniel Brat, Jun Kong

    Abstract: Highly clumped nuclei clusters captured in fluorescence in situ hybridization microscopy images are common histology entities under investigations in a wide spectrum of tissue-related biomedical investigations. Due to their large scale in presence, computer based image analysis is used to facilitate such analysis with improved analysis efficiency and reproducibility. To ensure the quality of downs… ▽ More

    Submitted 14 August, 2018; originally announced August 2018.

    Comments: 4 pages

  48. arXiv:1806.09093  [pdf, other

    cs.CV

    Analysis of Cellular Feature Differences of Astrocytomas with Distinct Mutational Profiles Using Digitized Histopathology Images

    Authors: Mousumi Roy, Fusheng Wang, George Teodoro, Jose Velazqeuz Vega, Daniel Brat, Jun Kong

    Abstract: Cellular phenotypic features derived from histopathology images are the basis of pathologic diagnosis and are thought to be related to underlying molecular profiles. Due to overwhelming cell numbers and population heterogeneity, it remains challenging to quantitatively compute and compare features of cells with distinct molecular signatures. In this study, we propose a self-reliant and efficient a… ▽ More

    Submitted 24 June, 2018; originally announced June 2018.

  49. arXiv:1806.09090  [pdf, other

    cs.CV

    Segmentation of Overlapped Steatosis in Whole-Slide Liver Histopathology Microscopy Images

    Authors: Mousumi Roy, Fusheng Wang, George Teodoro, Miriam B Vos, Alton Brad Farris, Jun Kong

    Abstract: An accurate steatosis quantification with pathology tissue samples is of high clinical importance. However, such pathology measurement is manually made in most clinical practices, subject to severe reader variability due to large sampling bias and poor reproducibility. Although some computerized automated methods are developed to quantify the steatosis regions, they present limited analysis capaci… ▽ More

    Submitted 24 June, 2018; originally announced June 2018.

  50. arXiv:1803.04364  [pdf

    q-bio.NC cs.DM stat.ML

    Maturation Trajectories of Cortical Resting-State Networks Depend on the Mediating Frequency Band

    Authors: Sheraz Khan, Javeria Hashmi, Fahimeh Mamashli, Konstantinos Michmizos, Manfred Kitzbichler, Hari Bharadwaj, Yousra Bekhti, Santosh Ganesan, Keri A Garel, Susan Whitfield-Gabrieli, Randy Gollub, Jian Kong, Lucia M Vaina, Kunjan Rana, Steven Stufflebeam, Matti Hamalainen, Tal Kenet

    Abstract: The functional significance of resting state networks and their abnormal manifestations in psychiatric disorders are firmly established, as is the importance of the cortical rhythms in mediating these networks. Resting state networks are known to undergo substantial reorganization from childhood to adulthood, but whether distinct cortical rhythms, which are generated by separable neural mechanisms… ▽ More

    Submitted 12 February, 2018; originally announced March 2018.