Skip to main content

Showing 1–50 of 88 results for author: Jin, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.08678  [pdf, other

    cs.LG math.OC stat.CO stat.ML

    How to beat a Bayesian adversary

    Authors: Zihan Ding, Kexin Jin, Jonas Latz, Chenguang Liu

    Abstract: Deep neural networks and other modern machine learning models are often susceptible to adversarial attacks. Indeed, an adversary may often be able to change a model's prediction through a small, directed perturbation of the model's input - an issue in safety-critical applications. Adversarially robust machine learning is usually based on a minmax optimisation problem that minimises the machine lea… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    MSC Class: 90C15; 65C35; 68T07

  2. arXiv:2407.00623  [pdf, other

    cs.CV

    Consistency Purification: Effective and Efficient Diffusion Purification towards Certified Robustness

    Authors: Yiquan Li, Zhongzhu Chen, Kun Jin, Jiongxiao Wang, Bo Li, Chaowei Xiao

    Abstract: Diffusion Purification, purifying noised images with diffusion models, has been widely used for enhancing certified robustness via randomized smoothing. However, existing frameworks often grapple with the balance between efficiency and effectiveness. While the Denoising Diffusion Probabilistic Model (DDPM) offers an efficient single-step purification, it falls short in ensuring purified images res… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  3. arXiv:2406.16756  [pdf, other

    cs.LG cs.AI cs.CY

    Addressing Polarization and Unfairness in Performative Prediction

    Authors: Kun Jin, Tian Xie, Yang Liu, Xueru Zhang

    Abstract: When machine learning (ML) models are used in applications that involve humans (e.g., online recommendation, school admission, hiring, lending), the model itself may trigger changes in the distribution of targeted data it aims to predict. Performative prediction (PP) is a framework that explicitly considers such model-dependent distribution shifts when learning ML models. While significant efforts… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  4. arXiv:2406.10467  [pdf, other

    cs.DS

    Scheduling two types of jobs with minimum makespan

    Authors: Song Cao, Kai Jin

    Abstract: We consider scheduling two types of jobs (A-job and B-job) to $p$ machines and minimizing their makespan. A group of same type of jobs processed consecutively by a machine is called a batch. For machine $v$, processing $x$ A-jobs in a batch takes $k^A_vx^2$ time units for a given speed $k^A_v$, and processing $x$ B-jobs in a batch takes $k^B_vx^2$ time units for a given speed $k^B_v$. We give an… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  5. arXiv:2406.07436  [pdf, other

    cs.PL

    McEval: Massively Multilingual Code Evaluation

    Authors: Linzheng Chai, Shukai Liu, Jian Yang, Yuwei Yin, Ke Jin, Jiaheng Liu, Tao Sun, Ge Zhang, Changyu Ren, Hongcheng Guo, Zekun Wang, Boyang Wang, Xianjie Wu, Bing Wang, Tongliang Li, Liqun Yang, Sufeng Duan, Zhoujun Li

    Abstract: Code large language models (LLMs) have shown remarkable advances in code understanding, completion, and generation tasks. Programming benchmarks, comprised of a selection of code challenges and corresponding test cases, serve as a standard to evaluate the capability of different LLMs in such tasks. However, most existing benchmarks primarily focus on Python and are still restricted to a limited nu… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 22 pages

  6. arXiv:2406.05247  [pdf, other

    cs.IR

    Measuring Fairness in Large-Scale Recommendation Systems with Missing Labels

    Authors: Yulong Dong, Kun Jin, Xinghai Hu, Yang Liu

    Abstract: In large-scale recommendation systems, the vast array of items makes it infeasible to obtain accurate user preferences for each product, resulting in a common issue of missing labels. Typically, only items previously recommended to users have associated ground truth data. Although there is extensive research on fairness concerning fully observed user-item interactions, the challenge of fairness in… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  7. arXiv:2404.10514  [pdf, other

    cs.DS

    Simple $k$-crashing Plan with a Good Approximation Ratio

    Authors: Ruixi Luo, Kai Jin, Zelin Ye

    Abstract: In project management, a project is typically described as an activity-on-edge network (AOE network), where each activity / job is represented as an edge of some network $N$ (which is a DAG). Some jobs must be finished before others can be started, as described by the topology structure of $N$. It is known that job $j_i$ in normal speed would require $b_i$ days to be finished after it is started.… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    ACM Class: K.6.1

  8. arXiv:2404.09682  [pdf, other

    cs.CL cs.AI

    Multi-News+: Cost-efficient Dataset Cleansing via LLM-based Data Annotation

    Authors: Juhwan Choi, Jungmin Yun, Kyohoon Jin, YoungBin Kim

    Abstract: The quality of the dataset is crucial for ensuring optimal performance and reliability of downstream task models. However, datasets often contain noisy data inadvertently included during the construction process. Numerous attempts have been made to correct this issue through human annotators. However, hiring and managing human annotators is expensive and time-consuming. As an alternative, recent s… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  9. arXiv:2404.05558  [pdf, other

    eess.IV cs.CV

    JDEC: JPEG Decoding via Enhanced Continuous Cosine Coefficients

    Authors: Woo Kyoung Han, Sunghoon Im, Jaedeok Kim, Kyong Hwan Jin

    Abstract: We propose a practical approach to JPEG image decoding, utilizing a local implicit neural representation with continuous cosine formulation. The JPEG algorithm significantly quantizes discrete cosine transform (DCT) spectra to achieve a high compression rate, inevitably resulting in quality degradation while encoding an image. We have designed a continuous cosine spectrum estimator to address the… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  10. arXiv:2403.17377  [pdf, other

    cs.CV cs.AI cs.LG

    Self-Rectifying Diffusion Sampling with Perturbed-Attention Guidance

    Authors: Donghoon Ahn, Hyoungwon Cho, Jaewon Min, Wooseok Jang, Jungwoo Kim, SeonHwa Kim, Hyun Hee Park, Kyong Hwan Jin, Seungryong Kim

    Abstract: Recent studies have demonstrated that diffusion models are capable of generating high-quality samples, but their quality heavily depends on sampling guidance techniques, such as classifier guidance (CG) and classifier-free guidance (CFG). These techniques are often not applicable in unconditional generation or in various downstream tasks such as image restoration. In this paper, we propose a novel… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Project page is available at https://ku-cvlab.github.io/Perturbed-Attention-Guidance

  11. arXiv:2403.15512  [pdf, other

    cs.CL cs.AI cs.LG

    Enhancing Effectiveness and Robustness in a Low-Resource Regime via Decision-Boundary-aware Data Augmentation

    Authors: Kyohoon Jin, Junho Lee, Juhwan Choi, Sangmin Song, Youngbin Kim

    Abstract: Efforts to leverage deep learning models in low-resource regimes have led to numerous augmentation studies. However, the direct application of methods such as mixup and cutout to text data, is limited due to their discrete characteristics. While methods using pretrained language models have exhibited efficiency, they require additional considerations for robustness. Inspired by recent studies on d… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: Accepted at LREC-COLING 2024

  12. arXiv:2402.11702  [pdf, other

    cs.SE cs.AI cs.LG

    Can ChatGPT Support Developers? An Empirical Evaluation of Large Language Models for Code Generation

    Authors: Kailun Jin, Chung-Yu Wang, Hung Viet Pham, Hadi Hemmati

    Abstract: Large language models (LLMs) have demonstrated notable proficiency in code generation, with numerous prior studies showing their promising capabilities in various development scenarios. However, these studies mainly provide evaluations in research settings, which leaves a significant gap in understanding how effectively LLMs can support developers in real-world. To address this, we conducted an em… ▽ More

    Submitted 16 March, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: 4 pages, 3 figures, 21st International Conference on Mining Software Repositories (MSR '24), April 15-16, 2024, Lisbon, Portugal

    ACM Class: I.2.2

  13. arXiv:2402.05591  [pdf, ps, other

    cs.CL cs.AI

    SoftEDA: Rethinking Rule-Based Data Augmentation with Soft Labels

    Authors: Juhwan Choi, Kyohoon Jin, Junho Lee, Sangmin Song, Youngbin Kim

    Abstract: Rule-based text data augmentation is widely used for NLP tasks due to its simplicity. However, this method can potentially damage the original meaning of the text, ultimately hurting the performance of the model. To overcome this limitation, we propose a straightforward technique for applying soft labels to augmented data. We conducted experiments across seven different classification tasks and em… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: ICLR 2023 Tiny Papers

  14. arXiv:2402.05584  [pdf, other

    cs.CL cs.AI

    AutoAugment Is What You Need: Enhancing Rule-based Augmentation Methods in Low-resource Regimes

    Authors: Juhwan Choi, Kyohoon Jin, Junho Lee, Sangmin Song, Youngbin Kim

    Abstract: Text data augmentation is a complex problem due to the discrete nature of sentences. Although rule-based augmentation methods are widely adopted in real-world applications because of their simplicity, they suffer from potential semantic damage. Previous researchers have suggested easy data augmentation with soft labels (softEDA), employing label smoothing to mitigate this problem. However, finding… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: EACL 2024 Student Research Workshop

  15. arXiv:2402.05512  [pdf, other

    cs.CL cs.AI

    GPTs Are Multilingual Annotators for Sequence Generation Tasks

    Authors: Juhwan Choi, Eunju Lee, Kyohoon Jin, YoungBin Kim

    Abstract: Data annotation is an essential step for constructing new datasets. However, the conventional approach of data annotation through crowdsourcing is both time-consuming and expensive. In addition, the complexity of this process increases when dealing with low-resource languages owing to the difference in the language pool of crowdworkers. To address these issues, this study proposes an autonomous an… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: EACL 2024 Findings: Camera-ready version

  16. arXiv:2312.15840  [pdf, other

    cs.CV cs.AI

    Masked Contrastive Reconstruction for Cross-modal Medical Image-Report Retrieval

    Authors: Zeqiang Wei, Kai Jin, Xiuzhuang Zhou

    Abstract: Cross-modal medical image-report retrieval task plays a significant role in clinical diagnosis and various medical generative tasks. Eliminating heterogeneity between different modalities to enhance semantic consistency is the key challenge of this task. The current Vision-Language Pretraining (VLP) models, with cross-modal contrastive learning and masked reconstruction as joint training tasks, ca… ▽ More

    Submitted 26 December, 2023; v1 submitted 25 December, 2023; originally announced December 2023.

  17. arXiv:2310.12189  [pdf, other

    cs.CV

    Mesh Represented Recycle Learning for 3D Hand Pose and Mesh Estimation

    Authors: Bosang Kim, Jonghyun Kim, Hyotae Lee, Lanying Jin, Jeongwon Ha, Dowoo Kwon, Jungpyo Kim, Wonhyeok Im, KyungMin Jin, Jungho Lee

    Abstract: In general, hand pose estimation aims to improve the robustness of model performance in the real-world scenes. However, it is difficult to enhance the robustness since existing datasets are obtained in restricted environments to annotate 3D information. Although neural networks quantitatively achieve a high estimation accuracy, unsatisfied results can be observed in visual quality. This discrepanc… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

  18. arXiv:2310.07394  [pdf, ps, other

    cs.CV

    CLIP for Lightweight Semantic Segmentation

    Authors: Ke Jin, Wankou Yang

    Abstract: The large-scale pretrained model CLIP, trained on 400 million image-text pairs, offers a promising paradigm for tackling vision tasks, albeit at the image level. Later works, such as DenseCLIP and LSeg, extend this paradigm to dense prediction, including semantic segmentation, and have achieved excellent results. However, the above methods either rely on CLIP-pretrained visual backbones or use non… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  19. arXiv:2309.11119  [pdf, other

    cs.CV

    BroadBEV: Collaborative LiDAR-camera Fusion for Broad-sighted Bird's Eye View Map Construction

    Authors: Minsu Kim, Giseop Kim, Kyong Hwan Jin, Sunwook Choi

    Abstract: A recent sensor fusion in a Bird's Eye View (BEV) space has shown its utility in various tasks such as 3D detection, map segmentation, etc. However, the approach struggles with inaccurate camera BEV estimation, and a perception of distant areas due to the sparsity of LiDAR points. In this paper, we propose a broad BEV fusion (BroadBEV) that addresses the problems with a spatial synchronization app… ▽ More

    Submitted 8 November, 2023; v1 submitted 20 September, 2023; originally announced September 2023.

  20. arXiv:2309.01409  [pdf, other

    cs.CV

    Implicit Neural Image Stitching

    Authors: Minsu Kim, Jaewon Lee, Byeonghun Lee, Sunghoon Im, Kyong Hwan Jin

    Abstract: Existing frameworks for image stitching often provide visually reasonable stitchings. However, they suffer from blurry artifacts and disparities in illumination, depth level, etc. Although the recent learning-based stitchings relax such disparities, the required methods impose sacrifice of image qualities failing to capture high-frequency details for stitched images. To address the problem, we pro… ▽ More

    Submitted 21 January, 2024; v1 submitted 4 September, 2023; originally announced September 2023.

  21. arXiv:2309.01406  [pdf, other

    cs.CV

    Learning Residual Elastic Warps for Image Stitching under Dirichlet Boundary Condition

    Authors: Minsu Kim, Yongjun Lee, Woo Kyoung Han, Kyong Hwan Jin

    Abstract: Trendy suggestions for learning-based elastic warps enable the deep image stitchings to align images exposed to large parallax errors. Despite the remarkable alignments, the methods struggle with occasional holes or discontinuity between overlapping and non-overlapping regions of a target image as the applied training strategy mostly focuses on overlap region alignment. As a result, they require a… ▽ More

    Submitted 18 October, 2023; v1 submitted 4 September, 2023; originally announced September 2023.

  22. arXiv:2308.13782  [pdf, other

    cs.CL cs.AI

    Planning with Logical Graph-based Language Model for Instruction Generation

    Authors: Fan Zhang, Kebing Jin, Hankz Hankui Zhuo

    Abstract: Despite the superior performance of large language models to generate natural language texts, it is hard to generate texts with correct logic according to a given task, due to the difficulties for neural models to capture implied rules from free-form texts. In this paper, we propose a novel graph-based language model, Logical-GLM, to infuse logic into language models for more valid text generation… ▽ More

    Submitted 5 July, 2024; v1 submitted 26 August, 2023; originally announced August 2023.

    Comments: 9 pages, 8 figures

  23. arXiv:2306.04090  [pdf, other

    cs.AI cs.MA

    PlayBest: Professional Basketball Player Behavior Synthesis via Planning with Diffusion

    Authors: Xiusi Chen, Wei-Yao Wang, Ziniu Hu, David Reynoso, Kun Jin, Mingyan Liu, P. Jeffrey Brantingham, Wei Wang

    Abstract: Dynamically planning in complex systems has been explored to improve decision-making in various domains. Professional basketball serves as a compelling example of a dynamic spatio-temporal game, encompassing context-dependent decision-making. However, processing the diverse on-court signals and navigating the vast space of potential actions and outcomes make it difficult for existing approaches to… ▽ More

    Submitted 16 July, 2024; v1 submitted 6 June, 2023; originally announced June 2023.

    Comments: CIKM 2024

  24. arXiv:2306.02582  [pdf, other

    cs.CV

    Enhancing Point Annotations with Superpixel and Confidence Learning Guided for Improving Semi-Supervised OCT Fluid Segmentation

    Authors: Tengjin Weng, Yang Shen, Kai Jin, Zhiming Cheng, Yunxiang Li, Gewen Zhang, Shuai Wang, Yaqi Wang

    Abstract: Automatic segmentation of fluid in Optical Coherence Tomography (OCT) images is beneficial for ophthalmologists to make an accurate diagnosis. Although semi-supervised OCT fluid segmentation networks enhance their performance by introducing additional unlabeled data, the performance enhancement is limited. To address this, we propose Superpixel and Confident Learning Guide Point Annotations Networ… ▽ More

    Submitted 30 November, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: Submission to BSPC

  25. arXiv:2306.00256  [pdf, other

    cs.LG

    DSGD-CECA: Decentralized SGD with Communication-Optimal Exact Consensus Algorithm

    Authors: Lisang Ding, Kexin Jin, Bicheng Ying, Kun Yuan, Wotao Yin

    Abstract: Decentralized Stochastic Gradient Descent (SGD) is an emerging neural network training approach that enables multiple agents to train a model collaboratively and simultaneously. Rather than using a central parameter server to collect gradients from all the agents, each agent keeps a copy of the model parameters and communicates with a small number of other agents to exchange model updates. Their c… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

  26. arXiv:2305.17866  [pdf, other

    cs.AI cs.IR

    Sequential Condition Evolved Interaction Knowledge Graph for Traditional Chinese Medicine Recommendation

    Authors: Jingjin Liu, Hankz Hankui Zhuo, Kebing Jin, Jiamin Yuan, Zhimin Yang, Zhengan Yao

    Abstract: Traditional Chinese Medicine (TCM) has a rich history of utilizing natural herbs to treat a diversity of illnesses. In practice, TCM diagnosis and treatment are highly personalized and organically holistic, requiring comprehensive consideration of the patient's state and symptoms over time. However, existing TCM recommendation approaches overlook the changes in patient status and only explore pote… ▽ More

    Submitted 6 October, 2023; v1 submitted 28 May, 2023; originally announced May 2023.

  27. arXiv:2305.13882  [pdf, other

    stat.ML cs.LG stat.CO

    Subsampling Error in Stochastic Gradient Langevin Diffusions

    Authors: Kexin Jin, Chenguang Liu, Jonas Latz

    Abstract: The Stochastic Gradient Langevin Dynamics (SGLD) are popularly used to approximate Bayesian posterior distributions in statistical learning procedures with large-scale data. As opposed to many usual Markov chain Monte Carlo (MCMC) algorithms, SGLD is not stationary with respect to the posterior distribution; two sources of error appear: The first error is introduced by an Euler--Maruyama discretis… ▽ More

    Submitted 26 April, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: AISTATS 2024

    MSC Class: 65C05; 62F15

  28. arXiv:2305.13608  [pdf, other

    cs.CV

    VDD: Varied Drone Dataset for Semantic Segmentation

    Authors: Wenxiao Cai, Ke Jin, Jinyan Hou, Cong Guo, Letian Wu, Wankou Yang

    Abstract: Semantic segmentation of drone images is critical for various aerial vision tasks as it provides essential semantic details to understand scenes on the ground. Ensuring high accuracy of semantic segmentation models for drones requires access to diverse, large-scale, and high-resolution datasets, which are often scarce in the field of aerial image processing. While existing datasets typically focus… ▽ More

    Submitted 2 July, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

  29. arXiv:2305.05090  [pdf, other

    cs.LG cs.DC math.OC

    Performative Federated Learning: A Solution to Model-Dependent and Heterogeneous Distribution Shifts

    Authors: Kun Jin, Tongxin Yin, Zhongzhu Chen, Zeyu Sun, Xueru Zhang, Yang Liu, Mingyan Liu

    Abstract: We consider a federated learning (FL) system consisting of multiple clients and a server, where the clients aim to collaboratively learn a common decision model from their distributed data. Unlike the conventional FL framework that assumes the client's data is static, we consider scenarios where the clients' data distributions may be reshaped by the deployed decision model. In this work, we levera… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

  30. arXiv:2304.12566  [pdf, other

    cs.LG

    AdaNPC: Exploring Non-Parametric Classifier for Test-Time Adaptation

    Authors: Yi-Fan Zhang, Xue Wang, Kexin Jin, Kun Yuan, Zhang Zhang, Liang Wang, Rong Jin, Tieniu Tan

    Abstract: Many recent machine learning tasks focus to develop models that can generalize to unseen distributions. Domain generalization (DG) has become one of the key topics in various fields. Several literatures show that DG can be arbitrarily hard without exploiting target domain information. To address this issue, test-time adaptive (TTA) methods are proposed. Existing TTA methods require offline target… ▽ More

    Submitted 9 May, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

    Comments: 30 pages, 12 figures

    Journal ref: The Fortieth International Conference on Machine Learning, ICML, 2023

  31. arXiv:2302.07676  [pdf, other

    cs.CV

    DIVOTrack: A Novel Dataset and Baseline Method for Cross-View Multi-Object Tracking in DIVerse Open Scenes

    Authors: Shenghao Hao, Peiyuan Liu, Yibing Zhan, Kaixun Jin, Zuozhu Liu, Mingli Song, Jenq-Neng Hwang, Gaoang Wang

    Abstract: Cross-view multi-object tracking aims to link objects between frames and camera views with substantial overlaps. Although cross-view multi-object tracking has received increased attention in recent years, existing datasets still have several issues, including 1) missing real-world scenarios, 2) lacking diverse scenes, 3) owning a limited number of tracks, 4) comprising only static cameras, and 5)… ▽ More

    Submitted 7 October, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

    Comments: Accepted to IJCV 2023

  32. arXiv:2212.05412  [pdf, other

    cs.AI

    A Hierarchical Temporal Planning-Based Approach for Dynamic Hoist Scheduling Problems

    Authors: Kebing Jin, Yingkai Xiao, Hankz Hankui Zhuo, Renyong Ma

    Abstract: Hoist scheduling has become a bottleneck in electroplating industry applications with the development of autonomous devices. Although there are a few approaches proposed to target at the challenging problem, they generally cannot scale to large-scale scheduling problems. In this paper, we formulate the hoist scheduling problem as a new temporal planning problem in the form of adapted PDDL, and pro… ▽ More

    Submitted 11 December, 2022; originally announced December 2022.

  33. arXiv:2211.15868  [pdf, other

    cs.CV

    Kinematic-aware Hierarchical Attention Network for Human Pose Estimation in Videos

    Authors: Kyung-Min Jin, Byoung-Sung Lim, Gun-Hee Lee, Tae-Kyung Kang, Seong-Whan Lee

    Abstract: Previous video-based human pose estimation methods have shown promising results by leveraging aggregated features of consecutive frames. However, most approaches compromise accuracy to mitigate jitter or do not sufficiently comprehend the temporal aspects of human motion. Furthermore, occlusion increases uncertainty between consecutive frames, which results in unsmooth results. To address these is… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  34. arXiv:2211.15666  [pdf, other

    cs.LG cs.AI cs.CV

    Learning Visual Planning Models from Partially Observed Images

    Authors: Kebing Jin, Zhanhao Xiao, Hankui Hankz Zhuo, Hai Wan, Jiaran Cai

    Abstract: There has been increasing attention on planning model learning in classical planning. Most existing approaches, however, focus on learning planning models from structured data in symbolic representations. It is often difficult to obtain such structured data in real-world scenarios. Although a number of approaches have been developed for learning planning models from fully observed unstructured dat… ▽ More

    Submitted 25 November, 2022; originally announced November 2022.

    Comments: 25 pages, 5 figures

  35. arXiv:2211.00322  [pdf, other

    cs.LG cs.AI cs.CR

    DensePure: Understanding Diffusion Models towards Adversarial Robustness

    Authors: Chaowei Xiao, Zhongzhu Chen, Kun Jin, Jiongxiao Wang, Weili Nie, Mingyan Liu, Anima Anandkumar, Bo Li, Dawn Song

    Abstract: Diffusion models have been recently employed to improve certified robustness through the process of denoising. However, the theoretical understanding of why diffusion models are able to improve the certified robustness is still lacking, preventing from further improvement. In this study, we close this gap by analyzing the fundamental properties of diffusion models and establishing the conditions u… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

  36. arXiv:2210.07881  [pdf, other

    math.OC cs.LG

    Communication-Efficient Topologies for Decentralized Learning with $O(1)$ Consensus Rate

    Authors: Zhuoqing Song, Weijian Li, Kexin Jin, Lei Shi, Ming Yan, Wotao Yin, Kun Yuan

    Abstract: Decentralized optimization is an emerging paradigm in distributed learning in which agents achieve network-wide solutions by peer-to-peer communication without the central server. Since communication tends to be slower than computation, when each agent communicates with only a few neighboring agents per iteration, they can complete iterations faster than with more agents or a central server. Howev… ▽ More

    Submitted 12 March, 2023; v1 submitted 14 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022

  37. arXiv:2209.06208  [pdf, other

    cs.LG cs.AI cs.HC eess.SP

    Identification of Cognitive Workload during Surgical Tasks with Multimodal Deep Learning

    Authors: Kaizhe Jin, Adrian Rubio-Solis, Ravi Naik, Tochukwu Onyeogulu, Amirul Islam, Salman Khan, Izzeddin Teeti, James Kinross, Daniel R Leff, Fabio Cuzzolin, George Mylonas

    Abstract: The operating room (OR) is a dynamic and complex environment consisting of a multidisciplinary team working together in a high take environment to provide safe and efficient patient care. Additionally, surgeons are frequently exposed to multiple psycho-organisational stressors that may cause negative repercussions on their immediate technical performance and long-term health. Many factors can ther… ▽ More

    Submitted 30 September, 2022; v1 submitted 12 September, 2022; originally announced September 2022.

  38. arXiv:2209.05056  [pdf, other

    cs.CV

    Situation Awareness for Automated Surgical Check-listing in AI-Assisted Operating Room

    Authors: Tochukwu Onyeogulu, Salman Khan, Izzeddin Teeti, Amirul Islam, Kaizhe Jin, Adrian Rubio-Solis, Ravi Naik, George Mylonas, Fabio Cuzzolin

    Abstract: Nowadays, there are more surgical procedures that are being performed using minimally invasive surgery (MIS). This is due to its many benefits, such as minimal post-operative problems, less bleeding, minor scarring, and a speedy recovery. However, the MIS's constrained field of view, small operating room, and indirect viewing of the operating scene could lead to surgical tools colliding and potent… ▽ More

    Submitted 23 September, 2022; v1 submitted 12 September, 2022; originally announced September 2022.

  39. arXiv:2209.03705  [pdf, other

    math.OC cs.LG math.NA

    Losing momentum in continuous-time stochastic optimisation

    Authors: Kexin Jin, Jonas Latz, Chenguang Liu, Alessandro Scagliotti

    Abstract: The training of deep neural networks and other modern machine learning models usually consists in solving non-convex optimisation problems that are high-dimensional and subject to large-scale data. Here, momentum-based stochastic optimisation algorithms have become especially popular in recent years. The stochasticity arises from data subsampling which reduces computational cost. Moreover, both, m… ▽ More

    Submitted 8 September, 2022; originally announced September 2022.

    MSC Class: 90C15; 37N40; 37H30; 65C40; 68T07; 68W20

  40. arXiv:2207.09725  [pdf, other

    cs.CV

    OTPose: Occlusion-Aware Transformer for Pose Estimation in Sparsely-Labeled Videos

    Authors: Kyung-Min Jin, Gun-Hee Lee, Seong-Whan Lee

    Abstract: Although many approaches for multi-human pose estimation in videos have shown profound results, they require densely annotated data which entails excessive man labor. Furthermore, there exists occlusion and motion blur that inevitably lead to poor estimation performance. To address these problems, we propose a method that leverages an attention mask for occluded joints and encodes temporal depende… ▽ More

    Submitted 27 July, 2022; v1 submitted 20 July, 2022; originally announced July 2022.

    Comments: 6 pages

    MSC Class: 68T45

  41. arXiv:2207.01831  [pdf, other

    cs.CV

    Learning Local Implicit Fourier Representation for Image Warping

    Authors: Jaewon Lee, Kwang Pyo Choi, Kyong Hwan Jin

    Abstract: Image warping aims to reshape images defined on rectangular grids into arbitrary shapes. Recently, implicit neural functions have shown remarkable performances in representing images in a continuous manner. However, a standalone multi-layer perceptron suffers from learning high-frequency Fourier coefficients. In this paper, we propose a local texture estimator for image warping (LTEW) followed by… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

    Comments: ECCV 2022 camera-ready version (https://ipl.dgist.ac.kr/LTEW.pdf)

  42. arXiv:2207.00768  [pdf, other

    cs.DS math.OC

    Sum-of-Max Partition under a Knapsack Constraint

    Authors: Kai Jin, Danna Zhang, Canhui Zhang

    Abstract: Sequence partition problems arise in many fields, such as sequential data analysis, information transmission, and parallel computing. In this paper, we study the following partition problem variant: given a sequence of $n$ items $1,\ldots,n$, where each item $i$ is associated with weight $w_i$ and another parameter $s_i$, partition the sequence into several consecutive subsequences, so that the to… ▽ More

    Submitted 11 October, 2022; v1 submitted 2 July, 2022; originally announced July 2022.

    ACM Class: F.2.2

  43. arXiv:2205.00698  [pdf

    eess.IV cs.CV cs.LG

    Unsupervised Denoising of Optical Coherence Tomography Images with Dual_Merged CycleWGAN

    Authors: Jie Du, Xujian Yang, Kecheng Jin, Xuanzheng Qi, Hu Chen

    Abstract: Nosie is an important cause of low quality Optical coherence tomography (OCT) image. The neural network model based on Convolutional neural networks(CNNs) has demonstrated its excellent performance in image denoising. However, OCT image denoising still faces great challenges because many previous neural network algorithms required a large number of labeled data, which might cost much time or is ex… ▽ More

    Submitted 2 May, 2022; originally announced May 2022.

    Comments: Mr. Hu Chen is our corresponding author

  44. arXiv:2204.11213  [pdf, ps, other

    cs.DS

    String Rearrangement Inequalities and a Total Order Between Primitive Words

    Authors: Ruixi Luo, Taikun Zhu, Kai Jin

    Abstract: We study the following rearrangement problem: Given $n$ words, rearrange and concatenate them so that the obtained string is lexicographically smallest (or largest, respectively). We show that this problem reduces to sorting the given words so that their repeating strings are non-decreasing (or non-increasing, respectively), where the repeating string of a word $A$ refers to the infinite string… ▽ More

    Submitted 24 April, 2022; originally announced April 2022.

    MSC Class: 68R15 ACM Class: F.2.2

  45. arXiv:2203.09952  [pdf, other

    cs.AI

    Conquering Ghosts: Relation Learning for Information Reliability Representation and End-to-End Robust Navigation

    Authors: Kefan Jin, Xingyao Han

    Abstract: Environmental disturbances, such as sensor data noises, various lighting conditions, challenging weathers and external adversarial perturbations, are inevitable in real self-driving applications. Existing researches and testings have shown that they can severely influence the vehicles perception ability and performance, one of the main issue is the false positive detection, i.e., the ghost object… ▽ More

    Submitted 20 February, 2023; v1 submitted 14 March, 2022; originally announced March 2022.

  46. arXiv:2202.08373  [pdf, other

    cs.LG cs.AI cs.CL

    Text-Based Action-Model Acquisition for Planning

    Authors: Kebing Jin, Huaixun Chen, Hankz Hankui Zhuo

    Abstract: Although there have been approaches that are capable of learning action models from plan traces, there is no work on learning action models from textual observations, which is pervasive and much easier to collect from real-world applications compared to plan traces. In this paper we propose a novel approach to learning action models from natural language texts by integrating Constraint Satisfactio… ▽ More

    Submitted 17 February, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

  47. arXiv:2202.07138  [pdf, other

    cs.AI cs.CL

    Integrating AI Planning with Natural Language Processing: A Combination of Explicit and Tacit Knowledge

    Authors: Kebing Jin, Hankz Hankui Zhuo

    Abstract: Natural language processing (NLP) aims at investigating the interactions between agents and humans, processing and analyzing large amounts of natural language data. Large-scale language models play an important role in current natural language processing. However, the challenges of explainability and complexity come along with the developments of language models. One way is to introduce logical re… ▽ More

    Submitted 13 April, 2023; v1 submitted 14 February, 2022; originally announced February 2022.

  48. arXiv:2202.04070  [pdf

    cs.NI

    Joint user association and power allocation in ultra-dense mmWave networks: a multi-connectivity approach

    Authors: Ailing Chen, Shengchang Li, Jichen Xiong, Kezhong Jin, Zhenzhou Tang

    Abstract: In ultra-dense millimeter wave (mmWave) networks, mmWave signals suffer from severe path losses and are easily blocked by obstacles. Meanwhile, ultra-dense deployment causes excessive handovers, which reduces the data link reliability. To alleviate the above issues, the novel technology, known as multi-connectivity enabled user association (MCUA) is incorporated in this letter. We aim to jointly o… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

  49. arXiv:2112.09836  [pdf, other

    cs.AI cs.LG

    Creativity of AI: Hierarchical Planning Model Learning for Facilitating Deep Reinforcement Learning

    Authors: Hankz Hankui Zhuo, Shuting Deng, Mu Jin, Zhihao Ma, Kebing Jin, Chen Chen, Chao Yu

    Abstract: Despite of achieving great success in real-world applications, Deep Reinforcement Learning (DRL) is still suffering from three critical issues, i.e., data efficiency, lack of the interpretability and transferability. Recent research shows that embedding symbolic knowledge into DRL is promising in addressing those challenges. Inspired by this, we introduce a novel deep reinforcement learning framew… ▽ More

    Submitted 7 July, 2023; v1 submitted 17 December, 2021; originally announced December 2021.

  50. arXiv:2112.06028  [pdf, other

    cs.AI

    Retrosynthetic Planning with Experience-Guided Monte Carlo Tree Search

    Authors: Siqi Hong, Hankz Hankui Zhuo, Kebing Jin, Guang Shao, Zhanwen Zhou

    Abstract: In retrosynthetic planning, the huge number of possible routes to synthesize a complex molecule using simple building blocks leads to a combinatorial explosion of possibilities. Even experienced chemists often have difficulty to select the most promising transformations. The current approaches rely on human-defined or machine-trained score functions which have limited chemical knowledge or use exp… ▽ More

    Submitted 9 June, 2023; v1 submitted 11 December, 2021; originally announced December 2021.