Skip to main content

Showing 1–50 of 118 results for author: Yi, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.04332  [pdf

    cs.ET

    Energy Efficient Knapsack Optimization Using Probabilistic Memristor Crossbars

    Authors: Jinzhan Li, Suhas Kumar, Su-in Yi

    Abstract: Constrained optimization underlies crucial societal problems (for instance, stock trading and bandwidth allocation), but is often computationally hard (complexity grows exponentially with problem size). The big-data era urgently demands low-latency and low-energy optimization at the edge, which cannot be handled by digital processors due to their non-parallel von Neumann architecture. Recent effor… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 16 pages, 8 figures

  2. arXiv:2407.04295  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    Jailbreak Attacks and Defenses Against Large Language Models: A Survey

    Authors: Sibo Yi, Yule Liu, Zhen Sun, Tianshuo Cong, Xinlei He, Jiaxing Song, Ke Xu, Qi Li

    Abstract: Large Language Models (LLMs) have performed exceptionally in various text-generative tasks, including question answering, translation, code completion, etc. However, the over-assistance of LLMs has raised the challenge of "jailbreaking", which induces the model to generate malicious responses against the usage policy and society by designing adversarial prompts. With the emergence of jailbreak att… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  3. arXiv:2406.12802  [pdf, other

    cs.RO

    Decentralized Multi-Robot Line-of-Sight Connectivity Maintenance under Uncertainty

    Authors: Yupeng Yang, Yiwei Lyu, Yanze Zhang, Sha Yi, Wenhao Luo

    Abstract: In this paper, we propose a novel decentralized control method to maintain Line-of-Sight connectivity for multi-robot networks in the presence of Guassian-distributed localization uncertainty. In contrast to most existing work that assumes perfect positional information about robots or enforces overly restrictive rigid formation against uncertainty, our method enables robots to preserve Line-of-Si… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepted by RSS 2024

  4. arXiv:2406.12225  [pdf, other

    cs.CV

    The Solution for CVPR2024 Foundational Few-Shot Object Detection Challenge

    Authors: Hongpeng Pan, Shifeng Yi, Shouwei Yang, Lei Qi, Bing Hu, Yi Xu, Yang Yang

    Abstract: This report introduces an enhanced method for the Foundational Few-Shot Object Detection (FSOD) task, leveraging the vision-language model (VLM) for object detection. However, on specific datasets, VLM may encounter the problem where the detected targets are misaligned with the target concepts of interest. This misalignment hinders the zero-shot performance of VLM and the application of fine-tunin… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: CVPR2024 Foundational Few-Shot Object Detection Challenge

  5. arXiv:2405.11868  [pdf, other

    cs.LG cs.AI cs.CE cs.IR cs.SI

    Towards Graph Contrastive Learning: A Survey and Beyond

    Authors: Wei Ju, Yifan Wang, Yifang Qin, Zhengyang Mao, Zhiping Xiao, Junyu Luo, Junwei Yang, Yiyang Gu, Dongjie Wang, Qingqing Long, Siyu Yi, Xiao Luo, Ming Zhang

    Abstract: In recent years, deep learning on graphs has achieved remarkable success in various domains. However, the reliance on annotated graph data remains a significant bottleneck due to its prohibitive cost and time-intensive nature. To address this challenge, self-supervised learning (SSL) on graphs has gained increasing attention and has made significant progress. SSL enables machine learning models to… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  6. arXiv:2405.04773  [pdf, other

    cs.LG cs.AI cs.IR cs.SI

    Hypergraph-enhanced Dual Semi-supervised Graph Classification

    Authors: Wei Ju, Zhengyang Mao, Siyu Yi, Yifang Qin, Yiyang Gu, Zhiping Xiao, Yifan Wang, Xiao Luo, Ming Zhang

    Abstract: In this paper, we study semi-supervised graph classification, which aims at accurately predicting the categories of graphs in scenarios with limited labeled graphs and abundant unlabeled graphs. Despite the promising capability of graph neural networks (GNNs), they typically require a large number of costly labeled graphs, while a wealth of unlabeled graphs fail to be effectively utilized. Moreove… ▽ More

    Submitted 28 May, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

    Comments: Accepted by Proceedings of the 41st International Conference on Machine Learning (ICML 2024)

  7. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  8. arXiv:2403.13660  [pdf

    cs.CV

    ProMamba: Prompt-Mamba for polyp segmentation

    Authors: Jianhao Xie, Ruofan Liao, Ziang Zhang, Sida Yi, Yuesheng Zhu, Guibo Luo

    Abstract: Detecting polyps through colonoscopy is an important task in medical image segmentation, which provides significant assistance and reference value for clinical surgery. However, accurate segmentation of polyps is a challenging task due to two main reasons. Firstly, polyps exhibit various shapes and colors. Secondly, the boundaries between polyps and their normal surroundings are often unclear. Add… ▽ More

    Submitted 26 March, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

    Comments: 10 pages, 2 figures,3 tabels

  9. arXiv:2403.04468  [pdf, other

    cs.LG cs.AI cs.IR cs.SI

    A Survey of Graph Neural Networks in Real world: Imbalance, Noise, Privacy and OOD Challenges

    Authors: Wei Ju, Siyu Yi, Yifan Wang, Zhiping Xiao, Zhengyang Mao, Hourun Li, Yiyang Gu, Yifang Qin, Nan Yin, Senzhang Wang, Xinwang Liu, Xiao Luo, Philip S. Yu, Ming Zhang

    Abstract: Graph-structured data exhibits universality and widespread applicability across diverse domains, such as social network analysis, biochemistry, financial fraud detection, and network security. Significant strides have been made in leveraging Graph Neural Networks (GNNs) to achieve remarkable success in these areas. However, in real-world scenarios, the training environment for models is often far… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  10. arXiv:2403.01091  [pdf, other

    cs.LG cs.AI cs.IR cs.SI

    COOL: A Conjoint Perspective on Spatio-Temporal Graph Neural Network for Traffic Forecasting

    Authors: Wei Ju, Yusheng Zhao, Yifang Qin, Siyu Yi, Jingyang Yuan, Zhiping Xiao, Xiao Luo, Xiting Yan, Ming Zhang

    Abstract: This paper investigates traffic forecasting, which attempts to forecast the future state of traffic based on historical situations. This problem has received ever-increasing attention in various scenarios and facilitated the development of numerous downstream applications such as urban planning and transportation management. However, the efficacy of existing methods remains sub-optimal due to thei… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: Accepted by Information Fusion 2024

  11. arXiv:2402.00447  [pdf, ps, other

    cs.LG cs.AI cs.SI

    A Survey of Data-Efficient Graph Learning

    Authors: Wei Ju, Siyu Yi, Yifan Wang, Qingqing Long, Junyu Luo, Zhiping Xiao, Ming Zhang

    Abstract: Graph-structured data, prevalent in domains ranging from social networks to biochemical analysis, serve as the foundation for diverse real-world systems. While graph neural networks demonstrate proficiency in modeling this type of data, their success is often reliant on significant amounts of labeled data, posing a challenge in practical scenarios with limited annotation resources. To tackle this… ▽ More

    Submitted 19 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: Accepted by Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence (IJCAI 2024)

  12. arXiv:2401.08718  [pdf, other

    cs.LG

    Investigating Fouling Efficiency in Football Using Expected Booking (xB) Model

    Authors: Adnan Azmat, Su Su Yi

    Abstract: This paper introduces the Expected Booking (xB) model, a novel metric designed to estimate the likelihood of a foul resulting in a yellow card in football. Through three iterative experiments, employing ensemble methods, the model demonstrates improved performance with additional features and an expanded dataset. Analysis of FIFA World Cup 2022 data validates the model's efficacy in providing insi… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  13. arXiv:2311.15210  [pdf, other

    cs.LG math.ST

    Topology combined machine learning for consonant recognition

    Authors: Pingyao Feng, Siheng Yi, Qingrui Qu, Zhiwang Yu, Yifei Zhu

    Abstract: In artificial-intelligence-aided signal processing, existing deep learning models often exhibit a black-box structure, and their validity and comprehensibility remain elusive. The integration of topological methods, despite its relatively nascent application, serves a dual purpose of making models more interpretable as well as extracting structural information from time-dependent data for smarter… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

  14. arXiv:2310.04162  [pdf, other

    cs.RO

    Light-LOAM: A Lightweight LiDAR Odometry and Mapping based on Graph-Matching

    Authors: Shiquan Yi, Yang Lyu, Lin Hua, Quan Pan, Chunhui Zhao

    Abstract: Simultaneous Localization and Mapping (SLAM) plays an important role in robot autonomy. Reliability and efficiency are the two most valued features for applying SLAM in robot applications. In this paper, we consider achieving a reliable LiDAR-based SLAM function in computation-limited platforms, such as quadrotor UAVs based on graph-based point cloud association. First, contrary to most works sele… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

  15. Continuous 3D Myocardial Motion Tracking via Echocardiography

    Authors: Chengkang Shen, Hao Zhu, You Zhou, Yu Liu, Si Yi, Lili Dong, Weipeng Zhao, David J. Brady, Xun Cao, Zhan Ma, Yi Lin

    Abstract: Myocardial motion tracking stands as an essential clinical tool in the prevention and detection of cardiovascular diseases (CVDs), the foremost cause of death globally. However, current techniques suffer from incomplete and inaccurate motion estimation of the myocardium in both spatial and temporal dimensions, hindering the early identification of myocardial dysfunction. To address these challenge… ▽ More

    Submitted 27 June, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: 18 pages, 11 figures

    Journal ref: IEEE Transactions on Medical Imaging, June 2024

  16. arXiv:2309.05287  [pdf, other

    cs.SD cs.AI eess.AS

    Addressing Feature Imbalance in Sound Source Separation

    Authors: Jaechang Kim, Jeongyeon Hwang, Soheun Yi, Jaewoong Cho, Jungseul Ok

    Abstract: Neural networks often suffer from a feature preference problem, where they tend to overly rely on specific features to solve a task while disregarding other features, even if those neglected features are essential for the task. Feature preference problems have primarily been investigated in classification task. However, we observe that feature preference occurs in high-dimensional regression task,… ▽ More

    Submitted 4 October, 2023; v1 submitted 11 September, 2023; originally announced September 2023.

  17. arXiv:2309.04694  [pdf, other

    cs.LG

    Redundancy-Free Self-Supervised Relational Learning for Graph Clustering

    Authors: Si-Yu Yi, Wei Ju, Yifang Qin, Xiao Luo, Luchen Liu, Yong-Dao Zhou, Ming Zhang

    Abstract: Graph clustering, which learns the node representations for effective cluster assignments, is a fundamental yet challenging task in data analysis and has received considerable attention accompanied by graph neural networks in recent years. However, most existing methods overlook the inherent relational information among the non-independent and non-identically distributed nodes in a graph. Due to t… ▽ More

    Submitted 9 September, 2023; originally announced September 2023.

    Comments: Accepted by IEEE Transactions on Neural Networks and Learning Systems (TNNLS 2024)

  18. arXiv:2309.00962  [pdf, other

    cs.RO cs.CV

    NTU4DRadLM: 4D Radar-centric Multi-Modal Dataset for Localization and Mapping

    Authors: Jun Zhang, Huayang Zhuge, Yiyao Liu, Guohao Peng, Zhenyu Wu, Haoyuan Zhang, Qiyang Lyu, Heshan Li, Chunyang Zhao, Dogan Kircali, Sanat Mharolkar, Xun Yang, Su Yi, Yuanzhe Wang, Danwei Wang

    Abstract: Simultaneous Localization and Mapping (SLAM) is moving towards a robust perception age. However, LiDAR- and visual- SLAM may easily fail in adverse conditions (rain, snow, smoke and fog, etc.). In comparison, SLAM based on 4D Radar, thermal camera and IMU can work robustly. But only a few literature can be found. A major reason is the lack of related datasets, which seriously hinders the research.… ▽ More

    Submitted 2 September, 2023; originally announced September 2023.

    Comments: 2023 IEEE International Intelligent Transportation Systems Conference (ITSC 2023)

  19. arXiv:2308.16609  [pdf, other

    cs.LG cs.AI cs.IR cs.SI

    Towards Long-Tailed Recognition for Graph Classification via Collaborative Experts

    Authors: Siyu Yi, Zhengyang Mao, Wei Ju, Yongdao Zhou, Luchen Liu, Xiao Luo, Ming Zhang

    Abstract: Graph classification, aiming at learning the graph-level representations for effective class assignments, has received outstanding achievements, which heavily relies on high-quality datasets that have balanced class distribution. In fact, most real-world graph data naturally presents a long-tailed form, where the head classes occupy much more samples than the tail classes, it thus is essential to… ▽ More

    Submitted 5 September, 2023; v1 submitted 31 August, 2023; originally announced August 2023.

    Comments: Accepted by IEEE Transactions on Big Data (TBD 2024)

  20. arXiv:2308.10058  [pdf

    cs.CV

    R-C-P Method: An Autonomous Volume Calculation Method Using Image Processing and Machine Vision

    Authors: MA Muktadir, Sydney Parker, Sun Yi

    Abstract: Machine vision and image processing are often used with sensors for situation awareness in autonomous systems, from industrial robots to self-driving cars. The 3D depth sensors, such as LiDAR (Light Detection and Ranging), Radar, are great invention for autonomous systems. Due to the complexity of the setup, LiDAR may not be suitable for some operational environments, for example, a space environm… ▽ More

    Submitted 3 February, 2024; v1 submitted 19 August, 2023; originally announced August 2023.

  21. FoodWise: Food Waste Reduction and Behavior Change on Campus with Data Visualization and Gamification

    Authors: Yue Yu, Sophia Yi, Xi Nan, Leo Yu-Ho Lo, Kento Shigyo, Liwenhan Xie, Jeffry Wicaksana, Kwang-Ting Cheng, Huamin Qu

    Abstract: Food waste presents a substantial challenge with significant environmental and economic ramifications, and its severity on campus environments is of particular concern. In response to this, we introduce FoodWise, a dual-component system tailored to inspire and incentivize campus communities to reduce food waste. The system consists of a data storytelling dashboard that graphically displays food wa… ▽ More

    Submitted 27 July, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: Accepted in ACM SIGCAS/SIGCHI Conference on Computing and Sustainable Societies (COMPASS) 2023

  22. arXiv:2307.05906  [pdf, other

    cs.LG

    Mini-Batch Optimization of Contrastive Loss

    Authors: Jaewoong Cho, Kartik Sreenivasan, Keon Lee, Kyunghoo Mun, Soheun Yi, Jeong-Gwan Lee, Anna Lee, Jy-yong Sohn, Dimitris Papailiopoulos, Kangwook Lee

    Abstract: Contrastive learning has gained significant attention as a method for self-supervised learning. The contrastive loss function ensures that embeddings of positive sample pairs (e.g., different samples from the same class or different views of the same object) are similar, while embeddings of negative pairs are dissimilar. Practical constraints such as large memory requirements make it challenging t… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

  23. arXiv:2307.05358  [pdf, other

    cs.LG cs.AI

    Combating Data Imbalances in Federated Semi-supervised Learning with Dual Regulators

    Authors: Sikai Bai, Shuaicheng Li, Weiming Zhuang, Jie Zhang, Song Guo, Kunlin Yang, Jun Hou, Shuai Zhang, Junyu Gao, Shuai Yi

    Abstract: Federated learning has become a popular method to learn from decentralized heterogeneous data. Federated semi-supervised learning (FSSL) emerges to train models from a small fraction of labeled data due to label scarcity on decentralized clients. Existing FSSL methods assume independent and identically distributed (IID) labeled data across clients and consistent class distribution between labeled… ▽ More

    Submitted 11 March, 2024; v1 submitted 11 July, 2023; originally announced July 2023.

    Journal ref: The 38th Annual AAAI Conference on Artificial Intelligence, 2024

  24. arXiv:2306.16265  [pdf, other

    cs.RO

    Reconfigurable Robot Control Using Flexible Coupling Mechanisms

    Authors: Sha Yi, Katia Sycara, Zeynep Temel

    Abstract: Reconfigurable robot swarms are capable of connecting with each other to form complex structures. Current mechanical or magnetic connection mechanisms can be complicated to manufacture, consume high power, have a limited load-bearing capacity, or can only form rigid structures. In this paper, we present our low-cost soft anchor design that enables flexible coupling and decoupling between robots. O… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  25. arXiv:2306.10503  [pdf, other

    cs.OS

    A Survey on User-Space Storage and Its Implementations

    Authors: Junzhe Li, Xiurui Pan, Shushu Yi, Jie Zhang

    Abstract: The storage stack in the traditional operating system is primarily optimized towards improving the CPU utilization and hiding the long I/O latency imposed by the slow I/O devices such as hard disk drivers (HDDs). However, the emerging storage media experience significant technique shifts in the past decade, which exhibit high bandwidth and low latency. These high-performance storage devices, unfor… ▽ More

    Submitted 18 June, 2023; originally announced June 2023.

  26. arXiv:2304.13017  [pdf, other

    cs.LG

    DuETT: Dual Event Time Transformer for Electronic Health Records

    Authors: Alex Labach, Aslesha Pokhrel, Xiao Shi Huang, Saba Zuberi, Seung Eun Yi, Maksims Volkovs, Tomi Poutanen, Rahul G. Krishnan

    Abstract: Electronic health records (EHRs) recorded in hospital settings typically contain a wide range of numeric time series data that is characterized by high sparsity and irregular observations. Effective modelling for such data must exploit its time series nature, the semantic relationship between different types of observations, and information in the sparsity structure of the data. Self-supervised Tr… ▽ More

    Submitted 15 August, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

    Comments: Accepted at MLHC 2023, camera-ready version

  27. arXiv:2304.09247  [pdf, other

    cs.CV

    SigSegment: A Signal-Based Segmentation Algorithm for Identifying Anomalous Driving Behaviours in Naturalistic Driving Videos

    Authors: Kelvin Kwakye, Younho Seong, Armstrong Aboah, Sun Yi

    Abstract: In recent years, distracted driving has garnered considerable attention as it continues to pose a significant threat to public safety on the roads. This has increased the need for innovative solutions that can identify and eliminate distracted driving behavior before it results in fatal accidents. In this paper, we propose a Signal-Based anomaly detection algorithm that segments videos into anomal… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

  28. arXiv:2304.09131  [pdf, other

    cs.CV cs.LG

    Variational Relational Point Completion Network for Robust 3D Classification

    Authors: Liang Pan, Xinyi Chen, Zhongang Cai, Junzhe Zhang, Haiyu Zhao, Shuai Yi, Ziwei Liu

    Abstract: Real-scanned point clouds are often incomplete due to viewpoint, occlusion, and noise, which hampers 3D geometric modeling and perception. Existing point cloud completion methods tend to generate global shape skeletons and hence lack fine local details. Furthermore, they mostly learn a deterministic partial-to-complete mapping, but overlook structural relations in man-made objects. To tackle these… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: 12 pages, 10 figures, accepted by PAMI. project webpage: https://mvp-dataset.github.io/. arXiv admin note: substantial text overlap with arXiv:2104.10154

  29. arXiv:2211.10582  [pdf, other

    cs.LG eess.SY

    Linear RNNs Provably Learn Linear Dynamic Systems

    Authors: Lifu Wang, Tianyu Wang, Shengwei Yi, Bo Shen, Bo Hu, Xing Cao

    Abstract: We study the learning ability of linear recurrent neural networks with Gradient Descent. We prove the first theoretical guarantee on linear RNNs to learn any stable linear dynamic system using any a large type of loss functions. For an arbitrary stable linear system with a parameter $ρ_C$ related to the transition matrix $C$, we show that despite the non-convexity of the parameter optimization los… ▽ More

    Submitted 22 October, 2023; v1 submitted 18 November, 2022; originally announced November 2022.

    Comments: 14 pages

  30. arXiv:2211.04454  [pdf, other

    cs.CL cs.LG

    SLATE: A Sequence Labeling Approach for Task Extraction from Free-form Inked Content

    Authors: Apurva Gandhi, Ryan Serrao, Biyi Fang, Gilbert Antonius, Jenna Hong, Tra My Nguyen, Sheng Yi, Ehi Nosakhare, Irene Shaffer, Soundararajan Srinivasan, Vivek Gupta

    Abstract: We present SLATE, a sequence labeling approach for extracting tasks from free-form content such as digitally handwritten (or "inked") notes on a virtual whiteboard. Our approach allows us to create a single, low-latency model to simultaneously perform sentence segmentation and classification of these sentences into task/non-task sentences. SLATE greatly outperforms a baseline two-model (sentence s… ▽ More

    Submitted 17 November, 2022; v1 submitted 8 November, 2022; originally announced November 2022.

    Comments: Accepted at EMNLP 2022 as an Industry Track paper

  31. arXiv:2210.04024  [pdf, other

    cs.LG

    Demand Layering for Real-Time DNN Inference with Minimized Memory Usage

    Authors: Mingoo Ji, Saehanseul Yi, Changjin Koo, Sol Ahn, Dongjoo Seo, Nikil Dutt, Jong-Chan Kim

    Abstract: When executing a deep neural network (DNN), its model parameters are loaded into GPU memory before execution, incurring a significant GPU memory burden. There are studies that reduce GPU memory usage by exploiting CPU memory as a swap device. However, this approach is not applicable in most embedded systems with integrated GPUs where CPU and GPU share a common memory. In this regard, we present De… ▽ More

    Submitted 8 October, 2022; originally announced October 2022.

    Comments: 14 pages, 16 figures. Accepted to the 43rd IEEE Real-Time Systems Symposium (RTSS), 2022

  32. arXiv:2208.07137  [pdf, other

    cs.CV

    An Empirical Study of Pseudo-Labeling for Image-based 3D Object Detection

    Authors: Xinzhu Ma, Yuan Meng, Yinmin Zhang, Lei Bai, Jun Hou, Shuai Yi, Wanli Ouyang

    Abstract: Image-based 3D detection is an indispensable component of the perception system for autonomous driving. However, it still suffers from the unsatisfying performance, one of the main reasons for which is the limited training data. Unfortunately, annotating the objects in the 3D space is extremely time/resource-consuming, which makes it hard to extend the training set arbitrarily. In this work, we fo… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

    Comments: tech report

  33. arXiv:2208.00173  [pdf, other

    cs.CV cs.AI cs.LG

    A Survey on Masked Autoencoder for Self-supervised Learning in Vision and Beyond

    Authors: Chaoning Zhang, Chenshuang Zhang, Junha Song, John Seon Keun Yi, Kang Zhang, In So Kweon

    Abstract: Masked autoencoders are scalable vision learners, as the title of MAE \cite{he2022masked}, which suggests that self-supervised learning (SSL) in vision might undertake a similar trajectory as in NLP. Specifically, generative pretext tasks with the masked prediction (e.g., BERT) have become a de facto standard SSL practice in NLP. By contrast, early attempts at generative methods in vision have bee… ▽ More

    Submitted 30 July, 2022; originally announced August 2022.

    Comments: First survey on masked autoencoder (under progress)

  34. arXiv:2207.11965  [pdf, other

    cs.FL

    Machine-checked executable semantics of Stateflow

    Authors: Shicheng Yi, Shuling Wang, Bohua Zhan, Naijun Zhan

    Abstract: Simulink is a widely used model-based development environment for embedded systems. Stateflow is a component of Simulink for modeling event-driven control via hierarchical state machines and flow charts. However, Stateflow lacks an official formal semantics, making it difficult to formally prove properties of its models in safety-critical applications. In this paper, we define a formal semantics f… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

    Comments: 26 pages

  35. Autoencoding Conditional GAN for Portfolio Allocation Diversification

    Authors: Jun Lu, Shao Yi

    Abstract: Over the decades, the Markowitz framework has been used extensively in portfolio analysis though it puts too much emphasis on the analysis of the market uncertainty rather than on the trend prediction. While generative adversarial network (GAN) and conditional GAN (CGAN) have been explored to generate financial time series and extract features that can help portfolio analysis. The limitation of th… ▽ More

    Submitted 17 June, 2022; originally announced July 2022.

    Journal ref: Applied Economics and Finance 9 (3), 55-68, 2022

  36. arXiv:2207.01909  [pdf, other

    cs.CV cs.LG eess.IV

    StyleFlow For Content-Fixed Image to Image Translation

    Authors: Weichen Fan, Jinghuan Chen, Jiabin Ma, Jun Hou, Shuai Yi

    Abstract: Image-to-image (I2I) translation is a challenging topic in computer vision. We divide this problem into three tasks: strongly constrained translation, normally constrained translation, and weakly constrained translation. The constraint here indicates the extent to which the content or semantic information in the original image is preserved. Although previous approaches have achieved good performan… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

  37. arXiv:2206.10157  [pdf, other

    cs.CV

    Probing Visual-Audio Representation for Video Highlight Detection via Hard-Pairs Guided Contrastive Learning

    Authors: Shuaicheng Li, Feng Zhang, Kunlin Yang, Lingbo Liu, Shinan Liu, Jun Hou, Shuai Yi

    Abstract: Video highlight detection is a crucial yet challenging problem that aims to identify the interesting moments in untrimmed videos. The key to this task lies in effective video representations that jointly pursue two goals, \textit{i.e.}, cross-modal representation learning and fine-grained feature discrimination. In this paper, these two challenges are tackled by not only enriching intra-modality a… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

  38. Differentiable Transient Rendering

    Authors: Shinyoung Yi, Donggun Kim, Kiseok Choi, Adrian Jarabo, Diego Gutierrez, Min H. Kim

    Abstract: Recent differentiable rendering techniques have become key tools to tackle many inverse problems in graphics and vision. Existing models, however, assume steady-state light transport, i.e., infinite speed of light. While this is a safe assumption for many applications, recent advances in ultrafast imaging leverage the wealth of information that can be extracted from the exact time of flight of lig… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

    Journal ref: ACM Transactions on Graphics 40, 6, Article 286 (December 2021)

  39. arXiv:2206.06067  [pdf, other

    cs.CV

    Better Teacher Better Student: Dynamic Prior Knowledge for Knowledge Distillation

    Authors: Zengyu Qiu, Xinzhu Ma, Kunlin Yang, Chunya Liu, Jun Hou, Shuai Yi, Wanli Ouyang

    Abstract: Knowledge distillation (KD) has shown very promising capabilities in transferring learning representations from large models (teachers) to small models (students). However, as the capacity gap between students and teachers becomes larger, existing KD methods fail to achieve better results. Our work shows that the `prior knowledge' is vital to KD, especially when applying large teachers. Particular… ▽ More

    Submitted 23 March, 2023; v1 submitted 13 June, 2022; originally announced June 2022.

    Comments: ICLR'23 accepted

  40. arXiv:2205.10507  [pdf

    cs.LG cs.CV

    Travel Time, Distance and Costs Optimization for Paratransit Operations using Graph Convolutional Neural Network

    Authors: Kelvin Kwakye, Younho Seong, Sun Yi

    Abstract: The provision of paratransit services is one option to meet the transportation needs of Vulnerable Road Users (VRUs). Like any other means of transportation, paratransit has obstacles such as high operational costs and longer trip times. As a result, customers are dissatisfied, and paratransit operators have a low approval rating. Researchers have undertaken various studies over the years to bette… ▽ More

    Submitted 21 May, 2022; originally announced May 2022.

  41. arXiv:2204.08970  [pdf, other

    cs.CV eess.IV

    Rendering Nighttime Image Via Cascaded Color and Brightness Compensation

    Authors: Zhihao Li, Si Yi, Zhan Ma

    Abstract: Image signal processing (ISP) is crucial for camera imaging, and neural networks (NN) solutions are extensively deployed for daytime scenes. The lack of sufficient nighttime image dataset and insights on nighttime illumination characteristics poses a great challenge for high-quality rendering using existing NN ISPs. To tackle it, we first built a high-resolution nighttime RAW-RGB (NR2R) dataset wi… ▽ More

    Submitted 21 April, 2022; v1 submitted 19 April, 2022; originally announced April 2022.

    Comments: Accepted by NTIRE 2022 (CVPR Workshop)

  42. arXiv:2204.04382  [pdf, other

    cs.CV cs.AI cs.DC

    Federated Unsupervised Domain Adaptation for Face Recognition

    Authors: Weiming Zhuang, Xin Gan, Yonggang Wen, Xuesen Zhang, Shuai Zhang, Shuai Yi

    Abstract: Given labeled data in a source domain, unsupervised domain adaptation has been widely adopted to generalize models for unlabeled data in a target domain, whose data distributions are different. However, existing works are inapplicable to face recognition under privacy constraints because they require sharing of sensitive face images between domains. To address this problem, we propose federated un… ▽ More

    Submitted 9 April, 2022; originally announced April 2022.

    Comments: ICME'22. arXiv admin note: substantial text overlap with arXiv:2105.07606

  43. Reducing overestimating and underestimating volatility via the augmented blending-ARCH model

    Authors: Jun Lu, Shao Yi

    Abstract: SVR-GARCH model tends to "backward eavesdrop" when forecasting the financial time series volatility in which case it tends to simply produce the prediction by deviating the previous volatility. Though the SVR-GARCH model has achieved good performance in terms of various performance measurements, trading opportunities, peak or trough behaviors in the time series are all hampered by underestimating… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

    Journal ref: Applied Economics and Finance 9 (2), 48-59, 2022

  44. arXiv:2202.13461  [pdf, other

    cs.RO

    Configuration Control for Physical Coupling of Heterogeneous Robot Swarms

    Authors: Sha Yi, Zeynep Temel, Katia Sycara

    Abstract: In this paper, we present a heterogeneous robot swarm system that can physically couple with each other to form functional structures and dynamically decouple to perform individual tasks. The connection between robots can be formed with a passive coupling mechanism, ensuring minimum energy consumption during coupling and decoupling behavior. The heterogeneity of the system enables the robots to pe… ▽ More

    Submitted 1 March, 2022; v1 submitted 27 February, 2022; originally announced February 2022.

  45. PuzzleBots: Physical Coupling of Robot Swarms

    Authors: Sha Yi, Zeynep Temel, Katia Sycara

    Abstract: Robot swarms have been shown to improve the ability of individual robots by inter-robot collaboration. In this paper, we present the PuzzleBots - a low-cost robotic swarm system where robots can physically couple with each other to form functional structures with minimum energy consumption while maintaining individual mobility to navigate within the environment. Each robot has knobs and holes alon… ▽ More

    Submitted 5 February, 2022; originally announced February 2022.

    Journal ref: 2021 IEEE International Conference on Robotics and Automation (ICRA), pp. 8742-8748

  46. arXiv:2201.07459  [pdf, other

    cs.CV

    PT4AL: Using Self-Supervised Pretext Tasks for Active Learning

    Authors: John Seon Keun Yi, Minseok Seo, Jongchan Park, Dong-Geol Choi

    Abstract: Labeling a large set of data is expensive. Active learning aims to tackle this problem by asking to annotate only the most informative data from the unlabeled set. We propose a novel active learning approach that utilizes self-supervised pretext tasks and a unique data sampler to select data that are both difficult and representative. We discover that the loss of a simple self-supervised pretext t… ▽ More

    Submitted 26 July, 2022; v1 submitted 19 January, 2022; originally announced January 2022.

    Comments: Code is available at https://github.com/johnsk95/PT4AL Updated for ECCV 2022 submission

  47. arXiv:2201.04019  [pdf, other

    cs.CV cs.AI

    Pyramid Fusion Transformer for Semantic Segmentation

    Authors: Zipeng Qin, Jianbo Liu, Xiaolin Zhang, Maoqing Tian, Aojun Zhou, Shuai Yi, Hongsheng Li

    Abstract: The recently proposed MaskFormer gives a refreshed perspective on the task of semantic segmentation: it shifts from the popular pixel-level classification paradigm to a mask-level classification method. In essence, it generates paired probabilities and masks corresponding to category segments and combines them during inference for the segmentation maps. In our study, we find that per-mask classifi… ▽ More

    Submitted 30 May, 2023; v1 submitted 11 January, 2022; originally announced January 2022.

  48. arXiv:2201.01901  [pdf, other

    cs.CV cs.CL

    Incremental Object Grounding Using Scene Graphs

    Authors: John Seon Keun Yi, Yoonwoo Kim, Sonia Chernova

    Abstract: Object grounding tasks aim to locate the target object in an image through verbal communications. Understanding human command is an important process needed for effective human-robot communication. However, this is challenging because human commands can be ambiguous and erroneous. This paper aims to disambiguate the human's referring expressions by allowing the agent to ask relevant questions base… ▽ More

    Submitted 13 November, 2022; v1 submitted 5 January, 2022; originally announced January 2022.

  49. arXiv:2112.12322  [pdf

    cs.ET physics.optics

    High-order tensor flow processing using integrated photonic circuits

    Authors: Shaofu Xu, Jing Wang, Sicheng Yi, Weiwen Zou

    Abstract: Tensor analytics lays mathematical basis for the prosperous promotion of multiway signal processing. To increase computing throughput, mainstream processors transform tensor convolutions to matrix multiplications to enhance parallelism of computing. However, such order-reducing transformation produces data duplicates and consumes additional memory. Here, we demonstrate an integrated photonic tenso… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

  50. arXiv:2109.07154  [pdf, other

    cs.CL

    Can Language Models be Biomedical Knowledge Bases?

    Authors: Mujeen Sung, Jinhyuk Lee, Sean Yi, Minji Jeon, Sungdong Kim, Jaewoo Kang

    Abstract: Pre-trained language models (LMs) have become ubiquitous in solving various natural language processing (NLP) tasks. There has been increasing interest in what knowledge these LMs contain and how we can extract that knowledge, treating LMs as knowledge bases (KBs). While there has been much work on probing LMs in the general domain, there has been little attention to whether these powerful LMs can… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021. Code available at https://github.com/dmis-lab/BioLAMA