Skip to main content

Showing 1–50 of 59 results for author: Hong, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.08035  [pdf, other

    cs.CV cs.AI

    LVBench: An Extreme Long Video Understanding Benchmark

    Authors: Weihan Wang, Zehai He, Wenyi Hong, Yean Cheng, Xiaohan Zhang, Ji Qi, Shiyu Huang, Bin Xu, Yuxiao Dong, Ming Ding, Jie Tang

    Abstract: Recent progress in multimodal large language models has markedly enhanced the understanding of short videos (typically under one minute), and several evaluation datasets have emerged accordingly. However, these advancements fall short of meeting the demands of real-world applications such as embodied intelligence for long-term decision-making, in-depth movie reviews and discussions, and live sport… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2406.01641  [pdf, other

    cs.MA cs.AI

    Reciprocal Reward Influence Encourages Cooperation From Self-Interested Agents

    Authors: John L. Zhou, Weizhe Hong, Jonathan C. Kao

    Abstract: Emergent cooperation among self-interested individuals is a widespread phenomenon in the natural world, but remains elusive in interactions between artificially intelligent agents. Instead, naïve reinforcement learning algorithms typically converge to Pareto-dominated outcomes in even the simplest of social dilemmas. An emerging class of opponent-shaping methods have demonstrated the ability to re… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 9 pages, 4 figures

  3. arXiv:2405.13197  [pdf, other

    cs.CV

    Global-Local Detail Guided Transformer for Sea Ice Recognition in Optical Remote Sensing Images

    Authors: Zhanchao Huang, Wenjun Hong, Hua Su

    Abstract: The recognition of sea ice is of great significance for reflecting climate change and ensuring the safety of ship navigation. Recently, many deep learning based methods have been proposed and applied to segment and recognize sea ice regions. However, the diverse scales of sea ice areas, the zigzag and fine edge contours, and the difficulty in distinguishing different types of sea ice pose challeng… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 5 pages, 5 figures

    Journal ref: IEEE IGARSS 2024

  4. arXiv:2405.04312  [pdf, other

    cs.CV

    Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer

    Authors: Zhuoyi Yang, Heyang Jiang, Wenyi Hong, Jiayan Teng, Wendi Zheng, Yuxiao Dong, Ming Ding, Jie Tang

    Abstract: Diffusion models have shown remarkable performance in image generation in recent years. However, due to a quadratic increase in memory during generating ultra-high-resolution images (e.g. 4096*4096), the resolution of generated images is often limited to 1024*1024. In this work. we propose a unidirectional block attention mechanism that can adaptively adjust the memory overhead during the inferenc… ▽ More

    Submitted 8 May, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  5. arXiv:2404.12623  [pdf, other

    cs.LG cs.CR cs.DC

    End-to-End Verifiable Decentralized Federated Learning

    Authors: Chaehyeon Lee, Jonathan Heiss, Stefan Tai, James Won-Ki Hong

    Abstract: Verifiable decentralized federated learning (FL) systems combining blockchains and zero-knowledge proofs (ZKP) make the computational integrity of local learning and global aggregation verifiable across workers. However, they are not end-to-end: data can still be corrupted prior to the learning. In this paper, we propose a verifiable decentralized FL system for end-to-end integrity and authenticit… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: 9 pages, 5 figures, This article has been accepted for presentation at the IEEE International Conference on Blockchain and Cryptocurrency (ICBC 2024)

  6. arXiv:2403.14943  [pdf, ps, other

    cs.IT eess.SP

    Primary Rate Maximization in Movable Antennas Empowered Symbiotic Radio Communications

    Authors: Bin Lyu, Hao Liu, Wenqing Hong, Shimin Gong, Feng Tian

    Abstract: In this paper, we propose a movable antenna (MA) empowered scheme for symbiotic radio (SR) communication systems. Specifically, multiple antennas at the primary transmitter (PT) can be flexibly moved to favorable locations to boost the channel conditions of the primary and secondary transmissions. The primary transmission is achieved by the active transmission from the PT to the primary user (PU),… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: To appear in IEEE VTC-Spring 2024. 6 Pages,5 figures

  7. arXiv:2402.04236  [pdf, other

    cs.CV cs.CL

    CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations

    Authors: Ji Qi, Ming Ding, Weihan Wang, Yushi Bai, Qingsong Lv, Wenyi Hong, Bin Xu, Lei Hou, Juanzi Li, Yuxiao Dong, Jie Tang

    Abstract: Vision-Language Models (VLMs) have demonstrated their broad effectiveness thanks to extensive training in aligning visual instructions to responses. However, such training of conclusive alignment leads models to ignore essential visual reasoning, further resulting in failures in meticulous visual problems and unfaithful responses. Drawing inspiration from human cognition in solving visual problems… ▽ More

    Submitted 22 May, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: 19 pages, 9 figures

  8. arXiv:2312.08914  [pdf, other

    cs.CV

    CogAgent: A Visual Language Model for GUI Agents

    Authors: Wenyi Hong, Weihan Wang, Qingsong Lv, Jiazheng Xu, Wenmeng Yu, Junhui Ji, Yan Wang, Zihan Wang, Yuxuan Zhang, Juanzi Li, Bin Xu, Yuxiao Dong, Ming Ding, Jie Tang

    Abstract: People are spending an enormous amount of time on digital devices through graphical user interfaces (GUIs), e.g., computer or smartphone screens. Large language models (LLMs) such as ChatGPT can assist people in tasks like writing emails, but struggle to understand and interact with GUIs, thus limiting their potential to increase automation levels. In this paper, we introduce CogAgent, an 18-billi… ▽ More

    Submitted 21 December, 2023; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: 27 pages, 19 figures

  9. arXiv:2312.06708  [pdf, other

    cs.CV

    Neutral Editing Framework for Diffusion-based Video Editing

    Authors: Sunjae Yoon, Gwanhyeong Koo, Ji Woo Hong, Chang D. Yoo

    Abstract: Text-conditioned image editing has succeeded in various types of editing based on a diffusion framework. Unfortunately, this success did not carry over to a video, which continues to be challenging. Existing video editing systems are still limited to rigid-type editing such as style transfer and object overlay. To this end, this paper proposes Neutral Editing (NeuEdit) framework to enable complex… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: 18 pages, 14 figures

  10. arXiv:2312.05454  [pdf, other

    cs.CV cs.AI cs.LG

    Model Evaluation for Domain Identification of Unknown Classes in Open-World Recognition: A Proposal

    Authors: Gusti Ahmad Fanshuri Alfarisy, Owais Ahmed Malik, Ong Wee Hong

    Abstract: Open-World Recognition (OWR) is an emerging field that makes a machine learning model competent in rejecting the unknowns, managing them, and incrementally adding novel samples to the base knowledge. However, this broad objective is not practical for an agent that works on a specific task. Not all rejected samples will be used for learning continually in the future. Some novel images in the open e… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  11. arXiv:2311.03707  [pdf, other

    cs.AI cs.LG cs.MA

    The NeurIPS 2022 Neural MMO Challenge: A Massively Multiagent Competition with Specialization and Trade

    Authors: Enhong Liu, Joseph Suarez, Chenhui You, Bo Wu, Bingcheng Chen, Jun Hu, Jiaxin Chen, Xiaolong Zhu, Clare Zhu, Julian Togelius, Sharada Mohanty, Weijun Hong, Rui Du, Yibing Zhang, Qinwen Wang, Xinhang Li, Zheng Yuan, Xiang Li, Yuejia Huang, Kun Zhang, Hanhui Yang, Shiqi Tang, Phillip Isola

    Abstract: In this paper, we present the results of the NeurIPS-2022 Neural MMO Challenge, which attracted 500 participants and received over 1,600 submissions. Like the previous IJCAI-2022 Neural MMO Challenge, it involved agents from 16 populations surviving in procedurally generated worlds by collecting resources and defeating opponents. This year's competition runs on the latest v1.6 Neural MMO, which in… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  12. arXiv:2311.03079  [pdf, other

    cs.CV

    CogVLM: Visual Expert for Pretrained Language Models

    Authors: Weihan Wang, Qingsong Lv, Wenmeng Yu, Wenyi Hong, Ji Qi, Yan Wang, Junhui Ji, Zhuoyi Yang, Lei Zhao, Xixuan Song, Jiazheng Xu, Bin Xu, Juanzi Li, Yuxiao Dong, Ming Ding, Jie Tang

    Abstract: We introduce CogVLM, a powerful open-source visual language foundation model. Different from the popular shallow alignment method which maps image features into the input space of language model, CogVLM bridges the gap between the frozen pretrained language model and image encoder by a trainable visual expert module in the attention and FFN layers. As a result, CogVLM enables deep fusion of vision… ▽ More

    Submitted 4 February, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

  13. arXiv:2310.10606  [pdf, other

    cs.RO cs.LG

    BayRnTune: Adaptive Bayesian Domain Randomization via Strategic Fine-tuning

    Authors: Tianle Huang, Nitish Sontakke, K. Niranjan Kumar, Irfan Essa, Stefanos Nikolaidis, Dennis W. Hong, Sehoon Ha

    Abstract: Domain randomization (DR), which entails training a policy with randomized dynamics, has proven to be a simple yet effective algorithm for reducing the gap between simulation and the real world. However, DR often requires careful tuning of randomization parameters. Methods like Bayesian Domain Randomization (Bayesian DR) and Active Domain Randomization (Adaptive DR) address this issue by automatin… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  14. arXiv:2309.09725  [pdf, ps, other

    stat.ML cs.LG math.OC

    Neural Collapse for Unconstrained Feature Model under Cross-entropy Loss with Imbalanced Data

    Authors: Wanli Hong, Shuyang Ling

    Abstract: Recent years have witnessed the huge success of deep neural networks (DNNs) in various tasks of computer vision and text processing. Interestingly, these DNNs with massive number of parameters share similar structural properties on their feature representation and last-layer classifier at terminal phase of training (TPT). Specifically, if the training data are balanced (each class shares the same… ▽ More

    Submitted 24 October, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: 38 pages, 10 figures

  15. arXiv:2309.03350  [pdf, other

    cs.CV cs.LG

    Relay Diffusion: Unifying diffusion process across resolutions for image synthesis

    Authors: Jiayan Teng, Wendi Zheng, Ming Ding, Wenyi Hong, Jianqiao Wangni, Zhuoyi Yang, Jie Tang

    Abstract: Diffusion models achieved great success in image synthesis, but still face challenges in high-resolution generation. Through the lens of discrete cosine transformation, we find the main reason is that \emph{the same noise level on a higher resolution results in a higher Signal-to-Noise Ratio in the frequency domain}. In this work, we present Relay Diffusion Model (RDM), which transfers a low-resol… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

  16. arXiv:2308.15802  [pdf, other

    cs.AI

    Benchmarking Robustness and Generalization in Multi-Agent Systems: A Case Study on Neural MMO

    Authors: Yangkun Chen, Joseph Suarez, Junjie Zhang, Chenghui Yu, Bo Wu, Hanmo Chen, Hengman Zhu, Rui Du, Shanliang Qian, Shuai Liu, Weijun Hong, Jinke He, Yibing Zhang, Liang Zhao, Clare Zhu, Julian Togelius, Sharada Mohanty, Jiaxin Chen, Xiu Li, Xiaolong Zhu, Phillip Isola

    Abstract: We present the results of the second Neural MMO challenge, hosted at IJCAI 2022, which received 1600+ submissions. This competition targets robustness and generalization in multi-agent systems: participants train teams of agents to complete a multi-task objective against opponents not seen during training. The competition combines relatively complex environment design with large numbers of agents… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

  17. Proprioceptive Learning with Soft Polyhedral Networks

    Authors: Xiaobo Liu, Xudong Han, Wei Hong, Fang Wan, Chaoyang Song

    Abstract: Proprioception is the "sixth sense" that detects limb postures with motor neurons. It requires a natural integration between the musculoskeletal systems and sensory receptors, which is challenging among modern robots that aim for lightweight, adaptive, and sensitive designs at a low cost. Here, we present the Soft Polyhedral Network with an embedded vision for physical interactions, capable of ada… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: 20 pages, 10 figures, 2 tables, submitted to the International Journal of Robotics Research for review

  18. arXiv:2307.09729  [pdf, other

    cs.CV cs.MM eess.IV

    NTIRE 2023 Quality Assessment of Video Enhancement Challenge

    Authors: Xiaohong Liu, Xiongkuo Min, Wei Sun, Yulun Zhang, Kai Zhang, Radu Timofte, Guangtao Zhai, Yixuan Gao, Yuqin Cao, Tengchuan Kou, Yunlong Dong, Ziheng Jia, Yilin Li, Wei Wu, Shuming Hu, Sibin Deng, Pengxiang Xiao, Ying Chen, Kai Li, Kai Zhao, Kun Yuan, Ming Sun, Heng Cong, Hao Wang, Lingzhi Fu , et al. (47 additional authors not shown)

    Abstract: This paper reports on the NTIRE 2023 Quality Assessment of Video Enhancement Challenge, which will be held in conjunction with the New Trends in Image Restoration and Enhancement Workshop (NTIRE) at CVPR 2023. This challenge is to address a major challenge in the field of video processing, namely, video quality assessment (VQA) for enhanced videos. The challenge uses the VQA Dataset for Perceptual… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

  19. arXiv:2306.14649  [pdf, other

    cs.NE

    CIMulator: A Comprehensive Simulation Platform for Computing-In-Memory Circuit Macros with Low Bit-Width and Real Memory Materials

    Authors: Hoang-Hiep Le, Md. Aftab Baig, Wei-Chen Hong, Cheng-Hsien Tsai, Cheng-Jui Yeh, Fu-Xiang Liang, I-Ting Huang, Wei-Tzu Tsai, Ting-Yin Cheng, Sourav De, Nan-Yow Chen, Wen-Jay Lee, Ing-Chao Lin, Da-Wei Chang, Darsen D. Lu

    Abstract: This paper presents a simulation platform, namely CIMulator, for quantifying the efficacy of various synaptic devices in neuromorphic accelerators for different neural network architectures. Nonvolatile memory devices, such as resistive random-access memory, ferroelectric field-effect transistor, and volatile static random-access memory devices, can be selected as synaptic devices. A multilayer pe… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  20. Bitcoin Double-Spending Attack Detection using Graph Neural Network

    Authors: Changhoon Kang, Jongsoo Woo, James Won-Ki Hong

    Abstract: Bitcoin transactions include unspent transaction outputs (UTXOs) as their inputs and generate one or more newly owned UTXOs at specified addresses. Each UTXO can only be used as an input in a transaction once, and using it in two or more different transactions is referred to as a double-spending attack. Ultimately, due to the characteristics of the Bitcoin protocol, double-spending is impossible.… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: 3 pages, 1 table, Accepted as poster at IEEE ICBC 2023

  21. arXiv:2304.03593  [pdf, other

    cs.RO cs.AI cs.LG

    Deep Reinforcement Learning-Based Mapless Crowd Navigation with Perceived Risk of the Moving Crowd for Mobile Robots

    Authors: Hafiq Anas, Ong Wee Hong, Owais Ahmed Malik

    Abstract: Current state-of-the-art crowd navigation approaches are mainly deep reinforcement learning (DRL)-based. However, DRL-based methods suffer from the issues of generalization and scalability. To overcome these challenges, we propose a method that includes a Collision Probability (CP) in the observation space to give the robot a sense of the level of danger of the moving crowd to help the robot navig… ▽ More

    Submitted 23 September, 2023; v1 submitted 7 April, 2023; originally announced April 2023.

    Comments: 6 pages, 7 figures

  22. arXiv:2304.00536  [pdf, other

    cs.RO

    Design of a Jumping Control Framework with Heuristic Landing for Bipedal Robots

    Authors: Jingwen Zhang, Junjie Shen, Yeting Liu, Dennis W. Hong

    Abstract: Generating dynamic jumping motions on legged robots remains a challenging control problem as the full flight phase and large landing impact are expected. Compared to quadrupedal robots or other multi-legged robots, bipedal robots place higher requirements for the control strategy given a much smaller footprint. To solve this problem, a novel heuristic landing planner is proposed in this paper. Wit… ▽ More

    Submitted 2 April, 2023; originally announced April 2023.

  23. arXiv:2303.09597  [pdf, other

    cs.RO cs.AI

    Residual Physics Learning and System Identification for Sim-to-real Transfer of Policies on Buoyancy Assisted Legged Robots

    Authors: Nitish Sontakke, Hosik Chae, Sangjoon Lee, Tianle Huang, Dennis W. Hong, Sehoon Ha

    Abstract: The light and soft characteristics of Buoyancy Assisted Lightweight Legged Unit (BALLU) robots have a great potential to provide intrinsically safe interactions in environments involving humans, unlike many heavy and rigid robots. However, their unique and sensitive dynamics impose challenges to obtaining robust control policies in the real world. In this work, we demonstrate robust sim-to-real tr… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

  24. arXiv:2211.09861  [pdf, other

    cs.CV

    Self-Supervised Visual Representation Learning via Residual Momentum

    Authors: Trung X. Pham, Axi Niu, Zhang Kang, Sultan Rizky Madjid, Ji Woo Hong, Daehyeok Kim, Joshua Tian Jin Tee, Chang D. Yoo

    Abstract: Self-supervised learning (SSL) approaches have shown promising capabilities in learning the representation from unlabeled data. Amongst them, momentum-based frameworks have attracted significant attention. Despite being a great success, these momentum-based SSL frameworks suffer from a large gap in representation between the online encoder (student) and the momentum encoder (teacher), which hinder… ▽ More

    Submitted 21 November, 2022; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: 18 pages, 16 figures

  25. arXiv:2211.09498  [pdf, other

    cs.NE

    Automatic Construction of Parallel Algorithm Portfolios for Multi-objective Optimization

    Authors: Xiasheng Ma, Shengcai Liu, Wenjing Hong

    Abstract: It has been widely observed that there exists no universal best Multi-objective Evolutionary Algorithm (MOEA) dominating all other MOEAs on all possible Multi-objective Optimization Problems (MOPs). In this work, we advocate using the Parallel Algorithm Portfolio (PAP), which runs multiple MOEAs independently in parallel and gets the best out of them, to combine the advantages of different MOEAs.… ▽ More

    Submitted 6 June, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

  26. arXiv:2211.04803  [pdf

    cs.CR

    DSCOT: An NFT-Based Blockchain Architecture for the Authentication of IoT-Enabled Smart Devices in Smart Cities

    Authors: Usman Khalil, Owais Ahmed Malik, Ong Wee Hong, Mueen Uddin

    Abstract: Smart city architecture brings all the underlying architectures, i.e., Internet of Things (IoT), Cyber-Physical Systems (CPSs), Internet of Cyber-Physical Things (IoCPT), and Internet of Everything (IoE), together to work as a system under its umbrella. The goal of smart city architecture is to come up with a solution that may integrate all the real-time response applications. However, the cyber-p… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

    Comments: 18 pages, 15 figures, 5 tables, journal

  27. Selective Query-guided Debiasing for Video Corpus Moment Retrieval

    Authors: Sunjae Yoon, Ji Woo Hong, Eunseop Yoon, Dahyun Kim, Junyeong Kim, Hee Suk Yoon, Chang D. Yoo

    Abstract: Video moment retrieval (VMR) aims to localize target moments in untrimmed videos pertinent to a given textual query. Existing retrieval systems tend to rely on retrieval bias as a shortcut and thus, fail to sufficiently learn multi-modal interactions between query and video. This retrieval bias stems from learning frequent co-occurrence patterns between query and moments, which spuriously correlat… ▽ More

    Submitted 26 November, 2022; v1 submitted 16 October, 2022; originally announced October 2022.

    Comments: 16 pages, 6 figures, Accepted in ECCV 2022

    Journal ref: In European Conference on Computer Vision (pp. 185-200). Springer, Cham (2022)

  28. arXiv:2209.12713  [pdf, other

    cs.MA cs.LG

    Multi-Agent Sequential Decision-Making via Communication

    Authors: Ziluo Ding, Kefan Su, Weixin Hong, Liwen Zhu, Tiejun Huang, Zongqing Lu

    Abstract: Communication helps agents to obtain information about others so that better coordinated behavior can be learned. Some existing work communicates predicted future trajectory with others, hoping to get clues about what others would do for better coordination. However, circular dependencies sometimes can occur when agents are treated synchronously so it is hard to coordinate decision-making. In this… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

    Comments: 20 pages

  29. arXiv:2208.13158  [pdf, other

    math.OC cs.RO

    Benchmark Results for Bookshelf Organization Problem as Mixed Integer Nonlinear Program with Mode Switch and Collision Avoidance

    Authors: Xuan Lin, Gabriel I. Fernandez, Dennis W. Hong

    Abstract: Mixed integer convex and nonlinear programs, MICP and MINLP, are expressive but require long solving times. Recent work that combines data-driven methods on solver heuristics has shown potential to overcome this issue allowing for applications on larger scale practical problems. To solve mixed-integer bilinear programs online with data-driven methods, several formulations exist including mathemati… ▽ More

    Submitted 28 August, 2022; originally announced August 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2110.00666

  30. arXiv:2205.15868  [pdf, other

    cs.CV cs.CL cs.LG

    CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers

    Authors: Wenyi Hong, Ming Ding, Wendi Zheng, Xinghan Liu, Jie Tang

    Abstract: Large-scale pretrained transformers have created milestones in text (GPT-3) and text-to-image (DALL-E and CogView) generation. Its application to video generation is still facing many challenges: The potential huge computation cost makes the training from scratch unaffordable; The scarcity and weak relevance of text-video datasets hinder the model understanding complex movement semantics. In this… ▽ More

    Submitted 29 May, 2022; originally announced May 2022.

  31. arXiv:2204.14217  [pdf, other

    cs.CV cs.LG

    CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers

    Authors: Ming Ding, Wendi Zheng, Wenyi Hong, Jie Tang

    Abstract: The development of the transformer-based text-to-image models are impeded by its slow generation and complexity for high-resolution images. In this work, we put forward a solution based on hierarchical transformers and local parallel auto-regressive generation. We pretrain a 6B-parameter transformer with a simple and flexible self-supervised task, Cross-modal general language model (CogLM), and fi… ▽ More

    Submitted 27 May, 2022; v1 submitted 28 April, 2022; originally announced April 2022.

  32. arXiv:2204.13508  [pdf, other

    cs.CE

    Design of Blockchain-based Travel Rule Compliance System

    Authors: Chaehyeon Lee, Changhoon Kang, Wonseok Choi, Jehoon Lee, Myunghun Cha, Jongsoo Woo, James Won-Ki Hong

    Abstract: In accordance with the guidelines of the Financial Action Task Force (FATF), Virtual Asset Service Providers (VASPs) should comply with a `travel rule', which requires them to exchange originator's and beneficiary's personal information when transferring virtual assets. In this paper, we propose a novel blockchain-based travel rule compliance system that supports fully-decentralized data exchange.… ▽ More

    Submitted 28 April, 2022; originally announced April 2022.

    Comments: 3 pages, 1 figure, 1 table. Accepted to IEEE ICBC 2022 as a poster paper

  33. arXiv:2204.11116  [pdf, other

    cs.RO

    Human-Robot Shared Control for Surgical Robot Based on Context-Aware Sim-to-Real Adaptation

    Authors: Dandan Zhang, Zicong Wu, Junhong Chen, Ruiqi Zhu, Adnan Munawar, Bo Xiao, Yuan Guan, Hang Su, Wuzhou Hong, Yao Guo, Gregory S. Fischer, Benny Lo, Guang-Zhong Yang

    Abstract: Human-robot shared control, which integrates the advantages of both humans and robots, is an effective approach to facilitate efficient surgical operation. Learning from demonstration (LfD) techniques can be used to automate some of the surgical subtasks for the construction of the shared control framework. However, a sufficient amount of data is required for the robot to learn the manoeuvres. Usi… ▽ More

    Submitted 4 June, 2022; v1 submitted 23 April, 2022; originally announced April 2022.

    Comments: Accepted by 2022ICRA

  34. arXiv:2204.06697  [pdf, other

    cs.CV cs.AI cs.LG

    HASA: Hybrid Architecture Search with Aggregation Strategy for Echinococcosis Classification and Ovary Segmentation in Ultrasound Images

    Authors: Jikuan Qian, Rui Li, Xin Yang, Yuhao Huang, Mingyuan Luo, Zehui Lin, Wenhui Hong, Ruobing Huang, Haining Fan, Dong Ni, Jun Cheng

    Abstract: Different from handcrafted features, deep neural networks can automatically learn task-specific features from data. Due to this data-driven nature, they have achieved remarkable success in various areas. However, manual design and selection of suitable network architectures are time-consuming and require substantial effort of human experts. To address this problem, researchers have proposed neural… ▽ More

    Submitted 20 April, 2022; v1 submitted 13 April, 2022; originally announced April 2022.

    Comments: 17 pages,11 figures. Accepted by Expert Systems and Applications, 2022

  35. arXiv:2204.05639  [pdf, other

    cs.NE

    Neural Network Pruning by Cooperative Coevolution

    Authors: Haopu Shang, Jia-Liang Wu, Wenjing Hong, Chao Qian

    Abstract: Neural network pruning is a popular model compression method which can significantly reduce the computing cost with negligible loss of accuracy. Recently, filters are often pruned directly by designing proper criteria or using auxiliary modules to measure their importance, which, however, requires expertise and trial-and-error. Due to the advantage of automation, pruning by evolutionary algorithms… ▽ More

    Submitted 9 May, 2022; v1 submitted 12 April, 2022; originally announced April 2022.

  36. arXiv:2203.16406  [pdf, other

    cs.AI cs.GT cs.LG

    PerfectDou: Dominating DouDizhu with Perfect Information Distillation

    Authors: Guan Yang, Minghuan Liu, Weijun Hong, Weinan Zhang, Fei Fang, Guangjun Zeng, Yue Lin

    Abstract: As a challenging multi-player card game, DouDizhu has recently drawn much attention for analyzing competition and collaboration in imperfect-information games. In this paper, we propose PerfectDou, a state-of-the-art DouDizhu AI system that dominates the game, in an actor-critic framework with a proposed technique named perfect information distillation. In detail, we adopt a perfect-training-imper… ▽ More

    Submitted 27 February, 2024; v1 submitted 30 March, 2022; originally announced March 2022.

    Comments: Fix minor errors. 23 pages, 8 figures, 13 tables. Published at NeurIPS 2022. The first two authors contribute equally. Project page at https://github.com/Netease-Games-AI-Lab-Guangzhou/PerfectDou

  37. arXiv:2203.05769  [pdf, other

    cs.CR

    DeTRM: Decentralised Trust and Reputation Management for Blockchain-based Supply Chains

    Authors: Guntur Dharma Putra, Changhoon Kang, Salil S. Kanhere, James Won-Ki Hong

    Abstract: Blockchain has the potential to enhance supply chain management systems by providing stronger assurance in transparency and traceability of traded commodities. However, blockchain does not overcome the inherent issues of data trust in IoT enabled supply chains. Recent proposals attempt to tackle these issues by incorporating generic trust and reputation management, which does not entirely address… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Comments: 9 pages, 8 figures. Accepted to IEEE ICBC 2022 as a short paper

  38. arXiv:2202.10583  [pdf, other

    cs.LG cs.AI

    MineRL Diamond 2021 Competition: Overview, Results, and Lessons Learned

    Authors: Anssi Kanervisto, Stephanie Milani, Karolis Ramanauskas, Nicholay Topin, Zichuan Lin, Junyou Li, Jianing Shi, Deheng Ye, Qiang Fu, Wei Yang, Weijun Hong, Zhongyue Huang, Haicheng Chen, Guangjun Zeng, Yue Lin, Vincent Micheli, Eloi Alonso, François Fleuret, Alexander Nikulin, Yury Belousov, Oleg Svidchenko, Aleksei Shpilman

    Abstract: Reinforcement learning competitions advance the field by providing appropriate scope and support to develop solutions toward a specific problem. To promote the development of more broadly applicable methods, organizers need to enforce the use of general techniques, the use of sample-efficient methods, and the reproducibility of the results. While beneficial for the research community, these restri… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

    Comments: Under review for PMLR volume on NeurIPS 2021 competitions

  39. arXiv:2202.04243  [pdf, other

    cs.CV

    Motion-Aware Transformer For Occluded Person Re-identification

    Authors: Mi Zhou, Hongye Liu, Zhekun Lv, Wei Hong, Xiai Chen

    Abstract: Recently, occluded person re-identification(Re-ID) remains a challenging task that people are frequently obscured by other people or obstacles, especially in a crowd massing situation. In this paper, we propose a self-supervised deep learning method to improve the location performance for human parts through occluded person Re-ID. Unlike previous works, we find that motion information derived from… ▽ More

    Submitted 10 February, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

    Comments: 10 pages, 3 figures

  40. arXiv:2201.11685  [pdf, other

    cs.LG cs.AI

    Generative Adversarial Exploration for Reinforcement Learning

    Authors: Weijun Hong, Menghui Zhu, Minghuan Liu, Weinan Zhang, Ming Zhou, Yong Yu, Peng Sun

    Abstract: Exploration is crucial for training the optimal reinforcement learning (RL) policy, where the key is to discriminate whether a state visiting is novel. Most previous work focuses on designing heuristic rules or distance metrics to check whether a state is novel without considering such a discrimination process that can be learned. In this paper, we propose a novel method called generative adversar… ▽ More

    Submitted 27 January, 2022; originally announced January 2022.

  41. arXiv:2201.11679  [pdf, other

    cs.LG cs.CV

    DropNAS: Grouped Operation Dropout for Differentiable Architecture Search

    Authors: Weijun Hong, Guilin Li, Weinan Zhang, Ruiming Tang, Yunhe Wang, Zhenguo Li, Yong Yu

    Abstract: Neural architecture search (NAS) has shown encouraging results in automating the architecture design. Recently, DARTS relaxes the search process with a differentiable formulation that leverages weight-sharing and SGD where all candidate operations are trained simultaneously. Our empirical results show that such procedure results in the co-adaption problem and Matthew Effect: operations with fewer… ▽ More

    Submitted 27 January, 2022; originally announced January 2022.

  42. arXiv:2111.01528  [pdf, other

    cs.CL cs.NE

    Effective and Imperceptible Adversarial Textual Attack via Multi-objectivization

    Authors: Shengcai Liu, Ning Lu, Wenjing Hong, Chao Qian, Ke Tang

    Abstract: The field of adversarial textual attack has significantly grown over the last few years, where the commonly considered objective is to craft adversarial examples (AEs) that can successfully fool the target model. However, the imperceptibility of attacks, which is also essential for practical attackers, is often left out by previous studies. In consequence, the crafted AEs tend to have obvious stru… ▽ More

    Submitted 14 December, 2023; v1 submitted 2 November, 2021; originally announced November 2021.

  43. arXiv:2110.00666  [pdf, other

    cs.RO

    ReDUCE: Reformulation of Mixed Integer Programs using Data from Unsupervised Clusters for Learning Efficient Strategies

    Authors: Xuan Lin, Gabriel I. Fernandez, Dennis W. Hong

    Abstract: Mixed integer convex and nonlinear programs, MICP and MINLP, are expressive but require long solving times. Recent work that combines learning methods on solver heuristics has shown potential to overcome this issue allowing for applications on larger scale practical problems. Gathering sufficient training data to employ these methods still present a challenge since getting data from traditional so… ▽ More

    Submitted 1 October, 2021; originally announced October 2021.

  44. arXiv:2106.07127  [pdf, other

    cs.RO

    Transition Motion Planning for Multi-Limbed Vertical Climbing Robots Using Complementarity Constraints

    Authors: Jingwen Zhang, Xuan Lin, Dennis W Hong

    Abstract: In order to achieve autonomous vertical wall climbing, the transition phase from the ground to the wall requires extra consideration inevitably. This paper focuses on the contact sequence planner to transition between flat terrain and vertical surfaces for multi-limbed climbing robots. To overcome the transition phase, it requires planning both multi-contact and contact wrenches simultaneously whi… ▽ More

    Submitted 13 June, 2021; originally announced June 2021.

    Comments: ICRA 2021 Accepted. Optimization, Climbing motion planning, Complementarity Constraints

  45. arXiv:2105.13290  [pdf, other

    cs.CV cs.LG

    CogView: Mastering Text-to-Image Generation via Transformers

    Authors: Ming Ding, Zhuoyi Yang, Wenyi Hong, Wendi Zheng, Chang Zhou, Da Yin, Junyang Lin, Xu Zou, Zhou Shao, Hongxia Yang, Jie Tang

    Abstract: Text-to-Image generation in the general domain has long been an open problem, which requires both a powerful generative model and cross-modal understanding. We propose CogView, a 4-billion-parameter Transformer with VQ-VAE tokenizer to advance this problem. We also demonstrate the finetuning strategies for various downstream tasks, e.g. style learning, super-resolution, text-image ranking and fash… ▽ More

    Submitted 5 November, 2021; v1 submitted 26 May, 2021; originally announced May 2021.

    Comments: to appear in NeurIPS 2021

  46. arXiv:2012.12453  [pdf, other

    cs.CV

    CholecSeg8k: A Semantic Segmentation Dataset for Laparoscopic Cholecystectomy Based on Cholec80

    Authors: W. -Y. Hong, C. -L. Kao, Y. -H. Kuo, J. -R. Wang, W. -L. Chang, C. -S. Shih

    Abstract: Computer-assisted surgery has been developed to enhance surgery correctness and safety. However, researchers and engineers suffer from limited annotated data to develop and train better algorithms. Consequently, the development of fundamental algorithms such as Simultaneous Localization and Mapping (SLAM) is limited. This article elaborates on the efforts of preparing the dataset for semantic segm… ▽ More

    Submitted 22 December, 2020; originally announced December 2020.

    Comments: 6 pages

  47. arXiv:2011.09967  [pdf, other

    cs.AI cs.NE eess.SP math.OC

    Electric Vehicle Charging Infrastructure Planning: A Scalable Computational Framework

    Authors: Wanshi Hong, Cong Zhang, Cy Chan, Bin Wang

    Abstract: The optimal charging infrastructure planning problem over a large geospatial area is challenging due to the increasing network sizes of the transportation system and the electric grid. The coupling between the electric vehicle travel behaviors and charging events is therefore complex. This paper focuses on the demonstration of a scalable computational framework for the electric vehicle charging in… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

  48. arXiv:2009.00577  [pdf, ps, other

    cs.CV

    A Short Review on Data Modelling for Vector Fields

    Authors: Jun Li, Wanrong Hong, Yusheng Xiang

    Abstract: Machine learning methods based on statistical principles have proven highly successful in dealing with a wide variety of data analysis and analytics tasks. Traditional data models are mostly concerned with independent identically distributed data. The recent success of end-to-end modelling scheme using deep neural networks equipped with effective structures such as convolutional layers or skip con… ▽ More

    Submitted 1 September, 2020; originally announced September 2020.

    Comments: 18 pages, 0 figures

  49. arXiv:2008.03973  [pdf, other

    cs.CV

    Deep Reinforcement Learning with Label Embedding Reward for Supervised Image Hashing

    Authors: Zhenzhen Wang, Weixiang Hong, Junsong Yuan

    Abstract: Deep hashing has shown promising results in image retrieval and recognition. Despite its success, most existing deep hashing approaches are rather similar: either multi-layer perceptron or CNN is applied to extract image feature, followed by different binarization activation functions such as sigmoid, tanh or autoencoder to generate binary code. In this work, we introduce a novel decision-making a… ▽ More

    Submitted 10 August, 2020; originally announced August 2020.

  50. arXiv:2002.09820  [pdf, other

    cs.LG cs.AI cs.RO eess.SY

    Deep Reinforcement Learning with Linear Quadratic Regulator Regions

    Authors: Gabriel I. Fernandez, Colin Togashi, Dennis W. Hong, Lin F. Yang

    Abstract: Practitioners often rely on compute-intensive domain randomization to ensure reinforcement learning policies trained in simulation can robustly transfer to the real world. Due to unmodeled nonlinearities in the real system, however, even such simulated policies can still fail to perform stably enough to acquire experience in real environments. In this paper we propose a novel method that guarantee… ▽ More

    Submitted 25 February, 2020; v1 submitted 22 February, 2020; originally announced February 2020.