Zum Hauptinhalt springen

Showing 1–50 of 70 results for author: Fang, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.16659  [pdf, other

    physics.med-ph cs.GR

    Motion-Driven Neural Optimizer for Prophylactic Braces Made by Distributed Microstructures

    Authors: Xingjian Han, Yu Jiang, Weiming Wang, Guoxin Fang, Simeon Gill, Zhiqiang Zhang, Shengfa Wang, Jun Saito, Deepak Kumar, Zhongxuan Luo, Emily Whiting, Charlie C. L. Wang

    Abstract: Joint injuries, and their long-term consequences, present a substantial global health burden. Wearable prophylactic braces are an attractive potential solution to reduce the incidence of joint injuries by limiting joint movements that are related to injury risk. Given human motion and ground reaction forces, we present a computational framework that enables the design of personalized braces by opt… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

  2. arXiv:2408.09198  [pdf, other

    cs.RO

    Learning Based Toolpath Planner on Diverse Graphs for 3D Printing

    Authors: Yuming Huang, Yuhu Guo, Renbo Su, Xingjian Han, Junhao Ding, Tianyu Zhang, Tao Liu, Weiming Wang, Guoxin Fang, Xu Song, Emily Whiting, Charlie C. L. Wang

    Abstract: This paper presents a learning based planner for computing optimized 3D printing toolpaths on prescribed graphs, the challenges of which include the varying graph structures on different models and the large scale of nodes & edges on a graph. We adopt an on-the-fly strategy to tackle these challenges, formulating the planner as a Deep Q-Network (DQN) based optimizer to decide the next `best' node… ▽ More

    Submitted 17 August, 2024; originally announced August 2024.

  3. arXiv:2407.10468  [pdf, other

    cs.SD cs.AI eess.AS

    LiteFocus: Accelerated Diffusion Inference for Long Audio Synthesis

    Authors: Zhenxiong Tan, Xinyin Ma, Gongfan Fang, Xinchao Wang

    Abstract: Latent diffusion models have shown promising results in audio generation, making notable advancements over traditional methods. However, their performance, while impressive with short audio clips, faces challenges when extended to longer audio sequences. These challenges are due to model's self-attention mechanism and training predominantly on 10-second clips, which complicates the extension to lo… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: Interspeech 2024; Code: https://github.com/Yuanshi9815/LiteFocus

  4. arXiv:2407.06937  [pdf, other

    cs.CV

    HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance

    Authors: Guian Fang, Wenbiao Yan, Yuanfan Guo, Jianhua Han, Zutao Jiang, Hang Xu, Shengcai Liao, Xiaodan Liang

    Abstract: Text-to-image diffusion models have significantly advanced in conditional image generation. However, these models usually struggle with accurately rendering images featuring humans, resulting in distorted limbs and other anomalies. This issue primarily stems from the insufficient recognition and evaluation of limb qualities in diffusion models. To address this issue, we introduce AbHuman, the firs… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: Accepted by ECCV2024

  5. arXiv:2407.04616  [pdf, other

    cs.CV cs.AI cs.LG

    Isomorphic Pruning for Vision Models

    Authors: Gongfan Fang, Xinyin Ma, Michael Bi Mi, Xinchao Wang

    Abstract: Structured pruning reduces the computational overhead of deep neural networks by removing redundant sub-structures. However, assessing the relative importance of different sub-structures remains a significant challenge, particularly in advanced vision models featuring novel mechanisms and architectures like self-attention, depth-wise convolutions, or residual connections. These heterogeneous subst… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  6. arXiv:2406.12315  [pdf, other

    cs.AI

    PruningBench: A Comprehensive Benchmark of Structural Pruning

    Authors: Haoling Li, Changhao Li, Mengqi Xue, Gongfan Fang, Sheng Zhou, Zunlei Feng, Huiqiong Wang, Yong Wang, Lechao Cheng, Mingli Song, Jie Song

    Abstract: Structural pruning has emerged as a promising approach for producing more efficient models. Nevertheless, the community suffers from a lack of standardized benchmarks and metrics, leaving the progress in this area not fully comprehended. To fill this gap, we present the first comprehensive benchmark, termed \textit{PruningBench}, for structural pruning. PruningBench showcases the following three c… ▽ More

    Submitted 20 July, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

    Comments: This is a paper aims to present a evaluation benchmark for structural pruning. The full text is 30 pages

  7. arXiv:2406.06911  [pdf, other

    cs.CV cs.AI

    AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising

    Authors: Zigeng Chen, Xinyin Ma, Gongfan Fang, Zhenxiong Tan, Xinchao Wang

    Abstract: Diffusion models have garnered significant interest from the community for their great generative ability across various applications. However, their typical multi-step sequential-denoising nature gives rise to high cumulative latency, thereby precluding the possibilities of parallel computation. To address this, we introduce AsyncDiff, a universal and plug-and-play acceleration scheme that enable… ▽ More

    Submitted 27 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

    Comments: Work in progress. Project Page: https://czg1225.github.io/asyncdiff_page/

  8. arXiv:2406.01733  [pdf, other

    cs.LG cs.CV

    Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching

    Authors: Xinyin Ma, Gongfan Fang, Michael Bi Mi, Xinchao Wang

    Abstract: Diffusion Transformers have recently demonstrated unprecedented generative capabilities for various tasks. The encouraging results, however, come with the cost of slow inference, since each denoising step requires inference on a transformer model with a large scale of parameters. In this study, we make an interesting and somehow surprising observation: the computation of a large proportion of laye… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Code is available at https://github.com/horseee/learning-to-cache

  9. arXiv:2406.00630  [pdf, other

    stat.ML cs.LG

    On Non-asymptotic Theory of Recurrent Neural Networks in Temporal Point Processes

    Authors: Zhiheng Chen, Guanhua Fang, Wen Yu

    Abstract: Temporal point process (TPP) is an important tool for modeling and predicting irregularly timed events across various domains. Recently, the recurrent neural network (RNN)-based TPPs have shown practical advantages over traditional parametric TPP models. However, in the current literature, it remains nascent in understanding neural TPPs from theoretical viewpoints. In this paper, we establish the… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  10. arXiv:2405.08935  [pdf, other

    cs.RO

    Function based sim-to-real learning for shape control of deformable free-form surfaces

    Authors: Yingjun Tian, Guoxin Fang, Renbo Su, Weiming Wang, Simeon Gill, Andrew Weightman, Charlie C. L. Wang

    Abstract: For the shape control of deformable free-form surfaces, simulation plays a crucial role in establishing the mapping between the actuation parameters and the deformed shapes. The differentiation of this forward kinematic mapping is usually employed to solve the inverse kinematic problem for determining the actuation parameters that can realize a target shape. However, the free-form surfaces obtaine… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  11. arXiv:2405.01882  [pdf, other

    cs.RO cs.AI eess.SP

    Millimeter Wave Radar-based Human Activity Recognition for Healthcare Monitoring Robot

    Authors: Zhanzhong Gu, Xiangjian He, Gengfa Fang, Chengpei Xu, Feng Xia, Wenjing Jia

    Abstract: Healthcare monitoring is crucial, especially for the daily care of elderly individuals living alone. It can detect dangerous occurrences, such as falls, and provide timely alerts to save lives. Non-invasive millimeter wave (mmWave) radar-based healthcare monitoring systems using advanced human activity recognition (HAR) models have recently gained significant attention. However, they encounter cha… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  12. arXiv:2404.06103  [pdf, other

    cs.SD cs.IR eess.AS

    Exploring Diverse Sounds: Identifying Outliers in a Music Corpus

    Authors: Le Cai, Sam Ferguson, Gengfa Fang, Hani Alshamrani

    Abstract: Existing research on music recommendation systems primarily focuses on recommending similar music, thereby often neglecting diverse and distinctive musical recordings. Musical outliers can provide valuable insights due to the inherent diversity of music itself. In this paper, we explore music outliers, investigating their potential usefulness for music discovery and recommendation systems. We argu… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Journal ref: The 16th International Symposium on Computer Music Multidisciplinary Research,2023

  13. arXiv:2403.14166  [pdf, other

    cs.CV

    Mini-Splatting: Representing Scenes with a Constrained Number of Gaussians

    Authors: Guangchi Fang, Bing Wang

    Abstract: In this study, we explore the challenge of efficiently representing scenes with a constrained number of Gaussians. Our analysis shifts from traditional graphics and 2D computer vision to the perspective of point clouds, highlighting the inefficient spatial distribution of Gaussian representation as a key limitation in model performance. To address this, we introduce strategies for densification in… ▽ More

    Submitted 18 May, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

  14. arXiv:2403.11236  [pdf, other

    cs.CL

    ChartThinker: A Contextual Chain-of-Thought Approach to Optimized Chart Summarization

    Authors: Mengsha Liu, Daoyuan Chen, Yaliang Li, Guian Fang, Ying Shen

    Abstract: Data visualization serves as a critical means for presenting data and mining its valuable insights. The task of chart summarization, through natural language processing techniques, facilitates in-depth data analysis of charts. However, there still are notable deficiencies in terms of visual-language matching and reasoning ability for existing approaches. To address these limitations, this study co… ▽ More

    Submitted 24 April, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

    Comments: Accepted by LREC-COLING 2024

  15. arXiv:2402.04033  [pdf, other

    cs.LG

    On provable privacy vulnerabilities of graph representations

    Authors: Ruofan Wu, Guanhua Fang, Qiying Pan, Mingyang Zhang, Tengfei Liu, Weiqiang Wang

    Abstract: Graph representation learning (GRL) is critical for extracting insights from complex network structures, but it also raises security concerns due to potential privacy vulnerabilities in these representations. This paper investigates the structural vulnerabilities in graph neural models where sensitive topological information can be inferred through edge reconstruction attacks. Our research primari… ▽ More

    Submitted 23 May, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  16. arXiv:2401.07781  [pdf, other

    cs.CV

    Towards A Better Metric for Text-to-Video Generation

    Authors: Jay Zhangjie Wu, Guian Fang, Haoning Wu, Xintao Wang, Yixiao Ge, Xiaodong Cun, David Junhao Zhang, Jia-Wei Liu, Yuchao Gu, Rui Zhao, Weisi Lin, Wynne Hsu, Ying Shan, Mike Zheng Shou

    Abstract: Generative models have demonstrated remarkable capability in synthesizing high-quality text, images, and videos. For video generation, contemporary text-to-video models exhibit impressive capabilities, crafting visually stunning videos. Nonetheless, evaluating such videos poses significant challenges. Current research predominantly employs automated metrics such as FVD, IS, and CLIP Score. However… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: Project page: https://showlab.github.io/T2VScore/

  17. arXiv:2312.05284  [pdf, other

    cs.CV

    SlimSAM: 0.1% Data Makes Segment Anything Slim

    Authors: Zigeng Chen, Gongfan Fang, Xinyin Ma, Xinchao Wang

    Abstract: Current approaches for compressing the Segment Anything Model (SAM) yield commendable results, yet necessitate extensive data to train a new network from scratch. Employing conventional pruning techniques can remarkably reduce data requirements but would suffer from a degradation in performance. To address this challenging trade-off, we introduce SlimSAM, a novel data-efficient SAM compression met… ▽ More

    Submitted 18 March, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

    Comments: Work in progress. Code reposity: http://github.com/czg1225/SlimSAM

  18. arXiv:2312.00858  [pdf, other

    cs.CV cs.AI

    DeepCache: Accelerating Diffusion Models for Free

    Authors: Xinyin Ma, Gongfan Fang, Xinchao Wang

    Abstract: Diffusion models have recently gained unprecedented attention in the field of image synthesis due to their remarkable generative capabilities. Notwithstanding their prowess, these models often incur substantial computational costs, primarily attributed to the sequential denoising process and cumbersome model size. Traditional methods for compressing diffusion models typically involve extensive ret… ▽ More

    Submitted 7 December, 2023; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: Work in progress. Project Page: https://horseee.github.io/Diffusion_DeepCache/

  19. arXiv:2311.17265  [pdf, other

    cs.CG

    Exceptional Mechanical Performance by Spatial Printing with Continuous Fiber: Curved Slicing, Toolpath Generation and Physical Verification

    Authors: Guoxin Fang, Tianyu Zhang, Yuming Huang, Zhizhou Zhang, Kunal Masania, Charlie C. L. Wang

    Abstract: This work explores a spatial printing method to fabricate continuous fiber-reinforced thermoplastic composites (CFRTPCs), which can achieve exceptional mechanical performance. For models giving complex 3D stress distribution under loads, typical planar-layer based fiber placement usually fails to provide sufficient reinforcement due to their orientations being constrained to planes. The effectiven… ▽ More

    Submitted 25 January, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

  20. arXiv:2311.12842  [pdf, other

    eess.IV cs.CV

    Multimodal Identification of Alzheimer's Disease: A Review

    Authors: Guian Fang, Mengsha Liu, Yi Zhong, Zhuolin Zhang, Jiehui Huang, Zhenchao Tang, Calvin Yu-Chian Chen

    Abstract: Alzheimer's disease is a progressive neurological disorder characterized by cognitive impairment and memory loss. With the increasing aging population, the incidence of AD is continuously rising, making early diagnosis and intervention an urgent need. In recent years, a considerable number of teams have applied computer-aided diagnostic techniques to early classification research of AD. Most studi… ▽ More

    Submitted 6 October, 2023; originally announced November 2023.

  21. arXiv:2311.08791  [pdf, other

    cs.DC

    A Direct Approach for Solving Cloud Computing Task Assignment with Soft Deadlines

    Authors: Guang Fang, Yuxiang Zhao

    Abstract: Job scheduling in cloud computing environments is a critical yet complex problem. Cloud computing user job requirements are highly dynamic and uncertain, while cloud computing resources are heterogeneous and constrained. This paper studies the online resource allocation problem for elastic computing jobs with soft deadlines in cloud computing environments. The main contributions include: 1) Intege… ▽ More

    Submitted 22 December, 2023; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: 17 pages,8 figures

  22. arXiv:2310.06311  [pdf, other

    cs.CV cs.MM

    Improving Compositional Text-to-image Generation with Large Vision-Language Models

    Authors: Song Wen, Guian Fang, Renrui Zhang, Peng Gao, Hao Dong, Dimitris Metaxas

    Abstract: Recent advancements in text-to-image models, particularly diffusion models, have shown significant promise. However, compositional text-to-image models frequently encounter difficulties in generating high-quality images that accurately align with input texts describing multiple objects, variable attributes, and intricate spatial relationships. To address this limitation, we employ large vision-lan… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  23. arXiv:2309.14279  [pdf, other

    cs.RO

    Spring-IMU Fusion Based Proprioception for Feedback Control of Soft Manipulators

    Authors: Yinan Meng, Guoxin Fang, Jiong Yang, Yuhu Guo, Charlie C. L. Wang

    Abstract: This paper presents a novel framework to realize proprioception and closed-loop control for soft manipulators. Deformations with large elongation and large bending can be precisely predicted using geometry-based sensor signals obtained from the inductive springs and the inertial measurement units (IMUs) with the help of machine learning techniques. Multiple geometric signals are fused into robust… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

  24. arXiv:2309.03818  [pdf, other

    stat.ML cs.LG

    Empirical Risk Minimization for Losses without Variance

    Authors: Guanhua Fang, Ping Li, Gennady Samorodnitsky

    Abstract: This paper considers an empirical risk minimization problem under heavy-tailed settings, where data does not have finite variance, but only has $p$-th moment with $p \in (1,2)$. Instead of using estimation procedure based on truncated observed data, we choose the optimizer by minimizing the risk value. Those risk values can be robustly estimated via using the remarkable Catoni's method (Catoni, 20… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

  25. arXiv:2308.01537  [pdf, other

    cs.MM

    Learning Causality-inspired Representation Consistency for Video Anomaly Detection

    Authors: Yang Liu, Zhaoyang Xia, Mengyang Zhao, Donglai Wei, Yuzheng Wang, Liu Siao, Bobo Ju, Gaoyun Fang, Jing Liu, Liang Song

    Abstract: Video anomaly detection is an essential yet challenging task in the multimedia community, with promising applications in smart cities and secure communities. Existing methods attempt to learn abstract representations of regular events with statistical dependence to model the endogenous normality, which discriminates anomalies by measuring the deviations to the learned distribution. However, conven… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

    Comments: Accepted to ACM MM 2023

  26. arXiv:2308.00549  [pdf, other

    cs.LG cs.AI cs.CV

    Copula for Instance-wise Feature Selection and Ranking

    Authors: Hanyu Peng, Guanhua Fang, Ping Li

    Abstract: Instance-wise feature selection and ranking methods can achieve a good selection of task-friendly features for each sample in the context of neural networks. However, existing approaches that assume feature subsets to be independent are imperfect when considering the dependency between features. To address this limitation, we propose to incorporate the Gaussian copula, a powerful mathematical tech… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: 15 pages, UAI poster

  27. arXiv:2308.00472  [pdf, other

    cs.CG

    Vector Field Based Volume Peeling for Multi-Axis Machining

    Authors: Neelotpal Dutta, Tianyu Zhang, Guoxin Fang, Ismail E. Yigit, Charlie C. L. Wang

    Abstract: This paper presents an easy-to-control volume peeling method for multi-axis machining based on the computation taken on vector fields. The current scalar field based methods are not flexible and the vector-field based methods do not guarantee the satisfaction of the constraints in the final results. We first conduct an optimization formulation to compute an initial vector field that is well aligne… ▽ More

    Submitted 4 October, 2023; v1 submitted 1 August, 2023; originally announced August 2023.

    Comments: ASME Journal of Computing and Information Science in Engineering, accepted, October 2023

  28. arXiv:2306.07223  [pdf, other

    cs.CY

    Wise in Vaccine Allocation

    Authors: Baiqiao Yin, Jiaqing Yuan, Weichen Lv, Jiehui Huang, Guian Fang

    Abstract: The paper uses machine learning and mathematical modeling to predict future vaccine distribution and solve the problem of allocating vaccines to different types of hospitals. They collected data and analyzed it, finding factors such as nearby residents, transportation, and medical personnel that impact distribution. They used the results to create a model and allocate vaccines to central and commu… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  29. arXiv:2306.07201  [pdf, other

    cs.CL

    LTCR: Long-Text Chinese Rumor Detection Dataset

    Authors: Ziyang Ma, Mengsha Liu, Guian Fang, Ying Shen

    Abstract: False information can spread quickly on social media, negatively influencing the citizens' behaviors and responses to social events. To better detect all of the fake news, especially long texts which are harder to find completely, a Long-Text Chinese Rumor detection dataset named LTCR is proposed. The LTCR dataset provides a valuable resource for accurately detecting misinformation, especially in… ▽ More

    Submitted 13 June, 2023; v1 submitted 12 June, 2023; originally announced June 2023.

  30. arXiv:2306.04902  [pdf, other

    cs.DS cs.LG math.ST

    A Cover Time Study of a non-Markovian Algorithm

    Authors: Guanhua Fang, Gennady Samorodnitsky, Zhiqiang Xu

    Abstract: Given a traversal algorithm, cover time is the expected number of steps needed to visit all nodes in a given graph. A smaller cover time means a higher exploration efficiency of traversal algorithm. Although random walk algorithms have been studied extensively in the existing literature, there has been no cover time result for any non-Markovian method. In this work, we stand on a theoretical persp… ▽ More

    Submitted 11 August, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: 25 pages

  31. arXiv:2305.19599  [pdf, other

    cs.CV cs.AI

    RealignDiff: Boosting Text-to-Image Diffusion Model with Coarse-to-fine Semantic Re-alignment

    Authors: Guian Fang, Zutao Jiang, Jianhua Han, Guansong Lu, Hang Xu, Shengcai Liao, Xiaodan Liang

    Abstract: Recent advances in text-to-image diffusion models have achieved remarkable success in generating high-quality, realistic images from textual descriptions. However, these approaches have faced challenges in precisely aligning the generated visual content with the textual concepts described in the prompts. In this paper, we propose a two-stage coarse-to-fine semantic re-alignment method, named Reali… ▽ More

    Submitted 27 November, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

  32. arXiv:2305.11627  [pdf, other

    cs.CL

    LLM-Pruner: On the Structural Pruning of Large Language Models

    Authors: Xinyin Ma, Gongfan Fang, Xinchao Wang

    Abstract: Large language models (LLMs) have shown remarkable capabilities in language understanding and generation. However, such impressive capability typically comes with a substantial model size, which presents significant challenges in both the deployment, inference, and training stages. With LLM being a general-purpose task solver, we explore its compression in a task-agnostic manner, which aims to pre… ▽ More

    Submitted 27 September, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Accepted at NeurIPS 2023

  33. arXiv:2305.10924  [pdf, other

    cs.LG cs.AI cs.CV

    Structural Pruning for Diffusion Models

    Authors: Gongfan Fang, Xinyin Ma, Xinchao Wang

    Abstract: Generative modeling has recently undergone remarkable advancements, primarily propelled by the transformative implications of Diffusion Probabilistic Models (DPMs). The impressive capability of these models, however, often entails significant computational overhead during both training and inference. To tackle this challenge, we present Diff-Pruning, an efficient compression method tailored for le… ▽ More

    Submitted 30 September, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: Preprint version

  34. arXiv:2304.11633  [pdf, other

    cs.CL

    Evaluating ChatGPT's Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness

    Authors: Bo Li, Gexiang Fang, Yang Yang, Quansen Wang, Wei Ye, Wen Zhao, Shikun Zhang

    Abstract: The capability of Large Language Models (LLMs) like ChatGPT to comprehend user intent and provide reasonable responses has made them extremely popular lately. In this paper, we focus on assessing the overall ability of ChatGPT using 7 fine-grained information extraction (IE) tasks. Specially, we present the systematically analysis by measuring ChatGPT's performance, explainability, calibration, an… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

  35. arXiv:2302.12007  [pdf, other

    cs.CV

    DMMG: Dual Min-Max Games for Self-Supervised Skeleton-Based Action Recognition

    Authors: Shannan Guan, Xin Yu, Wei Huang, Gengfa Fang, Haiyan Lu

    Abstract: In this work, we propose a new Dual Min-Max Games (DMMG) based self-supervised skeleton action recognition method by augmenting unlabeled data in a contrastive learning framework. Our DMMG consists of a viewpoint variation min-max game and an edge perturbation min-max game. These two min-max games adopt an adversarial paradigm to perform data augmentation on the skeleton sequences and graph-struct… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

  36. arXiv:2302.05510  [pdf, other

    cs.RO cs.CG

    Support Generation for Robot-Assisted 3D Printing with Curved Layers

    Authors: Tianyu Zhang, Yuming Huang, Piotr Kukulski, Neelotpal Dutta, Guoxin Fang, Charlie C. L. Wang

    Abstract: Robot-assisted 3D printing has drawn a lot of attention by its capability to fabricate curved layers that are optimized according to different objectives. However, the support generation algorithm based on a fixed printing direction for planar layers cannot be directly applied for curved layers as the orientation of material accumulation is dynamically varied. In this paper, we propose a skeleton-… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

  37. arXiv:2301.12900  [pdf, other

    cs.AI cs.CV

    DepGraph: Towards Any Structural Pruning

    Authors: Gongfan Fang, Xinyin Ma, Mingli Song, Michael Bi Mi, Xinchao Wang

    Abstract: Structural pruning enables model acceleration by removing structurally-grouped parameters from neural networks. However, the parameter-grouping patterns vary widely across different models, making architecture-specific pruners, which rely on manually-designed grouping schemes, non-generalizable to new architectures. In this work, we study a highly-challenging yet barely-explored task, any structur… ▽ More

    Submitted 23 March, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

  38. arXiv:2212.13369  [pdf

    cs.SD cs.LG cs.MM eess.AS

    Feature Selection Approaches for Optimising Music Emotion Recognition Methods

    Authors: Le Cai, Sam Ferguson, Haiyan Lu, Gengfa Fang

    Abstract: The high feature dimensionality is a challenge in music emotion recognition. There is no common consensus on a relation between audio features and emotion. The MER system uses all available features to recognize emotion; however, this is not an optimal solution since it contains irrelevant data acting as noise. In this paper, we introduce a feature selection approach to eliminate redundant feature… ▽ More

    Submitted 27 December, 2022; originally announced December 2022.

  39. arXiv:2211.08311  [pdf, ps, other

    stat.ML cs.LG

    On Penalization in Stochastic Multi-armed Bandits

    Authors: Guanhua Fang, Ping Li, Gennady Samorodnitsky

    Abstract: We study an important variant of the stochastic multi-armed bandit (MAB) problem, which takes penalization into consideration. Instead of directly maximizing cumulative expected reward, we need to balance between the total reward and fairness level. In this paper, we present some new insights in MAB and formulate the problem in the penalization framework, where rigorous penalized regret can be wel… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  40. arXiv:2210.10592  [pdf, other

    cs.SI cs.AI cs.LG

    DyTed: Disentangled Representation Learning for Discrete-time Dynamic Graph

    Authors: Kaike Zhang, Qi Cao, Gaolin Fang, Bingbing Xu, Hongjian Zou, Huawei Shen, Xueqi Cheng

    Abstract: Unsupervised representation learning for dynamic graphs has attracted a lot of research attention in recent years. Compared with static graph, the dynamic graph is a comprehensive embodiment of both the intrinsic stable characteristics of nodes and the time-related dynamic preference. However, existing methods generally mix these two types of information into a single representation space, which m… ▽ More

    Submitted 15 August, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

  41. arXiv:2209.11318  [pdf, other

    cs.RO

    OpenPneu: Compact platform for pneumatic actuation with multi-channels

    Authors: Yingjun Tian, Renbo Su, Xilong Wang, Nur Banu Altin, Guoxin Fang, Charlie C. L. Wang

    Abstract: This paper presents a compact system, OpenPneu, to support the pneumatic actuation for multi-chambers on soft robots. Micro-pumps are employed in the system to generate airflow and therefore no extra input as compressed air is needed. Our system conducts modular design to provide good scalability, which has been demonstrated on a prototype with ten air channels. Each air channel of OpenPneu is equ… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

  42. arXiv:2208.03444  [pdf, other

    cs.CV

    AFE-CNN: 3D Skeleton-based Action Recognition with Action Feature Enhancement

    Authors: Shannan Guan, Haiyan Lu, Linchao Zhu, Gengfa Fang

    Abstract: Existing 3D skeleton-based action recognition approaches reach impressive performance by encoding handcrafted action features to image format and decoding by CNNs. However, such methods are limited in two ways: a) the handcrafted action features are difficult to handle challenging actions, and b) they generally require complex CNN models to improve action recognition accuracy, which usually occur… ▽ More

    Submitted 6 August, 2022; originally announced August 2022.

  43. arXiv:2208.03185  [pdf, ps, other

    math.ST cs.LG stat.ML

    Catoni-style Confidence Sequences under Infinite Variance

    Authors: Sujay Bhatt, Guanhua Fang, Ping Li, Gennady Samorodnitsky

    Abstract: In this paper, we provide an extension of confidence sequences for settings where the variance of the data-generating distribution does not exist or is infinite. Confidence sequences furnish confidence intervals that are valid at arbitrary data-dependent stopping times, naturally having a wide range of applications. We first establish a lower bound for the width of the Catoni-style confidence sequ… ▽ More

    Submitted 5 August, 2022; originally announced August 2022.

    Comments: 10 pages

  44. arXiv:2207.13309  [pdf, other

    cs.CV

    Federated Selective Aggregation for Knowledge Amalgamation

    Authors: Donglin Xie, Ruonan Yu, Gongfan Fang, Jie Song, Zunlei Feng, Xinchao Wang, Li Sun, Mingli Song

    Abstract: In this paper, we explore a new knowledge-amalgamation problem, termed Federated Selective Aggregation (FedSA). The goal of FedSA is to train a student model for a new task with the help of several decentralized teachers, whose pre-training tasks and data are different and agnostic. Our motivation for investigating such a problem setup stems from a recent dilemma of model sharing. Many researchers… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

    Comments: 18 pages, 4 figures

  45. arXiv:2207.03618  [pdf, other

    cs.CV cs.LG

    PoseGU: 3D Human Pose Estimation with Novel Human Pose Generator and Unbiased Learning

    Authors: Shannan Guan, Haiyan Lu, Linchao Zhu, Gengfa Fang

    Abstract: 3D pose estimation has recently gained substantial interests in computer vision domain. Existing 3D pose estimation methods have a strong reliance on large size well-annotated 3D pose datasets, and they suffer poor model generalization on unseen poses due to limited diversity of 3D poses in training sets. In this work, we propose PoseGU, a novel human pose generator that generates diverse poses wi… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

  46. arXiv:2205.07523  [pdf, other

    cs.CL

    Prompting to Distill: Boosting Data-Free Knowledge Distillation via Reinforced Prompt

    Authors: Xinyin Ma, Xinchao Wang, Gongfan Fang, Yongliang Shen, Weiming Lu

    Abstract: Data-free knowledge distillation (DFKD) conducts knowledge distillation via eliminating the dependence of original training data, and has recently achieved impressive results in accelerating pre-trained language models. At the heart of DFKD is to reconstruct a synthetic dataset by inverting the parameters of the uncompressed model. Prior DFKD approaches, however, have largely relied on hand-crafte… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

    Comments: Accepted by IJCAI2022

  47. arXiv:2205.05166  [pdf, other

    cs.RO

    Soft Robotic Mannequin: Design and Algorithm for Deformation Control

    Authors: Yingjun Tian, Guoxin Fang, Justas Petrulis, Andrew Weightman, Charlie C. L. Wang

    Abstract: This paper presents a novel soft robotic system for a deformable mannequin that can be employed to physically realize the 3D geometry of different human bodies. The soft membrane on a mannequin is deformed by inflating several curved chambers using pneumatic actuation. Controlling the freeform surface of a soft membrane by adjusting the pneumatic actuation in different chambers is challenging as t… ▽ More

    Submitted 23 May, 2022; v1 submitted 10 May, 2022; originally announced May 2022.

  48. arXiv:2204.11723  [pdf, other

    cs.CV eess.IV

    4DAC: Learning Attribute Compression for Dynamic Point Clouds

    Authors: Guangchi Fang, Qingyong Hu, Yiling Xu, Yulan Guo

    Abstract: With the development of the 3D data acquisition facilities, the increasing scale of acquired 3D point clouds poses a challenge to the existing data compression techniques. Although promising performance has been achieved in static point cloud compression, it remains under-explored and challenging to leverage temporal correlations within a point cloud sequence for effective dynamic point cloud comp… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

  49. arXiv:2203.09931  [pdf, other

    cs.CV eess.IV

    3DAC: Learning Attribute Compression for Point Clouds

    Authors: Guangchi Fang, Qingyong Hu, Hanyun Wang, Yiling Xu, Yulan Guo

    Abstract: We study the problem of attribute compression for large-scale unstructured 3D point clouds. Through an in-depth exploration of the relationships between different encoding steps and different attribute channels, we introduce a deep compression network, termed 3DAC, to explicitly compress the attributes of 3D point clouds and reduce storage usage in this paper. Specifically, the point cloud attribu… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

  50. Knowledge Amalgamation for Object Detection with Transformers

    Authors: Haofei Zhang, Feng Mao, Mengqi Xue, Gongfan Fang, Zunlei Feng, Jie Song, Mingli Song

    Abstract: Knowledge amalgamation (KA) is a novel deep model reusing task aiming to transfer knowledge from several well-trained teachers to a multi-talented and compact student. Currently, most of these approaches are tailored for convolutional neural networks (CNNs). However, there is a tendency that transformers, with a completely different architecture, are starting to challenge the domination of CNNs in… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible