Skip to main content

Showing 1–50 of 92 results for author: Jing, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.03876  [pdf, other

    cs.CR cs.CL

    DART: Deep Adversarial Automated Red Teaming for LLM Safety

    Authors: Bojian Jiang, Yi Jing, Tianhao Shen, Qing Yang, Deyi Xiong

    Abstract: Manual Red teaming is a commonly-used method to identify vulnerabilities in large language models (LLMs), which, is costly and unscalable. In contrast, automated red teaming uses a Red LLM to automatically generate adversarial prompts to the Target LLM, offering a scalable way for safety vulnerability detection. However, the difficulty of building a powerful automated Red LLM lies in the fact that… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  2. arXiv:2403.20014  [pdf, other

    cs.DB cs.AI cs.CL

    PURPLE: Making a Large Language Model a Better SQL Writer

    Authors: Tonghui Ren, Yuankai Fan, Zhenying He, Ren Huang, Jiaqi Dai, Can Huang, Yinan Jing, Kai Zhang, Yifan Yang, X. Sean Wang

    Abstract: Large Language Model (LLM) techniques play an increasingly important role in Natural Language to SQL (NL2SQL) translation. LLMs trained by extensive corpora have strong natural language understanding and basic SQL generation abilities without additional tuning specific to NL2SQL tasks. Existing LLMs-based NL2SQL approaches try to improve the translation by enhancing the LLMs with an emphasis on us… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: 12 pages, accepted by ICDE 2024 (40th IEEE International Conference on Data Engineering)

  3. arXiv:2403.19275  [pdf, other

    cs.CL cs.AI

    Knowledge Boundary and Persona Dynamic Shape A Better Social Media Agent

    Authors: Junkai Zhou, Liang Pang, Ya Jing, Jia Gu, Huawei Shen, Xueqi Cheng

    Abstract: Constructing personalized and anthropomorphic agents holds significant importance in the simulation of social networks. However, there are still two key problems in existing works: the agent possesses world knowledge that does not belong to its personas, and it cannot eliminate the interference of diverse persona information on current actions, which reduces the personalization and anthropomorphis… ▽ More

    Submitted 2 April, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

  4. arXiv:2403.17745  [pdf, other

    cs.LG

    Leave No Patient Behind: Enhancing Medication Recommendation for Rare Disease Patients

    Authors: Zihao Zhao, Yi Jing, Fuli Feng, Jiancan Wu, Chongming Gao, Xiangnan He

    Abstract: Medication recommendation systems have gained significant attention in healthcare as a means of providing tailored and effective drug combinations based on patients' clinical information. However, existing approaches often suffer from fairness issues, as recommendations tend to be more accurate for patients with common diseases compared to those with rare conditions. In this paper, we propose a no… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  5. arXiv:2403.16208  [pdf, ps, other

    math.NA cs.LG

    Convergence analysis of OT-Flow for sample generation

    Authors: Yang Jing, Lei Li

    Abstract: Deep generative models aim to learn the underlying distribution of data and generate new ones. Despite the diversity of generative models and their high-quality generation performance in practice, most of them lack rigorous theoretical convergence proofs. In this work, we aim to establish some convergence results for OT-Flow, one of the deep generative models. First, by reformulating the framework… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  6. arXiv:2403.00211  [pdf, other

    cs.CV

    Trustworthy Self-Attention: Enabling the Network to Focus Only on the Most Relevant References

    Authors: Yu Jing, Tan Yujuan, Ren Ao, Liu Duo

    Abstract: The prediction of optical flow for occluded points is still a difficult problem that has not yet been solved. Recent methods use self-attention to find relevant non-occluded points as references for estimating the optical flow of occluded points based on the assumption of self-similarity. However, they rely on visual features of a single image and weak constraints, which are not sufficient to cons… ▽ More

    Submitted 26 March, 2024; v1 submitted 29 February, 2024; originally announced March 2024.

    Comments: Correct Figure 1

  7. arXiv:2402.17144  [pdf, other

    cs.DB cs.AI

    Metasql: A Generate-then-Rank Framework for Natural Language to SQL Translation

    Authors: Yuankai Fan, Zhenying He, Tonghui Ren, Can Huang, Yinan Jing, Kai Zhang, X. Sean Wang

    Abstract: The Natural Language Interface to Databases (NLIDB) empowers non-technical users with database access through intuitive natural language (NL) interactions. Advanced approaches, utilizing neural sequence-to-sequence models or large-scale language models, typically employ auto-regressive decoding to generate unique SQL queries sequentially. While these translation models have greatly improved the ov… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  8. arXiv:2402.15140  [pdf, other

    cs.AI

    A Relation-Interactive Approach for Message Passing in Hyper-relational Knowledge Graphs

    Authors: Yonglin Jing

    Abstract: Hyper-relational knowledge graphs (KGs) contain additional key-value pairs, providing more information about the relations. In many scenarios, the same relation can have distinct key-value pairs, making the original triple fact more recognizable and specific. Prior studies on hyper-relational KGs have established a solid standard method for hyper-relational graph encoding. In this work, we propose… ▽ More

    Submitted 1 March, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

  9. arXiv:2401.05879  [pdf

    cs.CV

    YOIO: You Only Iterate Once by mining and fusing multiple necessary global information in the optical flow estimation

    Authors: Yu Jing, Tan Yujuan, Ren Ao, Liu Duo

    Abstract: Occlusions pose a significant challenge to optical flow algorithms that even rely on global evidences. We consider an occluded point to be one that is imaged in the reference frame but not in the next. Estimating the motion of these points is extremely difficult, particularly in the two-frame setting. Previous work only used the current frame as the only input, which could not guarantee providing… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: arXiv admin note: text overlap with arXiv:2104.02409 by other authors

  10. arXiv:2312.13139  [pdf, other

    cs.RO cs.CV

    Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation

    Authors: Hongtao Wu, Ya Jing, Chilam Cheang, Guangzeng Chen, Jiafeng Xu, Xinghang Li, Minghuan Liu, Hang Li, Tao Kong

    Abstract: Generative pre-trained models have demonstrated remarkable effectiveness in language and vision domains by learning useful representations. In this paper, we extend the scope of this effectiveness by showing that visual robot manipulation can significantly benefit from large-scale video generative pre-training. We introduce GR-1, a straightforward GPT-style model designed for multi-task language-c… ▽ More

    Submitted 21 December, 2023; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: Project page: https://GR1-Manipulation.github.io

  11. arXiv:2311.09829  [pdf, other

    cs.CL

    FollowEval: A Multi-Dimensional Benchmark for Assessing the Instruction-Following Capability of Large Language Models

    Authors: Yimin Jing, Renren Jin, Jiahao Hu, Huishi Qiu, Xiaohua Wang, Peng Wang, Deyi Xiong

    Abstract: The effective assessment of the instruction-following ability of large language models (LLMs) is of paramount importance. A model that cannot adhere to human instructions might be not able to provide reliable and helpful responses. In pursuit of this goal, various benchmarks have been constructed to evaluate the instruction-following capacity of these models. However, these benchmarks are limited… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: Work in progress

  12. arXiv:2311.01378  [pdf, other

    cs.RO cs.AI cs.LG

    Vision-Language Foundation Models as Effective Robot Imitators

    Authors: Xinghang Li, Minghuan Liu, Hanbo Zhang, Cunjun Yu, Jie Xu, Hongtao Wu, Chilam Cheang, Ya Jing, Weinan Zhang, Huaping Liu, Hang Li, Tao Kong

    Abstract: Recent progress in vision language foundation models has shown their ability to understand multimodal data and resolve complicated vision language tasks, including robotics manipulation. We seek a straightforward way of making use of existing vision-language models (VLMs) with simple fine-tuning on robotics data. To this end, we derive a simple and novel vision-language manipulation framework, dub… ▽ More

    Submitted 4 February, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: Fix typos. Project page: https://roboflamingo.github.io

  13. arXiv:2309.05073  [pdf, other

    cs.CV

    FreeMan: Towards Benchmarking 3D Human Pose Estimation under Real-World Conditions

    Authors: Jiong Wang, Fengyu Yang, Wenbo Gou, Bingliang Li, Danqi Yan, Ailing Zeng, Yijun Gao, Junle Wang, Yanqing Jing, Ruimao Zhang

    Abstract: Estimating the 3D structure of the human body from natural scenes is a fundamental aspect of visual perception. 3D human pose estimation is a vital step in advancing fields like AIGC and human-robot interaction, serving as a crucial technique for understanding and interacting with human actions in real-world settings. However, the current datasets, often collected under single laboratory condition… ▽ More

    Submitted 3 April, 2024; v1 submitted 10 September, 2023; originally announced September 2023.

    Comments: CVPR2024 camera ready version. 19 pages, 16 figures. Project page: https://wangjiongw.github.io/freeman/ ; API: https://github.com/wangjiongw/FreeMan_API

  14. arXiv:2308.03624  [pdf, other

    cs.RO cs.CV

    MOMA-Force: Visual-Force Imitation for Real-World Mobile Manipulation

    Authors: Taozheng Yang, Ya Jing, Hongtao Wu, Jiafeng Xu, Kuankuan Sima, Guangzeng Chen, Qie Sima, Tao Kong

    Abstract: In this paper, we present a novel method for mobile manipulators to perform multiple contact-rich manipulation tasks. While learning-based methods have the potential to generate actions in an end-to-end manner, they often suffer from insufficient action accuracy and robustness against noise. On the other hand, classical control-based methods can enhance system robustness, but at the cost of extens… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2023

  15. arXiv:2308.03620  [pdf, other

    cs.RO cs.CV

    Exploring Visual Pre-training for Robot Manipulation: Datasets, Models and Methods

    Authors: Ya Jing, Xuelin Zhu, Xingbin Liu, Qie Sima, Taozheng Yang, Yunhai Feng, Tao Kong

    Abstract: Visual pre-training with large-scale real-world data has made great progress in recent years, showing great potential in robot learning with pixel observations. However, the recipes of visual pre-training for robot manipulation tasks are yet to be built. In this paper, we thoroughly investigate the effects of visual pre-training strategies on robot manipulation tasks from three fundamental perspec… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2023

  16. arXiv:2307.16356  [pdf, other

    cs.IT eess.SP

    Interleaved Training for Massive MIMO Downlink via Exploring Spatial Correlation

    Authors: Cheng Zhang, Chang Liu, Yindi Jing, Minjie Ding, Yongming Huang

    Abstract: Interleaved training has been studied for single-user and multi-user massive MIMO downlink with either fully-digital or hybrid beamforming. However, the impact of channel correlation on its average training overhead is rarely addressed. In this paper, we explore the channel correlation to improve the interleaved training for single-user massive MIMO downlink. For the beam-domain interleaved traini… ▽ More

    Submitted 16 January, 2024; v1 submitted 30 July, 2023; originally announced July 2023.

    Comments: 14 pages (double column), 8 figures. The paper has been accepted by IEEE Transactions on Wireless Communications

  17. arXiv:2307.10730  [pdf, other

    cs.IT eess.SP

    Joint Port Selection Based Channel Acquisition for FDD Cell-Free Massive MIMO

    Authors: Cheng Zhang, Pengguang Du, Minjie Ding, Yindi Jing, Yongming Huang

    Abstract: In frequency division duplexing (FDD) cell-free massive MIMO, the acquisition of the channel state information (CSI) is very challenging because of the large overhead required for the training and feedback of the downlink channels of multiple cooperating base stations (BSs). In this paper, for systems with partial uplink-downlink channel reciprocity, and a general spatial domain channel model with… ▽ More

    Submitted 12 January, 2024; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: 15 pages, 11 figures. The paper has been accepted by IEEE TRANSACTIONS ON COMMUNICATIONS

  18. arXiv:2306.07610  [pdf, other

    cs.CL

    Soft Language Clustering for Multilingual Model Pre-training

    Authors: Jiali Zeng, Yufan Jiang, Yongjing Yin, Yi Jing, Fandong Meng, Binghuai Lin, Yunbo Cao, Jie Zhou

    Abstract: Multilingual pre-trained language models have demonstrated impressive (zero-shot) cross-lingual transfer abilities, however, their performance is hindered when the target language has distant typology from source languages or when pre-training data is limited in size. In this paper, we propose XLM-P, which contextually retrieves prompts as flexible guidance for encoding instances conditionally. Ou… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

  19. arXiv:2305.16982  [pdf, other

    cs.CL cs.AI

    TranSFormer: Slow-Fast Transformer for Machine Translation

    Authors: Bei Li, Yi Jing, Xu Tan, Zhen Xing, Tong Xiao, Jingbo Zhu

    Abstract: Learning multiscale Transformer models has been evidenced as a viable approach to augmenting machine translation systems. Prior research has primarily focused on treating subwords as basic units in developing such systems. However, the incorporation of fine-grained character-level features into multiscale Transformer has not yet been explored. In this work, we present a \textbf{S}low-\textbf{F}ast… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted by Findings of ACL2023

  20. arXiv:2304.14593  [pdf, other

    cs.CV

    Deep Graph Reprogramming

    Authors: Yongcheng Jing, Chongbin Yuan, Li Ju, Yiding Yang, Xinchao Wang, Dacheng Tao

    Abstract: In this paper, we explore a novel model reusing task tailored for graph neural networks (GNNs), termed as "deep graph reprogramming". We strive to reprogram a pre-trained GNN, without amending raw node features nor model parameters, to handle a bunch of cross-level downstream tasks in various domains. To this end, we propose an innovative Data Reprogramming paradigm alongside a Model Reprogramming… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

    Comments: CVPR 2023 Highlight

  21. arXiv:2304.11595  [pdf, other

    cs.CV cs.AI cs.LG

    Segment Anything in Non-Euclidean Domains: Challenges and Opportunities

    Authors: Yongcheng Jing, Xinchao Wang, Dacheng Tao

    Abstract: The recent work known as Segment Anything (SA) has made significant strides in pushing the boundaries of semantic segmentation into the era of foundation models. The impact of SA has sparked extremely active discussions and ushered in an encouraging new wave of developing foundation models for the diverse tasks in the Euclidean domain, such as object detection and image inpainting. Despite the pro… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

    Comments: Work in progress

  22. arXiv:2304.04135  [pdf, other

    cs.CV

    Propheter: Prophetic Teacher Guided Long-Tailed Distribution Learning

    Authors: Wenxiang Xu, Yongcheng Jing, Linyun Zhou, Wenqi Huang, Lechao Cheng, Zunlei Feng, Mingli Song

    Abstract: The problem of deep long-tailed learning, a prevalent challenge in the realm of generic visual recognition, persists in a multitude of real-world applications. To tackle the heavily-skewed dataset issue in long-tailed classification, prior efforts have sought to augment existing deep models with the elaborate class-balancing strategies, such as class rebalancing, data augmentation, and module impr… ▽ More

    Submitted 25 September, 2023; v1 submitted 8 April, 2023; originally announced April 2023.

    Comments: 12 pages

  23. arXiv:2304.00782  [pdf, other

    cs.CV

    NeMF: Inverse Volume Rendering with Neural Microflake Field

    Authors: Youjia Zhang, Teng Xu, Junqing Yu, Yuteng Ye, Junle Wang, Yanqing Jing, Jingyi Yu, Wei Yang

    Abstract: Recovering the physical attributes of an object's appearance from its images captured under an unknown illumination is challenging yet essential for photo-realistic rendering. Recent approaches adopt the emerging implicit scene representations and have shown impressive results.However, they unanimously adopt a surface-based representation,and hence can not well handle scenes with very complex geom… ▽ More

    Submitted 3 April, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

  24. arXiv:2303.10936  [pdf, other

    cs.RO cs.CV

    Learning to Explore Informative Trajectories and Samples for Embodied Perception

    Authors: Ya Jing, Tao Kong

    Abstract: We are witnessing significant progress on perception models, specifically those trained on large-scale internet images. However, efficiently generalizing these perception models to unseen embodied tasks is insufficiently studied, which will help various relevant applications (e.g., home robots). Unlike static perception methods trained on pre-collected images, the embodied agent can move around in… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: To be published in IEEE International Conference on Robotics and Automation (ICRA), 2023

  25. arXiv:2212.05946  [pdf, other

    cs.CV

    Evaluation and Improvement of Interpretability for Self-Explainable Part-Prototype Networks

    Authors: Qihan Huang, Mengqi Xue, Wenqi Huang, Haofei Zhang, Jie Song, Yongcheng Jing, Mingli Song

    Abstract: Part-prototype networks (e.g., ProtoPNet, ProtoTree, and ProtoPool) have attracted broad research interest for their intrinsic interpretability and comparable accuracy to non-interpretable counterparts. However, recent works find that the interpretability from prototypes is fragile, due to the semantic gap between the similarities in the feature space and that in the input space. In this work, we… ▽ More

    Submitted 25 October, 2023; v1 submitted 12 December, 2022; originally announced December 2022.

  26. arXiv:2212.00532  [pdf, other

    eess.IV cs.CV

    EBHI-Seg: A Novel Enteroscope Biopsy Histopathological Haematoxylin and Eosin Image Dataset for Image Segmentation Tasks

    Authors: Liyu Shi, Xiaoyan Li, Weiming Hu, Haoyuan Chen, Jing Chen, Zizhen Fan, Minghe Gao, Yujie Jing, Guotao Lu, Deguo Ma, Zhiyu Ma, Qingtao Meng, Dechao Tang, Hongzan Sun, Marcin Grzegorzek, Shouliang Qi, Yueyang Teng, Chen Li

    Abstract: Background and Purpose: Colorectal cancer is a common fatal malignancy, the fourth most common cancer in men, and the third most common cancer in women worldwide. Timely detection of cancer in its early stages is essential for treating the disease. Currently, there is a lack of datasets for histopathological image segmentation of rectal cancer, which often hampers the assessment accuracy when comp… ▽ More

    Submitted 6 December, 2022; v1 submitted 1 December, 2022; originally announced December 2022.

  27. arXiv:2211.08006  [pdf, other

    eess.IV cs.CV cs.LG stat.ME

    Auto-outlier Fusion Technique for Chest X-ray classification with Multi-head Attention Mechanism

    Authors: Yuru Jing, Zixuan Li

    Abstract: A chest X-ray is one of the most widely available radiological examinations for diagnosing and detecting various lung illnesses. The National Institutes of Health (NIH) provides an extensive database, ChestX-ray8 and ChestXray14, to help establish a deep learning community for analysing and predicting lung diseases. ChestX-ray14 consists of 112,120 frontal-view X-ray images of 30,805 distinct pati… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: Accepted by the Journal of Image Processing Theory and Applications

  28. arXiv:2211.07652  [pdf

    cs.LG cs.AI

    Machine Learning Performance Analysis to Predict Stroke Based on Imbalanced Medical Dataset

    Authors: Yuru Jing

    Abstract: Cerebral stroke, the second most substantial cause of death universally, has been a primary public health concern over the last few years. With the help of machine learning techniques, early detection of various stroke alerts is accessible, which can efficiently prevent or diminish the stroke. Medical dataset, however, are frequently unbalanced in their class label, with a tendency to poorly predi… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: Accepted by CAIBDA 2022

  29. arXiv:2210.13076  [pdf, other

    cs.CV cs.CL

    Towards Unifying Reference Expression Generation and Comprehension

    Authors: Duo Zheng, Tao Kong, Ya Jing, Jiaan Wang, Xiaojie Wang

    Abstract: Reference Expression Generation (REG) and Comprehension (REC) are two highly correlated tasks. Modeling REG and REC simultaneously for utilizing the relation between them is a promising way to improve both. However, the problem of distinct inputs, as well as building connections between them in a single model, brings challenges to the design and training of the joint model. To address the problems… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: Accepted to EMNLP 2022 (main conference)

  30. arXiv:2209.07031  [pdf, other

    cs.CL

    A semantic hierarchical graph neural network for text classification

    Authors: Shuai Hua, Xinxin Li, Yunpeng Jing, Qunfeng Liu

    Abstract: The key to the text classification task is language representation and important information extraction, and there are many related studies. In recent years, the research on graph neural network (GNN) in text classification has gradually emerged and shown its advantages, but the existing models mainly focus on directly inputting words as graph nodes into the GNN models ignoring the different level… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

    Comments: 10 pages, 3 figures

  31. arXiv:2208.12145  [pdf, other

    cs.LG math.PR

    A deep learning framework for geodesics under spherical Wasserstein-Fisher-Rao metric and its application for weighted sample generation

    Authors: Yang Jing, Jiaheng Chen, Lei Li, Jianfeng Lu

    Abstract: Wasserstein-Fisher-Rao (WFR) distance is a family of metrics to gauge the discrepancy of two Radon measures, which takes into account both transportation and weight change. Spherical WFR distance is a projected version of WFR distance for probability measures so that the space of Radon measures equipped with WFR can be viewed as metric cone over the space of probability measures with spherical WFR… ▽ More

    Submitted 25 August, 2022; originally announced August 2022.

  32. arXiv:2207.11681  [pdf, other

    cs.CV

    Learning Graph Neural Networks for Image Style Transfer

    Authors: Yongcheng Jing, Yining Mao, Yiding Yang, Yibing Zhan, Mingli Song, Xinchao Wang, Dacheng Tao

    Abstract: State-of-the-art parametric and non-parametric style transfer approaches are prone to either distorted local style patterns due to global statistics alignment, or unpleasing artifacts resulting from patch mismatching. In this paper, we study a novel semi-parametric neural style transfer framework that alleviates the deficiency of both parametric and non-parametric stylization. The core idea of our… ▽ More

    Submitted 13 February, 2023; v1 submitted 24 July, 2022; originally announced July 2022.

    Comments: Accepted to ECCV 2022

  33. A Survey on Collaborative DNN Inference for Edge Intelligence

    Authors: Weiqing Ren, Yuben Qu, Chao Dong, Yuqian Jing, Hao Sun, Qihui Wu, Song Guo

    Abstract: With the vigorous development of artificial intelligence (AI), the intelligent applications based on deep neural network (DNN) change people's lifestyles and the production efficiency. However, the huge amount of computation and data generated from the network edge becomes the major bottleneck, and traditional cloud-based computing mode has been unable to meet the requirements of real-time process… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

    Journal ref: Mach. Intell. Res. 20 (2023) 370-395

  34. arXiv:2206.09337  [pdf, other

    cs.CL

    Learning Multiscale Transformer Models for Sequence Generation

    Authors: Bei Li, Tong Zheng, Yi Jing, Chengbo Jiao, Tong Xiao, Jingbo Zhu

    Abstract: Multiscale feature hierarchies have been witnessed the success in the computer vision area. This further motivates researchers to design multiscale Transformer for natural language processing, mostly based on the self-attention mechanism. For example, restricting the receptive field across heads or extracting local fine-grained features via convolutions. However, most of existing works directly mo… ▽ More

    Submitted 19 June, 2022; originally announced June 2022.

    Comments: accepted by ICML2022

  35. arXiv:2205.11192  [pdf, other

    cs.CV

    Active Domain Adaptation with Multi-level Contrastive Units for Semantic Segmentation

    Authors: Hao Zhang, Ruimao Zhang, Zhanglin Peng, Junle Wang, Yanqing Jing

    Abstract: To further reduce the cost of semi-supervised domain adaptation (SSDA) labeling, a more effective way is to use active learning (AL) to annotate a selected subset with specific properties. However, domain adaptation tasks are always addressed in two interactive aspects: domain transfer and the enhancement of discrimination, which requires the selected data to be both uncertain under the model and… ▽ More

    Submitted 25 May, 2022; v1 submitted 23 May, 2022; originally announced May 2022.

  36. arXiv:2205.03043  [pdf, other

    cs.SD cs.AI cs.LG eess.AS

    Sound2Synth: Interpreting Sound via FM Synthesizer Parameters Estimation

    Authors: Zui Chen, Yansen Jing, Shengcheng Yuan, Yifei Xu, Jian Wu, Hang Zhao

    Abstract: Synthesizer is a type of electronic musical instrument that is now widely used in modern music production and sound design. Each parameters configuration of a synthesizer produces a unique timbre and can be viewed as a unique instrument. The problem of estimating a set of parameters configuration that best restore a sound timbre is an important yet complicated problem, i.e.: the synthesizer parame… ▽ More

    Submitted 28 July, 2022; v1 submitted 6 May, 2022; originally announced May 2022.

    Comments: 8 pages, 8 figures. v2: IJCAI2022 published, format revisions and bugfixes

  37. arXiv:2204.07205  [pdf

    cs.SI

    Expanding the Reach of Research Computing: A Landscape Study

    Authors: Dhruva K. Chakravorty, Sarah K. Janes, James V. Howell, Lisa M. Perez, Amy Schultz, Marie Goldie, Austin L. Gamble, Rajiv Malkan, Honggao Liu, Daniel Mireles, Yuanqi Jing, Zhenhua He, Tim Cockerill

    Abstract: Research-computing continues to play an ever increasing role in academia. Access to computing resources, however, varies greatly between institutions. Sustaining the growing need for computing skills and access to advanced cyberinfrastructure requires that computing resources be available to students at all levels of scholarship, including community colleges. The National Science Foundation-funded… ▽ More

    Submitted 18 April, 2022; v1 submitted 14 April, 2022; originally announced April 2022.

  38. arXiv:2203.15958  [pdf, other

    cs.CV cs.AI

    High-resolution Face Swapping via Latent Semantics Disentanglement

    Authors: Yangyang Xu, Bailin Deng, Junle Wang, Yanqing Jing, Jia Pan, Shengfeng He

    Abstract: We present a novel high-resolution face swapping method using the inherent prior knowledge of a pre-trained GAN model. Although previous research can leverage generative priors to produce high-resolution results, their quality can suffer from the entangled semantics of the latent space. We explicitly disentangle the latent semantics by utilizing the progressive nature of the generator, deriving st… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: Paper is Acctpted by CVPR2022

  39. arXiv:2203.14812  [pdf

    cs.CV eess.IV

    An attention mechanism based convolutional network for satellite precipitation downscaling over China

    Authors: Yinghong Jing, Liupeng Lin, Xinghua Li, Tongwen Li, Huanfeng Shen

    Abstract: Precipitation is a key part of hydrological circulation and is a sensitive indicator of climate change. The Integrated Multi-satellitE Retrievals for the Global Precipitation Measurement (GPM) mission (IMERG) datasets are widely used for global and regional precipitation investigations. However, their local application is limited by the relatively coarse spatial resolution. Therefore, in this pape… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

  40. arXiv:2203.09176  [pdf, other

    cs.CL

    ODE Transformer: An Ordinary Differential Equation-Inspired Model for Sequence Generation

    Authors: Bei Li, Quan Du, Tao Zhou, Yi Jing, Shuhan Zhou, Xin Zeng, Tong Xiao, JingBo Zhu, Xuebo Liu, Min Zhang

    Abstract: Residual networks are an Euler discretization of solutions to Ordinary Differential Equations (ODE). This paper explores a deeper relationship between Transformer and numerical ODE methods. We first show that a residual block of layers in Transformer can be described as a higher-order solution to ODE. Inspired by this, we design a new architecture, {\it ODE Transformer}, which is analogous to the… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: Long paper accepted by ACL2022 main conference. arXiv admin note: substantial text overlap with arXiv:2104.02308

  41. arXiv:2201.08603  [pdf, other

    cs.AR

    Trireme: Exploring Hierarchical Multi-Level Parallelism for Domain Specific Hardware Acceleration

    Authors: Georgios Zacharopoulos, Adel Ejjeh, Ying Jing, En-Yu Yang, Tianyu Jia, Iulian Brumar, Jeremy Intan, Muhammad Huzaifa, Sarita Adve, Vikram Adve, Gu-Yeon Wei, David Brooks

    Abstract: The design of heterogeneous systems that include domain specific accelerators is a challenging and time-consuming process. While taking into account area constraints, designers must decide which parts of an application to accelerate in hardware and which to leave in software. Moreover, applications in domains such as Extended Reality (XR) offer opportunities for various forms of parallel execution… ▽ More

    Submitted 21 January, 2022; originally announced January 2022.

    Comments: 20 pages

  42. arXiv:2201.05232  [pdf, other

    cs.AR

    FARSI: Facebook AR System Investigator for Agile Domain-Specific System-on-Chip Exploration

    Authors: Behzad Boroujerdian, Ying Jing, Amit Kumar, Lavanya Subramanian, Luke Yen, Vincent Lee, Vivek Venkatesan, Amit Jindal, Robert Shearer, Vijay Janapa Reddi

    Abstract: Domain-specific SoCs (DSSoCs) are attractive solutions for domains with stringent power/performance/area constraints; however, they suffer from two fundamental complexities. On the one hand, their many specialized hardware blocks result in complex systems and thus high development effort. On the other, their many system knobs expand the complexity of design space, making the search for the optimal… ▽ More

    Submitted 17 January, 2022; v1 submitted 13 January, 2022; originally announced January 2022.

  43. arXiv:2110.12751  [pdf, other

    stat.ML cs.LG

    Maximum Correntropy Criterion Regression models with tending-to-zero scale parameters

    Authors: Ying Jing, Lianqiang Yang

    Abstract: Maximum correntropy criterion regression (MCCR) models have been well studied within the frame of statistical learning when the scale parameters take fixed values or go to infinity. This paper studies the MCCR models with tending-to-zero scale parameters. It is revealed that the optimal learning rate of MCCR models is ${\mathcal{O}}(n^{-1})$ in the asymptotic sense when the sample size $n$ goes to… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

  44. arXiv:2109.12872  [pdf, other

    cs.CV cs.LG

    Meta-Aggregator: Learning to Aggregate for 1-bit Graph Neural Networks

    Authors: Yongcheng Jing, Yiding Yang, Xinchao Wang, Mingli Song, Dacheng Tao

    Abstract: In this paper, we study a novel meta aggregation scheme towards binarizing graph neural networks (GNNs). We begin by developing a vanilla 1-bit GNN framework that binarizes both the GNN parameters and the graph features. Despite the lightweight architecture, we observed that this vanilla framework suffered from insufficient discriminative power in distinguishing graph topologies, leading to a dram… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

    Comments: Accepted to ICCV 2021

  45. arXiv:2109.10485  [pdf, other

    cs.CL

    The NiuTrans Machine Translation Systems for WMT21

    Authors: Shuhan Zhou, Tao Zhou, Binghao Wei, Yingfeng Luo, Yongyu Mu, Zefan Zhou, Chenglong Wang, Xuanjun Zhou, Chuanhao Lv, Yi Jing, Laohu Wang, Jingnan Zhang, Canan Huang, Zhongxiang Yan, Chi Hu, Bei Li, Tong Xiao, Jingbo Zhu

    Abstract: This paper describes NiuTrans neural machine translation systems of the WMT 2021 news translation tasks. We made submissions to 9 language directions, including English$\leftrightarrow$$\{$Chinese, Japanese, Russian, Icelandic$\}$ and English$\rightarrow$Hausa tasks. Our primary systems are built on several effective variants of Transformer, e.g., Transformer-DLCL, ODE-Transformer. We also utilize… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

  46. arXiv:2108.11082  [pdf, other

    cs.CV cs.GR

    3D Face Recognition: A Survey

    Authors: Yaping Jing, Xuequan Lu, Shang Gao

    Abstract: Face recognition is one of the most studied research topics in the community. In recent years, the research on face recognition has shifted to using 3D facial surfaces, as more discriminating features can be represented by the 3D geometric information. This survey focuses on reviewing the 3D face recognition techniques developed in the past ten years which are generally categorized into convention… ▽ More

    Submitted 25 August, 2021; originally announced August 2021.

  47. arXiv:2107.11099  [pdf, other

    quant-ph cs.CV cs.LG

    RGB Image Classification with Quantum Convolutional Ansaetze

    Authors: Yu Jing, Xiaogang Li, Yang Yang, Chonghang Wu, Wenbing Fu, Wei Hu, Yuanyuan Li, Hua Xu

    Abstract: With the rapid growth of qubit numbers and coherence times in quantum hardware technology, implementing shallow neural networks on the so-called Noisy Intermediate-Scale Quantum (NISQ) devices has attracted a lot of interest. Many quantum (convolutional) circuit ansaetze are proposed for grayscale images classification tasks with promising empirical results. However, when applying these ansaetze o… ▽ More

    Submitted 22 February, 2022; v1 submitted 23 July, 2021; originally announced July 2021.

    Comments: https://link.springer.com/article/10.1007/s11128-022-03442-8

    Journal ref: Quantum Inf Process 21, 101 (2022)

  48. arXiv:2103.16284  [pdf, other

    cs.CV

    Locate then Segment: A Strong Pipeline for Referring Image Segmentation

    Authors: Ya Jing, Tao Kong, Wei Wang, Liang Wang, Lei Li, Tieniu Tan

    Abstract: Referring image segmentation aims to segment the objects referred by a natural language expression. Previous methods usually focus on designing an implicit and recurrent feature interaction mechanism to fuse the visual-linguistic features to directly generate the final segmentation mask without explicitly modeling the localization information of the referent instances. To tackle these problems, we… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

    Comments: CVPR 2021

  49. arXiv:2103.14123  [pdf, other

    cs.MA

    Preliminary Experimental Results of Context-Aware Teams of Multiple Autonomous Agents Operating under Constrained Communications

    Authors: Jose Martinez-Lorenzo, Jeff Hudack, Yutao Jing, Michael Shaham, Zixuan Liang, Abdullah Al Bashit, Yushu Wu, Weite Zhang, Matthew Skopin, Juan Heredia-Juesas, Yuntao Ma, Tristan Sweeney, Nicolas Ares, Ari Fox

    Abstract: This work presents and experimentally test the framework used by our context-aware, distributed team of small Unmanned Aerial Systems (SUAS) capable of operating in real-time, in an autonomous fashion, and under constrained communications. Our framework relies on three layered approach: (1) Operational layer, where fast temporal and narrow spatial decisions are made; (2) Tactical Layer, where temp… ▽ More

    Submitted 25 March, 2021; originally announced March 2021.

    Comments: 7 pages, 6 figures

  50. arXiv:2103.10478  [pdf, other

    cs.LG

    Unsupervised Doppler Radar-Based Activity Recognition for e-Healthcare

    Authors: Yordanka Karayaneva, Sara Sharifzadeh, Wenda Li, Yanguo Jing, Bo Tan

    Abstract: Passive radio frequency (RF) sensing and monitoring of human daily activities in elderly care homes is an emerging topic. Micro-Doppler radars are an appealing solution considering their non-intrusiveness, deep penetration, and high-distance range. Unsupervised activity recognition using Doppler radar data has not received attention, in spite of its importance in case of unlabelled or poorly label… ▽ More

    Submitted 2 November, 2021; v1 submitted 18 March, 2021; originally announced March 2021.