Skip to main content

Showing 1–50 of 99 results for author: Ning, X

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.13519  [pdf, other

    cs.CV

    GPSFormer: A Global Perception and Local Structure Fitting-based Transformer for Point Cloud Understanding

    Authors: Changshuo Wang, Meiqing Wu, Siew-Kei Lam, Xin Ning, Shangshu Yu, Ruiping Wang, Weijun Li, Thambipillai Srikanthan

    Abstract: Despite the significant advancements in pre-training methods for point cloud understanding, directly capturing intricate shape information from irregular point clouds without reliance on external data remains a formidable challenge. To address this problem, we propose GPSFormer, an innovative Global Perception and Local Structure Fitting-based Transformer, which learns detailed shape information f… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  2. arXiv:2407.04629  [pdf, other

    cs.CL cs.AI

    Entity Decomposition with Filtering: A Zero-Shot Clinical Named Entity Recognition Framework

    Authors: Reza Averly, Xia Ning

    Abstract: Clinical named entity recognition (NER) aims to retrieve important entities within clinical narratives. Recent works have demonstrated that large language models (LLMs) can achieve strong performance in this task. While previous works focus on proprietary LLMs, we investigate how open NER LLMs, trained specifically for entity recognition, perform in clinical NER. In this paper, we aim to improve t… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: Preprint

  3. arXiv:2407.00945  [pdf, other

    cs.LG

    Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs

    Authors: Enshu Liu, Junyi Zhu, Zinan Lin, Xuefei Ning, Matthew B. Blaschko, Shengen Yan, Guohao Dai, Huazhong Yang, Yu Wang

    Abstract: The rapid advancement of large language models (LLMs) has led to architectures with billions to trillions of parameters, posing significant deployment challenges due to their substantial demands on memory, processing power, and energy consumption. Sparse Mixture-of-Experts (SMoE) architectures have emerged as a solution, activating only a subset of parameters per token, thereby achieving faster in… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  4. arXiv:2406.14909  [pdf, other

    cs.LG cs.AI cs.CL

    MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression

    Authors: Tianyu Fu, Haofeng Huang, Xuefei Ning, Genghan Zhang, Boju Chen, Tianqi Wu, Hongyi Wang, Zixiao Huang, Shiyao Li, Shengen Yan, Guohao Dai, Huazhong Yang, Yu Wang

    Abstract: Sparse attention can effectively mitigate the significant memory and throughput demands of Large Language Models (LLMs) in long contexts. Existing methods typically employ a uniform sparse attention mask, applying the same sparse pattern across different attention heads and input lengths. However, this uniform approach fails to capture the diverse attention patterns inherent in LLMs, ignoring thei… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 10 pages

    ACM Class: I.2.7

  5. arXiv:2406.14629  [pdf, other

    cs.CL cs.AI

    Can LLMs Learn by Teaching? A Preliminary Study

    Authors: Xuefei Ning, Zifu Wang, Shiyao Li, Zinan Lin, Peiran Yao, Tianyu Fu, Matthew B. Blaschko, Guohao Dai, Huazhong Yang, Yu Wang

    Abstract: Teaching to improve student models (e.g., knowledge distillation) is an extensively studied methodology in LLMs. However, for humans, teaching not only improves students but also improves teachers. We ask: Can LLMs also learn by teaching (LbT)? If yes, we can potentially unlock the possibility of continuously advancing the models without solely relying on human-produced data or stronger models. In… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Under review

  6. arXiv:2406.08552  [pdf, other

    cs.CV

    DiTFastAttn: Attention Compression for Diffusion Transformer Models

    Authors: Zhihang Yuan, Pu Lu, Hanling Zhang, Xuefei Ning, Linfeng Zhang, Tianchen Zhao, Shengen Yan, Guohao Dai, Yu Wang

    Abstract: Diffusion Transformers (DiT) excel at image and video generation but face computational challenges due to self-attention's quadratic complexity. We propose DiTFastAttn, a novel post-training compression method to alleviate DiT's computational bottleneck. We identify three key redundancies in the attention computation during DiT inference: 1. spatial redundancy, where many attention heads focus on… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  7. arXiv:2406.02540  [pdf, other

    cs.CV

    ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation

    Authors: Tianchen Zhao, Tongcheng Fang, Enshu Liu, Rui Wan, Widyadewi Soedarmadji, Shiyao Li, Zinan Lin, Guohao Dai, Shengen Yan, Huazhong Yang, Xuefei Ning, Yu Wang

    Abstract: Diffusion transformers (DiTs) have exhibited remarkable performance in visual generation tasks, such as generating realistic images or videos based on textual instructions. However, larger model sizes and multi-frame processing for video generation lead to increased computational and memory costs, posing challenges for practical deployment on edge devices. Post-Training Quantization (PTQ) is an ef… ▽ More

    Submitted 30 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: Project Page: https://a-suozhang.xyz/viditq.github.io/

  8. arXiv:2405.20710  [pdf, other

    cs.IR

    Information Maximization via Variational Autoencoders for Cross-Domain Recommendation

    Authors: Xuying Ning, Wujiang Xu, Xiaolei Liu, Mingming Ha, Qiongxu Ma, Youru Li, Linxun Chen, Yongfeng Zhang

    Abstract: Cross-Domain Sequential Recommendation (CDSR) methods aim to address the data sparsity and cold-start problems present in Single-Domain Sequential Recommendation (SDSR). Existing CDSR methods typically rely on overlapping users, designing complex cross-domain modules to capture users' latent interests that can propagate across different domains. However, their propagated informative information is… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  9. arXiv:2405.17890  [pdf, other

    cs.IR cs.CL cs.LG

    SLMRec: Empowering Small Language Models for Sequential Recommendation

    Authors: Wujiang Xu, Zujie Liang, Jiaojiao Han, Xuying Ning, Wenfang Lin, Linxun Chen, Feng Wei, Yongfeng Zhang

    Abstract: The sequential Recommendation (SR) task involves predicting the next item a user is likely to interact with, given their past interactions. The SR models examine the sequence of a user's actions to discern more complex behavioral patterns and temporal dynamics. Recent research demonstrates the great impact of LLMs on sequential recommendation systems, either viewing sequential recommendation as la… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  10. arXiv:2405.17873  [pdf, other

    cs.CV cs.AI

    MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization

    Authors: Tianchen Zhao, Xuefei Ning, Tongcheng Fang, Enshu Liu, Guyue Huang, Zinan Lin, Shengen Yan, Guohao Dai, Yu Wang

    Abstract: Diffusion models have achieved significant visual generation quality. However, their significant computational and memory costs pose challenge for their application on resource-constrained mobile devices or even desktop GPUs. Recent few-step diffusion models reduces the inference time by reducing the denoising steps. However, their memory consumptions are still excessive. The Post Training Quantiz… ▽ More

    Submitted 29 May, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: Project Page: https://a-suozhang.xyz/mixdq.github.io/

  11. arXiv:2405.16256  [pdf, other

    cs.DC cs.AI

    HetHub: A Heterogeneous distributed hybrid training system for large-scale models

    Authors: Si Xu, Zixiao Huang, Yan Zeng, Shengen Yan, Xuefei Ning, Haolin Ye, Sipei Gu, Chunsheng Shui, Zhezheng Lin, Hao Zhang, Sheng Wang, Guohao Dai, Yu Wang

    Abstract: The development of large-scale models relies on a vast number of computing resources. For example, the GPT-4 model (1.8 trillion parameters) requires 25000 A100 GPUs for its training. It is a challenge to build a large-scale cluster with a type of GPU-accelerator. Using multiple types of GPU-accelerators to construct a cluster is an effective way to solve the problem of insufficient homogeneous GP… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  12. arXiv:2405.14224  [pdf, other

    cs.CV

    DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis

    Authors: Yao Teng, Yue Wu, Han Shi, Xuefei Ning, Guohao Dai, Yu Wang, Zhenguo Li, Xihui Liu

    Abstract: Diffusion models have achieved great success in image generation, with the backbone evolving from U-Net to Vision Transformers. However, the computational cost of Transformers is quadratic to the number of tokens, leading to significant challenges when dealing with high-resolution images. In this work, we propose Diffusion Mamba (DiM), which combines the efficiency of Mamba, a sequence model based… ▽ More

    Submitted 10 July, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: The code of our work is available here: {\url{https://github.com/tyshiwo1/DiM-DiffusionMamba/}}

  13. arXiv:2404.15760  [pdf, other

    cs.LG cs.AI stat.ML

    Debiasing Machine Unlearning with Counterfactual Examples

    Authors: Ziheng Chen, Jia Wang, Jun Zhuang, Abbavaram Gowtham Reddy, Fabrizio Silvestri, Jin Huang, Kaushiki Nag, Kun Kuang, Xin Ning, Gabriele Tolomei

    Abstract: The right to be forgotten (RTBF) seeks to safeguard individuals from the enduring effects of their historical actions by implementing machine-learning techniques. These techniques facilitate the deletion of previously acquired knowledge without requiring extensive model retraining. However, they often overlook a critical issue: unlearning processes bias. This bias emerges from two main sources: (1… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  14. arXiv:2404.15264  [pdf, other

    cs.CV

    TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting

    Authors: Jiahe Li, Jiawei Zhang, Xiao Bai, Jin Zheng, Xin Ning, Jun Zhou, Lin Gu

    Abstract: Radiance fields have demonstrated impressive performance in synthesizing lifelike 3D talking heads. However, due to the difficulty in fitting steep appearance changes, the prevailing paradigm that presents facial motions by directly modifying point appearance may lead to distortions in dynamic regions. To tackle this challenge, we introduce TalkingGaussian, a deformation-based radiance fields fram… ▽ More

    Submitted 5 July, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: Accepted at ECCV 2024. Project page: https://fictionarry.github.io/TalkingGaussian/

  15. arXiv:2404.14294  [pdf, other

    cs.CL cs.AI

    A Survey on Efficient Inference for Large Language Models

    Authors: Zixuan Zhou, Xuefei Ning, Ke Hong, Tianyu Fu, Jiaming Xu, Shiyao Li, Yuming Lou, Luning Wang, Zhihang Yuan, Xiuhong Li, Shengen Yan, Guohao Dai, Xiao-Ping Zhang, Yuhan Dong, Yu Wang

    Abstract: Large Language Models (LLMs) have attracted extensive attention due to their remarkable performance across various tasks. However, the substantial computational and memory requirements of LLM inference pose challenges for deployment in resource-constrained scenarios. Efforts within the field have been directed towards developing techniques aimed at enhancing the efficiency of LLM inference. This p… ▽ More

    Submitted 8 June, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

  16. arXiv:2404.02241  [pdf, other

    cs.CV

    Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better

    Authors: Enshu Liu, Junyi Zhu, Zinan Lin, Xuefei Ning, Matthew B. Blaschko, Sergey Yekhanin, Shengen Yan, Guohao Dai, Huazhong Yang, Yu Wang

    Abstract: Diffusion Models (DM) and Consistency Models (CM) are two types of popular generative models with good generation quality on various tasks. When training DM and CM, intermediate weight checkpoints are not fully utilized and only the last converged checkpoint is used. In this work, we find that high-quality model weights often lie in a basin which cannot be reached by SGD but can be obtained by pro… ▽ More

    Submitted 7 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

  17. arXiv:2403.17491  [pdf, other

    cs.CL

    DGoT: Dynamic Graph of Thoughts for Scientific Abstract Generation

    Authors: Xinyu Ning, Yutong Zhao, Yitong Liu, Hongwen Yang

    Abstract: The method of training language models based on domain datasets has obtained significant achievements in the task of generating scientific paper abstracts. However, such models face problems of generalization and expensive training costs. The use of large language models (LLMs) to solve the task of generating paper abstracts saves the cost of model training. However, due to the hallucination probl… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted by LREC-COLING 2024

  18. arXiv:2403.16379  [pdf, other

    cs.CV

    FlashEval: Towards Fast and Accurate Evaluation of Text-to-image Diffusion Generative Models

    Authors: Lin Zhao, Tianchen Zhao, Zinan Lin, Xuefei Ning, Guohao Dai, Huazhong Yang, Yu Wang

    Abstract: In recent years, there has been significant progress in the development of text-to-image generative models. Evaluating the quality of the generative models is one essential step in the development process. Unfortunately, the evaluation process could consume a significant amount of computational resources, making the required periodic evaluation of model performance (e.g., monitoring training progr… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: The paper is accepted by CVPR 2024

  19. arXiv:2403.06912  [pdf, other

    cs.CV

    DNGaussian: Optimizing Sparse-View 3D Gaussian Radiance Fields with Global-Local Depth Normalization

    Authors: Jiahe Li, Jiawei Zhang, Xiao Bai, Jin Zheng, Xin Ning, Jun Zhou, Lin Gu

    Abstract: Radiance fields have demonstrated impressive performance in synthesizing novel views from sparse input views, yet prevailing methods suffer from high training costs and slow inference speed. This paper introduces DNGaussian, a depth-regularized framework based on 3D Gaussian radiance fields, offering real-time and high-quality few-shot novel view synthesis at low costs. Our motivation stems from t… ▽ More

    Submitted 24 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: Accepted at CVPR 2024. Project page: https://fictionarry.github.io/DNGaussian/

  20. arXiv:2402.18158  [pdf, other

    cs.CL cs.AI

    Evaluating Quantized Large Language Models

    Authors: Shiyao Li, Xuefei Ning, Luning Wang, Tengxuan Liu, Xiangsheng Shi, Shengen Yan, Guohao Dai, Huazhong Yang, Yu Wang

    Abstract: Post-training quantization (PTQ) has emerged as a promising technique to reduce the cost of large language models (LLMs). Specifically, PTQ can effectively mitigate memory consumption and reduce computational overhead in LLMs. To meet the requirements of both high efficiency and performance across diverse scenarios, a comprehensive evaluation of quantized LLMs is essential to guide the selection o… ▽ More

    Submitted 6 June, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  21. arXiv:2402.09391  [pdf, other

    cs.AI cs.CE cs.CL

    LlaSMol: Advancing Large Language Models for Chemistry with a Large-Scale, Comprehensive, High-Quality Instruction Tuning Dataset

    Authors: Botao Yu, Frazier N. Baker, Ziqi Chen, Xia Ning, Huan Sun

    Abstract: Chemistry plays a crucial role in many domains, such as drug discovery and material science. While large language models (LLMs) such as GPT-4 exhibit remarkable capabilities on natural language processing tasks, existing research indicates that their performance on chemistry tasks is discouragingly low. In this paper, however, we demonstrate that our developed LLMs can achieve very strong results… ▽ More

    Submitted 1 April, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

    Comments: Added further analysis experiments. Work in progress

  22. arXiv:2402.08831  [pdf, other

    cs.CL cs.AI cs.IR

    eCeLLM: Generalizing Large Language Models for E-commerce from Large-scale, High-quality Instruction Data

    Authors: Bo Peng, Xinyi Ling, Ziru Chen, Huan Sun, Xia Ning

    Abstract: With tremendous efforts on developing effective e-commerce models, conventional e-commerce models show limited success in generalist e-commerce modeling, and suffer from unsatisfactory performance on new users and new products - a typical out-of-domain generalization challenge. Meanwhile, large language models (LLMs) demonstrate outstanding performance in generalist modeling and out-of-domain gene… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: Bo Peng and Xinyi Ling contributed equally to this paper

  23. arXiv:2402.05136  [pdf, other

    cs.CL

    LV-Eval: A Balanced Long-Context Benchmark with 5 Length Levels Up to 256K

    Authors: Tao Yuan, Xuefei Ning, Dong Zhou, Zhijie Yang, Shiyao Li, Minghui Zhuang, Zheyue Tan, Zhuyu Yao, Dahua Lin, Boxun Li, Guohao Dai, Shengen Yan, Yu Wang

    Abstract: State-of-the-art large language models (LLMs) are now claiming remarkable supported context lengths of 256k or even more. In contrast, the average context lengths of mainstream benchmarks are insufficient (5k-21k), and they suffer from potential knowledge leakage and inaccurate metrics, resulting in biased evaluation. This paper introduces LV-Eval, a challenging long-context benchmark with five le… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  24. arXiv:2401.16299  [pdf, other

    cs.LG cs.AI

    Enhancing Molecular Property Prediction with Auxiliary Learning and Task-Specific Adaptation

    Authors: Vishal Dey, Xia Ning

    Abstract: Pretrained Graph Neural Networks have been widely adopted for various molecular property prediction tasks. Despite their ability to encode structural and relational features of molecules, traditional fine-tuning of such pretrained GNNs on the target task can lead to poor generalization. To address this, we explore the adaptation of pretrained GNNs to the target task by jointly training them with m… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  25. arXiv:2401.14807  [pdf, other

    cs.CV

    PL-FSCIL: Harnessing the Power of Prompts for Few-Shot Class-Incremental Learning

    Authors: Songsong Tian, Lusi Li, Weijun Li, Hang Ran, Li Li, Xin Ning

    Abstract: Few-Shot Class-Incremental Learning (FSCIL) aims to enable deep neural networks to learn new tasks incrementally from a small number of labeled samples without forgetting previously learned tasks, closely mimicking human learning patterns. In this paper, we propose a novel approach called Prompt Learning for FSCIL (PL-FSCIL), which harnesses the power of prompts in conjunction with a pre-trained V… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  26. arXiv:2401.03868  [pdf, other

    cs.AR cs.AI

    FlightLLM: Efficient Large Language Model Inference with a Complete Mapping Flow on FPGAs

    Authors: Shulin Zeng, Jun Liu, Guohao Dai, Xinhao Yang, Tianyu Fu, Hongyi Wang, Wenheng Ma, Hanbo Sun, Shiyao Li, Zixiao Huang, Yadong Dai, Jintao Li, Zehao Wang, Ruoyu Zhang, Kairui Wen, Xuefei Ning, Yu Wang

    Abstract: Transformer-based Large Language Models (LLMs) have made a significant impact on various domains. However, LLMs' efficiency suffers from both heavy computation and memory overheads. Compression techniques like sparsification and quantization are commonly used to mitigate the gap between LLM's computation/memory overheads and hardware capacity. However, existing GPU and transformer-based accelerato… ▽ More

    Submitted 9 January, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

    Comments: Accepted to FPGA'24

  27. arXiv:2401.01772  [pdf, other

    cs.AI cs.NI

    A Novel Paradigm for Neural Computation: X-Net with Learnable Neurons and Adaptable Structure

    Authors: Yanjie Li, Weijun Li, Lina Yu, Min Wu, Jinyi Liu, Wenqiang Li, Meilan Hao, Shu Wei, Yusong Deng, Liping Zhang, Xiaoli Dong, Hong Qin, Xin Ning, Yugui Zhang, Baoli Lu, Jian Xu, Shuang Li

    Abstract: Multilayer perception (MLP) has permeated various disciplinary domains, ranging from bioinformatics to financial analytics, where their application has become an indispensable facet of contemporary scientific research endeavors. However, MLP has obvious drawbacks. 1), The type of activation function is single and relatively fixed, which leads to poor `representation ability' of the network, and it… ▽ More

    Submitted 12 July, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

    Comments: 35 pages

  28. arXiv:2312.07243  [pdf, other

    cs.AI

    A Unified Sampling Framework for Solver Searching of Diffusion Probabilistic Models

    Authors: Enshu Liu, Xuefei Ning, Huazhong Yang, Yu Wang

    Abstract: Recent years have witnessed the rapid progress and broad application of diffusion probabilistic models (DPMs). Sampling from DPMs can be viewed as solving an ordinary differential equation (ODE). Despite the promising performance, the generation of DPMs usually consumes much time due to the large number of function evaluations (NFE). Though recent works have accelerated the sampling to around 20 s… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  29. arXiv:2311.17177  [pdf, other

    cs.CV

    THInImg: Cross-modal Steganography for Presenting Talking Heads in Images

    Authors: Lin Zhao, Hongxuan Li, Xuefei Ning, Xinru Jiang

    Abstract: Cross-modal Steganography is the practice of concealing secret signals in publicly available cover signals (distinct from the modality of the secret signals) unobtrusively. While previous approaches primarily concentrated on concealing a relatively small amount of information, we propose THInImg, which manages to hide lengthy audio data (and subsequently decode talking head video) inside an identi… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: Accepted at WACV 2024

  30. arXiv:2311.10996  [pdf, other

    cs.LG

    BrainZ-BP: A Non-invasive Cuff-less Blood Pressure Estimation Approach Leveraging Brain Bio-impedance and Electrocardiogram

    Authors: Bufang Yang, Le Liu, Wenxuan Wu, Mengliang Zhou, Hongxing Liu, Xinbao Ning

    Abstract: Accurate and continuous blood pressure (BP) monitoring is essential to the early prevention of cardiovascular diseases. Non-invasive and cuff-less BP estimation algorithm has gained much attention in recent years. Previous studies have demonstrated that brain bio-impedance (BIOZ) is a promising technique for non-invasive intracranial pressure (ICP) monitoring. Clinically, treatment for patients wi… ▽ More

    Submitted 23 November, 2023; v1 submitted 18 November, 2023; originally announced November 2023.

  31. arXiv:2311.07742  [pdf, other

    cs.IR

    Modeling Sequences as Star Graphs to Address Over-smoothing in Self-attentive Sequential Recommendation

    Authors: Bo Peng, Ziqi Chen, Srinivasan Parthasarathy, Xia Ning

    Abstract: Self-attention (SA) mechanisms have been widely used in developing sequential recommendation (SR) methods, and demonstrated state-of-the-art performance. However, in this paper, we show that self-attentive SR methods substantially suffer from the over-smoothing issue that item embeddings within a sequence become increasingly similar across attention blocks. As widely demonstrated in the literature… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: text overlap with arXiv:2209.07997

    Journal ref: ACM Transactions on Knowledge Discovery from Data (TKDD) 2024

  32. arXiv:2311.07475   

    cs.CV

    Masked Face Dataset Generation and Masked Face Recognition

    Authors: Rui Cai, Xuying Ning, Peter N. Belhumeur

    Abstract: In the post-pandemic era, wearing face masks has posed great challenge to the ordinary face recognition. In the previous study, researchers has applied pretrained VGG16, and ResNet50 to extract features on the elaborate curated existing masked face recognition (MFR) datasets, RMFRD and SMFRD. To make the model more adaptable to the real world situation where the sample size is smaller and the came… ▽ More

    Submitted 25 December, 2023; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: This is not a conference paper and is just a technical report

  33. arXiv:2311.07414  [pdf, other

    cs.CV

    FIRST: A Million-Entry Dataset for Text-Driven Fashion Synthesis and Design

    Authors: Zhen Huang, Yihao Li, Dong Pei, Jiapeng Zhou, Xuliang Ning, Jianlin Han, Xiaoguang Han, Xuejun Chen

    Abstract: Text-driven fashion synthesis and design is an extremely valuable part of artificial intelligence generative content(AIGC), which has the potential to propel a tremendous revolution in the traditional fashion industry. To advance the research on text-driven fashion synthesis and design, we introduce a new dataset comprising a million high-resolution fashion images with rich structured textual(FIRS… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: 11 pages, 8 figures

  34. arXiv:2311.04760  [pdf, other

    cs.IR cs.LG

    Towards Open-world Cross-Domain Sequential Recommendation: A Model-Agnostic Contrastive Denoising Approach

    Authors: Wujiang Xu, Xuying Ning, Wenfang Lin, Mingming Ha, Qiongxu Ma, Qianqiao Liang, Xuewen Tao, Linxun Chen, Bing Han, Minnan Luo

    Abstract: Cross-domain sequential recommendation (CDSR) aims to address the data sparsity problems that exist in traditional sequential recommendation (SR) systems. The existing approaches aim to design a specific cross-domain unit that can transfer and propagate information across multiple domains by relying on overlapping users with abundant behaviors. However, in real-world recommender systems, CDSR sc… ▽ More

    Submitted 5 June, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

  35. Deep Learning-based 3D Point Cloud Classification: A Systematic Survey and Outlook

    Authors: Huang Zhang, Changshuo Wang, Shengwei Tian, Baoli Lu, Liping Zhang, Xin Ning, Xiao Bai

    Abstract: In recent years, point cloud representation has become one of the research hotspots in the field of computer vision, and has been widely used in many fields, such as autonomous driving, virtual reality, robotics, etc. Although deep learning techniques have achieved great success in processing regular structured 2D grid image data, there are still great challenges in processing irregular, unstructu… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

    Journal ref: Displays 102456 (2023)

  36. arXiv:2311.00603  [pdf, other

    cs.CV

    Occluded Person Re-Identification with Deep Learning: A Survey and Perspectives

    Authors: Enhao Ning, Changshuo Wang, Huang Zhangc, Xin Ning, Prayag Tiwari

    Abstract: Person re-identification (Re-ID) technology plays an increasingly crucial role in intelligent surveillance systems. Widespread occlusion significantly impacts the performance of person Re-ID. Occluded person Re-ID refers to a pedestrian matching method that deals with challenges such as pedestrian information loss, noise interference, and perspective misalignment. It has garnered extensive attenti… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  37. arXiv:2310.15211  [pdf, other

    q-bio.QM cs.AI cs.LG q-bio.MN

    Modeling Path Importance for Effective Alzheimer's Disease Drug Repurposing

    Authors: Shunian Xiang, Patrick J. Lawrence, Bo Peng, ChienWei Chiang, Dokyoon Kim, Li Shen, Xia Ning

    Abstract: Recently, drug repurposing has emerged as an effective and resource-efficient paradigm for AD drug discovery. Among various methods for drug repurposing, network-based methods have shown promising results as they are capable of leveraging complex networks that integrate multiple interaction types, such as protein-protein interactions, to more effectively identify candidate drugs. However, existing… ▽ More

    Submitted 27 October, 2023; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: 16 pages, 3 figures, 2 tables, 1 supplementary figure, 5 supplementary tables, Preprint of an article accepted for publication in Pacific Symposium on Biocomputing ©2023 World Scientific Publishing Co., Singapore, http://psb.stanford.edu/

  38. arXiv:2310.13725  [pdf

    cs.LG cs.AI q-bio.QM

    Enhancing drug and cell line representations via contrastive learning for improved anti-cancer drug prioritization

    Authors: Patrick J. Lawrence, Xia Ning

    Abstract: Due to cancer's complex nature and variable response to therapy, precision oncology informed by omics sequence analysis has become the current standard of care. However, the amount of data produced for each patients makes it difficult to quickly identify the best treatment regimen. Moreover, limited data availability has hindered computational methods' abilities to learn patterns associated with e… ▽ More

    Submitted 27 October, 2023; v1 submitted 20 October, 2023; originally announced October 2023.

    Comments: 60 pages, 4 figures, 4 tables, 11 supplementary tables, 1 supplementary note, submitted to Nature Communications

  39. arXiv:2310.01612  [pdf, other

    cs.IR

    Towards Efficient and Effective Adaptation of Large Language Models for Sequential Recommendation

    Authors: Bo Peng, Ben Burns, Ziqi Chen, Srinivasan Parthasarathy, Xia Ning

    Abstract: In recent years, with large language models (LLMs) achieving state-of-the-art performance in context understanding, increasing efforts have been dedicated to developing LLM-enhanced sequential recommendation (SR) methods. Considering that most existing LLMs are not specifically optimized for recommendation tasks, adapting them for SR becomes a critical step in LLM-enhanced SR methods. Though numer… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  40. arXiv:2309.10195  [pdf, other

    cs.IR

    Multi-modality Meets Re-learning: Mitigating Negative Transfer in Sequential Recommendation

    Authors: Bo Peng, Srinivasan Parthasarathy, Xia Ning

    Abstract: Learning effective recommendation models from sparse user interactions represents a fundamental challenge in developing sequential recommendation methods. Recently, pre-training-based methods have been developed to tackle this challenge. Though promising, in this paper, we show that existing methods suffer from the notorious negative transfer issue, where the model adapted from the pre-trained mod… ▽ More

    Submitted 20 September, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

  41. arXiv:2309.04967  [pdf, other

    cs.CV

    Towards Fully Decoupled End-to-End Person Search

    Authors: Pengcheng Zhang, Xiao Bai, Jin Zheng, Xin Ning

    Abstract: End-to-end person search aims to jointly detect and re-identify a target person in raw scene images with a unified model. The detection task unifies all persons while the re-id task discriminates different identities, resulting in conflict optimal objectives. Existing works proposed to decouple end-to-end person search to alleviate such conflict. Yet these methods are still sub-optimal on one or t… ▽ More

    Submitted 10 March, 2024; v1 submitted 10 September, 2023; originally announced September 2023.

    Comments: DICTA 2023 Best Student Paper

  42. arXiv:2309.02671  [pdf, other

    cs.LG cs.AI

    RLSynC: Offline-Online Reinforcement Learning for Synthon Completion

    Authors: Frazier N. Baker, Ziqi Chen, Daniel Adu-Ampratwum, Xia Ning

    Abstract: Retrosynthesis is the process of determining the set of reactant molecules that can react to form a desired product. Semi-template-based retrosynthesis methods, which imitate the reverse logic of synthesis reactions, first predict the reaction centers in the products, and then complete the resulting synthons back into reactants. We develop a new offline-online reinforcement learning method RLSynC… ▽ More

    Submitted 29 March, 2024; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: 32 pages, 5 figures, 4 tables

  43. arXiv:2308.11890  [pdf, other

    cs.LG q-bio.BM

    Shape-conditioned 3D Molecule Generation via Equivariant Diffusion Models

    Authors: Ziqi Chen, Bo Peng, Srinivasan Parthasarathy, Xia Ning

    Abstract: Ligand-based drug design aims to identify novel drug candidates of similar shapes with known active molecules. In this paper, we formulated an in silico shape-conditioned molecule generation problem to generate 3D molecule structures conditioned on the shape of a given molecule. To address this problem, we developed a translation- and rotation-equivariant shape-guided generative model ShapeMol. Sh… ▽ More

    Submitted 16 October, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

  44. arXiv:2307.15337  [pdf, other

    cs.CL cs.AI

    Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation

    Authors: Xuefei Ning, Zinan Lin, Zixuan Zhou, Zifu Wang, Huazhong Yang, Yu Wang

    Abstract: This work aims at decreasing the end-to-end generation latency of large language models (LLMs). One of the major causes of the high generation latency is the sequential decoding approach adopted by almost all state-of-the-art LLMs. In this work, motivated by the thinking and writing process of humans, we propose Skeleton-of-Thought (SoT), which first guides LLMs to generate the skeleton of the ans… ▽ More

    Submitted 1 March, 2024; v1 submitted 28 July, 2023; originally announced July 2023.

    Comments: In ICLR'24

  45. arXiv:2307.08209  [pdf, other

    cs.CV

    Ada3D : Exploiting the Spatial Redundancy with Adaptive Inference for Efficient 3D Object Detection

    Authors: Tianchen Zhao, Xuefei Ning, Ke Hong, Zhongyuan Qiu, Pu Lu, Yali Zhao, Linfeng Zhang, Lipu Zhou, Guohao Dai, Huazhong Yang, Yu Wang

    Abstract: Voxel-based methods have achieved state-of-the-art performance for 3D object detection in autonomous driving. However, their significant computational and memory costs pose a challenge for their application to resource-constrained vehicles. One reason for this high resource consumption is the presence of a large number of redundant background points in Lidar point clouds, resulting in spatial redu… ▽ More

    Submitted 8 August, 2023; v1 submitted 16 July, 2023; originally announced July 2023.

    Comments: Accepted at ICCV2023

  46. arXiv:2307.03416  [pdf, other

    cs.CV

    Learning Adversarial Semantic Embeddings for Zero-Shot Recognition in Open Worlds

    Authors: Tianqi Li, Guansong Pang, Xiao Bai, Jin Zheng, Lei Zhou, Xin Ning

    Abstract: Zero-Shot Learning (ZSL) focuses on classifying samples of unseen classes with only their side semantic information presented during training. It cannot handle real-life, open-world scenarios where there are test samples of unknown classes for which neither samples (e.g., images) nor their side semantic information is known during training. Open-Set Recognition (OSR) is dedicated to addressing the… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    ACM Class: I.4; I.5

  47. arXiv:2306.17771  [pdf, other

    cs.LG cs.IR q-bio.QM

    Precision Anti-Cancer Drug Selection via Neural Ranking

    Authors: Vishal Dey, Xia Ning

    Abstract: Personalized cancer treatment requires a thorough understanding of complex interactions between drugs and cancer cell lines in varying genetic and molecular contexts. To address this, high-throughput screening has been used to generate large-scale drug response data, facilitating data-driven computational models. Such models can capture complex drug-cell line interactions across various contexts i… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Comments: Accepted in BioKDD '23

  48. arXiv:2306.08860  [pdf, other

    cs.LG

    OMS-DPM: Optimizing the Model Schedule for Diffusion Probabilistic Models

    Authors: Enshu Liu, Xuefei Ning, Zinan Lin, Huazhong Yang, Yu Wang

    Abstract: Diffusion probabilistic models (DPMs) are a new class of generative models that have achieved state-of-the-art generation quality in various domains. Despite the promise, one major drawback of DPMs is the slow generation speed due to the large number of neural network evaluations required in the generation process. In this paper, we reveal an overlooked dimension -- model schedule -- for optimizin… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: Accepted by ICML2023

  49. arXiv:2304.08130  [pdf, other

    cs.CV

    A Survey on Few-Shot Class-Incremental Learning

    Authors: Songsong Tian, Lusi Li, Weijun Li, Hang Ran, Xin Ning, Prayag Tiwari

    Abstract: Large deep learning models are impressive, but they struggle when real-time data is not available. Few-shot class-incremental learning (FSCIL) poses a significant challenge for deep neural networks to learn new tasks from just a few labeled samples without forgetting the previously learned ones. This setup easily leads to catastrophic forgetting and overfitting problems, severely affecting model p… ▽ More

    Submitted 23 October, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

  50. arXiv:2303.02162  [pdf, other

    q-bio.QM cs.LG

    T-Cell Receptor Optimization with Reinforcement Learning and Mutation Policies for Precesion Immunotherapy

    Authors: Ziqi Chen, Martin Renqiang Min, Hongyu Guo, Chao Cheng, Trevor Clancy, Xia Ning

    Abstract: T cells monitor the health status of cells by identifying foreign peptides displayed on their surface. T-cell receptors (TCRs), which are protein complexes found on the surface of T cells, are able to bind to these peptides. This process is known as TCR recognition and constitutes a key step for immune response. Optimizing TCR sequences for TCR recognition represents a fundamental step towards the… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.