Zum Hauptinhalt springen

Showing 1–17 of 17 results for author: Leng, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.12902  [pdf, other

    cs.AI cs.CL cs.LG

    IAA: Inner-Adaptor Architecture Empowers Frozen Large Language Model with Multimodal Capabilities

    Authors: Bin Wang, Chunyu Xie, Dawei Leng, Yuhui Yin

    Abstract: In the field of multimodal large language models (MLLMs), common methods typically involve unfreezing the language model during training to foster profound visual understanding. However, the fine-tuning of such models with vision-language data often leads to a diminution of their natural language processing (NLP) capabilities. To avoid this performance degradation, a straightforward solution is to… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

  2. arXiv:2408.10755  [pdf, other

    cs.LG cs.AI

    Generating Synthetic Fair Syntax-agnostic Data by Learning and Distilling Fair Representation

    Authors: Md Fahim Sikder, Resmi Ramachandranpillai, Daniel de Leng, Fredrik Heintz

    Abstract: Data Fairness is a crucial topic due to the recent wide usage of AI powered applications. Most of the real-world data is filled with human or machine biases and when those data are being used to train AI models, there is a chance that the model will reflect the bias in the training data. Existing bias-mitigating generative methods based on GANs, Diffusion models need in-processing fairness objecti… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  3. arXiv:2408.08189  [pdf, other

    cs.CV

    FancyVideo: Towards Dynamic and Consistent Video Generation via Cross-frame Textual Guidance

    Authors: Jiasong Feng, Ao Ma, Jing Wang, Bo Cheng, Xiaodan Liang, Dawei Leng, Yuhui Yin

    Abstract: Synthesizing motion-rich and temporally consistent videos remains a challenge in artificial intelligence, especially when dealing with extended durations. Existing text-to-video (T2V) models commonly employ spatial cross-attention for text control, equivalently guiding different frame generations without frame-specific textual guidance. Thus, the model's capacity to comprehend the temporal logic c… ▽ More

    Submitted 16 August, 2024; v1 submitted 15 August, 2024; originally announced August 2024.

  4. TWIN V2: Scaling Ultra-Long User Behavior Sequence Modeling for Enhanced CTR Prediction at Kuaishou

    Authors: Zihua Si, Lin Guan, ZhongXiang Sun, Xiaoxue Zang, Jing Lu, Yiqun Hui, Xingchao Cao, Zeyu Yang, Yichen Zheng, Dewei Leng, Kai Zheng, Chenbin Zhang, Yanan Niu, Yang Song, Kun Gai

    Abstract: The significance of modeling long-term user interests for CTR prediction tasks in large-scale recommendation systems is progressively gaining attention among researchers and practitioners. Existing work, such as SIM and TWIN, typically employs a two-stage approach to model long-term user behavior sequences for efficiency concerns. The first stage rapidly retrieves a subset of sequences related to… ▽ More

    Submitted 16 August, 2024; v1 submitted 23 July, 2024; originally announced July 2024.

    Comments: Accepted by CIKM 2024

  5. arXiv:2406.14281  [pdf, other

    cs.LG cs.AI

    FairX: A comprehensive benchmarking tool for model analysis using fairness, utility, and explainability

    Authors: Md Fahim Sikder, Resmi Ramachandranpillai, Daniel de Leng, Fredrik Heintz

    Abstract: We present FairX, an open-source Python-based benchmarking tool designed for the comprehensive analysis of models under the umbrella of fairness, utility, and eXplainability (XAI). FairX enables users to train benchmarking bias-removal models and evaluate their fairness using a wide array of fairness metrics, data utility metrics, and generate explanations for model predictions, all within a unifi… ▽ More

    Submitted 21 August, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

  6. arXiv:2404.09520  [pdf, other

    cs.IR

    UniSAR: Modeling User Transition Behaviors between Search and Recommendation

    Authors: Teng Shi, Zihua Si, Jun Xu, Xiao Zhang, Xiaoxue Zang, Kai Zheng, Dewei Leng, Yanan Niu, Yang Song

    Abstract: Nowadays, many platforms provide users with both search and recommendation services as important tools for accessing information. The phenomenon has led to a correlation between user search and recommendation behaviors, providing an opportunity to model user interests in a fine-grained way. Existing approaches either model user search and recommendation behaviors separately or overlook the differe… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: Accepted by SIGIR 2024

  7. arXiv:2309.00952  [pdf, other

    cs.CL cs.AI

    Bridge Diffusion Model: bridge non-English language-native text-to-image diffusion model with English communities

    Authors: Shanyuan Liu, Dawei Leng, Yuhui Yin

    Abstract: Text-to-Image generation (TTI) technologies are advancing rapidly, especially in the English language communities. However, English-native TTI models inherently carry biases from English world centric training data, which creates a dilemma for development of other language-native TTI models. One common choice is fine-tuning the English-native TTI model with translated samples from non-English comm… ▽ More

    Submitted 2 September, 2023; originally announced September 2023.

  8. arXiv:2309.00227  [pdf, other

    cs.CV

    What Makes Good Open-Vocabulary Detector: A Disassembling Perspective

    Authors: Jincheng Li, Chunyu Xie, Xiaoyu Wu, Bin Wang, Dawei Leng

    Abstract: Open-vocabulary detection (OVD) is a new object detection paradigm, aiming to localize and recognize unseen objects defined by an unbounded vocabulary. This is challenging since traditional detectors can only learn from pre-defined categories and thus fail to detect and localize objects out of pre-defined vocabulary. To handle the challenge, OVD leverages pre-trained cross-modal VLM, such as CLIP,… ▽ More

    Submitted 31 August, 2023; originally announced September 2023.

    Journal ref: KDD workshop 2023

  9. KuaiSAR: A Unified Search And Recommendation Dataset

    Authors: Zhongxiang Sun, Zihua Si, Xiaoxue Zang, Dewei Leng, Yanan Niu, Yang Song, Xiao Zhang, Jun Xu

    Abstract: The confluence of Search and Recommendation (S&R) services is vital to online services, including e-commerce and video platforms. The integration of S&R modeling is a highly intuitive approach adopted by industry practitioners. However, there is a noticeable lack of research conducted in this area within academia, primarily due to the absence of publicly available datasets. Consequently, a substan… ▽ More

    Submitted 13 August, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

    Comments: CIKM 2023 resource track

    Report number: 5407--5411

    Journal ref: CIKM '23: Proceedings of the 32nd ACM International Conference on Information and Knowledge Management October 2023

  10. arXiv:2302.02352  [pdf, other

    cs.IR

    TWIN: TWo-stage Interest Network for Lifelong User Behavior Modeling in CTR Prediction at Kuaishou

    Authors: Jianxin Chang, Chenbin Zhang, Zhiyi Fu, Xiaoxue Zang, Lin Guan, Jing Lu, Yiqun Hui, Dewei Leng, Yanan Niu, Yang Song, Kun Gai

    Abstract: Life-long user behavior modeling, i.e., extracting a user's hidden interests from rich historical behaviors in months or even years, plays a central role in modern CTR prediction systems. Conventional algorithms mostly follow two cascading stages: a simple General Search Unit (GSU) for fast and coarse search over tens of thousands of long-term behaviors and an Exact Search Unit (ESU) for effective… ▽ More

    Submitted 26 June, 2023; v1 submitted 5 February, 2023; originally announced February 2023.

    Comments: Accepted by KDD 2023

  11. arXiv:2302.01115  [pdf, other

    cs.IR

    PEPNet: Parameter and Embedding Personalized Network for Infusing with Personalized Prior Information

    Authors: Jianxin Chang, Chenbin Zhang, Yiqun Hui, Dewei Leng, Yanan Niu, Yang Song, Kun Gai

    Abstract: With the increase of content pages and interactive buttons in online services such as online-shopping and video-watching websites, industrial-scale recommender systems face challenges in multi-domain and multi-task recommendations. The core of multi-task and multi-domain recommendation is to accurately capture user interests in multiple scenarios given multiple user behaviors. In this paper, we pr… ▽ More

    Submitted 26 June, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

    Comments: Accepted by KDD 2023

  12. CCMB: A Large-scale Chinese Cross-modal Benchmark

    Authors: Chunyu Xie, Heng Cai, Jincheng Li, Fanjing Kong, Xiaoyu Wu, Jianfei Song, Henrique Morimitsu, Lin Yao, Dexin Wang, Xiangzheng Zhang, Dawei Leng, Baochang Zhang, Xiangyang Ji, Yafeng Deng

    Abstract: Vision-language pre-training (VLP) on large-scale datasets has shown premier performance on various downstream tasks. In contrast to plenty of available benchmarks with English corpus, large-scale pre-training datasets and downstream datasets with Chinese corpus remain largely unexplored. In this work, we build a large-scale high-quality Chinese Cross-Modal Benchmark named CCMB for the research co… ▽ More

    Submitted 8 November, 2023; v1 submitted 8 May, 2022; originally announced May 2022.

    Journal ref: Proceedings of the 31st ACM International Conference on Multimedia (ACM MM), 2023, Pages 4219-4227

  13. arXiv:2103.03724  [pdf, other

    q-bio.BM cs.LG

    Sequence-based deep learning antibody design for in silico antibody affinity maturation

    Authors: Yue Kang, Dawei Leng, Jinjiang Guo, Lurong Pan

    Abstract: Antibody therapeutics has been extensively studied in drug discovery and development within the past decades. One increasingly popular focus in the antibody discovery pipeline is the optimization step for therapeutic leads. Both traditional methods and in silico approaches aim to generate candidates with high binding affinity against specific target antigens. Traditional in vitro approaches use hy… ▽ More

    Submitted 14 August, 2022; v1 submitted 20 February, 2021; originally announced March 2021.

  14. arXiv:2102.07640  [pdf, other

    cs.IR

    Real-time tracking of COVID-19 and coronavirus research updates through text mining

    Authors: Yutong Jin, Jie Li, Xinyu Wang, Peiyao Li, Jinjiang Guo, Junfeng Wu, Dawei Leng, Lurong Pan

    Abstract: The novel coronavirus (SARS-CoV-2) which causes COVID-19 is an ongoing pandemic. There are ongoing studies with up to hundreds of publications uploaded to databases daily. We are exploring the use-case of artificial intelligence and natural language processing in order to efficiently sort through these publications. We demonstrate that clinical trial information, preclinical studies, and a general… ▽ More

    Submitted 8 February, 2021; originally announced February 2021.

  15. arXiv:2102.06086  [pdf, other

    q-bio.BM cs.LG q-bio.QM

    ParaVS: A Simple, Fast, Efficient and Flexible Graph Neural Network Framework for Structure-Based Virtual Screening

    Authors: Junfeng Wu, Dawei Leng, Lurong Pan

    Abstract: Structure-based virtual screening (SBVS) is a promising in silico technique that integrates computational methods into drug design. An extensively used method in SBVS is molecular docking. However, the docking process can hardly be computationally efficient and accurate simultaneously because classic mechanics scoring function is used to approximate, but hardly reach, the quantum mechanics precisi… ▽ More

    Submitted 8 February, 2021; originally announced February 2021.

  16. arXiv:2102.04064  [pdf, other

    cs.LG cs.AI

    Enhance Information Propagation for Graph Neural Network by Heterogeneous Aggregations

    Authors: Dawei Leng, Jinjiang Guo, Lurong Pan, Jie Li, Xinyu Wang

    Abstract: Graph neural networks are emerging as continuation of deep learning success w.r.t. graph data. Tens of different graph neural network variants have been proposed, most following a neighborhood aggregation scheme, where the node features are updated via aggregating features of its neighboring nodes from layer to layer. Though related research surges, the power of GNNs are still not on-par-with thei… ▽ More

    Submitted 8 February, 2021; originally announced February 2021.

  17. arXiv:2102.01649  [pdf, other

    cs.SI cs.AI cs.IR cs.LG

    Heterogeneous Graph based Deep Learning for Biomedical Network Link Prediction

    Authors: Jinjiang Guo, Jie Li, Dawei Leng, Lurong Pan

    Abstract: Multi-scale biomedical knowledge networks are expanding with emerging experimental technologies that generates multi-scale biomedical big data. Link prediction is increasingly used especially in bipartite biomedical networks to identify hidden biological interactions and relationshipts between key entities such as compounds, targets, gene and diseases. We propose a Graph Neural Networks (GNN) meth… ▽ More

    Submitted 23 February, 2022; v1 submitted 28 January, 2021; originally announced February 2021.