Skip to main content

Showing 1–50 of 570 results for author: Shen, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.13147  [pdf, other

    cs.CV

    DFMSD: Dual Feature Masking Stage-wise Knowledge Distillation for Object Detection

    Authors: Zhourui Zhang, Jun Li, Zhijian Wu, Jifeng Shen, Jianhua Xu

    Abstract: In recent years, current mainstream feature masking distillation methods mainly function by reconstructing selectively masked regions of a student network from the feature maps of a teacher network. In these methods, attention mechanisms can help to identify spatially important regions and crucial object-aware channel clues, such that the reconstructed features are encoded with sufficient discrimi… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  2. arXiv:2407.12884  [pdf, other

    cs.LG cs.AI cs.CV cs.GR cs.HC

    SurroFlow: A Flow-Based Surrogate Model for Parameter Space Exploration and Uncertainty Quantification

    Authors: Jingyi Shen, Yuhan Duan, Han-Wei Shen

    Abstract: Existing deep learning-based surrogate models facilitate efficient data generation, but fall short in uncertainty quantification, efficient parameter space exploration, and reverse prediction. In our work, we introduce SurroFlow, a novel normalizing flow-based surrogate model, to learn the invertible transformation between simulation parameters and simulation outputs. The model not only allows acc… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: To be published in Proc. IEEE VIS 2024

  3. arXiv:2407.10876  [pdf, other

    cs.CV

    RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception

    Authors: Chunliang Li, Wencheng Han, Junbo Yin, Sanyuan Zhao, Jianbing Shen

    Abstract: Concurrent processing of multiple autonomous driving 3D perception tasks within the same spatiotemporal scene poses a significant challenge, in particular due to the computational inefficiencies and feature competition between tasks when using traditional multi-task learning approaches. This paper addresses these issues by proposing a novel unified representation, RepVF, which harmonizes the repre… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: Accepted by ECCV 2024

  4. arXiv:2407.07454   

    cs.LG

    CM-DQN: A Value-Based Deep Reinforcement Learning Model to Simulate Confirmation Bias

    Authors: Jiacheng Shen, Lihan Feng

    Abstract: In human decision-making tasks, individuals learn through trials and prediction errors. When individuals learn the task, some are more influenced by good outcomes, while others weigh bad outcomes more heavily. Such confirmation bias can lead to different learning effects. In this study, we propose a new algorithm in Deep Reinforcement Learning, CM-DQN, which applies the idea of different update st… ▽ More

    Submitted 16 July, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: We found there is a big issue regarding to the complement of the experiments where one experiment is not compelling and comprehensive

  5. arXiv:2407.05138  [pdf, other

    cs.SE cs.AI

    Vortex under Ripplet: An Empirical Study of RAG-enabled Applications

    Authors: Yuchen Shao, Yuheng Huang, Jiawei Shen, Lei Ma, Ting Su, Chengcheng Wan

    Abstract: Large language models (LLMs) enhanced by retrieval-augmented generation (RAG) provide effective solutions in various application scenarios. However, developers face challenges in integrating RAG-enhanced LLMs into software systems, due to lack of interface specification, requirements from software context, and complicated system management. In this paper, we manually studied 100 open-source applic… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  6. arXiv:2407.01436  [pdf, other

    cs.CV cs.RO

    AdaOcc: Adaptive Forward View Transformation and Flow Modeling for 3D Occupancy and Flow Prediction

    Authors: Dubing Chen, Wencheng Han, Jin Fang, Jianbing Shen

    Abstract: In this technical report, we present our solution for the Vision-Centric 3D Occupancy and Flow Prediction track in the nuScenes Open-Occ Dataset Challenge at CVPR 2024. Our innovative approach involves a dual-stage framework that enhances 3D occupancy and flow predictions by incorporating adaptive forward view transformation and flow modeling. Initially, we independently train the occupancy model,… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 2nd Place in the 3D Occupancy and Flow Prediction Challenge (CVPR24)

  7. arXiv:2406.18610  [pdf, other

    cs.CV

    Vox-UDA: Voxel-wise Unsupervised Domain Adaptation for Cryo-Electron Subtomogram Segmentation with Denoised Pseudo Labeling

    Authors: Haoran Li, Xingjian Li, Jiahua Shi, Huaming Chen, Bo Du, Daisuke Kihara, Johan Barthelemy, Jun Shen, Min Xu

    Abstract: Cryo-Electron Tomography (cryo-ET) is a 3D imaging technology facilitating the study of macromolecular structures at near-atomic resolution. Recent volumetric segmentation approaches on cryo-ET images have drawn widespread interest in biological sector. However, existing methods heavily rely on manually labeled data, which requires highly professional skills, thereby hindering the adoption of full… ▽ More

    Submitted 30 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    Comments: 11 pages

  8. arXiv:2406.16434  [pdf, other

    cs.CV

    Multi-threshold Deep Metric Learning for Facial Expression Recognition

    Authors: Wenwu Yang, Jinyi Yu, Tuo Chen, Zhenguang Liu, Xun Wang, Jianbing Shen

    Abstract: Effective expression feature representations generated by a triplet-based deep metric learning are highly advantageous for facial expression recognition (FER). The performance of triplet-based deep metric learning is contingent upon identifying the best threshold for triplet loss. Threshold validation, however, is tough and challenging, as the ideal threshold changes among datasets and even across… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: accepted by Pattern Recognition

  9. arXiv:2406.13264  [pdf, other

    cs.AI cs.LG cs.SE

    Do Multimodal Foundation Models Understand Enterprise Workflows? A Benchmark for Business Process Management Tasks

    Authors: Michael Wornow, Avanika Narayan, Ben Viggiano, Ishan S. Khare, Tathagat Verma, Tibor Thompson, Miguel Angel Fuentes Hernandez, Sudharsan Sundar, Chloe Trujillo, Krrish Chawla, Rongfei Lu, Justin Shen, Divya Nagaraj, Joshua Martinez, Vardhan Agrawal, Althea Hudson, Nigam H. Shah, Christopher Re

    Abstract: Existing ML benchmarks lack the depth and diversity of annotations needed for evaluating models on business process management (BPM) tasks. BPM is the practice of documenting, measuring, improving, and automating enterprise workflows. However, research has focused almost exclusively on one task - full end-to-end automation using agents based on multimodal foundation models (FMs) like GPT-4. This f… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  10. ViDSOD-100: A New Dataset and a Baseline Model for RGB-D Video Salient Object Detection

    Authors: Junhao Lin, Lei Zhu, Jiaxing Shen, Huazhu Fu, Qing Zhang, Liansheng Wang

    Abstract: With the rapid development of depth sensor, more and more RGB-D videos could be obtained. Identifying the foreground in RGB-D videos is a fundamental and important task. However, the existing salient object detection (SOD) works only focus on either static RGB-D images or RGB videos, ignoring the collaborating of RGB-D and video information. In this paper, we first collect a new annotated RGB-D vi… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Journal ref: International Journal of Computer Vision (2024)

  11. arXiv:2406.11409  [pdf, other

    cs.CL cs.AI

    CodeGemma: Open Code Models Based on Gemma

    Authors: CodeGemma Team, Heri Zhao, Jeffrey Hui, Joshua Howland, Nam Nguyen, Siqi Zuo, Andrea Hu, Christopher A. Choquette-Choo, Jingyue Shen, Joe Kelley, Kshitij Bansal, Luke Vilnis, Mateo Wirth, Paul Michel, Peter Choy, Pratik Joshi, Ravin Kumar, Sarmad Hashmi, Shubham Agrawal, Zhitao Gong, Jane Fine, Tris Warkentin, Ale Jakse Hartman, Bin Ni, Kathy Korevec , et al. (2 additional authors not shown)

    Abstract: This paper introduces CodeGemma, a collection of specialized open code models built on top of Gemma, capable of a variety of code and natural language generation tasks. We release three model variants. CodeGemma 7B pretrained (PT) and instruction-tuned (IT) variants have remarkably resilient natural language understanding, excel in mathematical reasoning, and match code capabilities of other open… ▽ More

    Submitted 18 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: v1: 11 pages, 4 figures, 5 tables. v2: Update metadata

  12. arXiv:2406.02886  [pdf, other

    cs.CL cs.AI

    PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs

    Authors: Rongzhi Zhang, Jiaming Shen, Tianqi Liu, Haorui Wang, Zhen Qin, Feng Han, Jialu Liu, Simon Baumgartner, Michael Bendersky, Chao Zhang

    Abstract: Large Language Models (LLMs) have exhibited impressive capabilities in various tasks, yet their vast parameter sizes restrict their applicability in resource-constrained settings. Knowledge distillation (KD) offers a viable solution by transferring expertise from large teacher models to compact student models. However, traditional KD techniques face specific challenges when applied to LLMs, includ… ▽ More

    Submitted 6 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: Findings of ACL 2024

  13. arXiv:2406.01883  [pdf, other

    cs.NE cs.HC

    Context Gating in Spiking Neural Networks: Achieving Lifelong Learning through Integration of Local and Global Plasticity

    Authors: Jiangrong Shen, Wenyao Ni, Qi Xu, Gang Pan, Huajin Tang

    Abstract: Humans learn multiple tasks in succession with minimal mutual interference, through the context gating mechanism in the prefrontal cortex (PFC). The brain-inspired models of spiking neural networks (SNN) have drawn massive attention for their energy efficiency and biological plausibility. To overcome catastrophic forgetting when learning multiple tasks in sequence, current SNN models for lifelong… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  14. arXiv:2406.01072  [pdf, other

    cs.NE cs.AI

    Towards Efficient Deep Spiking Neural Networks Construction with Spiking Activity based Pruning

    Authors: Yaxin Li, Qi Xu, Jiangrong Shen, Hongming Xu, Long Chen, Gang Pan

    Abstract: The emergence of deep and large-scale spiking neural networks (SNNs) exhibiting high performance across diverse complex datasets has led to a need for compressing network models due to the presence of a significant number of redundant structural units, aiming to more effectively leverage their low-power consumption and biological interpretability advantages. Currently, most model compression techn… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  15. arXiv:2406.00515  [pdf, other

    cs.CL cs.AI cs.SE

    A Survey on Large Language Models for Code Generation

    Authors: Juyong Jiang, Fan Wang, Jiasi Shen, Sungju Kim, Sunghun Kim

    Abstract: Large Language Models (LLMs) have garnered remarkable advancements across diverse code-related tasks, known as Code LLMs, particularly in code generation that generates source code with LLM from natural language descriptions. This burgeoning field has captured significant interest from both academic researchers and industry professionals due to its practical significance in software development, e… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  16. arXiv:2405.20815  [pdf, other

    cs.DC cs.NI

    Distributed Simulation for Digital Twins of Large-Scale Real-World DiffServ-Based Networks

    Authors: Zhuoyao Huang, Nan Zhang, Jingran Shen, Georgios Diamantopoulos, Zhengchang Hua, Nikos Tziritas, Georgios Theodoropoulos

    Abstract: Digital Twin technology facilitates the monitoring and online analysis of large-scale communication networks. Faster predictions of network performance thus become imperative, especially for analysing Quality of Service (QoS) parameters in large-scale city networks. Discrete Event Simulation (DES) is a standard network analysis technology, and can be further optimised with parallel and distributed… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: 15 pages, 6 figures, accepted by Euro-Par 2024: 30th International European Conference on Parallel and Distributed Computing

  17. arXiv:2405.20310  [pdf, other

    cs.CV

    A Pixel Is Worth More Than One 3D Gaussians in Single-View 3D Reconstruction

    Authors: Jianghao Shen, Nan Xue, Tianfu Wu

    Abstract: Learning 3D scene representation from a single-view image is a long-standing fundamental problem in computer vision, with the inherent ambiguity in predicting contents unseen from the input view. Built on the recently proposed 3D Gaussian Splatting (3DGS), the Splatter Image method has made promising progress on fast single-image novel view synthesis via learning a single 3D Gaussian for each pixe… ▽ More

    Submitted 3 June, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

    Comments: preprint, under review

  18. arXiv:2405.18361  [pdf, other

    cs.CV

    Is a 3D-Tokenized LLM the Key to Reliable Autonomous Driving?

    Authors: Yifan Bai, Dongming Wu, Yingfei Liu, Fan Jia, Weixin Mao, Ziheng Zhang, Yucheng Zhao, Jianbing Shen, Xing Wei, Tiancai Wang, Xiangyu Zhang

    Abstract: Rapid advancements in Autonomous Driving (AD) tasks turned a significant shift toward end-to-end fashion, particularly in the utilization of vision-language models (VLMs) that integrate robust logical reasoning and cognitive abilities to enable comprehensive end-to-end planning. However, these VLM-based approaches tend to integrate 2D vision tokenizers and a large language model (LLM) for ego-car… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  19. arXiv:2405.17633  [pdf, other

    cs.CL

    HEART-felt Narratives: Tracing Empathy and Narrative Style in Personal Stories with LLMs

    Authors: Jocelyn Shen, Joel Mire, Hae Won Park, Cynthia Breazeal, Maarten Sap

    Abstract: Empathy serves as a cornerstone in enabling prosocial behaviors, and can be evoked through sharing of personal experiences in stories. While empathy is influenced by narrative content, intuitively, people respond to the way a story is told as well, through narrative style. Yet the relationship between empathy and narrative style is not fully understood. In this work, we empirically examine and qua… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  20. arXiv:2405.15708  [pdf, other

    cs.CL

    EmpathicStories++: A Multimodal Dataset for Empathy towards Personal Experiences

    Authors: Jocelyn Shen, Yubin Kim, Mohit Hulse, Wazeer Zulfikar, Sharifa Alghowinem, Cynthia Breazeal, Hae Won Park

    Abstract: Modeling empathy is a complex endeavor that is rooted in interpersonal and experiential dimensions of human interaction, and remains an open problem within AI. Existing empathy datasets fall short in capturing the richness of empathy responses, often being confined to in-lab or acted scenarios, lacking longitudinal data, and missing self-reported labels. We introduce a new multimodal dataset for e… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: Accepted to ACL 2024 Findings

  21. arXiv:2405.14411  [pdf, other

    cs.AI eess.SY

    Large Language Models for Explainable Decisions in Dynamic Digital Twins

    Authors: Nan Zhang, Christian Vergara-Marcillo, Georgios Diamantopoulos, Jingran Shen, Nikos Tziritas, Rami Bahsoon, Georgios Theodoropoulos

    Abstract: Dynamic data-driven Digital Twins (DDTs) can enable informed decision-making and provide an optimisation platform for the underlying system. By leveraging principles of Dynamic Data-Driven Applications Systems (DDDAS), DDTs can formulate computational modalities for feedback loops, model updates and decision-making, including autonomous ones. However, understanding autonomous decision-making often… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 8 pages, 3 figures, under review

  22. arXiv:2405.11844  [pdf

    cs.AR cs.ET

    NeRTCAM: CAM-Based CMOS Implementation of Reference Frames for Neuromorphic Processors

    Authors: Harideep Nair, William Leyman, Agastya Sampath, Quinn Jacobson, John Paul Shen

    Abstract: Neuromorphic architectures mimicking biological neural networks have been proposed as a much more efficient alternative to conventional von Neumann architectures for the exploding compute demands of AI workloads. Recent neuroscience theory on intelligence suggests that Cortical Columns (CCs) are the fundamental compute units in the neocortex and intelligence arises from CC's ability to store, pred… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: Accepted and Presented at Neuro-Inspired Computational Elements (NICE) Conference, La Jolla, CA. 2024

  23. arXiv:2405.11672  [pdf

    cs.LG

    Interpretable Machine Learning Enhances Disease Prognosis: Applications on COVID-19 and Onward

    Authors: Jinzhi Shen, Ke Ma

    Abstract: In response to the COVID-19 pandemic, the integration of interpretable machine learning techniques has garnered significant attention, offering transparent and understandable insights crucial for informed clinical decision making. This literature review delves into the applications of interpretable machine learning in predicting the prognosis of respiratory diseases, particularly focusing on COVID… ▽ More

    Submitted 20 May, 2024; v1 submitted 19 May, 2024; originally announced May 2024.

  24. arXiv:2405.10979  [pdf, other

    cs.CR

    Private Data Leakage in Federated Human Activity Recognition for Wearable Healthcare Devices

    Authors: Kongyang Chen, Dongping Zhang, Sijia Guan, Bing Mi, Jiaxing Shen, Guoqing Wang

    Abstract: Wearable data serves various health monitoring purposes, such as determining activity states based on user behavior and providing tailored exercise recommendations. However, the individual data perception and computational capabilities of wearable devices are limited, often necessitating the joint training of models across multiple devices. Federated Human Activity Recognition (HAR) presents a via… ▽ More

    Submitted 20 June, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

  25. "Community Guidelines Make this the Best Party on the Internet": An In-Depth Study of Online Platforms' Content Moderation Policies

    Authors: Brennan Schaffner, Arjun Nitin Bhagoji, Siyuan Cheng, Jacqueline Mei, Jay L. Shen, Grace Wang, Marshini Chetty, Nick Feamster, Genevieve Lakier, Chenhao Tan

    Abstract: Moderating user-generated content on online platforms is crucial for balancing user safety and freedom of speech. Particularly in the United States, platforms are not subject to legal constraints prescribing permissible content. Each platform has thus developed bespoke content moderation policies, but there is little work towards a comparative understanding of these policies across platforms and t… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  26. arXiv:2405.00614  [pdf, other

    cs.LG

    Multigroup Robustness

    Authors: Lunjia Hu, Charlotte Peale, Judy Hanwen Shen

    Abstract: To address the shortcomings of real-world datasets, robust learning algorithms have been designed to overcome arbitrary and indiscriminate data corruption. However, practical processes of gathering data may lead to patterns of data corruption that are localized to specific partitions of the training dataset. Motivated by critical applications where the learned model is deployed to make predictions… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  27. arXiv:2404.19180  [pdf, other

    cs.AR

    MACO: Exploring GEMM Acceleration on a Loosely-Coupled Multi-core Processor

    Authors: Bingcai Sui, Junzhong Shen, Caixia Sun, Junhui Wang, Zhong Zheng, Wei Guo

    Abstract: General-purpose processor vendors have integrated customized accelerator in their products due to the widespread use of General Matrix-Matrix Multiplication (GEMM) kernels. However, it remains a challenge to further improve the flexibilityand scalability of these GEMM-enhanced processors to cater to the emerging large-scale GEMM workloads. In this paper we propose MACO, a novel loosely-coupled mul… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  28. arXiv:2404.18414  [pdf, other

    cs.LG

    Learning a Sparse Neural Network using IHT

    Authors: Saeed Damadi, Soroush Zolfaghari, Mahdi Rezaie, Jinglai Shen

    Abstract: The core of a good model is in its ability to focus only on important information that reflects the basic patterns and consistencies, thus pulling out a clear, noise-free signal from the dataset. This necessitates using a simplified model defined by fewer parameters. The importance of theoretical foundations becomes clear in this context, as this paper relies on established results from the domain… ▽ More

    Submitted 17 July, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  29. arXiv:2404.17811  [pdf

    cs.RO

    Efficient Bi-manipulation using RGBD Multi-model Fusion based on Attention Mechanism

    Authors: Jian Shen, Jiaxin Huang, Zhigong Song

    Abstract: Dual-arm robots have great application prospects in intelligent manufacturing due to their human-like structure when deployed with advanced intelligence algorithm. However, the previous visuomotor policy suffers from perception deficiencies in environments where features of images are impaired by the various conditions, such as abnormal lighting, occlusion and shadow etc. The Focal CVAE framework… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: 14 pages,5 figures

  30. arXiv:2404.17554  [pdf

    cs.HC eess.SP eess.SY stat.AP

    A Novel Context driven Critical Integrative Levels (CIL) Approach: Advancing Human-Centric and Integrative Lighting Asset Management in Public Libraries with Practical Thresholds

    Authors: Jing Lin, Nina Mylly, Per Olof Hedekvist, Jingchun Shen

    Abstract: This paper proposes the context driven Critical Integrative Levels (CIL), a novel approach to lighting asset management in public libraries that aligns with the transformative vision of human-centric and integrative lighting. This approach encompasses not only the visual aspects of lighting performance but also prioritizes the physiological and psychological well-being of library users. Incorporat… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  31. arXiv:2404.16841  [pdf, other

    cs.CR

    Machine Unlearning in Large Language Models

    Authors: Kongyang Chen, Zixin Wang, Bing Mi, Waixi Liu, Shaowei Wang, Xiaojun Ren, Jiaxing Shen

    Abstract: Recently, large language models (LLMs) have emerged as a notable field, attracting significant attention for its ability to automatically generate intelligent contents for various application domains. However, LLMs still suffer from significant security and privacy issues. For example, LLMs might expose user privacy from hacking attacks or targeted prompts. To address this problem, this paper intr… ▽ More

    Submitted 3 February, 2024; originally announced April 2024.

  32. arXiv:2404.16687  [pdf, other

    cs.CV

    NTIRE 2024 Quality Assessment of AI-Generated Content Challenge

    Authors: Xiaohong Liu, Xiongkuo Min, Guangtao Zhai, Chunyi Li, Tengchuan Kou, Wei Sun, Haoning Wu, Yixuan Gao, Yuqin Cao, Zicheng Zhang, Xiele Wu, Radu Timofte, Fei Peng, Huiyuan Fu, Anlong Ming, Chuanming Wang, Huadong Ma, Shuai He, Zifei Dou, Shu Chen, Huacong Zhang, Haiyi Xie, Chengwei Wang, Baoying Chen, Jishen Zeng , et al. (89 additional authors not shown)

    Abstract: This paper reports on the NTIRE 2024 Quality Assessment of AI-Generated Content Challenge, which will be held in conjunction with the New Trends in Image Restoration and Enhancement Workshop (NTIRE) at CVPR 2024. This challenge is to address a major challenge in the field of image and video processing, namely, Image Quality Assessment (IQA) and Video Quality Assessment (VQA) for AI-Generated Conte… ▽ More

    Submitted 7 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

  33. arXiv:2404.15312  [pdf, other

    eess.SP cs.CV

    Realtime Person Identification via Gait Analysis

    Authors: Shanmuga Venkatachalam, Harideep Nair, Prabhu Vellaisamy, Yongqi Zhou, Ziad Youssfi, John Paul Shen

    Abstract: Each person has a unique gait, i.e., walking style, that can be used as a biometric for personal identification. Recent works have demonstrated effective gait recognition using deep neural networks, however most of these works predominantly focus on classification accuracy rather than model efficiency. In order to perform gait recognition using wearable devices on the edge, it is imperative to dev… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  34. arXiv:2404.11996  [pdf, other

    cs.AI

    DST-GTN: Dynamic Spatio-Temporal Graph Transformer Network for Traffic Forecasting

    Authors: Songtao Huang, Hongjin Song, Tianqi Jiang, Akbar Telikani, Jun Shen, Qingguo Zhou, Binbin Yong, Qiang Wu

    Abstract: Accurate traffic forecasting is essential for effective urban planning and congestion management. Deep learning (DL) approaches have gained colossal success in traffic forecasting but still face challenges in capturing the intricacies of traffic dynamics. In this paper, we identify and address this challenges by emphasizing that spatial features are inherently dynamic and change over time. A novel… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  35. Inductive Cognitive Diagnosis for Fast Student Learning in Web-Based Online Intelligent Education Systems

    Authors: Shuo Liu, Junhao Shen, Hong Qian, Aimin Zhou

    Abstract: Cognitive diagnosis aims to gauge students' mastery levels based on their response logs. Serving as a pivotal module in web-based online intelligent education systems (WOIESs), it plays an upstream and fundamental role in downstream tasks like learning item recommendation and computerized adaptive testing. WOIESs are open learning environment where numerous new students constantly register and com… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: WWW 2024

  36. arXiv:2404.09724  [pdf, other

    cs.CR

    Privacy-Preserving Federated Unlearning with Certified Client Removal

    Authors: Ziyao Liu, Huanyi Ye, Yu Jiang, Jiyuan Shen, Jiale Guo, Ivan Tjuawinata, Kwok-Yan Lam

    Abstract: In recent years, Federated Unlearning (FU) has gained attention for addressing the removal of a client's influence from the global model in Federated Learning (FL) systems, thereby ensuring the ``right to be forgotten" (RTBF). State-of-the-art methods for unlearning use historical data from FL clients, such as gradients or locally trained models. However, studies have revealed significant informat… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  37. arXiv:2404.07229  [pdf, other

    cs.CL cs.AI

    Personality-affected Emotion Generation in Dialog Systems

    Authors: Zhiyuan Wen, Jiannong Cao, Jiaxing Shen, Ruosong Yang, Shuaiqi Liu, Maosong Sun

    Abstract: Generating appropriate emotions for responses is essential for dialog systems to provide human-like interaction in various application scenarios. Most previous dialog systems tried to achieve this goal by learning empathetic manners from anonymous conversational data. However, emotional responses generated by those methods may be inconsistent, which will decrease user engagement and service qualit… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: Accepted by ACM Transactions on Information Systems

  38. arXiv:2404.06486  [pdf, other

    cs.LG cs.CV

    GO4Align: Group Optimization for Multi-Task Alignment

    Authors: Jiayi Shen, Cheems Wang, Zehao Xiao, Nanne Van Noord, Marcel Worring

    Abstract: This paper proposes \textit{GO4Align}, a multi-task optimization approach that tackles task imbalance by explicitly aligning the optimization across tasks. To achieve this, we design an adaptive group risk minimization strategy, compromising two crucial techniques in implementation: (i) dynamical group assignment, which clusters similar tasks based on task interactions; (ii) risk-guided group indi… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  39. arXiv:2404.03764  [pdf, other

    cs.LG stat.ME stat.ML

    CONCERT: Covariate-Elaborated Robust Local Information Transfer with Conditional Spike-and-Slab Prior

    Authors: Ruqian Zhang, Yijiao Zhang, Annie Qu, Zhongyi Zhu, Juan Shen

    Abstract: The popularity of transfer learning stems from the fact that it can borrow information from useful auxiliary datasets. Existing statistical transfer learning methods usually adopt a global similarity measure between the source data and the target data, which may lead to inefficiency when only local information is shared. In this paper, we propose a novel Bayesian transfer learning method named "CO… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: 31 pages, 22 figures

  40. arXiv:2404.03659  [pdf, other

    cs.LG cs.CR

    Federated Unlearning for Human Activity Recognition

    Authors: Kongyang Chen, Dongping zhang, Yaping Chai, Weibin Zhang, Shaowei Wang, Jiaxing Shen

    Abstract: The rapid evolution of Internet of Things (IoT) technology has spurred the widespread adoption of Human Activity Recognition (HAR) in various daily life domains. Federated Learning (FL) is frequently utilized to build a global HAR model by aggregating user contributions without transmitting raw individual data. Despite substantial progress in user privacy protection with FL, challenges persist. Re… ▽ More

    Submitted 17 January, 2024; originally announced April 2024.

  41. arXiv:2404.02491  [pdf, other

    cs.CL cs.AI cs.LG

    Measuring Social Norms of Large Language Models

    Authors: Ye Yuan, Kexin Tang, Jianhao Shen, Ming Zhang, Chenguang Wang

    Abstract: We present a new challenge to examine whether large language models understand social norms. In contrast to existing datasets, our dataset requires a fundamental understanding of social norms to solve. Our dataset features the largest set of social norm skills, consisting of 402 skills and 12,383 questions covering a wide set of social norms ranging from opinions and arguments to culture and laws.… ▽ More

    Submitted 22 May, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

  42. arXiv:2404.01693  [pdf, other

    cs.LG

    HeMeNet: Heterogeneous Multichannel Equivariant Network for Protein Multitask Learning

    Authors: Rong Han, Wenbing Huang, Lingxiao Luo, Xinyan Han, Jiaming Shen, Zhiqiang Zhang, Jun Zhou, Ting Chen

    Abstract: Understanding and leveraging the 3D structures of proteins is central to a variety of biological and drug discovery tasks. While deep learning has been applied successfully for structure-based protein function prediction tasks, current methods usually employ distinct training for each task. However, each of the tasks is of small size, and such a single-task strategy hinders the models' performance… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  43. arXiv:2403.15241  [pdf, other

    cs.CV

    IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object Detection

    Authors: Junbo Yin, Jianbing Shen, Runnan Chen, Wei Li, Ruigang Yang, Pascal Frossard, Wenguan Wang

    Abstract: Bird's eye view (BEV) representation has emerged as a dominant solution for describing 3D space in autonomous driving scenarios. However, objects in the BEV representation typically exhibit small sizes, and the associated point cloud context is inherently sparse, which leads to great challenges for reliable 3D perception. In this paper, we propose IS-Fusion, an innovative multimodal fusion framewo… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024; Code: https://github.com/yinjunbo/IS-Fusion

  44. arXiv:2403.13380  [pdf, other

    cs.CE

    A characteristics-based method for shock-ramp data analysis

    Authors: Jingxiang Shen, Wei Kang

    Abstract: For the data analysis problem of shock-ramp compression, i.e., ramp compression after a relatively strong initial shock, a characteristics-based method that strictly deals with the initial hydrodynamic shock is described in detail. Validation of this analysis method using simulated shock-ramp data generated by molecular dynamics and one-dimensional radiation hydrodynamic code is also presented.

    Submitted 20 March, 2024; originally announced March 2024.

  45. arXiv:2403.11886  [pdf, other

    cs.CL cs.AI

    QueryAgent: A Reliable and Efficient Reasoning Framework with Environmental Feedback-based Self-Correction

    Authors: Xiang Huang, Sitao Cheng, Shanshan Huang, Jiayu Shen, Yong Xu, Chaoyun Zhang, Yuzhong Qu

    Abstract: Employing Large Language Models (LLMs) for semantic parsing has achieved remarkable success. However, we find existing methods fall short in terms of reliability and efficiency when hallucinations are encountered. In this paper, we address these challenges with a framework called QueryAgent, which solves a question step-by-step and performs step-wise self-correction. We introduce an environmental… ▽ More

    Submitted 13 June, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted by ACL 2024 main conference. 22 pages,7 figures, 13 tables

  46. arXiv:2403.10616  [pdf, other

    cs.LG cs.CL

    DiPaCo: Distributed Path Composition

    Authors: Arthur Douillard, Qixuan Feng, Andrei A. Rusu, Adhiguna Kuncoro, Yani Donchev, Rachita Chhaparia, Ionel Gog, Marc'Aurelio Ranzato, Jiajun Shen, Arthur Szlam

    Abstract: Progress in machine learning (ML) has been fueled by scaling neural network models. This scaling has been enabled by ever more heroic feats of engineering, necessary for accommodating ML approaches that require high bandwidth communication between devices working in parallel. In this work, we propose a co-designed modular architecture and training approach for ML models, dubbed DIstributed PAth CO… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  47. arXiv:2403.09369  [pdf, other

    cs.NI

    PreConfig: A Pretrained Model for Automating Network Configuration

    Authors: Fuliang Li, Haozhi Lang, Jiajie Zhang, Jiaxing Shen, Xingwei Wang

    Abstract: Manual network configuration automation (NCA) tools face significant challenges in versatility and flexibility due to their reliance on extensive domain expertise and manual design, limiting their adaptability to diverse scenarios and complex application needs. This paper introduces PreConfig, an innovative NCA tool that leverages a pretrained language model for automating network configuration ta… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  48. arXiv:2403.08948  [pdf, ps, other

    eess.SY cs.GT

    Model-free Resilient Controller Design based on Incentive Feedback Stackelberg Game and Q-learning

    Authors: Jiajun Shen, Fengjun Li, Morteza Hashemi, Huazhen Fang

    Abstract: In the swift evolution of Cyber-Physical Systems (CPSs) within intelligent environments, especially in the industrial domain shaped by Industry 4.0, the surge in development brings forth unprecedented security challenges. This paper explores the intricate security issues of Industrial CPSs (ICPSs), with a specific focus on the unique threats presented by intelligent attackers capable of directly c… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 8 pages

  49. arXiv:2403.07187  [pdf, other

    cs.LG

    UPS: Efficiently Building Foundation Models for PDE Solving via Cross-Modal Adaptation

    Authors: Junhong Shen, Tanya Marwah, Ameet Talwalkar

    Abstract: We present Unified PDE Solvers (UPS), a data- and compute-efficient approach to developing unified neural operators for diverse families of spatiotemporal PDEs from various domains, dimensions, and resolutions. UPS embeds different PDEs into a shared representation space and processes them using a FNO-transformer architecture. Rather than training the network from scratch, which is data-demanding… ▽ More

    Submitted 23 May, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  50. arXiv:2403.00165  [pdf, other

    cs.CL cs.LG

    TELEClass: Taxonomy Enrichment and LLM-Enhanced Hierarchical Text Classification with Minimal Supervision

    Authors: Yunyi Zhang, Ruozhen Yang, Xueqiang Xu, Rui Li, Jinfeng Xiao, Jiaming Shen, Jiawei Han

    Abstract: Hierarchical text classification aims to categorize each document into a set of classes in a label taxonomy. Most earlier works focus on fully or semi-supervised methods that require a large amount of human annotated data which is costly and time-consuming to acquire. To alleviate human efforts, in this paper, we work on hierarchical text classification with the minimal amount of supervision: usin… ▽ More

    Submitted 16 June, 2024; v1 submitted 29 February, 2024; originally announced March 2024.