Zum Hauptinhalt springen

Showing 1–50 of 93 results for author: Mei, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.09181  [pdf, other

    cs.CV cs.CR cs.LG

    PADetBench: Towards Benchmarking Physical Attacks against Object Detection

    Authors: Jiawei Lian, Jianhong Pan, Lefan Wang, Yi Wang, Lap-Pui Chau, Shaohui Mei

    Abstract: Physical attacks against object detection have gained increasing attention due to their significant practical implications. However, conducting physical experiments is extremely time-consuming and labor-intensive. Moreover, physical dynamics and cross-domain transformation are challenging to strictly regulate in the real world, leading to unaligned evaluation and comparison, severely hindering the… ▽ More

    Submitted 17 August, 2024; originally announced August 2024.

  2. Flexible 3D Lane Detection by Hierarchical Shape MatchingFlexible 3D Lane Detection by Hierarchical Shape Matching

    Authors: Zhihao Guan, Ruixin Liu, Zejian Yuan, Ao Liu, Kun Tang, Tong Zhou, Erlong Li, Chao Zheng, Shuqi Mei

    Abstract: As one of the basic while vital technologies for HD map construction, 3D lane detection is still an open problem due to varying visual conditions, complex typologies, and strict demands for precision. In this paper, an end-to-end flexible and hierarchical lane detector is proposed to precisely predict 3D lane lines from point clouds. Specifically, we design a hierarchical network predicting flexib… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

  3. arXiv:2406.10261  [pdf, other

    cs.CL cs.AI

    FoodSky: A Food-oriented Large Language Model that Passes the Chef and Dietetic Examination

    Authors: Pengfei Zhou, Weiqing Min, Chaoran Fu, Ying Jin, Mingyu Huang, Xiangyang Li, Shuhuan Mei, Shuqiang Jiang

    Abstract: Food is foundational to human life, serving not only as a source of nourishment but also as a cornerstone of cultural identity and social interaction. As the complexity of global dietary needs and preferences grows, food intelligence is needed to enable food perception and reasoning for various tasks, ranging from recipe generation and dietary recommendation to diet-disease correlation discovery a… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 32 pages, 19 figures

  4. arXiv:2406.08654  [pdf, other

    stat.ML cs.LG math.OC

    Large Stepsize Gradient Descent for Non-Homogeneous Two-Layer Networks: Margin Improvement and Fast Optimization

    Authors: Yuhang Cai, Jingfeng Wu, Song Mei, Michael Lindsey, Peter L. Bartlett

    Abstract: The typical training of neural networks using large stepsize gradient descent (GD) under the logistic loss often involves two distinct phases, where the empirical risk oscillates in the first phase but decreases monotonically in the second phase. We investigate this phenomenon in two-layer networks that satisfy a near-homogeneity condition. We show that the second phase begins once the empirical r… ▽ More

    Submitted 26 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: Clarify our results on sigmoid neural networks

  5. arXiv:2405.19079  [pdf, other

    eess.IV cs.CV

    On the Influence of Smoothness Constraints in Computed Tomography Motion Compensation

    Authors: Mareike Thies, Fabian Wagner, Noah Maul, Siyuan Mei, Mingxuan Gu, Laura Pfaff, Nastassia Vysotskaya, Haijun Yu, Andreas Maier

    Abstract: Computed tomography (CT) relies on precise patient immobilization during image acquisition. Nevertheless, motion artifacts in the reconstructed images can persist. Motion compensation methods aim to correct such artifacts post-acquisition, often incorporating temporal smoothness constraints on the estimated motion patterns. This study analyzes the influence of a spline-based motion model within an… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  6. arXiv:2405.03239  [pdf, other

    cs.LG cs.AI

    Deep Learning for Detecting and Early Predicting Chronic Obstructive Pulmonary Disease from Spirogram Time Series: A UK Biobank Study

    Authors: Shuhao Mei, Yuxi Zhou, Jiahao Xu, Yuxuan Wan, Shan Cao, Qinghao Zhao, Shijia Geng, Junqing Xie, Shenda Hong

    Abstract: Chronic Obstructive Pulmonary Disease (COPD) is a chronic inflammatory lung condition that causes airflow obstruction. The existing methods can only detect patients who already have COPD based on obvious features shown in the spirogram (In this article, the spirogram specifically involves measuring Volume-Flow curve time series). Early prediction of COPD risk is vital for monitoring COPD disease p… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  7. arXiv:2404.18444  [pdf, other

    cs.LG cs.AI math.ST stat.ML

    U-Nets as Belief Propagation: Efficient Classification, Denoising, and Diffusion in Generative Hierarchical Models

    Authors: Song Mei

    Abstract: U-Nets are among the most widely used architectures in computer vision, renowned for their exceptional performance in applications such as image segmentation, denoising, and diffusion modeling. However, a theoretical explanation of the U-Net architecture design has not yet been fully established. This paper introduces a novel interpretation of the U-Net architecture by studying certain generativ… ▽ More

    Submitted 1 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: v2 updated discussions of related literature

  8. arXiv:2404.14807  [pdf, other

    cs.CV

    Reference-Free Multi-Modality Volume Registration of X-Ray Microscopy and Light-Sheet Fluorescence Microscopy

    Authors: Siyuan Mei, Fuxin Fan, Mareike Thies, Mingxuan Gu, Fabian Wagner, Oliver Aust, Ina Erceg, Zeynab Mirzaei, Georgiana Neag, Yipeng Sun, Yixing Huang, Andreas Maier

    Abstract: Recently, X-ray microscopy (XRM) and light-sheet fluorescence microscopy (LSFM) have emerged as two pivotal imaging tools in preclinical research on bone remodeling diseases, offering micrometer-level resolution. Integrating these complementary modalities provides a holistic view of bone microstructures, facilitating function-oriented volume analysis across different disease cycles. However, regis… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  9. arXiv:2404.14747  [pdf, other

    cs.CV

    Differentiable Score-Based Likelihoods: Learning CT Motion Compensation From Clean Images

    Authors: Mareike Thies, Noah Maul, Siyuan Mei, Laura Pfaff, Nastassia Vysotskaya, Mingxuan Gu, Jonas Utz, Dennis Possart, Lukas Folle, Fabian Wagner, Andreas Maier

    Abstract: Motion artifacts can compromise the diagnostic value of computed tomography (CT) images. Motion correction approaches require a per-scan estimation of patient-specific motion patterns. In this work, we train a score-based model to act as a probability density estimator for clean head CT images. Given the trained model, we quantify the deviation of a given motion-affected CT image from the ideal di… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  10. arXiv:2404.07771  [pdf, other

    cs.LG math.ST stat.ML

    An Overview of Diffusion Models: Applications, Guided Generation, Statistical Rates and Optimization

    Authors: Minshuo Chen, Song Mei, Jianqing Fan, Mengdi Wang

    Abstract: Diffusion models, a powerful and universal generative AI technology, have achieved tremendous success in computer vision, audio, reinforcement learning, and computational biology. In these applications, diffusion models provide flexible high-dimensional data modeling, and act as a sampler for generating new samples under active guidance towards task-desired properties. Despite the significant empi… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  11. arXiv:2404.05868  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Negative Preference Optimization: From Catastrophic Collapse to Effective Unlearning

    Authors: Ruiqi Zhang, Licong Lin, Yu Bai, Song Mei

    Abstract: Large Language Models (LLMs) often memorize sensitive, private, or copyrighted data during pre-training. LLM unlearning aims to eliminate the influence of undesirable data from the pre-trained model while preserving the model's utilities on other tasks. Several practical methods have recently been proposed for LLM unlearning, mostly based on gradient ascent (GA) on the loss of undesirable data. Ho… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  12. arXiv:2404.03541  [pdf, other

    eess.IV cs.CV

    Segmentation-Guided Knee Radiograph Generation using Conditional Diffusion Models

    Authors: Siyuan Mei, Fuxin Fan, Fabian Wagner, Mareike Thies, Mingxuan Gu, Yipeng Sun, Andreas Maier

    Abstract: Deep learning-based medical image processing algorithms require representative data during development. In particular, surgical data might be difficult to obtain, and high-quality public datasets are limited. To overcome this limitation and augment datasets, a widely adopted solution is the generation of synthetic images. In this work, we employ conditional diffusion models to generate knee radiog… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  13. arXiv:2403.14440  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Analysing Diffusion Segmentation for Medical Images

    Authors: Mathias Öttl, Siyuan Mei, Frauke Wilm, Jana Steenpass, Matthias Rübner, Arndt Hartmann, Matthias Beckmann, Peter Fasching, Andreas Maier, Ramona Erber, Katharina Breininger

    Abstract: Denoising Diffusion Probabilistic models have become increasingly popular due to their ability to offer probabilistic modeling and generate diverse outputs. This versatility inspired their adaptation for image segmentation, where multiple predictions of the model can produce segmentation results that not only achieve high quality but also capture the uncertainty inherent in the model. Here, powerf… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  14. arXiv:2403.10695  [pdf, other

    eess.IV cs.CV

    EAGLE: An Edge-Aware Gradient Localization Enhanced Loss for CT Image Reconstruction

    Authors: Yipeng Sun, Yixing Huang, Linda-Sophie Schneider, Mareike Thies, Mingxuan Gu, Siyuan Mei, Siming Bayer, Andreas Maier

    Abstract: Computed Tomography (CT) image reconstruction is crucial for accurate diagnosis and deep learning approaches have demonstrated significant potential in improving reconstruction quality. However, the choice of loss function profoundly affects the reconstructed images. Traditional mean squared error loss often produces blurry images lacking fine details, while alternatives designed to improve may in… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: Preprint

  15. arXiv:2402.19456  [pdf, other

    quant-ph cs.DS math.PR math.ST stat.ML

    Statistical Estimation in the Spiked Tensor Model via the Quantum Approximate Optimization Algorithm

    Authors: Leo Zhou, Joao Basso, Song Mei

    Abstract: The quantum approximate optimization algorithm (QAOA) is a general-purpose algorithm for combinatorial optimization. In this paper, we analyze the performance of the QAOA on a statistical estimation problem, namely, the spiked tensor model, which exhibits a statistical-computational gap classically. We prove that the weak recovery threshold of $1$-step QAOA matches that of $1$-step tensor power it… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: 51 pages, 4 figures, 1 table

  16. arXiv:2402.19161  [pdf, other

    cs.CV cs.AI cs.RO

    MemoNav: Working Memory Model for Visual Navigation

    Authors: Hongxin Li, Zeyu Wang, Xu Yang, Yuran Yang, Shuqi Mei, Zhaoxiang Zhang

    Abstract: Image-goal navigation is a challenging task that requires an agent to navigate to a goal indicated by an image in unfamiliar environments. Existing methods utilizing diverse scene memories suffer from inefficient exploration since they use all historical observations for decision-making without considering the goal-relevant fraction. To address this limitation, we present MemoNav, a novel memory m… ▽ More

    Submitted 28 March, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: Accepted to CVPR 2024. Code: https://github.com/ZJULiHongxin/MemoNav

  17. arXiv:2402.15688  [pdf, other

    cs.LG

    Anchor-free Clustering based on Anchor Graph Factorization

    Authors: Shikun Mei, Fangfang Li, Quanxue Gao, Ming Yang

    Abstract: Anchor-based methods are a pivotal approach in handling clustering of large-scale data. However, these methods typically entail two distinct stages: selecting anchor points and constructing an anchor graph. This bifurcation, along with the initialization of anchor points, significantly influences the overall performance of the algorithm. To mitigate these issues, we introduce a novel method termed… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  18. arXiv:2401.16039  [pdf, other

    eess.IV cs.CV cs.LG

    Data-Driven Filter Design in FBP: Transforming CT Reconstruction with Trainable Fourier Series

    Authors: Yipeng Sun, Linda-Sophie Schneider, Fuxin Fan, Mareike Thies, Mingxuan Gu, Siyuan Mei, Yuzhong Zhou, Siming Bayer, Andreas Maier

    Abstract: In this study, we introduce a Fourier series-based trainable filter for computed tomography (CT) reconstruction within the filtered backprojection (FBP) framework. This method overcomes the limitation in noise reduction, inherent in conventional FBP methods, by optimizing Fourier series coefficients to construct the filter. This method enables robust performance across different resolution scales… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: Preprint

  19. arXiv:2401.09283  [pdf, other

    eess.IV cs.CV

    A gradient-based approach to fast and accurate head motion compensation in cone-beam CT

    Authors: Mareike Thies, Fabian Wagner, Noah Maul, Haijun Yu, Manuela Meier, Linda-Sophie Schneider, Mingxuan Gu, Siyuan Mei, Lukas Folle, Andreas Maier

    Abstract: Cone-beam computed tomography (CBCT) systems, with their portability, present a promising avenue for direct point-of-care medical imaging, particularly in critical scenarios such as acute stroke assessment. However, the integration of CBCT into clinical workflows faces challenges, primarily linked to long scan duration resulting in patient motion during scanning and leading to image quality degrad… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  20. arXiv:2311.12320  [pdf, other

    cs.AI

    A Survey on Multimodal Large Language Models for Autonomous Driving

    Authors: Can Cui, Yunsheng Ma, Xu Cao, Wenqian Ye, Yang Zhou, Kaizhao Liang, Jintai Chen, Juanwu Lu, Zichong Yang, Kuei-Da Liao, Tianren Gao, Erlong Li, Kun Tang, Zhipeng Cao, Tong Zhou, Ao Liu, Xinrui Yan, Shuqi Mei, Jianguo Cao, Ziran Wang, Chao Zheng

    Abstract: With the emergence of Large Language Models (LLMs) and Vision Foundation Models (VFMs), multimodal AI systems benefiting from large models have the potential to equally perceive the real world, make decisions, and control tools as humans. In recent months, LLMs have shown widespread attention in autonomous driving and map systems. Despite its immense potential, there is still a lack of a comprehen… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  21. arXiv:2311.08442  [pdf, other

    math.ST cs.LG stat.ML

    Mean-field variational inference with the TAP free energy: Geometric and statistical properties in linear models

    Authors: Michael Celentano, Zhou Fan, Licong Lin, Song Mei

    Abstract: We study mean-field variational inference in a Bayesian linear model when the sample size n is comparable to the dimension p. In high dimensions, the common approach of minimizing a Kullback-Leibler divergence from the posterior distribution, or maximizing an evidence lower bound, may deviate from the true posterior mean and underestimate posterior uncertainty. We study instead minimization of the… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: 79 pages, 5 figures

  22. arXiv:2311.01753  [pdf, other

    cs.MA cs.AI cs.LG

    RiskQ: Risk-sensitive Multi-Agent Reinforcement Learning Value Factorization

    Authors: Siqi Shen, Chennan Ma, Chao Li, Weiquan Liu, Yongquan Fu, Songzhu Mei, Xinwang Liu, Cheng Wang

    Abstract: Multi-agent systems are characterized by environmental uncertainty, varying policies of agents, and partial observability, which result in significant risks. In the context of Multi-Agent Reinforcement Learning (MARL), learning coordinated and decentralized policies that are sensitive to risk is challenging. To formulate the coordination requirements in risk-sensitive MARL, we introduce the Risk-s… ▽ More

    Submitted 21 March, 2024; v1 submitted 3 November, 2023; originally announced November 2023.

    Comments: Accepted at NeurIPS 2023

  23. arXiv:2310.14037  [pdf, other

    cs.IR

    MARVEL: Unlocking the Multi-Modal Capability of Dense Retrieval via Visual Module Plugin

    Authors: Tianshuo Zhou, Sen Mei, Xinze Li, Zhenghao Liu, Chenyan Xiong, Zhiyuan Liu, Yu Gu, Ge Yu

    Abstract: This paper proposes Multi-modAl Retrieval model via Visual modulE pLugin (MARVEL), which learns an embedding space for queries and multi-modal documents to conduct retrieval. MARVEL encodes queries and multi-modal documents with a unified encoder model, which helps to alleviate the modality gap between images and texts. Specifically, we enable the image understanding ability of the well-trained de… ▽ More

    Submitted 15 June, 2024; v1 submitted 21 October, 2023; originally announced October 2023.

  24. arXiv:2310.10616  [pdf, other

    cs.LG

    How Do Transformers Learn In-Context Beyond Simple Functions? A Case Study on Learning with Representations

    Authors: Tianyu Guo, Wei Hu, Song Mei, Huan Wang, Caiming Xiong, Silvio Savarese, Yu Bai

    Abstract: While large language models based on the transformer architecture have demonstrated remarkable in-context learning (ICL) capabilities, understandings of such capabilities are still in an early stage, where existing theory and mechanistic understanding focus mostly on simple scenarios such as learning simple function classes. This paper takes initial steps on understanding ICL in more complex scena… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  25. arXiv:2310.08566  [pdf, other

    cs.LG cs.AI cs.CL math.ST stat.ML

    Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised Pretraining

    Authors: Licong Lin, Yu Bai, Song Mei

    Abstract: Large transformer models pretrained on offline reinforcement learning datasets have demonstrated remarkable in-context reinforcement learning (ICRL) capabilities, where they can make good decisions when prompted with interaction trajectories from unseen environments. However, when and how transformers can be trained to perform ICRL have not been theoretically well-understood. In particular, it is… ▽ More

    Submitted 26 May, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

  26. arXiv:2310.03845  [pdf, other

    astro-ph.EP astro-ph.IM cs.LG

    Euclid: Identification of asteroid streaks in simulated images using deep learning

    Authors: M. Pöntinen, M. Granvik, A. A. Nucita, L. Conversi, B. Altieri, B. Carry, C. M. O'Riordan, D. Scott, N. Aghanim, A. Amara, L. Amendola, N. Auricchio, M. Baldi, D. Bonino, E. Branchini, M. Brescia, S. Camera, V. Capobianco, C. Carbone, J. Carretero, M. Castellano, S. Cavuoti, A. Cimatti, R. Cledassou, G. Congedo , et al. (92 additional authors not shown)

    Abstract: Up to 150000 asteroids will be visible in the images of the ESA Euclid space telescope, and the instruments of Euclid offer multiband visual to near-infrared photometry and slitless spectra of these objects. Most asteroids will appear as streaks in the images. Due to the large number of images and asteroids, automated detection methods are needed. A non-machine-learning approach based on the Strea… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: 18 pages, 11 figures

    Journal ref: A&A 679, A135 (2023)

  27. arXiv:2309.14241  [pdf, other

    cs.CV

    Informative Data Mining for One-Shot Cross-Domain Semantic Segmentation

    Authors: Yuxi Wang, Jian Liang, Jun Xiao, Shuqi Mei, Yuran Yang, Zhaoxiang Zhang

    Abstract: Contemporary domain adaptation offers a practical solution for achieving cross-domain transfer of semantic segmentation between labeled source data and unlabeled target data. These solutions have gained significant popularity; however, they require the model to be retrained when the test environment changes. This can result in unbearable costs in certain applications due to the time-consuming trai… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: Accepted by ICCV 2023

  28. arXiv:2309.11420  [pdf, ps, other

    cs.LG math.ST stat.ML

    Deep Networks as Denoising Algorithms: Sample-Efficient Learning of Diffusion Models in High-Dimensional Graphical Models

    Authors: Song Mei, Yuchen Wu

    Abstract: We investigate the approximation efficiency of score functions by deep neural networks in diffusion-based generative modeling. While existing approximation theories utilize the smoothness of score functions, they suffer from the curse of dimensionality for intrinsically high-dimensional data. This limitation is pronounced in graphical models such as Markov random fields, common for image distribut… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

    Comments: 41 pages

  29. arXiv:2308.14029  [pdf, other

    cs.IR cs.AI

    Text Matching Improves Sequential Recommendation by Reducing Popularity Biases

    Authors: Zhenghao Liu, Sen Mei, Chenyan Xiong, Xiaohua Li, Shi Yu, Zhiyuan Liu, Yu Gu, Ge Yu

    Abstract: This paper proposes Text mAtching based SequenTial rEcommendation model (TASTE), which maps items and users in an embedding space and recommends items by matching their text representations. TASTE verbalizes items and user-item interactions using identifiers and attributes of items. To better characterize user behaviors, TASTE additionally proposes an attention sparsity method, which enables TASTE… ▽ More

    Submitted 27 August, 2023; originally announced August 2023.

    Comments: Accepted by CIKM 2023

  30. arXiv:2308.05967  [pdf, other

    cs.CV

    YOLOrtho -- A Unified Framework for Teeth Enumeration and Dental Disease Detection

    Authors: Shenxiao Mei, Chenglong Ma, Feihong Shen, Huikai Wu

    Abstract: Detecting dental diseases through panoramic X-rays images is a standard procedure for dentists. Normally, a dentist need to identify diseases and find the infected teeth. While numerous machine learning models adopting this two-step procedure have been developed, there has not been an end-to-end model that can identify teeth and their associated diseases at the same time. To fill the gap, we devel… ▽ More

    Submitted 4 September, 2023; v1 submitted 11 August, 2023; originally announced August 2023.

  31. arXiv:2307.11411  [pdf, other

    cs.CV cs.AI

    Deep Directly-Trained Spiking Neural Networks for Object Detection

    Authors: Qiaoyi Su, Yuhong Chou, Yifan Hu, Jianing Li, Shijie Mei, Ziyang Zhang, Guoqi Li

    Abstract: Spiking neural networks (SNNs) are brain-inspired energy-efficient models that encode information in spatiotemporal dynamics. Recently, deep SNNs trained directly have shown great success in achieving high performance on classification tasks with very few time steps. However, how to design a directly-trained SNN for the regression task of object detection still remains a challenging problem. To ad… ▽ More

    Submitted 26 July, 2023; v1 submitted 21 July, 2023; originally announced July 2023.

    Comments: Accepted by ICCV2023

  32. arXiv:2307.11353  [pdf, other

    cs.LG math.ST stat.ML

    What can a Single Attention Layer Learn? A Study Through the Random Features Lens

    Authors: Hengyu Fu, Tianyu Guo, Yu Bai, Song Mei

    Abstract: Attention layers -- which map a sequence of inputs to a sequence of outputs -- are core building blocks of the Transformer architecture which has achieved significant breakthroughs in modern artificial intelligence. This paper presents a rigorous theoretical study on the learning and generalization of a single multi-head attention layer, with a sequence of key vectors and a separate query vector a… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

    Comments: 41pages, 5 figures

  33. arXiv:2306.12111  [pdf, other

    cs.CV

    A Comprehensive Study on the Robustness of Image Classification and Object Detection in Remote Sensing: Surveying and Benchmarking

    Authors: Shaohui Mei, Jiawei Lian, Xiaofei Wang, Yuru Su, Mingyang Ma, Lap-Pui Chau

    Abstract: Deep neural networks (DNNs) have found widespread applications in interpreting remote sensing (RS) imagery. However, it has been demonstrated in previous works that DNNs are vulnerable to different types of noises, particularly adversarial noises. Surprisingly, there has been a lack of comprehensive studies on the robustness of RS tasks, prompting us to undertake a thorough survey and benchmark on… ▽ More

    Submitted 15 September, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

  34. arXiv:2306.04637  [pdf, other

    cs.LG cs.AI cs.CL math.ST stat.ML

    Transformers as Statisticians: Provable In-Context Learning with In-Context Algorithm Selection

    Authors: Yu Bai, Fan Chen, Huan Wang, Caiming Xiong, Song Mei

    Abstract: Neural sequence models based on the transformer architecture have demonstrated remarkable \emph{in-context learning} (ICL) abilities, where they can perform new tasks when prompted with training and test examples, without any parameter update to the model. This work first provides a comprehensive statistical theory for transformers to perform ICL. Concretely, we show that transformers can implemen… ▽ More

    Submitted 6 July, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: V2 releases code

  35. arXiv:2303.00449  [pdf, other

    cs.CV

    Exploring Epipolar Consistency Conditions for Rigid Motion Compensation in In-vivo X-ray Microscopy

    Authors: Mareike Thies, Fabian Wagner, Mingxuan Gu, Siyuan Mei, Yixing Huang, Sabrina Pechmann, Oliver Aust, Daniela Weidner, Georgiana Neag, Stefan Uderhardt, Georg Schett, Silke Christiansen, Andreas Maier

    Abstract: Intravital X-ray microscopy (XRM) in preclinical mouse models is of vital importance for the identification of microscopic structural pathological changes in the bone which are characteristic of osteoporosis. The complexity of this method stems from the requirement for high-quality 3D reconstructions of the murine bones. However, respiratory motion and muscle relaxation lead to inconsistencies in… ▽ More

    Submitted 28 February, 2024; v1 submitted 1 March, 2023; originally announced March 2023.

  36. CBA: Contextual Background Attack against Optical Aerial Detection in the Physical World

    Authors: Jiawei Lian, Xiaofei Wang, Yuru Su, Mingyang Ma, Shaohui Mei

    Abstract: Patch-based physical attacks have increasingly aroused concerns. However, most existing methods focus on obscuring targets captured on the ground, and some of these methods are simply extended to deceive aerial detectors. They smear the targeted objects in the physical world with the elaborated adversarial patches, which can only slightly sway the aerial detectors' prediction and with weak att… ▽ More

    Submitted 23 March, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

  37. arXiv:2302.13487  [pdf, other

    cs.CV

    Contextual adversarial attack against aerial detection in the physical world

    Authors: Jiawei Lian, Xiaofei Wang, Yuru Su, Mingyang Ma, Shaohui Mei

    Abstract: Deep Neural Networks (DNNs) have been extensively utilized in aerial detection. However, DNNs' sensitivity and vulnerability to maliciously elaborated adversarial examples have progressively garnered attention. Recently, physical attacks have gradually become a hot issue due to they are more practical in the real world, which poses great threats to some security-critical applications. In this pape… ▽ More

    Submitted 26 February, 2023; originally announced February 2023.

  38. arXiv:2302.01333  [pdf, other

    cs.LG cs.IT math.ST stat.ML

    Lower Bounds for Learning in Revealing POMDPs

    Authors: Fan Chen, Huan Wang, Caiming Xiong, Song Mei, Yu Bai

    Abstract: This paper studies the fundamental limits of reinforcement learning (RL) in the challenging \emph{partially observable} setting. While it is well-established that learning in Partially Observable Markov Decision Processes (POMDPs) requires exponentially many samples in the worst case, a surge of recent work shows that polynomial sample complexities are achievable under the \emph{revealing conditio… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

  39. arXiv:2301.02060  [pdf, ps, other

    math.OC cs.LG math.NA stat.ML

    A first-order augmented Lagrangian method for constrained minimax optimization

    Authors: Zhaosong Lu, Sanyou Mei

    Abstract: In this paper we study a class of constrained minimax problems. In particular, we propose a first-order augmented Lagrangian method for solving them, whose subproblems turn out to be a much simpler structured minimax problem and are suitably solved by a first-order method recently developed in [26] by the authors. Under some suitable assumptions, an \emph{operation complexity} of… ▽ More

    Submitted 17 April, 2023; v1 submitted 5 January, 2023; originally announced January 2023.

    Comments: added a new section. arXiv admin note: substantial text overlap with arXiv:2301.01716

    MSC Class: 90C26; 90C30; 90C47; 90C99; 65K05

  40. arXiv:2301.01716  [pdf, ps, other

    math.OC cs.LG math.NA stat.ML

    First-order penalty methods for bilevel optimization

    Authors: Zhaosong Lu, Sanyou Mei

    Abstract: In this paper we study a class of unconstrained and constrained bilevel optimization problems in which the lower level is a possibly nonsmooth convex optimization problem, while the upper level is a possibly nonconvex optimization problem. We introduce a notion of $\varepsilon$-KKT solution for them and show that an $\varepsilon$-KKT solution leads to an $O(\sqrt{\varepsilon})$- or… ▽ More

    Submitted 7 March, 2024; v1 submitted 4 January, 2023; originally announced January 2023.

    Comments: Accepted by SIAM Journal on Optimization

    MSC Class: 90C26; 90C30; 90C47; 90C99; 65K05

  41. arXiv:2212.11123  [pdf, other

    cs.CV cs.AI cs.RO

    THMA: Tencent HD Map AI System for Creating HD Map Annotations

    Authors: Kun Tang, Xu Cao, Zhipeng Cao, Tong Zhou, Erlong Li, Ao Liu, Shengtao Zou, Chang Liu, Shuqi Mei, Elena Sizikova, Chao Zheng

    Abstract: Nowadays, autonomous vehicle technology is becoming more and more mature. Critical to progress and safety, high-definition (HD) maps, a type of centimeter-level map collected using a laser sensor, provide accurate descriptions of the surrounding environment. The key challenge of HD map production is efficient, high-quality collection and annotation of large-volume datasets. Due to the demand for h… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

    Comments: IAAI 2023

  42. arXiv:2212.06604  [pdf, ps, other

    cs.CR

    Plausible deniability for privacy-preserving data synthesis

    Authors: Song Mei, Zhiqiang Ye

    Abstract: In the field of privacy protection, publishing complete data (especially high-dimensional data sets) is one of the most challenging problems. The common encryption technology can not deal with the attacker to take differential attack to obtain sensitive information, while the existing differential privacy protection algorithm model takes a long time for high-dimensional calculation and needs to ad… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

  43. arXiv:2212.06449  [pdf, other

    cs.SI cs.PF

    A Novel Location Free Link Prediction in Multiplex Social Networks

    Authors: Song Mei, Cong Zhen

    Abstract: In recent decades, the emergence of social networks has enabled internet service providers (e.g., Facebook, Twitter and Uber) to achieve great commercial success. Link prediction is recognized as a common practice to build the topology of social networks and keep them evolving. Conventionally, link prediction methods are dependent of location information of users, which suffers from information le… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

  44. arXiv:2211.02778  [pdf, other

    math.ST cs.LG

    Near-optimal multiple testing in Bayesian linear models with finite-sample FDR control

    Authors: Taejoo Ahn, Licong Lin, Song Mei

    Abstract: In high dimensional variable selection problems, statisticians often seek to design multiple testing procedures that control the False Discovery Rate (FDR), while concurrently identifying a greater number of relevant variables. Model-X methods, such as Knockoffs and conditional randomization tests, achieve the primary goal of finite-sample FDR control, assuming a known distribution of covariates.… ▽ More

    Submitted 21 July, 2023; v1 submitted 4 November, 2022; originally announced November 2022.

    Comments: V3 releases code

  45. Benchmarking Adversarial Patch Against Aerial Detection

    Authors: Jiawei Lian, Shaohui Mei, Shun Zhang, Mingyang Ma

    Abstract: DNNs are vulnerable to adversarial examples, which poses great security concerns for security-critical systems. In this paper, a novel adaptive-patch-based physical attack (AP-PA) framework is proposed, which aims to generate adversarial patches that are adaptive in both physical dynamics and varying scales, and by which the particular targets can be hidden from being detected. Furthermore, the ad… ▽ More

    Submitted 30 October, 2022; originally announced October 2022.

    Comments: 14 pages, 14 figures

  46. arXiv:2209.14990  [pdf, ps, other

    cs.LG cs.AI math.ST stat.ML

    Partially Observable RL with B-Stability: Unified Structural Condition and Sharp Sample-Efficient Algorithms

    Authors: Fan Chen, Yu Bai, Song Mei

    Abstract: Partial Observability -- where agents can only observe partial information about the true underlying state of the system -- is ubiquitous in real-world applications of Reinforcement Learning (RL). Theoretically, learning a near-optimal policy under partial observability is known to be hard in the worst case due to an exponential sample complexity lower bound. Recent work has identified several tra… ▽ More

    Submitted 16 December, 2022; v1 submitted 29 September, 2022; originally announced September 2022.

  47. arXiv:2209.11745  [pdf, other

    cs.LG cs.AI math.ST stat.ML

    Unified Algorithms for RL with Decision-Estimation Coefficients: PAC, Reward-Free, Preference-Based Learning, and Beyond

    Authors: Fan Chen, Song Mei, Yu Bai

    Abstract: Modern Reinforcement Learning (RL) is more than just learning the optimal policy; Alternative learning goals such as exploring the environment, estimating the underlying model, and learning from preference feedback are all of practical importance. While provably sample-efficient algorithms for each specific goal have been proposed, these algorithms often depend strongly on the particular learning… ▽ More

    Submitted 28 April, 2024; v1 submitted 23 September, 2022; originally announced September 2022.

  48. arXiv:2209.10770  [pdf, other

    cs.CV cs.AI

    Multi-level Adversarial Spatio-temporal Learning for Footstep Pressure based FoG Detection

    Authors: Kun Hu, Shaohui Mei, Wei Wang, Kaylena A. Ehgoetz Martens, Liang Wang, Simon J. G. Lewis, David D. Feng, Zhiyong Wang

    Abstract: Freezing of gait (FoG) is one of the most common symptoms of Parkinson's disease, which is a neurodegenerative disorder of the central nervous system impacting millions of people around the world. To address the pressing need to improve the quality of treatment for FoG, devising a computer-aided detection and quantification tool for FoG has been increasingly important. As a non-invasive technique… ▽ More

    Submitted 22 September, 2022; originally announced September 2022.

  49. arXiv:2209.09733  [pdf, other

    eess.IV cs.CV

    Metal Inpainting in CBCT Projections Using Score-based Generative Model

    Authors: Siyuan Mei, Fuxin Fan, Andreas Maier

    Abstract: During orthopaedic surgery, the inserting of metallic implants or screws are often performed under mobile C-arm systems. Due to the high attenuation of metals, severe metal artifacts occur in 3D reconstructions, which degrade the image quality greatly. To reduce the artifacts, many metal artifact reduction algorithms have been developed and metal inpainting in projection domain is an essential ste… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

  50. arXiv:2209.05996  [pdf, other

    cs.CV

    M$^2$-3DLaneNet: Exploring Multi-Modal 3D Lane Detection

    Authors: Yueru Luo, Xu Yan, Chaoda Zheng, Chao Zheng, Shuqi Mei, Tang Kun, Shuguang Cui, Zhen Li

    Abstract: Estimating accurate lane lines in 3D space remains challenging due to their sparse and slim nature. Previous works mainly focused on using images for 3D lane detection, leading to inherent projection error and loss of geometry information. To address these issues, we explore the potential of leveraging LiDAR for 3D lane detection, either as a standalone method or in combination with existing monoc… ▽ More

    Submitted 8 August, 2023; v1 submitted 13 September, 2022; originally announced September 2022.

    Comments: update