Zum Hauptinhalt springen

Showing 151–200 of 623 results for author: Qi, X

.
  1. arXiv:2304.08480  [pdf, other

    cs.CV cs.AI cs.CL

    DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP Training

    Authors: Yihao Chen, Xianbiao Qi, Jianan Wang, Lei Zhang

    Abstract: We propose DisCo-CLIP, a distributed memory-efficient CLIP training approach, to reduce the memory consumption of contrastive loss when training contrastive learning models. Our approach decomposes the contrastive loss and its gradient computation into two parts, one to calculate the intra-GPU gradients and the other to compute the inter-GPU gradients. According to our decomposition, only the intr… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: To appear in CVPR 2023 as a highlight, our code will be public at https://github.com/IDEA-Research/DisCo-CLIP

  2. arXiv:2304.07051  [pdf, other

    cs.CV cs.AI

    The Second Monocular Depth Estimation Challenge

    Authors: Jaime Spencer, C. Stella Qian, Michaela Trescakova, Chris Russell, Simon Hadfield, Erich W. Graf, Wendy J. Adams, Andrew J. Schofield, James Elder, Richard Bowden, Ali Anwar, Hao Chen, Xiaozhi Chen, Kai Cheng, Yuchao Dai, Huynh Thai Hoa, Sadat Hossain, Jianmian Huang, Mohan Jing, Bo Li, Chao Li, Baojun Li, Zhiwen Liu, Stefano Mattoccia, Siegfried Mercelis , et al. (18 additional authors not shown)

    Abstract: This paper discusses the results for the second edition of the Monocular Depth Estimation Challenge (MDEC). This edition was open to methods using any form of supervision, including fully-supervised, self-supervised, multi-task or proxy depth. The challenge was based around the SYNS-Patches dataset, which features a wide diversity of environments with high-quality dense ground-truth. This includes… ▽ More

    Submitted 26 April, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

    Comments: Published at CVPRW2023

  3. arXiv:2304.06219  [pdf, other

    physics.plasm-ph

    Transport of intense ion beams in plasmas: collimation and energy-loss reduction

    Authors: Yongtao Zhao, Benzheng Chen, Dong Wu, Rui Cheng, Xianming Zhou, Yu Lei, Yuyu Wang, Xin Qi, Guoqing Xiao, Jieru Ren, Xing Wang, Dieter H. H. Hoffmann, Fei Gao, Zhanghu Hu, Younian Wang, Wei Yu, Stephan Fritzsche, Xiantu He

    Abstract: We compare the transport properties of a well-characterized hydrogen plasma for low and high current ion beams. The energy-loss of low current beams can be well understood, within the framework of current stopping power models. However, for high current proton beams, significant energy-loss reduction and collimation is observed in the experiment. We have developed a new particle-in-cell code, whic… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

  4. arXiv:2304.00962  [pdf, other

    cs.CV cs.AI

    RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding

    Authors: Jihan Yang, Runyu Ding, Weipeng Deng, Zhe Wang, Xiaojuan Qi

    Abstract: We propose a lightweight and scalable Regional Point-Language Contrastive learning framework, namely \textbf{RegionPLC}, for open-world 3D scene understanding, aiming to identify and recognize open-set objects and categories. Specifically, based on our empirical studies, we introduce a 3D-aware SFusion strategy that fuses 3D vision-language pairs derived from multiple 2D foundation models, yieldin… ▽ More

    Submitted 5 May, 2024; v1 submitted 3 April, 2023; originally announced April 2023.

    Comments: To appear in CVPR2024 .project page: https://jihanyang.github.io/projects/RegionPLC

  5. arXiv:2303.15713  [pdf

    cond-mat.mtrl-sci

    Robust 3.7 V-Na$_{2/3}$[Cu$_{1/3}$Mn$_{2/3}$]O$_2$ Cathode for Na-ion Batteries

    Authors: Xiaohui Rong, Xingguo Qi, Quan Zhou, Libin Kang, Dongdong Xiao, Ruijuan Xiao, Feixiang Ding, Yang Yang, Yuan Liu, Yun Su, Shiguang Zhang, Lunhua He, Yaxiang Lu, Liquan Chen, Yong-Sheng Hu

    Abstract: Na-ion batteries (NIBs), which are recognized as a next-generation alternative technology for energy storage, still suffer from commercialization constraints due to the lack of low-cost, high-performance cathode materials. Since our first discovery of Cu$^{3+}$/Cu$^{2+}$ electrochemistry in 2014, numerous Cu-substituted/doped materials have been designed for NIBs. However for almost ten years, the… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: 15 pages, 3 figures, 1 table

  6. arXiv:2303.15181  [pdf, other

    cs.CV

    DreamStone: Image as Stepping Stone for Text-Guided 3D Shape Generation

    Authors: Zhengzhe Liu, Peng Dai, Ruihui Li, Xiaojuan Qi, Chi-Wing Fu

    Abstract: In this paper, we present a new text-guided 3D shape generation approach DreamStone that uses images as a stepping stone to bridge the gap between text and shape modalities for generating 3D shapes without requiring paired text and 3D data. The core of our approach is a two-stage feature-space alignment strategy that leverages a pre-trained single-view reconstruction (SVR) model to map CLIP featur… ▽ More

    Submitted 23 September, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)

  7. Inflation and Dark Matter in the $Z_5$ Model

    Authors: XinXin Qi, Hao Sun

    Abstract: We discuss the possibility of unifying dark matter physics and inflation in the $Z_5$ model of the two-component dark matter. Inflation driven by the two-component dark matter fields can be divided into two cases, singlet dark matter inflation and mixed dark matter inflation, where both two-component play the role of inflaton in the latter case. For dark matter, we focus on the mixed dark matter i… ▽ More

    Submitted 28 April, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

    Journal ref: JCAP05(2023)051

  8. arXiv:2303.14893  [pdf, other

    cs.CV

    Context-Aware Transformer for 3D Point Cloud Automatic Annotation

    Authors: Xiaoyan Qian, Chang Liu, Xiaojuan Qi, Siew-Chong Tan, Edmund Lam, Ngai Wong

    Abstract: 3D automatic annotation has received increased attention since manually annotating 3D point clouds is laborious. However, existing methods are usually complicated, e.g., pipelined training for 3D foreground/background segmentation, cylindrical object proposals, and point completion. Furthermore, they often overlook the inter-object feature relation that is particularly informative to hard samples… ▽ More

    Submitted 26 March, 2023; originally announced March 2023.

  9. arXiv:2303.14727  [pdf, other

    cs.CV

    You Only Need One Thing One Click: Self-Training for Weakly Supervised 3D Scene Understanding

    Authors: Zhengzhe Liu, Xiaojuan Qi, Chi-Wing Fu

    Abstract: 3D scene understanding, e.g., point cloud semantic and instance segmentation, often requires large-scale annotated training data, but clearly, point-wise labels are too tedious to prepare. While some recent methods propose to train a 3D network with small percentages of point labels, we take the approach to an extreme and propose ``One Thing One Click,'' meaning that the annotator only needs to la… ▽ More

    Submitted 9 September, 2023; v1 submitted 26 March, 2023; originally announced March 2023.

    Comments: Extension of One Thing One Click (CVPR'2021) arXiv:2104.02246

  10. arXiv:2303.13479  [pdf, other

    cs.CV

    IST-Net: Prior-free Category-level Pose Estimation with Implicit Space Transformation

    Authors: Jianhui Liu, Yukang Chen, Xiaoqing Ye, Xiaojuan Qi

    Abstract: Category-level 6D pose estimation aims to predict the poses and sizes of unseen objects from a specific category. Thanks to prior deformation, which explicitly adapts a category-specific 3D prior (i.e., a 3D template) to a given object instance, prior-based methods attained great success and have become a major research stream. However, obtaining category-specific priors requires collecting a larg… ▽ More

    Submitted 19 July, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: Accepted by ICCV2023

  11. arXiv:2303.11633  [pdf, other

    cs.CV

    Learning Context-aware Classifier for Semantic Segmentation

    Authors: Zhuotao Tian, Jiequan Cui, Li Jiang, Xiaojuan Qi, Xin Lai, Yixin Chen, Shu Liu, Jiaya Jia

    Abstract: Semantic segmentation is still a challenging task for parsing diverse contexts in different scenes, thus the fixed classifier might not be able to well address varying feature distributions during testing. Different from the mainstream literature where the efficacy of strong backbones and effective decoder heads has been well studied, in this paper, additional contextual hints are instead exploite… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: AAAI 2023. Code and models are available at https://github.com/tianzhuotao/CAC

  12. arXiv:2303.11301  [pdf, other

    cs.CV

    VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking

    Authors: Yukang Chen, Jianhui Liu, Xiangyu Zhang, Xiaojuan Qi, Jiaya Jia

    Abstract: 3D object detectors usually rely on hand-crafted proxies, e.g., anchors or centers, and translate well-studied 2D frameworks to 3D. Thus, sparse voxel features need to be densified and processed by dense prediction heads, which inevitably costs extra computation. In this paper, we instead propose VoxelNext for fully sparse 3D object detection. Our core insight is to predict objects directly based… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: In CVPR 2023, Code and models are available at https://github.com/dvlab-research/VoxelNeXt

  13. arXiv:2303.09152  [pdf, other

    cs.CV

    Learning a Room with the Occ-SDF Hybrid: Signed Distance Function Mingled with Occupancy Aids Scene Representation

    Authors: Xiaoyang Lyu, Peng Dai, Zizhang Li, Dongyu Yan, Yi Lin, Yifan Peng, Xiaojuan Qi

    Abstract: Implicit neural rendering, which uses signed distance function (SDF) representation with geometric priors (such as depth or surface normal), has led to impressive progress in the surface reconstruction of large-scale scenes. However, applying this method to reconstruct a room-level scene from images may miss structures in low-intensity areas or small and thin objects. We conducted experiments on t… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

  14. arXiv:2303.07939  [pdf, ps, other

    physics.atom-ph

    Measurement of hyperfine structure and the Zemach radius in $\rm^6Li^+$ using optical Ramsey technique

    Authors: Wei Sun, Pei-Pei Zhang, Peng-peng Zhou, Shao-long Chen, Zhi-qiang Zhou, Yao Huang, Xiao-Qiu Qi, Zong-Chao Yan, Ting-Yun Shi, G. W. F. Drake, Zhen-Xiang Zhong, Hua Guan, Ke-lin Gao

    Abstract: We investigate the $2\,^3\!S_1$--$2\,^3\!P_J$ ($J = 0, 1, 2$) transitions in $\rm^6Li^+$ using the optical Ramsey technique and achieve the most precise values of the hyperfine splittings of the $2\,^3\!S_1$ and $2\,^3\!P_J$ states, with smallest uncertainty of about 10~kHz. The present results reduce the uncertainties of previous experiments by a factor of 5 for the $2\,^3\!S_1$ state and a facto… ▽ More

    Submitted 18 March, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

    Comments: 6 pages, 6 figures

  15. arXiv:2303.03910  [pdf, other

    hep-ex physics.ins-det

    JUNO sensitivity to $^7$Be, $pep$, and CNO solar neutrinos

    Authors: Angel Abusleme, Thomas Adam, Shakeel Ahmad, Rizwan Ahmed, Sebastiano Aiello, Muhammad Akram, Abid Aleem, Tsagkarakis Alexandros, Fengpeng An, Qi An, Giuseppe Andronico, Nikolay Anfimov, Vito Antonelli, Tatiana Antoshkina, Burin Asavapibhop, João Pedro Athayde Marcondes de André, Didier Auguste, Weidong Bai, Nikita Balashov, Wander Baldini, Andrea Barresi, Davide Basilico, Eric Baussan, Marco Bellato, Marco Beretta , et al. (592 additional authors not shown)

    Abstract: The Jiangmen Underground Neutrino Observatory (JUNO), the first multi-kton liquid scintillator detector, which is under construction in China, will have a unique potential to perform a real-time measurement of solar neutrinos well below the few MeV threshold typical for Water Cherenkov detectors. JUNO's large target mass and excellent energy resolution are prerequisites for reaching unprecedented… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

  16. arXiv:2303.01765  [pdf, other

    cs.CV

    Diverse 3D Hand Gesture Prediction from Body Dynamics by Bilateral Hand Disentanglement

    Authors: Xingqun Qi, Chen Liu, Muyi Sun, Lincheng Li, Changjie Fan, Xin Yu

    Abstract: Predicting natural and diverse 3D hand gestures from the upper body dynamics is a practical yet challenging task in virtual avatar creation. Previous works usually overlook the asymmetric motions between two hands and generate two hands in a holistic manner, leading to unnatural results. In this work, we introduce a novel bilateral hand disentanglement based two-stage 3D hand generation method to… ▽ More

    Submitted 20 March, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

    Comments: Accepted at CVPR 2023

  17. arXiv:2303.00369  [pdf, other

    cs.CV eess.IV

    Indescribable Multi-modal Spatial Evaluator

    Authors: Lingke Kong, X. Sharon Qi, Qijin Shen, Jiacheng Wang, Jingyi Zhang, Yanle Hu, Qichao Zhou

    Abstract: Multi-modal image registration spatially aligns two images with different distributions. One of its major challenges is that images acquired from different imaging machines have different imaging distributions, making it difficult to focus only on the spatial aspect of the images and ignore differences in distributions. In this study, we developed a self-supervised approach, Indescribable Multi-mo… ▽ More

    Submitted 1 March, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: Accepted by CVPR2023

  18. arXiv:2302.10050  [pdf, ps, other

    hep-ph hep-ex

    Pionic transitions from $Z_c(4020)$ to $D$ wave charmonia

    Authors: Xiao-Yu Qi, Qi Wu, Dian-Yong Chen

    Abstract: In the present work, we investigate the charmed meson loops contributions to the pionic transitions from $Z_c(4020)^+$ to the $D$ wave triplets charmonia by using an effective Lagrangian approach. Our estimations indicate that the predicted branching fraction of $Z_c(4020)^+ \to π^+ ψ(1^3D_J) , \ J=(1,2,3)$ are much smaller than the one of $Z_c(4020)^+ \to π^+ h_c $. Thus, searching… ▽ More

    Submitted 16 October, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: 9 pages, 5 figures, accepted for publication in EPJC

  19. arXiv:2302.09621  [pdf

    eess.IV

    Augmenting endometriosis analysis from ultrasound data with deep learning

    Authors: Adrian Balica, Jennifer Dai, Kayla Piiwaa, Xiao Qi, Ashlee N. Green, Nancy Phillips, Susan Egan, Ilker Hacihaliloglu

    Abstract: Endometriosis is a non-malignant disorder that affects 176 million women globally. Diagnostic delays result in severe dysmenorrhea, dyspareunia, chronic pelvic pain, and infertility. Therefore, there is a significant need to diagnose patients at an early stage. Our objective in this work is to investigate the potential of deep learning methods to classify endometriosis from ultrasound data. Retros… ▽ More

    Submitted 19 February, 2023; originally announced February 2023.

    Comments: Accepted to 2023 SPIE Medical Imaging Conference

  20. arXiv:2302.06107  [pdf, ps, other

    math.RT math.CO math.RA

    Representation type of blocks of cyclotomic Hecke algebras of type $G(r, 1, n)$

    Authors: Yanbo Li, Xiangyu Qi

    Abstract: Let $K$ be an algebraically closed field with $Char K\neq 2$ and $(s_1, s_2, \cdots, s_r)\in \mathbb{Z}^r$ a multicharge with $r>2$. Let $\mathcal {H}_n(q, Q)$ be a cyclotomic Hecke algebra of type $G(r, 1, n)$, where $q\neq 0, 1$ and $Q=(q^{s_1}, q^{s_2}, \cdots, q^{s_r})$. For each block $B$ of $\mathcal {H}_n(q, Q)$, we introduce a new invariant, called block move vector, which can be considere… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

    Comments: 66 pages

  21. arXiv:2301.13007  [pdf, other

    cs.CV cs.AI cs.LG

    EuclidNet: Deep Visual Reasoning for Constructible Problems in Geometry

    Authors: Man Fai Wong, Xintong Qi, Chee Wei Tan

    Abstract: In this paper, we present a deep learning-based framework for solving geometric construction problems through visual reasoning, which is useful for automated geometry theorem proving. Constructible problems in geometry often ask for the sequence of straightedge-and-compass constructions to construct a given goal given some initial setup. Our EuclidNet framework leverages the neural network archite… ▽ More

    Submitted 27 December, 2022; originally announced January 2023.

    Comments: Accepted by 2nd MATH-AI Workshop at NeurIPS'22

    Journal ref: Adv. Artif. Intell. Mach. Learn.(2023), 3(1):839-852

  22. arXiv:2301.12576  [pdf, other

    cs.LG cs.CR

    Uncovering Adversarial Risks of Test-Time Adaptation

    Authors: Tong Wu, Feiran Jia, Xiangyu Qi, Jiachen T. Wang, Vikash Sehwag, Saeed Mahloujifar, Prateek Mittal

    Abstract: Recently, test-time adaptation (TTA) has been proposed as a promising solution for addressing distribution shifts. It allows a base model to adapt to an unforeseen distribution during inference by leveraging the information from the batch of (unlabeled) test data. However, we uncover a novel security vulnerability of TTA based on the insight that predictions on benign samples can be impacted by ma… ▽ More

    Submitted 4 February, 2023; v1 submitted 29 January, 2023; originally announced January 2023.

  23. arXiv:2301.09544  [pdf, other

    cs.RO cs.CV

    Learning to View: Decision Transformers for Active Object Detection

    Authors: Wenhao Ding, Nathalie Majcherczyk, Mohit Deshpande, Xuewei Qi, Ding Zhao, Rajasimman Madhivanan, Arnie Sen

    Abstract: Active perception describes a broad class of techniques that couple planning and perception systems to move the robot in a way to give the robot more information about the environment. In most robotic systems, perception is typically independent of motion planning. For example, traditional object detection is passive: it operates only on the images it receives. However, we have a chance to improve… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

    Comments: Accepted to ICRA 2023

  24. arXiv:2301.01100  [pdf, other

    cs.CV cs.LG

    Understanding Imbalanced Semantic Segmentation Through Neural Collapse

    Authors: Zhisheng Zhong, Jiequan Cui, Yibo Yang, Xiaoyang Wu, Xiaojuan Qi, Xiangyu Zhang, Jiaya Jia

    Abstract: A recent study has shown a phenomenon called neural collapse in that the within-class means of features and the classifier weight vectors converge to the vertices of a simplex equiangular tight frame at the terminal phase of training for classification. In this paper, we explore the corresponding structures of the last-layer feature centers and classifiers in semantic segmentation. Based on our em… ▽ More

    Submitted 3 January, 2023; originally announced January 2023.

    Comments: Technical Report

  25. arXiv:2301.00145  [pdf, other

    cs.CV

    Attentional Graph Convolutional Network for Structure-aware Audio-Visual Scene Classification

    Authors: Liguang Zhou, Yuhongze Zhou, Xiaonan Qi, Junjie Hu, Tin Lun Lam, Yangsheng Xu

    Abstract: Audio-Visual scene understanding is a challenging problem due to the unstructured spatial-temporal relations that exist in the audio signals and spatial layouts of different objects and various texture patterns in the visual images. Recently, many studies have focused on abstracting features from convolutional neural networks while the learning of explicit semantically relevant frames of sound sig… ▽ More

    Submitted 31 December, 2022; originally announced January 2023.

  26. arXiv:2212.13771  [pdf, other

    cs.CV

    Exploring Vision Transformers as Diffusion Learners

    Authors: He Cao, Jianan Wang, Tianhe Ren, Xianbiao Qi, Yihao Chen, Yuan Yao, Lei Zhang

    Abstract: Score-based diffusion models have captured widespread attention and funded fast progress of recent vision generative tasks. In this paper, we focus on diffusion model backbone which has been much neglected before. We systematically explore vision Transformers as diffusion learners for various generative tasks. With our improvements the performance of vanilla ViT-based backbone (IU-ViT) is boosted… ▽ More

    Submitted 28 December, 2022; originally announced December 2022.

  27. Unconventionally Fast Transport through Sliding Dynamics of Rodlike Particles in Macromolecular Networks

    Authors: Xuanyu Zhang, Xiaobin Dai, Md Ahsan Habib, Ziyang Xu, Lijuan Gao, Wenlong Chen, Wenjie Wei, Zhongqiu Tang, Xianyu Qi, Xiangjun Gong, Lingxiang Jiang, Li-Tang Yan

    Abstract: Transport of rodlike particles in confinement environments of macromolecular networks plays crucial roles in many important biological processes and technological applications. The relevant understanding has been limited to thin rods with diameter much smaller than network mesh size, although the opposite case, of which the dynamical behaviors and underlying physical mechanisms remain unclear, is… ▽ More

    Submitted 19 November, 2023; v1 submitted 26 December, 2022; originally announced December 2022.

  28. arXiv:2212.05537  [pdf, other

    cs.SE

    Technical Debt Management in OSS Projects: An Empirical Study on GitHub

    Authors: Zengyang Li, Yilin Peng, Peng Liang, Apostolos Ampatzoglou, Ran Mo, Hui Liu, Xiaoxiao Qi

    Abstract: Technical debt (TD) refers to delayed tasks and immature artifacts that may bring short-term benefits but incur extra costs of change during maintenance and evolution in the long term. TD has been extensively studied in the past decade, and numerous open source software (OSS) projects were used to explore specific aspects of TD and validate various approaches for TD management (TDM). However, ther… ▽ More

    Submitted 11 December, 2022; originally announced December 2022.

    Comments: 15 pages, 8 images, 10 tables, Manuscript submitted to a Journal (2022)

  29. arXiv:2212.05326  [pdf, other

    cs.LG

    Vertical Layering of Quantized Neural Networks for Heterogeneous Inference

    Authors: Hai Wu, Ruifei He, Haoru Tan, Xiaojuan Qi, Kaibin Huang

    Abstract: Although considerable progress has been obtained in neural network quantization for efficient inference, existing methods are not scalable to heterogeneous devices as one dedicated model needs to be trained, transmitted, and stored for one specific hardware setting, incurring considerable costs in model training and maintenance. In this paper, we study a new vertical-layered representation of neur… ▽ More

    Submitted 10 December, 2022; originally announced December 2022.

    Comments: Submitted to IEEE for possible publication

  30. arXiv:2212.01749  [pdf, other

    cs.LG

    Semantic Graph Neural Network with Multi-measure Learning for Semi-supervised Classification

    Authors: Junchao Lin, Yuan Wan, Jingwen Xu, Xingchen Qi

    Abstract: Graph Neural Networks (GNNs) have attracted increasing attention in recent years and have achieved excellent performance in semi-supervised node classification tasks. The success of most GNNs relies on one fundamental assumption, i.e., the original graph structure data is available. However, recent studies have shown that GNNs are vulnerable to the complex underlying structure of the graph, making… ▽ More

    Submitted 4 December, 2022; originally announced December 2022.

  31. arXiv:2211.16312  [pdf, other

    cs.CV

    PLA: Language-Driven Open-Vocabulary 3D Scene Understanding

    Authors: Runyu Ding, Jihan Yang, Chuhui Xue, Wenqing Zhang, Song Bai, Xiaojuan Qi

    Abstract: Open-vocabulary scene understanding aims to localize and recognize unseen categories beyond the annotated label space. The recent breakthrough of 2D open-vocabulary perception is largely driven by Internet-scale paired image-text data with rich vocabulary concepts. However, this success cannot be directly transferred to 3D scenarios due to the inaccessibility of large-scale 3D-text pairs. To this… ▽ More

    Submitted 22 March, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: CVPR2023

  32. arXiv:2211.15098  [pdf, other

    cs.CV

    MGFN: Magnitude-Contrastive Glance-and-Focus Network for Weakly-Supervised Video Anomaly Detection

    Authors: Yingxian Chen, Zhengzhe Liu, Baoheng Zhang, Wilton Fok, Xiaojuan Qi, Yik-Chung Wu

    Abstract: Weakly supervised detection of anomalies in surveillance videos is a challenging task. Going beyond existing works that have deficient capabilities to localize anomalies in long videos, we propose a novel glance and focus network to effectively integrate spatial-temporal information for accurate anomaly detection. In addition, we empirically found that existing approaches that use feature magnitud… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

    Report number: AAAI2023

  33. arXiv:2211.11727  [pdf, other

    cs.CV cs.LG

    Parametric Classification for Generalized Category Discovery: A Baseline Study

    Authors: Xin Wen, Bingchen Zhao, Xiaojuan Qi

    Abstract: Generalized Category Discovery (GCD) aims to discover novel categories in unlabelled datasets using knowledge learned from labelled samples. Previous studies argued that parametric classifiers are prone to overfitting to seen categories, and endorsed using a non-parametric classifier formed with semi-supervised k-means. However, in this study, we investigate the failure of parametric classifiers,… ▽ More

    Submitted 15 December, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

    Comments: v3: ICCV'23 version; v4: updated the dataset table

  34. A $(k+1)$-partite entanglement measure of $N$-partite quantum states

    Authors: Yan Hong, Xianfei Qi, Ting Gao, Fengli Yan

    Abstract: The concept of \textquotedblleft the permutationally invariant part of a density matrx\textquotedblright constitutes an important tool for entanglement characterization of multiqubit systems. In this paper, we first present $(k+1)$-partite entanglement measure of $N$-partite quantum system, which possesses desirable properties of an entanglement measure. Moreover, we give strong bounds on this mea… ▽ More

    Submitted 6 November, 2022; originally announced November 2022.

    Journal ref: Eur. Phys. J. Plus (2023) 138:1081

  35. arXiv:2211.00899  [pdf, other

    eess.IV cs.CV

    LightVessel: Exploring Lightweight Coronary Artery Vessel Segmentation via Similarity Knowledge Distillation

    Authors: Hao Dang, Yuekai Zhang, Xingqun Qi, Wanting Zhou, Muyi Sun

    Abstract: In recent years, deep convolution neural networks (DCNNs) have achieved great prospects in coronary artery vessel segmentation. However, it is difficult to deploy complicated models in clinical scenarios since high-performance approaches have excessive parameters and high computation costs. To tackle this problem, we propose \textbf{LightVessel}, a Similarity Knowledge Distillation Framework, for… ▽ More

    Submitted 25 February, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: 5 pages, 7 figures, conference

  36. arXiv:2210.17002  [pdf, ps, other

    cond-mat.mes-hall quant-ph

    Two-band description of the strong `spin'-orbit coupled one-dimensional hole gas in a cylindrical Ge nanowire

    Authors: Rui Li, Xin-Yu Qi

    Abstract: The low-energy effective Hamiltonian of the strong `spin'-orbit coupled one-dimensional hole gas in a cylindrical Ge nanowire in the presence of a strong magnetic field is studied both numerically and analytically. Basing on the Luttinger-Kohn Hamiltonian in the spherical approximation, we show this strong `spin'-orbit coupled one-dimensional hole gas can be accurately described by an effective tw… ▽ More

    Submitted 10 February, 2023; v1 submitted 30 October, 2022; originally announced October 2022.

    Comments: 8 pages, 7 figures

    Journal ref: J. Phys.: Condens. Matter 35, 135302 (2023)

  37. arXiv:2210.16810  [pdf, other

    cs.CV

    SL3D: Self-supervised-Self-labeled 3D Recognition

    Authors: Fernando Julio Cendra, Lan Ma, Jiajun Shen, Xiaojuan Qi

    Abstract: Deep learning has attained remarkable success in many 3D visual recognition tasks, including shape classification, object detection, and semantic segmentation. However, many of these results rely on manually collecting densely annotated real-world 3D data, which is highly time-consuming and expensive to obtain, limiting the scalability of 3D recognition tasks. Thus, we study unsupervised 3D recogn… ▽ More

    Submitted 16 December, 2022; v1 submitted 30 October, 2022; originally announced October 2022.

    Comments: This paper has already been accepted by Neural Information Processing Systems (NeurIPS 2022) Workshop on Self-Supervised Learning: Theory and Practice

  38. arXiv:2210.12262  [pdf, other

    cs.LG

    Group Distributionally Robust Reinforcement Learning with Hierarchical Latent Variables

    Authors: Mengdi Xu, Peide Huang, Yaru Niu, Visak Kumar, Jielin Qiu, Chao Fang, Kuan-Hui Lee, Xuewei Qi, Henry Lam, Bo Li, Ding Zhao

    Abstract: One key challenge for multi-task Reinforcement learning (RL) in practice is the absence of task indicators. Robust RL has been applied to deal with task ambiguity, but may result in over-conservative policies. To balance the worst-case (robustness) and average performance, we propose Group Distributionally Robust Markov Decision Process (GDR-MDP), a flexible hierarchical MDP formulation that encod… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: 27 pages, 10 figures

  39. arXiv:2210.09509  [pdf, other

    cs.CV

    Deep Data Augmentation for Weed Recognition Enhancement: A Diffusion Probabilistic Model and Transfer Learning Based Approach

    Authors: Dong Chen, Xinda Qi, Yu Zheng, Yuzhen Lu, Zhaojian Li

    Abstract: Weed management plays an important role in many modern agricultural applications. Conventional weed control methods mainly rely on chemical herbicides or hand weeding, which are often cost-ineffective, environmentally unfriendly, or even posing a threat to food safety and human health. Recently, automated/robotic weeding using machine vision systems has seen increased research attention with its p… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: 15 pages, 9 figures

  40. arXiv:2210.07574  [pdf, other

    cs.CV

    Is synthetic data from generative models ready for image recognition?

    Authors: Ruifei He, Shuyang Sun, Xin Yu, Chuhui Xue, Wenqing Zhang, Philip Torr, Song Bai, Xiaojuan Qi

    Abstract: Recent text-to-image generation models have shown promising results in generating high-fidelity photo-realistic images. Though the results are astonishing to human eyes, how applicable these generated images are for recognition tasks remains under-explored. In this work, we extensively study whether and how synthetic images generated from state-of-the-art text-to-image generation models can be use… ▽ More

    Submitted 15 February, 2023; v1 submitted 14 October, 2022; originally announced October 2022.

    Comments: ICLR 2023, spotlight

  41. arXiv:2210.05593  [pdf, other

    cs.CV

    Prototypical VoteNet for Few-Shot 3D Point Cloud Object Detection

    Authors: Shizhen Zhao, Xiaojuan Qi

    Abstract: Most existing 3D point cloud object detection approaches heavily rely on large amounts of labeled training data. However, the labeling process is costly and time-consuming. This paper considers few-shot 3D point cloud object detection, where only a few annotated samples of novel classes are needed with abundant samples of base classes. To this end, we propose Prototypical VoteNet to recognize and… ▽ More

    Submitted 21 December, 2022; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022

  42. arXiv:2210.03555  [pdf

    cs.IT cs.AI

    In-situ Model Downloading to Realize Versatile Edge AI in 6G Mobile Networks

    Authors: Kaibin Huang, Hai Wu, Zhiyan Liu, Xiaojuan Qi

    Abstract: The sixth-generation (6G) mobile networks are expected to feature the ubiquitous deployment of machine learning and AI algorithms at the network edge. With rapid advancements in edge AI, the time has come to realize intelligence downloading onto edge devices (e.g., smartphones and sensors). To materialize this version, we propose a novel technology in this article, called in-situ model downloading… ▽ More

    Submitted 2 April, 2023; v1 submitted 7 October, 2022; originally announced October 2022.

    Comments: To appear in IEEE Wireless Communications

  43. arXiv:2209.14201  [pdf, other

    cs.CV

    Spatial Pruned Sparse Convolution for Efficient 3D Object Detection

    Authors: Jianhui Liu, Yukang Chen, Xiaoqing Ye, Zhuotao Tian, Xiao Tan, Xiaojuan Qi

    Abstract: 3D scenes are dominated by a large number of background points, which is redundant for the detection task that mainly needs to focus on foreground objects. In this paper, we analyze major components of existing sparse 3D CNNs and find that 3D CNNs ignore the redundancy of data and further amplify it in the down-sampling process, which brings a huge amount of extra and unnecessary computational ove… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

    Comments: Accepted by NeurIPS 2022

  44. arXiv:2209.13103  [pdf

    cs.SI stat.ME

    A Review: Random Walk in Graph Sampling

    Authors: Xiao Qi

    Abstract: Graph sampling is a technique to pick a subset of vertices and/ or edges from original graph. Among various graph sampling approaches, Traversal Based Sampling (TBS) are widely used due to low cost and feasibility for many cases, in which Simple Random Walk (SRW) and its variants share a large proportion in TBS. We illustrate the foundation SRW and presents the problems of SRW. Based on the proble… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

  45. arXiv:2209.12804  [pdf

    stat.ME stat.AP

    Efficient Random Walk based Sampling with Inverse Degree

    Authors: Xiao Qi

    Abstract: Random walk sampling methods have been widely used in graph sampling in recent years, while it has bias towards higher degree nodes in the sample. To overcome this deficiency, classical methods such as MHRW design weighted walking by repeating low-degree nodes while rejecting high-degree nodes, so that the long-term behavior of Markov chain can achieve uniform distribution. This modification, howe… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

  46. arXiv:2209.12797  [pdf, other

    cs.CV

    Rethinking Resolution in the Context of Efficient Video Recognition

    Authors: Chuofan Ma, Qiushan Guo, Yi Jiang, Zehuan Yuan, Ping Luo, Xiaojuan Qi

    Abstract: In this paper, we empirically study how to make the most of low-resolution frames for efficient video recognition. Existing methods mainly focus on developing compact networks or alleviating temporal redundancy of video inputs to increase efficiency, whereas compressing frame resolution has rarely been considered a promising solution. A major concern is the poor recognition accuracy on low-resolut… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

    Comments: Accepted by NIPS2022

  47. arXiv:2209.12767  [pdf

    stat.ME stat.AP

    Weighted Jump in Random Walk Graph Sampling

    Authors: Xiao Qi

    Abstract: Random walk based sampling methods have been widely used in graph sampling in recent years, while it has bias towards higher degree nodes in the sample. To overcome this deficiency, classical methods such as GMD modify the topology of target graphs so that the long-term behavior of Markov chain can achieve uniform distribution. This modification, however, reduces the conductance of graphs, thus ma… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

  48. arXiv:2209.04145  [pdf, other

    cs.CV

    ISS: Image as Stepping Stone for Text-Guided 3D Shape Generation

    Authors: Zhengzhe Liu, Peng Dai, Ruihui Li, Xiaojuan Qi, Chi-Wing Fu

    Abstract: Text-guided 3D shape generation remains challenging due to the absence of large paired text-shape data, the substantial semantic gap between these two modalities, and the structural complexity of 3D shapes. This paper presents a new framework called Image as Stepping Stone (ISS) for the task by introducing 2D image as a stepping stone to connect the two modalities and to eliminate the need for pai… ▽ More

    Submitted 23 February, 2023; v1 submitted 9 September, 2022; originally announced September 2022.

    Comments: ICLR 2023 spotlight

  49. arXiv:2209.02940  [pdf, other

    hep-th

    Emergent bulk gauge field in random tensor networks

    Authors: Xiao-Liang Qi

    Abstract: Random tensor network states are toy models for holographic duality, which have entanglement properties determined by graph geometry. In this paper, we propose a generalization of the random tensor network states which describe an ensemble of states preserving a given global symmetry. We show that Renyi entropy for this family of states can be described by a quantum extremal surface formula, with… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

    Comments: A paper contributed to "A Festschrift in Honor of the C. N. Yang Centenary". 17 pages. 3 figures

  50. arXiv:2208.13953  [pdf, other

    physics.optics

    Imaginary coupling induced Dirac points and group velocity control in non-reciprocal Hermitian Lattice

    Authors: Yuandan Wang, Junhao Yang, Yu Dang, Haohao Wang, Guoguo Xin, Xinyuan Qi

    Abstract: We propose a mechanism to achieve the group velocity control of bifurcation light via an imaginary coupling effect in the non-reciprocal lattice. The physical model is composed of two-layer photonic lattices with non-reciprocal coupling in each unit cell, which can support a real energy spectrum with a pair of Dirac points in the first Brillouin zone due to the Hermicity. Furthermore, we show that… ▽ More

    Submitted 2 September, 2022; v1 submitted 29 August, 2022; originally announced August 2022.