Skip to main content

Showing 1–18 of 18 results for author: Xian, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10900  [pdf, other

    cs.CV cs.CL

    AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models

    Authors: Xiyang Wu, Tianrui Guan, Dianqi Li, Shuaiyi Huang, Xiaoyu Liu, Xijun Wang, Ruiqi Xian, Abhinav Shrivastava, Furong Huang, Jordan Lee Boyd-Graber, Tianyi Zhou, Dinesh Manocha

    Abstract: Large vision-language models (LVLMs) hallucinate: certain context cues in an image may trigger the language module's overconfident and incorrect reasoning on abnormal or hypothetical objects. Though a few benchmarks have been developed to investigate LVLM hallucinations, they mainly rely on hand-crafted corner cases whose fail patterns may hardly generalize, and finetuning on them could undermine… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  2. arXiv:2405.04034  [pdf, other

    cs.LG cs.CR cs.CY

    Differentially Private Post-Processing for Fair Regression

    Authors: Ruicheng Xian, Qiaobo Li, Gautam Kamath, Han Zhao

    Abstract: This paper describes a differentially private post-processing algorithm for learning fair regressors satisfying statistical parity, addressing privacy concerns of machine learning models trained on sensitive data, as well as fairness concerns of their potential to propagate historical biases. Our algorithm can be applied to post-process any given regressor to improve fairness by remapping its outp… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: ICML 2024. Code is at https://github.com/rxian/fair-regression

  3. arXiv:2405.04025  [pdf, other

    cs.LG cs.CY

    Optimal Group Fair Classifiers from Linear Post-Processing

    Authors: Ruicheng Xian, Han Zhao

    Abstract: We propose a post-processing algorithm for fair classification that mitigates model bias under a unified family of group fairness criteria covering statistical parity, equal opportunity, and equalized odds, applicable to multi-class problems and both attribute-aware and attribute-blind settings. It achieves fairness by re-calibrating the output score of the given base model with a "fairness cost"… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Code is at https://github.com/rxian/fair-classification

  4. arXiv:2404.03187  [pdf, other

    cs.CV

    AGL-NET: Aerial-Ground Cross-Modal Global Localization with Varying Scales

    Authors: Tianrui Guan, Ruiqi Xian, Xijun Wang, Xiyang Wu, Mohamed Elnoor, Daeun Song, Dinesh Manocha

    Abstract: We present AGL-NET, a novel learning-based method for global localization using LiDAR point clouds and satellite maps. AGL-NET tackles two critical challenges: bridging the representation gap between image and points modalities for robust feature matching, and handling inherent scale discrepancies between global view and local view. To address these challenges, AGL-NET leverages a unified network… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  5. arXiv:2402.10527  [pdf, other

    cs.CL cs.CR stat.AP

    Zero-shot sampling of adversarial entities in biomedical question answering

    Authors: R. Patrick Xian, Alex J. Lee, Vincent Wang, Qiming Cui, Russell Ro, Reza Abbasi-Asl

    Abstract: The increasing depth of parametric domain knowledge in large language models (LLMs) is fueling their rapid deployment in real-world applications. In high-stakes and knowledge-intensive tasks, understanding model vulnerabilities is essential for quantifying the trustworthiness of model predictions and regulating their use. The recent discovery of named entities as adversarial examples in natural la… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: 20 pages incl. appendix, under review

  6. arXiv:2402.10340  [pdf, other

    cs.RO cs.AI

    Highlighting the Safety Concerns of Deploying LLMs/VLMs in Robotics

    Authors: Xiyang Wu, Souradip Chakraborty, Ruiqi Xian, Jing Liang, Tianrui Guan, Fuxiao Liu, Brian M. Sadler, Dinesh Manocha, Amrit Singh Bedi

    Abstract: In this paper, we highlight the critical issues of robustness and safety associated with integrating large language models (LLMs) and vision-language models (VLMs) into robotics applications. Recent works focus on using LLMs and VLMs to improve the performance of robotics tasks, such as manipulation and navigation. Despite these improvements, analyzing the safety of such systems remains underexplo… ▽ More

    Submitted 16 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

  7. arXiv:2310.14566  [pdf, other

    cs.CV cs.CL

    HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models

    Authors: Tianrui Guan, Fuxiao Liu, Xiyang Wu, Ruiqi Xian, Zongxia Li, Xiaoyu Liu, Xijun Wang, Lichang Chen, Furong Huang, Yaser Yacoob, Dinesh Manocha, Tianyi Zhou

    Abstract: We introduce HallusionBench, a comprehensive benchmark designed for the evaluation of image-context reasoning. This benchmark presents significant challenges to advanced large visual-language models (LVLMs), such as GPT-4V(Vision), Gemini Pro Vision, Claude 3, and LLaVA-1.5, by emphasizing nuanced understanding and interpretation of visual data. The benchmark comprises 346 images paired with 1129… ▽ More

    Submitted 25 March, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: Accepted to CVPR 2024

  8. arXiv:2308.13985  [pdf, other

    cs.LG cs.AI

    Revisiting Scalarization in Multi-Task Learning: A Theoretical Perspective

    Authors: Yuzheng Hu, Ruicheng Xian, Qilong Wu, Qiuling Fan, Lang Yin, Han Zhao

    Abstract: Linear scalarization, i.e., combining all loss functions by a weighted sum, has been the default choice in the literature of multi-task learning (MTL) since its inception. In recent years, there is a surge of interest in developing Specialized Multi-Task Optimizers (SMTOs) that treat MTL as a multi-objective optimization problem. However, it remains open whether there is a fundamental advantage of… ▽ More

    Submitted 22 September, 2023; v1 submitted 26 August, 2023; originally announced August 2023.

    Comments: Accepted at NeurIPS 2023

  9. arXiv:2306.12272  [pdf, other

    cond-mat.mtrl-sci cs.CE cs.LG math.CO

    From structure mining to unsupervised exploration of atomic octahedral networks

    Authors: R. Patrick Xian, Ryan J. Morelock, Ido Hadar, Charles B. Musgrave, Christopher Sutton

    Abstract: Networks of atom-centered coordination octahedra commonly occur in inorganic and hybrid solid-state materials. Characterizing their spatial arrangements and characteristics is crucial for relating structures to properties for many materials families. The traditional method using case-by-case inspection becomes prohibitive for discovering trends and similarities in large datasets. Here, we operatio… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: 56 pages

  10. arXiv:2305.12437  [pdf, other

    cs.CV

    PLAR: Prompt Learning for Action Recognition

    Authors: Xijun Wang, Ruiqi Xian, Tianrui Guan, Dinesh Manocha

    Abstract: We present a new general learning approach, Prompt Learning for Action Recognition (PLAR), which leverages the strengths of prompt learning to guide the learning process. Our approach is designed to predict the action label by helping the models focus on the descriptions or instructions associated with actions in the input videos. Our formulation uses various prompts, including learnable prompts,… ▽ More

    Submitted 14 November, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

  11. arXiv:2304.06866  [pdf, other

    cs.CV

    PMI Sampler: Patch Similarity Guided Frame Selection for Aerial Action Recognition

    Authors: Ruiqi Xian, Xijun Wang, Divya Kothandaraman, Dinesh Manocha

    Abstract: We present a new algorithm for selection of informative frames in video action recognition. Our approach is designed for aerial videos captured using a moving camera where human actors occupy a small spatial resolution of video frames. Our algorithm utilizes the motion bias within aerial videos, which enables the selection of motion-salient frames. We introduce the concept of patch mutual informat… ▽ More

    Submitted 15 November, 2023; v1 submitted 13 April, 2023; originally announced April 2023.

  12. arXiv:2303.02575  [pdf, other

    cs.CV cs.RO

    MITFAS: Mutual Information based Temporal Feature Alignment and Sampling for Aerial Video Action Recognition

    Authors: Ruiqi Xian, Xijun Wang, Dinesh Manocha

    Abstract: We present a novel approach for action recognition in UAV videos. Our formulation is designed to handle occlusion and viewpoint changes caused by the movement of a UAV. We use the concept of mutual information to compute and align the regions corresponding to human action or motion in the temporal domain. This enables our recognition model to learn from the key features associated with the motion.… ▽ More

    Submitted 15 November, 2023; v1 submitted 4 March, 2023; originally announced March 2023.

  13. AZTR: Aerial Video Action Recognition with Auto Zoom and Temporal Reasoning

    Authors: Xijun Wang, Ruiqi Xian, Tianrui Guan, Celso M. de Melo, Stephen M. Nogar, Aniket Bera, Dinesh Manocha

    Abstract: We propose a novel approach for aerial video action recognition. Our method is designed for videos captured using UAVs and can run on edge or mobile devices. We present a learning-based approach that uses customized auto zoom to automatically identify the human target and scale it appropriately. This makes it easier to extract the key features and reduces the computational overhead. We also presen… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: Accepted for publication at ICRA 2023

  14. arXiv:2212.10764  [pdf, other

    cs.IR cs.AI cs.CL cs.LG

    Learning List-Level Domain-Invariant Representations for Ranking

    Authors: Ruicheng Xian, Honglei Zhuang, Zhen Qin, Hamed Zamani, Jing Lu, Ji Ma, Kai Hui, Han Zhao, Xuanhui Wang, Michael Bendersky

    Abstract: Domain adaptation aims to transfer the knowledge learned on (data-rich) source domains to (low-resource) target domains, and a popular method is invariant representation learning, which matches and aligns the data distributions on the feature space. Although this method is studied extensively and applied on classification and regression problems, its adoption on ranking problems is sporadic, and t… ▽ More

    Submitted 31 October, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: NeurIPS 2023. Comparison to v1: revised presentation and proof of Corollary 4.9

  15. arXiv:2211.01528  [pdf, other

    cs.LG cs.AI cs.CY stat.ML

    Fair and Optimal Classification via Post-Processing

    Authors: Ruicheng Xian, Lang Yin, Han Zhao

    Abstract: To mitigate the bias exhibited by machine learning models, fairness criteria can be integrated into the training process to ensure fair treatment across all demographics, but it often comes at the expense of model performance. Understanding such tradeoffs, therefore, underlies the design of fair algorithms. To this end, this paper provides a complete characterization of the inherent tradeoff of de… ▽ More

    Submitted 5 June, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: ICML 2023. Code is at https://github.com/rxian/fair-classification. Comparison to v2: corrected proof of Theorem 4.4

  16. arXiv:1910.06956  [pdf, ps, other

    cs.LG stat.ML

    Neural tangent kernels, transportation mappings, and universal approximation

    Authors: Ziwei Ji, Matus Telgarsky, Ruicheng Xian

    Abstract: This paper establishes rates of universal approximation for the shallow neural tangent kernel (NTK): network weights are only allowed microscopic changes from random initialization, which entails that activations are mostly unchanged, and the network is nearly equivalent to its linearization. Concretely, the paper has two main contributions: a generic scheme to approximate functions with the NTK b… ▽ More

    Submitted 14 February, 2020; v1 submitted 15 October, 2019; originally announced October 2019.

  17. arXiv:1906.07709   

    cs.LG stat.ML

    Approximation power of random neural networks

    Authors: Bolton Bailey, Ziwei Ji, Matus Telgarsky, Ruicheng Xian

    Abstract: This paper investigates the approximation power of three types of random neural networks: (a) infinite width networks, with weights following an arbitrary distribution; (b) finite width networks obtained by subsampling the preceding infinite width networks; (c) finite width networks obtained by starting with standard Gaussian initialization, and then adding a vanishingly small correction to the we… ▽ More

    Submitted 17 October, 2019; v1 submitted 18 June, 2019; originally announced June 2019.

    Comments: This submission constitutes a poor approach to the problem, and has no scientific purpose. A superior (different) approach (and stronger final result, also treating the NTK) has appeared in arXiv:1910.06956 ; please see that work instead

  18. arXiv:1901.07064  [pdf, other

    cs.DB

    Predictive Indexing

    Authors: Joy Arulraj, Ran Xian, Lin Ma, Andrew Pavlo

    Abstract: There has been considerable research on automated index tuning in database management systems (DBMSs). But the majority of these solutions tune the index configuration by retrospectively making computationally expensive physical design changes all at once. Such changes degrade the DBMS's performance during the process, and have reduced utility during subsequent query processing due to the delay be… ▽ More

    Submitted 21 January, 2019; originally announced January 2019.

    Comments: 12 pages

    ACM Class: H.2.2; H.2.4