Zum Hauptinhalt springen

Showing 1–50 of 56 results for author: Ke, Q

Searching in archive cs. Search in all archives.
.
  1. Weakly Contrastive Learning via Batch Instance Discrimination and Feature Clustering for Small Sample SAR ATR

    Authors: Yikui Zhai, Wenlve Zhou, Bing Sun, Jingwen Li, Qirui Ke, Zilu Ying, Junying Gan, Chaoyun Mai, Ruggero Donida Labati, Vincenzo Piuri, Fabio Scotti

    Abstract: In recent years, impressive performance of deep learning technology has been recognized in Synthetic Aperture Radar (SAR) Automatic Target Recognition (ATR). Since a large amount of annotated data is required in this technique, it poses a trenchant challenge to the issue of obtaining a high recognition rate through less labeled data. To overcome this problem, inspired by the contrastive learning,… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

  2. arXiv:2407.06064  [pdf, other

    eess.IV cs.CV

    Pan-denoising: Guided Hyperspectral Image Denoising via Weighted Represent Coefficient Total Variation

    Authors: Shuang Xu, Qiao Ke, Jiangjun Peng, Xiangyong Cao, Zixiang Zhao

    Abstract: This paper introduces a novel paradigm for hyperspectral image (HSI) denoising, which is termed \textit{pan-denoising}. In a given scene, panchromatic (PAN) images capture similar structures and textures to HSIs but with less noise. This enables the utilization of PAN images to guide the HSI denoising process. Consequently, pan-denoising, which incorporates an additional prior, has the potential t… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  3. arXiv:2406.13327  [pdf, other

    cs.CV

    Part-aware Unified Representation of Language and Skeleton for Zero-shot Action Recognition

    Authors: Anqi Zhu, Qiuhong Ke, Mingming Gong, James Bailey

    Abstract: While remarkable progress has been made on supervised skeleton-based action recognition, the challenge of zero-shot recognition remains relatively unexplored. In this paper, we argue that relying solely on aligning label-level semantics and global skeleton features is insufficient to effectively transfer locally consistent visual knowledge from seen to unseen classes. To address this limitation, w… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  4. arXiv:2405.20633  [pdf, other

    cs.CV

    Action-OOD: An End-to-End Skeleton-Based Model for Robust Out-of-Distribution Human Action Detection

    Authors: Jing Xu, Anqi Zhu, Jingyu Lin, Qiuhong Ke, Cunjian Chen

    Abstract: Human action recognition is a crucial task in computer vision systems. However, in real-world scenarios, human actions often fall outside the distribution of training data, requiring a model to both recognize in-distribution (ID) actions and reject out-of-distribution (OOD) ones. Despite its importance, there has been limited research on OOD detection in human actions. Existing works on OOD detect… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: Under consideration at Computer Vision and Image Understanding

  5. arXiv:2405.11336  [pdf, other

    cs.CV

    UPAM: Unified Prompt Attack in Text-to-Image Generation Models Against Both Textual Filters and Visual Checkers

    Authors: Duo Peng, Qiuhong Ke, Jun Liu

    Abstract: Text-to-Image (T2I) models have raised security concerns due to their potential to generate inappropriate or harmful images. In this paper, we propose UPAM, a novel framework that investigates the robustness of T2I models from the attack perspective. Unlike most existing attack methods that focus on deceiving textual defenses, UPAM aims to deceive both textual and visual defenses in T2I models. UP… ▽ More

    Submitted 25 May, 2024; v1 submitted 18 May, 2024; originally announced May 2024.

    Comments: Accepted by ICML2024

    ACM Class: I.2.6

  6. arXiv:2405.05791  [pdf, other

    cs.CV

    Sequential Amodal Segmentation via Cumulative Occlusion Learning

    Authors: Jiayang Ao, Qiuhong Ke, Krista A. Ehinger

    Abstract: To fully understand the 3D context of a single image, a visual system must be able to segment both the visible and occluded regions of objects, while discerning their occlusion order. Ideally, the system should be able to handle any object and not be restricted to segmenting a limited set of object classes, especially in robotic applications. Addressing this need, we introduce a diffusion model wi… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  7. arXiv:2401.06445  [pdf, other

    physics.soc-ph cs.SI

    Directed network comparison using motifs

    Authors: Chenwei Xie, Qiao Ke, Haoyu Chen, Chuang Liu, Xiu-Xiu Zhan

    Abstract: Analyzing and characterizing the differences between networks is a fundamental and challenging problem in network science. Previously, most network comparison methods that rely on topological properties have been restricted to measuring differences between two undirected networks. However, many networks, such as biological networks, social networks, and transportation networks, exhibit inherent di… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

  8. arXiv:2401.01510  [pdf, other

    cs.CV

    Answering from Sure to Uncertain: Uncertainty-Aware Curriculum Learning for Video Question Answering

    Authors: Haopeng Li, Qiuhong Ke, Mingming Gong, Tom Drummond

    Abstract: While significant advancements have been made in video question answering (VideoQA), the potential benefits of enhancing model generalization through tailored difficulty scheduling have been largely overlooked in existing research. This paper seeks to bridge that gap by incorporating VideoQA into a curriculum learning (CL) framework that progressively trains models from simpler to more complex dat… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

  9. arXiv:2401.01505  [pdf, other

    cs.CV

    Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional Sports

    Authors: Haopeng Li, Andong Deng, Qiuhong Ke, Jun Liu, Hossein Rahmani, Yulan Guo, Bernt Schiele, Chen Chen

    Abstract: Reasoning over sports videos for question answering is an important task with numerous applications, such as player training and information retrieval. However, this task has not been explored due to the lack of relevant datasets and the challenging nature it presents. Most datasets for video question answering (VideoQA) focus mainly on general and coarse-grained understanding of daily-life videos… ▽ More

    Submitted 14 February, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

  10. arXiv:2311.03943  [pdf, other

    cs.CV

    CLIP Guided Image-perceptive Prompt Learning for Image Enhancement

    Authors: Weiwen Chen, Qiuhong Ke, Zinuo Li

    Abstract: Image enhancement is a significant research area in the fields of computer vision and image processing. In recent years, many learning-based methods for image enhancement have been developed, where the Look-up-table (LUT) has proven to be an effective tool. In this paper, we delve into the potential of Contrastive Language-Image Pre-Training (CLIP) Guided Prompt Learning, proposing a simple struct… ▽ More

    Submitted 22 November, 2023; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: A trial work to the image enhancement

  11. arXiv:2308.13893  [pdf, other

    cs.CV

    Unsupervised Domain Adaptation via Domain-Adaptive Diffusion

    Authors: Duo Peng, Qiuhong Ke, Yinjie Lei, Jun Liu

    Abstract: Unsupervised Domain Adaptation (UDA) is quite challenging due to the large distribution discrepancy between the source domain and the target domain. Inspired by diffusion models which have strong capability to gradually convert data distributions across a large gap, we consider to explore the diffusion technique to handle the challenging UDA task. However, using diffusion models to convert data di… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

    Comments: 11 pages, 4 figures

  12. arXiv:2308.12350  [pdf, other

    cs.CV

    Diffusion-based Image Translation with Label Guidance for Domain Adaptive Semantic Segmentation

    Authors: Duo Peng, Ping Hu, Qiuhong Ke, Jun Liu

    Abstract: Translating images from a source domain to a target domain for learning target models is one of the most common strategies in domain adaptive semantic segmentation (DASS). However, existing methods still struggle to preserve semantically-consistent local details between the original and translated images. In this work, we present an innovative approach that addresses this challenge by using source… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: Accepted to ICCV2023

  13. arXiv:2306.16643  [pdf

    cs.DL cs.SI physics.soc-ph

    Cautious explorers generate more future academic impact

    Authors: Xingsheng Yang, Zhaoru Ke, Qing Ke, Haipeng Zhang, Fengnan Gao

    Abstract: Some scientists are more likely to explore unfamiliar research topics while others tend to exploit existing ones. In previous work, correlations have been found between scientists' topic choices and their career performances. However, literature has yet to untangle the intricate interplay between scientific impact and research topic choices, where scientific exploration and exploitation intertwine… ▽ More

    Submitted 29 June, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: 16 pages of main text and 94 pages of supplementary information. v2: Added page number and fixed typo in author list

  14. arXiv:2306.13897  [pdf, other

    cs.CY cs.AI

    ICN: Interactive Convolutional Network for Forecasting Travel Demand of Shared Micromobility

    Authors: Yiming Xu, Qian Ke, Xiaojian Zhang, Xilei Zhao

    Abstract: Accurate shared micromobility demand predictions are essential for transportation planning and management. Although deep learning models provide powerful tools to deal with demand prediction problems, studies on forecasting highly-accurate spatiotemporal shared micromobility demand are still lacking. This paper proposes a deep learning model named Interactive Convolutional Network (ICN) to forecas… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

  15. arXiv:2304.06724  [pdf, other

    cs.CR cs.CV cs.LG

    GradMDM: Adversarial Attack on Dynamic Networks

    Authors: Jianhong Pan, Lin Geng Foo, Qichen Zheng, Zhipeng Fan, Hossein Rahmani, Qiuhong Ke, Jun Liu

    Abstract: Dynamic neural networks can greatly reduce computation redundancy without compromising accuracy by adapting their structures based on the input. In this paper, we explore the robustness of dynamic neural networks against energy-oriented attacks targeted at reducing their efficiency. Specifically, we attack dynamic models with our novel algorithm GradMDM. GradMDM is a technique that adjusts the dir… ▽ More

    Submitted 1 April, 2023; originally announced April 2023.

    Comments: Accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)

  16. arXiv:2304.00280  [pdf, other

    cs.CV

    Progressive Channel-Shrinking Network

    Authors: Jianhong Pan, Siyuan Yang, Lin Geng Foo, Qiuhong Ke, Hossein Rahmani, Zhipeng Fan, Jun Liu

    Abstract: Currently, salience-based channel pruning makes continuous breakthroughs in network compression. In the realization, the salience mechanism is used as a metric of channel salience to guide pruning. Therefore, salience-based channel pruning can dynamically adjust the channel width at run-time, which provides a flexible pruning scheme. However, there are two problems emerging: a gating function is o… ▽ More

    Submitted 1 April, 2023; originally announced April 2023.

  17. arXiv:2303.06596  [pdf, other

    cs.CV cs.LG

    Amodal Intra-class Instance Segmentation: Synthetic Datasets and Benchmark

    Authors: Jiayang Ao, Qiuhong Ke, Krista A. Ehinger

    Abstract: Images of realistic scenes often contain intra-class objects that are heavily occluded from each other, making the amodal perception task that requires parsing the occluded parts of the objects challenging. Although important for downstream tasks such as robotic grasping systems, the lack of large-scale amodal datasets with detailed annotations makes it difficult to model intra-class occlusions ex… ▽ More

    Submitted 7 November, 2023; v1 submitted 12 March, 2023; originally announced March 2023.

    Comments: Accepted at WACV 2024. Datasets are available at https://github.com/saraao/amodal-dataset

  18. arXiv:2303.01692  [pdf, other

    cs.LG cs.AI cs.CY

    Travel Demand Forecasting: A Fair AI Approach

    Authors: Xiaojian Zhang, Qian Ke, Xilei Zhao

    Abstract: Artificial Intelligence (AI) and machine learning have been increasingly adopted for travel demand forecasting. The AI-based travel demand forecasting models, though generate accurate predictions, may produce prediction biases and raise fairness issues. Using such biased models for decision-making may lead to transportation policies that exacerbate social inequalities. However, limited studies hav… ▽ More

    Submitted 25 September, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

    Comments: improved the methodology; updated new contents

  19. arXiv:2211.16940  [pdf, other

    cs.CV

    DiffPose: Toward More Reliable 3D Pose Estimation

    Authors: Jia Gong, Lin Geng Foo, Zhipeng Fan, Qiuhong Ke, Hossein Rahmani, Jun Liu

    Abstract: Monocular 3D human pose estimation is quite challenging due to the inherent ambiguity and occlusion, which often lead to high uncertainty and indeterminacy. On the other hand, diffusion models have recently emerged as an effective tool for generating high-quality images from noise. Inspired by their capability, we explore a novel pose estimation framework (DiffPose) that formulates 3D pose estimat… ▽ More

    Submitted 9 April, 2023; v1 submitted 30 November, 2022; originally announced November 2022.

    Comments: Accepted to CVPR 2023

  20. arXiv:2211.02883  [pdf, other

    cs.CV

    Unified Multi-View Orthonormal Non-Negative Graph Based Clustering Framework

    Authors: Liangchen Liu, Qiuhong Ke, Chaojie Li, Feiping Nie, Yingying Zhu

    Abstract: Spectral clustering is an effective methodology for unsupervised learning. Most traditional spectral clustering algorithms involve a separate two-step procedure and apply the transformed new representations for the final clustering results. Recently, much progress has been made to utilize the non-negative feature property in real-world data and to jointly learn the representation and clustering re… ▽ More

    Submitted 1 December, 2022; v1 submitted 3 November, 2022; originally announced November 2022.

  21. arXiv:2209.13204  [pdf, other

    cs.CV cs.GR

    NEURAL MARIONETTE: A Transformer-based Multi-action Human Motion Synthesis System

    Authors: Weiqiang Wang, Xuefei Zhe, Qiuhong Ke, Di Kang, Tingguang Li, Ruizhi Chen, Linchao Bao

    Abstract: We present a neural network-based system for long-term, multi-action human motion synthesis. The system, dubbed as NEURAL MARIONETTE, can produce high-quality and meaningful motions with smooth transitions from simple user input, including a sequence of action tags with expected action duration, and optionally a hand-drawn moving trajectory if the user specifies. The core of our system is a novel… ▽ More

    Submitted 27 November, 2023; v1 submitted 27 September, 2022; originally announced September 2022.

  22. arXiv:2209.10073  [pdf, other

    cs.CV

    Adaptive Local-Component-aware Graph Convolutional Network for One-shot Skeleton-based Action Recognition

    Authors: Anqi Zhu, Qiuhong Ke, Mingming Gong, James Bailey

    Abstract: Skeleton-based action recognition receives increasing attention because the skeleton representations reduce the amount of training data by eliminating visual information irrelevant to actions. To further improve the sample efficiency, meta-learning-based one-shot learning solutions were developed for skeleton-based action recognition. These methods find the nearest neighbor according to the simila… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

  23. arXiv:2209.01425  [pdf, other

    cs.CV

    Dynamic Spatio-Temporal Specialization Learning for Fine-Grained Action Recognition

    Authors: Tianjiao Li, Lin Geng Foo, Qiuhong Ke, Hossein Rahmani, Anran Wang, Jinghua Wang, Jun Liu

    Abstract: The goal of fine-grained action recognition is to successfully discriminate between action categories with subtle differences. To tackle this, we derive inspiration from the human visual system which contains specialized regions in the brain that are dedicated towards handling specific tasks. We design a novel Dynamic Spatio-Temporal Specialization (DSTS) module, which consists of specialized neur… ▽ More

    Submitted 3 September, 2022; originally announced September 2022.

    Comments: Accepted to ECCV 2022

  24. arXiv:2207.12100  [pdf, other

    cs.CV

    IGFormer: Interaction Graph Transformer for Skeleton-based Human Interaction Recognition

    Authors: Yunsheng Pang, Qiuhong Ke, Hossein Rahmani, James Bailey, Jun Liu

    Abstract: Human interaction recognition is very important in many applications. One crucial cue in recognizing an interaction is the interactive body parts. In this work, we propose a novel Interaction Graph Transformer (IGFormer) network for skeleton-based interaction recognition via modeling the interactive body parts as graphs. More specifically, the proposed IGFormer constructs interaction graphs accord… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

    Comments: Accepted by ECCV 2022

  25. arXiv:2207.09675  [pdf, other

    cs.CV

    ERA: Expert Retrieval and Assembly for Early Action Prediction

    Authors: Lin Geng Foo, Tianjiao Li, Hossein Rahmani, Qiuhong Ke, Jun Liu

    Abstract: Early action prediction aims to successfully predict the class label of an action before it is completely performed. This is a challenging task because the beginning stages of different actions can be very similar, with only minor subtle differences for discrimination. In this paper, we propose a novel Expert Retrieval and Assembly (ERA) module that retrieves and assembles a set of experts most sp… ▽ More

    Submitted 22 July, 2022; v1 submitted 20 July, 2022; originally announced July 2022.

    Comments: Accepted to ECCV 2022

  26. Image Amodal Completion: A Survey

    Authors: Jiayang Ao, Qiuhong Ke, Krista A. Ehinger

    Abstract: Existing computer vision systems can compete with humans in understanding the visible parts of objects, but still fall far short of humans when it comes to depicting the invisible parts of partially occluded objects. Image amodal completion aims to equip computers with human-like amodal completion functions to understand an intact object despite it being partially occluded. The main purpose of thi… ▽ More

    Submitted 7 November, 2023; v1 submitted 5 July, 2022; originally announced July 2022.

    Comments: Accepted at Computer Vision and Image Understanding. See https://doi.org/10.1016/j.cviu.2023.103661 for the final version

  27. arXiv:2206.06544  [pdf, ps, other

    cs.CV

    A Survey of Automated Data Augmentation Algorithms for Deep Learning-based Image Classification Tasks

    Authors: Zihan Yang, Richard O. Sinnott, James Bailey, Qiuhong Ke

    Abstract: In recent years, one of the most popular techniques in the computer vision community has been the deep learning technique. As a data-driven technique, deep model requires enormous amounts of accurately labelled training data, which is often inaccessible in many real-world applications. A data-space solution is Data Augmentation (DA), that can artificially generate new images out of original sample… ▽ More

    Submitted 6 October, 2022; v1 submitted 13 June, 2022; originally announced June 2022.

    Comments: 68 pages, 9 figures. Submitted to Knowledge and Information Systems (KAIS)

    MSC Class: A.1; I.4.3; I.5.2

  28. arXiv:2205.03825  [pdf, other

    cs.CV

    Iterative Geometry-Aware Cross Guidance Network for Stereo Image Inpainting

    Authors: Ang Li, Shanshan Zhao, Qingjie Zhang, Qiuhong Ke

    Abstract: Currently, single image inpainting has achieved promising results based on deep convolutional neural networks. However, inpainting on stereo images with missing regions has not been explored thoroughly, which is also a significant but different problem. One crucial requirement for stereo image inpainting is stereo consistency. To achieve it, we propose an Iterative Geometry-Aware Cross Guidance Ne… ▽ More

    Submitted 10 May, 2022; v1 submitted 8 May, 2022; originally announced May 2022.

    Comments: Accepted by IJCAI 2022

  29. arXiv:2110.09783  [pdf, other

    cs.CV

    Spatial-Temporal Transformer for 3D Point Cloud Sequences

    Authors: Yimin Wei, Hao Liu, Tingting Xie, Qiuhong Ke, Yulan Guo

    Abstract: Effective learning of spatial-temporal information within a point cloud sequence is highly important for many down-stream tasks such as 4D semantic segmentation and 3D action recognition. In this paper, we propose a novel framework named Point Spatial-Temporal Transformer (PST2) to learn spatial-temporal representations from dynamic 3D point cloud sequences. Our PST2 consists of two major modules:… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

    Journal ref: WACV2022

  30. arXiv:2108.08344  [pdf, other

    cs.CV cs.AI

    The Multi-Modal Video Reasoning and Analyzing Competition

    Authors: Haoran Peng, He Huang, Li Xu, Tianjiao Li, Jun Liu, Hossein Rahmani, Qiuhong Ke, Zhicheng Guo, Cong Wu, Rongchang Li, Mang Ye, Jiahao Wang, Jiaxu Zhang, Yuanzhong Liu, Tao He, Fuwei Zhang, Xianbin Liu, Tao Lin

    Abstract: In this paper, we introduce the Multi-Modal Video Reasoning and Analyzing Competition (MMVRAC) workshop in conjunction with ICCV 2021. This competition is composed of four different tracks, namely, video question answering, skeleton-based action recognition, fisheye video-based action recognition, and person re-identification, which are based on two datasets: SUTD-TrafficQA and UAV-Human. We summa… ▽ More

    Submitted 18 August, 2021; originally announced August 2021.

    Comments: Accepted to ICCV 2021 Workshops

    ACM Class: I.2.10; I.2.6

  31. arXiv:2107.09176  [pdf

    cs.DL cs.CY

    Temporal search in the scientific space predicts breakthrough inventions

    Authors: Chao Min, Qing Ke

    Abstract: The development of inventions is theorized as a process of searching and recombining existing knowledge components. Previous studies under this theory have examined myriad characteristics of recombined knowledge and their performance implications. One feature that has received much attention is technological knowledge age. Yet, little is known about how the age of scientific knowledge influences t… ▽ More

    Submitted 19 July, 2021; originally announced July 2021.

  32. arXiv:2106.06487  [pdf, other

    cs.DL cs.CY

    A dataset of mentorship in science with semantic and demographic estimations

    Authors: Qing Ke, Lizhen Liang, Ying Ding, Stephen V. David, Daniel E. Acuna

    Abstract: Mentorship in science is crucial for topic choice, career decisions, and the success of mentees and mentors. Typically, researchers who study mentorship use article co-authorship and doctoral dissertation datasets. However, available datasets of this type focus on narrow selections of fields and miss out on early career and non-publication-related interactions. Here, we describe MENTORSHIP, a crow… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

    Comments: Data can be found at https://doi.org/10.5281/zenodo.4917086

  33. arXiv:2106.01532  [pdf, other

    cs.CV

    Noise Doesn't Lie: Towards Universal Detection of Deep Inpainting

    Authors: Ang Li, Qiuhong Ke, Xingjun Ma, Haiqin Weng, Zhiyuan Zong, Feng Xue, Rui Zhang

    Abstract: Deep image inpainting aims to restore damaged or missing regions in an image with realistic contents. While having a wide range of applications such as object removal and image recovery, deep inpainting techniques also have the risk of being manipulated for image forgery. A promising countermeasure against such forgeries is deep inpainting detection, which aims to locate the inpainted regions in a… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: Accepted by IJCAI 2021

  34. arXiv:2105.11537  [pdf, other

    cs.SI cs.AI cs.LG

    Graph Neural Network Based VC Investment Success Prediction

    Authors: Shiwei Lyu, Shuai Ling, Kaihao Guo, Haipeng Zhang, Kunpeng Zhang, Suting Hong, Qing Ke, Jinjie Gu

    Abstract: Predicting the start-ups that will eventually succeed is essentially important for the venture capital business and worldwide policy makers, especially at an early stage such that rewards can possibly be exponential. Though various empirical studies and data-driven modeling work have been done, the predictive power of the complex networks of stakeholders including venture capital investors, star… ▽ More

    Submitted 25 May, 2021; originally announced May 2021.

    Comments: 11pages, 5figures

  35. arXiv:2103.04778  [pdf, other

    cs.CV

    Bridging the Distribution Gap of Visible-Infrared Person Re-identification with Modality Batch Normalization

    Authors: Wenkang Li, Qi Ke, Wenbin Chen, Yicong Zhou

    Abstract: Visible-infrared cross-modality person re-identification (VI-ReID), whose aim is to match person images between visible and infrared modality, is a challenging cross-modality image retrieval task. Most existing works integrate batch normalization layers into their neural network, but we found out that batch normalization layers would lead to two types of distribution gap: 1) inter-mini-batch distr… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

  36. arXiv:2101.10897  [pdf, other

    cs.CV

    HexCNN: A Framework for Native Hexagonal Convolutional Neural Networks

    Authors: Yunxiang Zhao, Qiuhong Ke, Flip Korn, Jianzhong Qi, Rui Zhang

    Abstract: Hexagonal CNN models have shown superior performance in applications such as IACT data analysis and aerial scene classification due to their better rotation symmetry and reduced anisotropy. In order to realize hexagonal processing, existing studies mainly use the ZeroOut method to imitate hexagonal processing, which causes substantial memory and computation overheads. We address this deficiency wi… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.

  37. arXiv:2101.06704  [pdf, other

    cs.AI cs.CR cs.CV cs.LG

    Adversarial Interaction Attack: Fooling AI to Misinterpret Human Intentions

    Authors: Nodens Koren, Qiuhong Ke, Yisen Wang, James Bailey, Xingjun Ma

    Abstract: Understanding the actions of both humans and artificial intelligence (AI) agents is important before modern AI systems can be fully integrated into our daily life. In this paper, we show that, despite their current huge success, deep learning based AI systems can be easily fooled by subtle adversarial noise to misinterpret the intention of an action in interaction scenarios. Based on a case study… ▽ More

    Submitted 17 January, 2021; originally announced January 2021.

    Comments: Preprint

  38. Human Action Recognition from Various Data Modalities: A Review

    Authors: Zehua Sun, Qiuhong Ke, Hossein Rahmani, Mohammed Bennamoun, Gang Wang, Jun Liu

    Abstract: Human Action Recognition (HAR) aims to understand human behavior and assign a label to each action. It has a wide range of applications, and therefore has been attracting increasing attention in the field of computer vision. Human actions can be represented using various data modalities, such as RGB, skeleton, depth, infrared, point cloud, event stream, audio, acceleration, radar, and WiFi signal,… ▽ More

    Submitted 21 June, 2022; v1 submitted 22 December, 2020; originally announced December 2020.

  39. arXiv:2010.09925  [pdf, other

    cs.CV cs.LG cs.MM

    Hierarchical Paired Channel Fusion Network for Street Scene Change Detection

    Authors: Yinjie Lei, Duo Peng, Pingping Zhang, Qiuhong Ke, Haifeng Li

    Abstract: Street Scene Change Detection (SSCD) aims to locate the changed regions between a given street-view image pair captured at different times, which is an important yet challenging task in the computer vision community. The intuitive way to solve the SSCD task is to fuse the extracted image feature pairs, and then directly measure the dissimilarity parts for producing a change map. Therefore, the key… ▽ More

    Submitted 19 October, 2020; originally announced October 2020.

    Comments: To appear in Transactions on Image Processing, including 13 pages, 13 figures, 9 tables

  40. arXiv:2009.01142  [pdf, other

    cs.CV

    Long-Term Anticipation of Activities with Cycle Consistency

    Authors: Yazan Abu Farha, Qiuhong Ke, Bernt Schiele, Juergen Gall

    Abstract: With the success of deep learning methods in analyzing activities in videos, more attention has recently been focused towards anticipating future activities. However, most of the work on anticipation either analyzes a partially observed activity or predicts the next action class. Recently, new approaches have been proposed to extend the prediction horizon up to several minutes in the future and th… ▽ More

    Submitted 2 September, 2020; originally announced September 2020.

    Comments: GCPR 2020

  41. arXiv:2006.15383  [pdf, other

    cs.DL cs.CY physics.soc-ph

    Interdisciplinary research and technological impact: Evidence from biomedicine

    Authors: Qing Ke

    Abstract: Interdisciplinary research (IDR) has been considered as an important source for scientific breakthroughs and as a solution to today's complex societal challenges. While ample empirical evidence has suggested its benefits within the academia such as better creativity and higher scientific impact and visibility, its societal benefits -- a key argument originally used for promoting IDR -- remain rela… ▽ More

    Submitted 4 January, 2023; v1 submitted 27 June, 2020; originally announced June 2020.

    Journal ref: Scientometrics 128, 2035-2077 (2023)

  42. Technological impact of biomedical research: the role of basicness and novelty

    Authors: Qing Ke

    Abstract: An ongoing interest in innovation studies is to understand how knowledge generated from scientific research can be used in the development of technologies. While previous inquiries have devoted to studying the scientific capacity of technologies and institutional factors facilitating technology transfer, little is known about the intrinsic characteristics of scientific publications that gain direc… ▽ More

    Submitted 3 June, 2020; originally announced June 2020.

    Journal ref: Research Policy 49, 104071 (2020)

  43. arXiv:2001.08199  [pdf, other

    cs.DL cs.SI physics.soc-ph

    Neural Embeddings of Scholarly Periodicals Reveal Complex Disciplinary Organizations

    Authors: Hao Peng, Qing Ke, Ceren Budak, Daniel M. Romero, Yong-Yeol Ahn

    Abstract: Understanding the structure of knowledge domains is one of the foundational challenges in science of science. Here, we propose a neural embedding technique that leverages the information contained in the citation network to obtain continuous vector representations of scientific periodicals. We demonstrate that our periodical embeddings encode nuanced relationships between periodicals as well as th… ▽ More

    Submitted 20 February, 2021; v1 submitted 22 January, 2020; originally announced January 2020.

  44. The citation disadvantage of clinical research

    Authors: Qing Ke

    Abstract: Biomedical research encompasses diverse types of activities, from basic science ("bench") to clinical medicine ("bedside") to bench-to-bedside translational research. It, however, remains unclear whether different types of research receive citations at varying rates. Here we aim to answer this question by using a newly proposed paper-level indicator that quantifies the extent to which a paper is b… ▽ More

    Submitted 3 December, 2019; originally announced December 2019.

    Journal ref: Journal of Informetrics 14, 100998 (2020)

  45. arXiv:1903.10610  [pdf, other

    cs.DL cs.CY physics.soc-ph

    An analysis of the evolution of science-technology linkage in biomedicine

    Authors: Qing Ke

    Abstract: Demonstrating the practical value of public research has been an important subject in science policy. Here we present a detailed study on the evolution of the citation linkage between life science related patents and biomedical research over a 37-year period. Our analysis relies on a newly-created dataset that systematically links millions of non-patent references to biomedical papers. We find a l… ▽ More

    Submitted 5 June, 2020; v1 submitted 25 March, 2019; originally announced March 2019.

    Comments: 13 pages, 6 figures, 7 tables

    Journal ref: Journal of Informetrics 14, 101074 (2020)

  46. arXiv:1812.10609  [pdf

    cs.DL cs.CY physics.soc-ph

    Identifying translational science through embeddings of controlled vocabularies

    Authors: Qing Ke

    Abstract: Objective: Translational science aims at "translating" basic scientific discoveries into clinical applications. The identification of translational science has practicality such as evaluating the effectiveness of investments made into large programs like the Clinical and Translational Science Awards. Despite several proposed methods that group publications---the primary unit of research output---i… ▽ More

    Submitted 26 December, 2018; originally announced December 2018.

    Comments: Accepted at JAMIA; Supporting Information at http://qke.github.io/assets/pdf/trans_supp.pdf

    Journal ref: Journal of the American Medical Informatics Association 26, 516-523 (2019)

  47. arXiv:1804.04105  [pdf, other

    cs.DL physics.soc-ph

    Comparing scientific and technological impact of biomedical research

    Authors: Qing Ke

    Abstract: Traditionally, the number of citations that a scholarly paper receives from other papers is used as the proxy of its scientific impact. Yet citations can come from domains outside the scientific community, and one such example is through patented technologies---paper can be cited by patents, achieving technological impact. While the scientific impact of papers has been extensively studied, the tec… ▽ More

    Submitted 3 July, 2018; v1 submitted 11 April, 2018; originally announced April 2018.

    Journal ref: Journal of Informetrics 12, 706-717 (2018)

  48. arXiv:1709.07580  [pdf, other

    cs.CY physics.soc-ph

    Service Providers of the Sharing Economy: Who Joins and Who Benefits?

    Authors: Qing Ke

    Abstract: Many "sharing economy" platforms, such as Uber and Airbnb, have become increasingly popular, providing consumers with more choices and suppliers a chance to make profit. They, however, have also brought about emerging issues regarding regulation, tax obligation, and impact on urban environment, and have generated heated debates from various interest groups. Empirical studies regarding these issues… ▽ More

    Submitted 21 September, 2017; originally announced September 2017.

    Comments: CSCW 2018 Online First

  49. A New Representation of Skeleton Sequences for 3D Action Recognition

    Authors: Qiuhong Ke, Mohammed Bennamoun, Senjian An, Ferdous Sohel, Farid Boussaid

    Abstract: This paper presents a new method for 3D action recognition with skeleton sequences (i.e., 3D trajectories of human skeleton joints). The proposed method first transforms each skeleton sequence into three clips each consisting of several frames for spatial temporal feature learning using deep neural networks. Each clip is generated from one channel of the cylindrical coordinates of the skeleton seq… ▽ More

    Submitted 4 June, 2017; v1 submitted 9 March, 2017; originally announced March 2017.

    Comments: CVPR 2017

  50. arXiv:1701.01645  [pdf, other

    cs.CY physics.soc-ph

    Sharing Means Renting?: An Entire-marketplace Analysis of Airbnb

    Authors: Qing Ke

    Abstract: Airbnb, an online marketplace for accommodations, has experienced a staggering growth accompanied by intense debates and scattered regulations around the world. Current discourses, however, are largely focused on opinions rather than empirical evidences. Here, we aim to bridge this gap by presenting the first large-scale measurement study on Airbnb, using a crawled data set containing 2.3 million… ▽ More

    Submitted 12 May, 2017; v1 submitted 6 January, 2017; originally announced January 2017.

    Comments: WebSci '17