Search | arXiv e-print repository

doi 10.1038/s41586-023-05810-5

Successful Kinetic Impact into an Asteroid for Planetary Defense

Authors: R. Terik Daly, Carolyn M. Ernst, Olivier S. Barnouin, Nancy L. Chabot, Andrew S. Rivkin, Andrew F. Cheng, Elena Y. Adams, Harrison F. Agrusa, Elisabeth D. Abel, Amy L. Alford, Erik I. Asphaug, Justin A. Atchison, Andrew R. Badger, Paul Baki, Ronald-L. Ballouz, Dmitriy L. Bekker, Julie Bellerose, Shyam Bhaskaran, Bonnie J. Buratti, Saverio Cambioni, Michelle H. Chen, Steven R. Chesley, George Chiu, Gareth S. Collins, Matthew W. Cox , et al. (76 additional authors not shown)

Abstract: While no known asteroid poses a threat to Earth for at least the next century, the catalog of near-Earth asteroids is incomplete for objects whose impacts would produce regional devastation. Several approaches have been proposed to potentially prevent an asteroid impact with Earth by deflecting or disrupting an asteroid. A test of kinetic impact technology was identified as the highest priority sp… ▽ More While no known asteroid poses a threat to Earth for at least the next century, the catalog of near-Earth asteroids is incomplete for objects whose impacts would produce regional devastation. Several approaches have been proposed to potentially prevent an asteroid impact with Earth by deflecting or disrupting an asteroid. A test of kinetic impact technology was identified as the highest priority space mission related to asteroid mitigation. NASA's Double Asteroid Redirection Test (DART) mission is the first full-scale test of kinetic impact technology. The mission's target asteroid was Dimorphos, the secondary member of the S-type binary near-Earth asteroid (65803) Didymos. This binary asteroid system was chosen to enable ground-based telescopes to quantify the asteroid deflection caused by DART's impact. While past missions have utilized impactors to investigate the properties of small bodies those earlier missions were not intended to deflect their targets and did not achieve measurable deflections. Here we report the DART spacecraft's autonomous kinetic impact into Dimorphos and reconstruct the impact event, including the timeline leading to impact, the location and nature of the DART impact site, and the size and shape of Dimorphos. The successful impact of the DART spacecraft with Dimorphos and the resulting change in Dimorphos's orbit demonstrates that kinetic impactor technology is a viable technique to potentially defend Earth if necessary. △ Less

Submitted 3 March, 2023; originally announced March 2023.

Comments: Accepted by Nature

arXiv:2303.02077 [pdf]

doi 10.1038/s41586-023-05805-2

Orbital Period Change of Dimorphos Due to the DART Kinetic Impact

Authors: Cristina A. Thomas, Shantanu P. Naidu, Peter Scheirich, Nicholas A. Moskovitz, Petr Pravec, Steven R. Chesley, Andrew S. Rivkin, David J. Osip, Tim A. Lister, Lance A. M. Benner, Marina Brozović, Carlos Contreras, Nidia Morrell, Agata Rożek, Peter Kušnirák, Kamil Hornoch, Declan Mages, Patrick A. Taylor, Andrew D. Seymour, Colin Snodgrass, Uffe G. Jørgensen, Martin Dominik, Brian Skiff, Tom Polakis, Matthew M. Knight , et al. (24 additional authors not shown)

Abstract: The Double Asteroid Redirection Test (DART) spacecraft successfully performed the first test of a kinetic impactor for asteroid deflection by impacting Dimorphos, the secondary of near-Earth binary asteroid (65803) Didymos, and changing the orbital period of Dimorphos. A change in orbital period of approximately 7 minutes was expected if the incident momentum from the DART spacecraft was directly… ▽ More The Double Asteroid Redirection Test (DART) spacecraft successfully performed the first test of a kinetic impactor for asteroid deflection by impacting Dimorphos, the secondary of near-Earth binary asteroid (65803) Didymos, and changing the orbital period of Dimorphos. A change in orbital period of approximately 7 minutes was expected if the incident momentum from the DART spacecraft was directly transferred to the asteroid target in a perfectly inelastic collision, but studies of the probable impact conditions and asteroid properties indicated that a considerable momentum enhancement ($β$) was possible. In the years prior to impact, we used lightcurve observations to accurately determine the pre-impact orbit parameters of Dimorphos with respect to Didymos. Here we report the change in the orbital period of Dimorphos as a result of the DART kinetic impact to be -33.0 +/- 1.0 (3$σ$) minutes. Using new Earth-based lightcurve and radar observations, two independent approaches determined identical values for the change in the orbital period. This large orbit period change suggests that ejecta contributed a significant amount of momentum to the asteroid beyond what the DART spacecraft carried. △ Less

Submitted 3 March, 2023; originally announced March 2023.

Comments: Accepted by Nature

arXiv:2303.01700 [pdf]

doi 10.1038/s41586-023-05811-4

Ejecta from the DART-produced active asteroid Dimorphos

Authors: Jian-Yang Li, Masatoshi Hirabayashi, Tony L. Farnham, Jessica M. Sunshine, Matthew M. Knight, Gonzalo Tancredi, Fernando Moreno, Brian Murphy, Cyrielle Opitom, Steve Chesley, Daniel J. Scheeres, Cristina A. Thomas, Eugene G. Fahnestock, Andrew F. Cheng, Linda Dressel, Carolyn M. Ernst, Fabio Ferrari, Alan Fitzsimmons, Simone Ieva, Stavro L. Ivanovski, Teddy Kareta, Ludmilla Kolokolova, Tim Lister, Sabina D. Raducan, Andrew S. Rivkin , et al. (39 additional authors not shown)

Abstract: Some active asteroids have been proposed to be the result of impact events. Because active asteroids are generally discovered serendipitously only after their tail formation, the process of the impact ejecta evolving into a tail has never been directly observed. NASA's Double Asteroid Redirection Test (DART) mission, apart from having successfully changed the orbital period of Dimorphos, demonstra… ▽ More Some active asteroids have been proposed to be the result of impact events. Because active asteroids are generally discovered serendipitously only after their tail formation, the process of the impact ejecta evolving into a tail has never been directly observed. NASA's Double Asteroid Redirection Test (DART) mission, apart from having successfully changed the orbital period of Dimorphos, demonstrated the activation process of an asteroid from an impact under precisely known impact conditions. Here we report the observations of the DART impact ejecta with the Hubble Space Telescope (HST) from impact time T+15 minutes to T+18.5 days at spatial resolutions of ~2.1 km per pixel. Our observations reveal a complex evolution of ejecta, which is first dominated by the gravitational interaction between the Didymos binary system and the ejected dust and later by solar radiation pressure. The lowest-speed ejecta dispersed via a sustained tail that displayed a consistent morphology with previously observed asteroid tails thought to be produced by impact. The ejecta evolution following DART's controlled impact experiment thus provides a framework for understanding the fundamental mechanisms acting on asteroids disrupted by natural impact. △ Less

Submitted 2 March, 2023; originally announced March 2023.

Comments: accepted by Nature

arXiv:2301.13574 [pdf, other]

doi 10.1007/JHEP04(2023)110

The mixed-state entanglement in holographic p-wave superconductor model

Authors: Zhe Yang, Fang-Jing Cheng, Chao Niu, Cheng-Yong Zhang, Peng Liu

Abstract: In this paper, we investigate the mixed-state entanglement in a model of p-wave superconductivity phase transition using holographic methods. We calculate several entanglement measures, including holographic entanglement entropy (HEE), mutual information (MI), and entanglement wedge cross-section (EWCS). Our results show that these measures display critical behavior at the phase transition points,… ▽ More In this paper, we investigate the mixed-state entanglement in a model of p-wave superconductivity phase transition using holographic methods. We calculate several entanglement measures, including holographic entanglement entropy (HEE), mutual information (MI), and entanglement wedge cross-section (EWCS). Our results show that these measures display critical behavior at the phase transition points, with the EWCS exhibiting opposite temperature behavior compared to the HEE. Additionally, we find that the critical exponents of all entanglement measures are twice those of the condensate. Moreover, we find that the EWCS is a more sensitive indicator of the critical behavior of phase transitions than the HEE. Furthermore, we uncover a universal inequality in the growth rates of EWCS and MI near critical points in thermal phase transitions, such as p-wave and s-wave superconductivity, suggesting that MI captures more information than EWCS when a phase transition first occurs. △ Less

Submitted 8 February, 2023; v1 submitted 31 January, 2023; originally announced January 2023.

Comments: 24 pages, 13 figures; several refs added and the text improved

arXiv:2301.13337

DAFD: Domain Adaptation via Feature Disentanglement for Image Classification

Authors: Zhize Wu, Changjiang Du, Le Zou, Ming Tan, Tong Xu, Fan Cheng, Fudong Nian, Thomas Weise

Abstract: A good feature representation is the key to image classification. In practice, image classifiers may be applied in scenarios different from what they have been trained on. This so-called domain shift leads to a significant performance drop in image classification. Unsupervised domain adaptation (UDA) reduces the domain shift by transferring the knowledge learned from a labeled source domain to an… ▽ More A good feature representation is the key to image classification. In practice, image classifiers may be applied in scenarios different from what they have been trained on. This so-called domain shift leads to a significant performance drop in image classification. Unsupervised domain adaptation (UDA) reduces the domain shift by transferring the knowledge learned from a labeled source domain to an unlabeled target domain. We perform feature disentanglement for UDA by distilling category-relevant features and excluding category-irrelevant features from the global feature maps. This disentanglement prevents the network from overfitting to category-irrelevant information and makes it focus on information useful for classification. This reduces the difficulty of domain alignment and improves the classification accuracy on the target domain. We propose a coarse-to-fine domain adaptation method called Domain Adaptation via Feature Disentanglement~(DAFD), which has two components: (1)the Category-Relevant Feature Selection (CRFS) module, which disentangles the category-relevant features from the category-irrelevant features, and (2)the Dynamic Local Maximum Mean Discrepancy (DLMMD) module, which achieves fine-grained alignment by reducing the discrepancy within the category-relevant features from different domains. Combined with the CRFS, the DLMMD module can align the category-relevant features properly. We conduct comprehensive experiment on four standard datasets. Our results clearly demonstrate the robustness and effectiveness of our approach in domain adaptive image classification tasks and its competitiveness to the state of the art. △ Less

Submitted 9 January, 2024; v1 submitted 30 January, 2023; originally announced January 2023.

Comments: Update the experimental results

arXiv:2301.08237 [pdf, other]

LoCoNet: Long-Short Context Network for Active Speaker Detection

Authors: Xizi Wang, Feng Cheng, Gedas Bertasius, David Crandall

Abstract: Active Speaker Detection (ASD) aims to identify who is speaking in each frame of a video. ASD reasons from audio and visual information from two contexts: long-term intra-speaker context and short-term inter-speaker context. Long-term intra-speaker context models the temporal dependencies of the same speaker, while short-term inter-speaker context models the interactions of speakers in the same sc… ▽ More Active Speaker Detection (ASD) aims to identify who is speaking in each frame of a video. ASD reasons from audio and visual information from two contexts: long-term intra-speaker context and short-term inter-speaker context. Long-term intra-speaker context models the temporal dependencies of the same speaker, while short-term inter-speaker context models the interactions of speakers in the same scene. These two contexts are complementary to each other and can help infer the active speaker. Motivated by these observations, we propose LoCoNet, a simple yet effective Long-Short Context Network that models the long-term intra-speaker context and short-term inter-speaker context. We use self-attention to model long-term intra-speaker context due to its effectiveness in modeling long-range dependencies, and convolutional blocks that capture local patterns to model short-term inter-speaker context. Extensive experiments show that LoCoNet achieves state-of-the-art performance on multiple datasets, achieving an mAP of 95.2%(+1.1%) on AVA-ActiveSpeaker, 68.1%(+22%) on Columbia dataset, 97.2%(+2.8%) on Talkies dataset and 59.7%(+8.0%) on Ego4D dataset. Moreover, in challenging cases where multiple speakers are present, or face of active speaker is much smaller than other faces in the same scene, LoCoNet outperforms previous state-of-the-art methods by 3.4% on the AVA-ActiveSpeaker dataset. The code will be released at https://github.com/SJTUwxz/LoCoNet_ASD. △ Less

Submitted 29 March, 2024; v1 submitted 19 January, 2023; originally announced January 2023.

Comments: accepted by CVPR 2024

arXiv:2212.14193 [pdf, other]

A Unified Object Counting Network with Object Occupation Prior

Authors: Shengqin Jiang, Qing Wang, Fengna Cheng, Yuankai Qi, Qingshan Liu

Abstract: The counting task, which plays a fundamental role in numerous applications (e.g., crowd counting, traffic statistics), aims to predict the number of objects with various densities. Existing object counting tasks are designed for a single object class. However, it is inevitable to encounter newly coming data with new classes in our real world. We name this scenario as \textit{evolving object counti… ▽ More The counting task, which plays a fundamental role in numerous applications (e.g., crowd counting, traffic statistics), aims to predict the number of objects with various densities. Existing object counting tasks are designed for a single object class. However, it is inevitable to encounter newly coming data with new classes in our real world. We name this scenario as \textit{evolving object counting}. In this paper, we build the first evolving object counting dataset and propose a unified object counting network as the first attempt to address this task. The proposed model consists of two key components: a class-agnostic mask module and a class-incremental module. The class-agnostic mask module learns generic object occupation prior via predicting a class-agnostic binary mask (e.g., 1 denotes there exists an object at the considering position in an image and 0 otherwise). The class-incremental module is used to handle new coming classes and provides discriminative class guidance for density map prediction. The combined outputs of class-agnostic mask module and image feature extractor are used to predict the final density map. When new classes come, we first add new neural nodes into the last regression and classification layers of class-incremental module. Then, instead of retraining the model from scratch, we utilize knowledge distillation to help the model remember what have already learned about previous object classes. We also employ a support sample bank to store a small number of typical training samples of each class, which are used to prevent the model from forgetting key information of old data. With this design, our model can efficiently and effectively adapt to new coming classes while keeping good performance on already seen data without large-scale retraining. Extensive experiments on the collected dataset demonstrate the favorable performance. △ Less

Submitted 30 June, 2023; v1 submitted 29 December, 2022; originally announced December 2022.

Comments: Accepted by IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY; The dataset and code are available at: https://github.com/Tanyjiang/EOCO

arXiv:2212.05051 [pdf, other]

VindLU: A Recipe for Effective Video-and-Language Pretraining

Authors: Feng Cheng, Xizi Wang, Jie Lei, David Crandall, Mohit Bansal, Gedas Bertasius

Abstract: The last several years have witnessed remarkable progress in video-and-language (VidL) understanding. However, most modern VidL approaches use complex and specialized model architectures and sophisticated pretraining protocols, making the reproducibility, analysis and comparisons of these frameworks difficult. Hence, instead of proposing yet another new VidL model, this paper conducts a thorough e… ▽ More The last several years have witnessed remarkable progress in video-and-language (VidL) understanding. However, most modern VidL approaches use complex and specialized model architectures and sophisticated pretraining protocols, making the reproducibility, analysis and comparisons of these frameworks difficult. Hence, instead of proposing yet another new VidL model, this paper conducts a thorough empirical study demystifying the most important factors in the VidL model design. Among the factors that we investigate are (i) the spatiotemporal architecture design, (ii) the multimodal fusion schemes, (iii) the pretraining objectives, (iv) the choice of pretraining data, (v) pretraining and finetuning protocols, and (vi) dataset and model scaling. Our empirical study reveals that the most important design factors include: temporal modeling, video-to-text multimodal fusion, masked modeling objectives, and joint training on images and videos. Using these empirical insights, we then develop a step-by-step recipe, dubbed VindLU, for effective VidL pretraining. Our final model trained using our recipe achieves comparable or better than state-of-the-art results on several VidL tasks without relying on external CLIP pretraining. In particular, on the text-to-video retrieval task, our approach obtains 61.2% on DiDeMo, and 55.0% on ActivityNet, outperforming current SOTA by 7.8% and 6.1% respectively. Furthermore, our model also obtains state-of-the-art video question-answering results on ActivityNet-QA, MSRVTT-QA, MSRVTT-MC and TVQA. Our code and pretrained models are publicly available at: https://github.com/klauscc/VindLU. △ Less

Submitted 5 April, 2023; v1 submitted 9 December, 2022; originally announced December 2022.

Comments: CVPR 2023. Project page: https://klauscc.github.io/vindlu.html

arXiv:2211.16032 [pdf, other]

Dimensionality-Varying Diffusion Process

Authors: Han Zhang, Ruili Feng, Zhantao Yang, Lianghua Huang, Yu Liu, Yifei Zhang, Yujun Shen, Deli Zhao, Jingren Zhou, Fan Cheng

Abstract: Diffusion models, which learn to reverse a signal destruction process to generate new data, typically require the signal at each step to have the same dimension. We argue that, considering the spatial redundancy in image signals, there is no need to maintain a high dimensionality in the evolution process, especially in the early generation phase. To this end, we make a theoretical generalization o… ▽ More Diffusion models, which learn to reverse a signal destruction process to generate new data, typically require the signal at each step to have the same dimension. We argue that, considering the spatial redundancy in image signals, there is no need to maintain a high dimensionality in the evolution process, especially in the early generation phase. To this end, we make a theoretical generalization of the forward diffusion process via signal decomposition. Concretely, we manage to decompose an image into multiple orthogonal components and control the attenuation of each component when perturbing the image. That way, along with the noise strength increasing, we are able to diminish those inconsequential components and thus use a lower-dimensional signal to represent the source, barely losing information. Such a reformulation allows to vary dimensions in both training and inference of diffusion models. Extensive experiments on a range of datasets suggest that our approach substantially reduces the computational cost and achieves on-par or even better synthesis performance compared to baseline methods. We also show that our strategy facilitates high-resolution image synthesis and improves FID of diffusion model trained on FFHQ at $1024\times1024$ resolution from 52.40 to 10.46. Code and models will be made publicly available. △ Less

Submitted 29 November, 2022; originally announced November 2022.

arXiv:2211.16022 [pdf, other]

Textual Enhanced Contrastive Learning for Solving Math Word Problems

Authors: Yibin Shen, Qianying Liu, Zhuoyuan Mao, Fei Cheng, Sadao Kurohashi

Abstract: Solving math word problems is the task that analyses the relation of quantities and requires an accurate understanding of contextual natural language information. Recent studies show that current models rely on shallow heuristics to predict solutions and could be easily misled by small textual perturbations. To address this problem, we propose a Textual Enhanced Contrastive Learning framework, whi… ▽ More Solving math word problems is the task that analyses the relation of quantities and requires an accurate understanding of contextual natural language information. Recent studies show that current models rely on shallow heuristics to predict solutions and could be easily misled by small textual perturbations. To address this problem, we propose a Textual Enhanced Contrastive Learning framework, which enforces the models to distinguish semantically similar examples while holding different mathematical logic. We adopt a self-supervised manner strategy to enrich examples with subtle textual variance by textual reordering or problem re-construction. We then retrieve the hardest to differentiate samples from both equation and textual perspectives and guide the model to learn their representations. Experimental results show that our method achieves state-of-the-art on both widely used benchmark datasets and also exquisitely designed challenge datasets in English and Chinese. \footnote{Our code and data is available at \url{https://github.com/yiyunya/Textual_CL_MWP} △ Less

Submitted 29 November, 2022; originally announced November 2022.

Comments: Findings of EMNLP 2022

arXiv:2211.07466 [pdf, other]

Reinforcement Learning Based Resource Allocation for Network Slices in O-RAN Midhaul

Authors: Nien Fang Cheng, Turgay Pamuklu, Melike Erol-Kantarci

Abstract: Network slicing envisions the 5th generation (5G) mobile network resource allocation to be based on different requirements for different services, such as Ultra-Reliable Low Latency Communication (URLLC) and Enhanced Mobile Broadband (eMBB). Open Radio Access Network (O-RAN), proposes an open and disaggregated concept of RAN by modulizing the functionalities into independent components. Network sl… ▽ More Network slicing envisions the 5th generation (5G) mobile network resource allocation to be based on different requirements for different services, such as Ultra-Reliable Low Latency Communication (URLLC) and Enhanced Mobile Broadband (eMBB). Open Radio Access Network (O-RAN), proposes an open and disaggregated concept of RAN by modulizing the functionalities into independent components. Network slicing for O-RAN can significantly improve performance. Therefore, an advanced resource allocation solution for network slicing in O-RAN is proposed in this study by applying Reinforcement Learning (RL). This research demonstrates an RL compatible simplified edge network simulator with three components, user equipment(UE), Edge O-Cloud, and Regional O-Cloud. This simulator is later used to discover how to improve throughput for targeted network slice(s) by dynamically allocating unused bandwidth from other slices. Increasing the throughput for certain network slicing can also benefit the end users with a higher average data rate, peak rate, or shorter transmission time. The results show that the RL model can provide eMBB traffic with a high peak rate and shorter transmission time for URLLC compared to balanced and eMBB focus baselines. △ Less

Submitted 14 November, 2022; originally announced November 2022.

Comments: Accepted Paper for IEEE CCNC 2023

arXiv:2210.11800 [pdf, other]

Rescue Implicit and Long-tail Cases: Nearest Neighbor Relation Extraction

Authors: Zhen Wan, Qianying Liu, Zhuoyuan Mao, Fei Cheng, Sadao Kurohashi, Jiwei Li

Abstract: Relation extraction (RE) has achieved remarkable progress with the help of pre-trained language models. However, existing RE models are usually incapable of handling two situations: implicit expressions and long-tail relation types, caused by language complexity and data sparsity. In this paper, we introduce a simple enhancement of RE using $k$ nearest neighbors ($k$NN-RE). $k$NN-RE allows the mod… ▽ More Relation extraction (RE) has achieved remarkable progress with the help of pre-trained language models. However, existing RE models are usually incapable of handling two situations: implicit expressions and long-tail relation types, caused by language complexity and data sparsity. In this paper, we introduce a simple enhancement of RE using $k$ nearest neighbors ($k$NN-RE). $k$NN-RE allows the model to consult training relations at test time through a nearest-neighbor search and provides a simple yet effective means to tackle the two issues above. Additionally, we observe that $k$NN-RE serves as an effective way to leverage distant supervision (DS) data for RE. Experimental results show that the proposed $k$NN-RE achieves state-of-the-art performances on a variety of supervised RE datasets, i.e., ACE05, SciERC, and Wiki80, along with outperforming the best model to date on the i2b2 and Wiki80 datasets in the setting of allowing using DS. Our code and models are available at: https://github.com/YukinoWan/kNN-RE. △ Less

Submitted 30 January, 2023; v1 submitted 21 October, 2022; originally announced October 2022.

Comments: EMNLP 2022 (short paper)

arXiv:2210.10421 [pdf]

Multi-view Gait Recognition based on Siamese Vision Transformer

Authors: Yanchen Yang, Lijun Yun, Ruoyu Li, Feiyan Cheng

Abstract: While the Vision Transformer has been used in gait recognition, its application in multi-view gait recognition is still limited. Different views significantly affect the extraction and identification accuracy of the characteristics of gait contour. To address this, this paper proposes a Siamese Mobile Vision Transformer (SMViT). This model not only focuses on the local characteristics of the human… ▽ More While the Vision Transformer has been used in gait recognition, its application in multi-view gait recognition is still limited. Different views significantly affect the extraction and identification accuracy of the characteristics of gait contour. To address this, this paper proposes a Siamese Mobile Vision Transformer (SMViT). This model not only focuses on the local characteristics of the human gait space but also considers the characteristics of long-distance attention associations, which can extract multi-dimensional step status characteristics. In addition, it describes how different perspectives affect gait characteristics and generate reliable perspective feature relationship factors. The average recognition rate of SMViT on the CASIA B data set reached 96.4%. The experimental results show that SMViT can attain state-of-the-art performance compared to advanced step recognition models such as GaitGAN, Multi_view GAN, Posegait and other gait recognition models. △ Less

Submitted 19 October, 2022; originally announced October 2022.

Comments: 13 pages,9 figures,1 table

arXiv:2210.07017 [pdf, other]

ComSearch: Equation Searching with Combinatorial Strategy for Solving Math Word Problems with Weak Supervision

Authors: Qianying Liu, Wenyu Guan, Jianhao Shen, Fei Cheng, Sadao Kurohashi

Abstract: Previous studies have introduced a weakly-supervised paradigm for solving math word problems requiring only the answer value annotation. While these methods search for correct value equation candidates as pseudo labels, they search among a narrow sub-space of the enormous equation space. To address this problem, we propose a novel search algorithm with combinatorial strategy \textbf{ComSearch}, wh… ▽ More Previous studies have introduced a weakly-supervised paradigm for solving math word problems requiring only the answer value annotation. While these methods search for correct value equation candidates as pseudo labels, they search among a narrow sub-space of the enormous equation space. To address this problem, we propose a novel search algorithm with combinatorial strategy \textbf{ComSearch}, which can compress the search space by excluding mathematically equivalent equations. The compression allows the searching algorithm to enumerate all possible equations and obtain high-quality data. We investigate the noise in the pseudo labels that hold wrong mathematical logic, which we refer to as the \textit{false-matching} problem, and propose a ranking model to denoise the pseudo labels. Our approach holds a flexible framework to utilize two existing supervised math word problem solvers to train pseudo labels, and both achieve state-of-the-art performance in the weak supervision task. △ Less

Submitted 7 March, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

Comments: EACL 2023 long paper, 14 pages

arXiv:2209.11873 [pdf]

After DART: Using the first full-scale test of a kinetic impactor to inform a future planetary defense mission

Authors: Thomas S. Statler, Sabina D. Raducan, Olivier S. Barnouin, Mallory E. DeCoster, Steven R. Chesley, Brent Barbee, Harrison F. Agrusa, Saverio Cambioni, Andrew F. Cheng, Elisabetta Dotto, Siegfried Eggl, Eugene G. Fahnestock, Fabio Ferrari, Dawn Graninger, Alain Herique, Isabel Herreros, Masatoshi Hirabayashi, Stavro Ivanovski, Martin Jutzi, Özgür Karatekin, Alice Lucchetti, Robert Luther, Rahil Makadia, Francesco Marzari, Patrick Michel , et al. (16 additional authors not shown)

Abstract: NASA's Double Asteroid Redirection Test (DART) is the first full-scale test of an asteroid deflection technology. Results from the hypervelocity kinetic impact and Earth-based observations, coupled with LICIACube and the later Hera mission, will result in measurement of the momentum transfer efficiency accurate to ~10% and characterization of the Didymos binary system. But DART is a single experim… ▽ More NASA's Double Asteroid Redirection Test (DART) is the first full-scale test of an asteroid deflection technology. Results from the hypervelocity kinetic impact and Earth-based observations, coupled with LICIACube and the later Hera mission, will result in measurement of the momentum transfer efficiency accurate to ~10% and characterization of the Didymos binary system. But DART is a single experiment; how could these results be used in a future planetary defense necessity involving a different asteroid? We examine what aspects of Dimorphos's response to kinetic impact will be constrained by DART results; how these constraints will help refine knowledge of the physical properties of asteroidal materials and predictive power of impact simulations; what information about a potential Earth impactor could be acquired before a deflection effort; and how design of a deflection mission should be informed by this understanding. We generalize the momentum enhancement factor $β$, showing that a particular direction-specific $β$ will be directly determined by the DART results, and that a related direction-specific $β$ is a figure of merit for a kinetic impact mission. The DART $β$ determination constrains the ejecta momentum vector, which, with hydrodynamic simulations, constrains the physical properties of Dimorphos's near-surface. In a hypothetical planetary defense exigency, extrapolating these constraints to a newly discovered asteroid will require Earth-based observations and benefit from in-situ reconnaissance. We show representative predictions for momentum transfer based on different levels of reconnaissance and discuss strategic targeting to optimize the deflection and reduce the risk of a counterproductive deflection in the wrong direction. △ Less

Submitted 23 September, 2022; originally announced September 2022.

Comments: 30 pages, 7 figures. Planetary Science Journal, in press, accepted 2022 September 22

arXiv:2209.10310 [pdf, other]

Seeking Diverse Reasoning Logic: Controlled Equation Expression Generation for Solving Math Word Problems

Authors: Yibin Shen, Qianying Liu, Zhuoyuan Mao, Zhen Wan, Fei Cheng, Sadao Kurohashi

Abstract: To solve Math Word Problems, human students leverage diverse reasoning logic that reaches different possible equation solutions. However, the mainstream sequence-to-sequence approach of automatic solvers aims to decode a fixed solution equation supervised by human annotation. In this paper, we propose a controlled equation generation solver by leveraging a set of control codes to guide the model t… ▽ More To solve Math Word Problems, human students leverage diverse reasoning logic that reaches different possible equation solutions. However, the mainstream sequence-to-sequence approach of automatic solvers aims to decode a fixed solution equation supervised by human annotation. In this paper, we propose a controlled equation generation solver by leveraging a set of control codes to guide the model to consider certain reasoning logic and decode the corresponding equations expressions transformed from the human reference. The empirical results suggest that our method universally improves the performance on single-unknown (Math23K) and multiple-unknown (DRAW1K, HMWP) benchmarks, with substantial improvements up to 13.2% accuracy on the challenging multiple-unknown datasets. △ Less

Submitted 29 November, 2022; v1 submitted 21 September, 2022; originally announced September 2022.

Comments: AACL 2022 short paper

arXiv:2209.06659 [pdf]

Effects of impact and target parameters on the results of a kinetic impactor: predictions for the Double Asteroid Redirection Test (DART) mission

Authors: Angela M. Stickle, Mallory E. DeCoster, Christoph Burger, Wendy K. Caldwell, Dawn Graninger, Kathryn M. Kumamoto, Robert Luther, Jens Ormö, Sabina Raducan, Emma Rainey, Christoph M. Schäfer, James D. Walker, Yun Zhang, Patrick Michel, J. Michael Owen, Olivier Barnouin, Andy F. Cheng, Sidney Cochron, Gareth S. Collins, Thomas M. Davison, Elisabetta Dotto, Fabio Ferrari, M. Isabel Herreros, Stavro L. Ivanovski, Martin Jutzi , et al. (8 additional authors not shown)

Abstract: The Double Asteroid Redirection Test (DART) spacecraft will impact into the asteroid Dimorphos on September 26, 2022 as a test of the kinetic impactor technique for planetary defense. The efficiency of the deflection following a kinetic impactor can be represented using the momentum enhancement factor, Beta, which is dependent on factors such as impact geometry and the specific target material pro… ▽ More The Double Asteroid Redirection Test (DART) spacecraft will impact into the asteroid Dimorphos on September 26, 2022 as a test of the kinetic impactor technique for planetary defense. The efficiency of the deflection following a kinetic impactor can be represented using the momentum enhancement factor, Beta, which is dependent on factors such as impact geometry and the specific target material properties. Currently, very little is known about Dimorphos and its material properties that introduces uncertainty in the results of the deflection efficiency observables, including crater formation, ejecta distribution, and Beta. The DART Impact Modeling Working Group (IWG) is responsible for using impact simulations to better understand the results of the DART impact. Pre-impact simulation studies also provide considerable insight into how different properties and impact scenarios affect momentum enhancement following a kinetic impact. This insight provides a basis for predicting the effects of the DART impact and the first understanding of how to interpret results following the encounter. Following the DART impact, the knowledge gained from these studies will inform the initial simulations that will recreate the impact conditions, including providing estimates for potential material properties of Dimorphos and Beta resulting from DARTs impact. This paper summarizes, at a high level, what has been learned from the IWG simulations and experiments in preparation for the DART impact. While unknown, estimates for reasonable potential material properties of Dimorphos provide predictions for Beta of 1-5, depending on end-member cases in the strength regime. △ Less

Submitted 14 September, 2022; originally announced September 2022.

Comments: Accepted to PSJ Didymos-DART Focus Issue

arXiv:2208.13108 [pdf, ps, other]

A Reformulation of Gaussian Completely Monotone Conjecture: A Hodge Structure on the Fisher Information along Heat Flow

Authors: Fan Cheng

Abstract: In the past decade, J. Huh solved several long-standing open problems on log-concave sequences in combinatorics. The ground-breaking techniques developed in those work are from algebraic geometry: "We believe that behind any log-concave sequence that appears in nature there is such a Hodge structure responsible for the log-concavity". A function is called completely monotone if its derivatives a… ▽ More In the past decade, J. Huh solved several long-standing open problems on log-concave sequences in combinatorics. The ground-breaking techniques developed in those work are from algebraic geometry: "We believe that behind any log-concave sequence that appears in nature there is such a Hodge structure responsible for the log-concavity". A function is called completely monotone if its derivatives alternate in signs; e.g., $e^{-t}$. A fundamental conjecture in mathematical physics and Shannon information theory is on the complete monotonicity of Gaussian distribution (GCMC), which states that $I(X+Z_t)$\footnote{The probability density function of $X+Z_t$ is called "heat flow" in mathematical physics.} is completely monotone in $t$, where $I$ is Fisher information, random variables $X$ and $Z_t$ are independent and $Z_t\sim\mathcal{N}(0,t)$ is Gaussian. Inspired by the algebraic geometry method introduced by J. Huh, GCMC is reformulated in the form of a log-convex sequence. In general, a completely monotone function can admit a log-convex sequence and a log-convex sequence can further induce a log-concave sequence. The new formulation may guide GCMC to the marvelous temple of algebraic geometry. Moreover, to make GCMC more accessible to researchers from both information theory and mathematics\footnote{The author was not familiar with algebraic geometry. The paper is also aimed at providing people outside information theory of necessary background on the history of GCMC in theory and application.}, together with some new findings, a thorough summary of the origin, the implication and further study on GCMC is presented. △ Less

Submitted 27 August, 2022; originally announced August 2022.

arXiv:2208.08010 [pdf, other]

doi 10.1109/TVCG.2023.3236380

ShortcutLens: A Visual Analytics Approach for Exploring Shortcuts in Natural Language Understanding Dataset

Authors: Zhihua Jin, Xingbo Wang, Furui Cheng, Chunhui Sun, Qun Liu, Huamin Qu

Abstract: Benchmark datasets play an important role in evaluating Natural Language Understanding (NLU) models. However, shortcuts -- unwanted biases in the benchmark datasets -- can damage the effectiveness of benchmark datasets in revealing models' real capabilities. Since shortcuts vary in coverage, productivity, and semantic meaning, it is challenging for NLU experts to systematically understand and avoi… ▽ More Benchmark datasets play an important role in evaluating Natural Language Understanding (NLU) models. However, shortcuts -- unwanted biases in the benchmark datasets -- can damage the effectiveness of benchmark datasets in revealing models' real capabilities. Since shortcuts vary in coverage, productivity, and semantic meaning, it is challenging for NLU experts to systematically understand and avoid them when creating benchmark datasets. In this paper, we develop a visual analytics system, ShortcutLens, to help NLU experts explore shortcuts in NLU benchmark datasets. The system allows users to conduct multi-level exploration of shortcuts. Specifically, Statistics View helps users grasp the statistics such as coverage and productivity of shortcuts in the benchmark dataset. Template View employs hierarchical and interpretable templates to summarize different types of shortcuts. Instance View allows users to check the corresponding instances covered by the shortcuts. We conduct case studies and expert interviews to evaluate the effectiveness and usability of the system. The results demonstrate that ShortcutLens supports users in gaining a better understanding of benchmark dataset issues through shortcuts, inspiring them to create challenging and pertinent benchmark datasets. △ Less

Submitted 16 August, 2022; originally announced August 2022.

Comments: 15 pages, 6 figures

arXiv:2207.06998 [pdf]

doi 10.3847/PSJ/ac76c9

Predictions for the Dynamical States of the Didymos System before and after the Planned DART Impact

Authors: Derek C. Richardson, Harrison F. Agrusa, Brent Barbee, William F. Bottke, Andrew F. Cheng, Siegfried Eggl, Fabio Ferrari, Masatoshi Hirabayashi, Özgür Karatekin, Jay McMahon, Stephen R. Schwartz, Ronald-Louis Ballouz, Adriano Campo Bagatin, Elisabetta Dotto, Eugene G. Fahnestock, Oscar Fuentes-Muñoz, Ioannis Gkolias, Douglas P. Hamilton, Seth A. Jacobson, Martin Jutzi, Josh Lyzhoft, Rahil Makadia, Alex J. Meyer, Patrick Michel, Ryota Nakano , et al. (11 additional authors not shown)

Abstract: NASA's Double Asteroid Redirection Test (DART) spacecraft is planned to impact the natural satellite of (65803) Didymos, Dimorphos, around 23:14 UTC on 26 September 2022, causing a reduction in its orbital period that will be measurable with ground-based observations. This test of kinetic impactor technology will provide the first estimate of the momentum transfer enhancement factor $β$ at a reali… ▽ More NASA's Double Asteroid Redirection Test (DART) spacecraft is planned to impact the natural satellite of (65803) Didymos, Dimorphos, around 23:14 UTC on 26 September 2022, causing a reduction in its orbital period that will be measurable with ground-based observations. This test of kinetic impactor technology will provide the first estimate of the momentum transfer enhancement factor $β$ at a realistic scale, wherein ejecta from the impact provides an additional deflection to the target. Earth-based observations, the LICIACube spacecraft (to be detached from DART prior to impact), and ESA's follow-up Hera mission to launch in 2024, will provide additional characterization of the deflection test. Together Hera and DART comprise the Asteroid Impact and Deflection Assessment (AIDA) cooperation between NASA and ESA. Here the predicted dynamical states of the binary system upon arrival and after impact are presented. The assumed dynamically relaxed state of the system will be excited by the impact, leading to an increase in eccentricity and slight tilt of the orbit together with enhanced libration of Dimorphos with amplitude dependent on the currently poorly known target shape. Free rotation around the moon's long axis may also be triggered and the orbital period will experience variations from seconds to minutes over timescales of days to months. Shape change of either body due to cratering or mass wasting triggered by crater formation and ejecta may affect $β$ but can be constrained through additional measurements. Both BYORP and gravity tides may cause measurable orbital changes on the timescale of Hera's rendezvous. △ Less

Submitted 14 July, 2022; originally announced July 2022.

Comments: 23 pages, 13 figures, published in PSJ

Journal ref: Planet. Sci. J. 3 157 (2022)

arXiv:2206.13042 [pdf, other]

A Strategy Optimized Pix2pix Approach for SAR-to-Optical Image Translation Task

Authors: Fujian Cheng, Yashu Kang, Chunlei Chen, Kezhao Jiang

Abstract: This technical report summarizes the analysis and approach on the image-to-image translation task in the Multimodal Learning for Earth and Environment Challenge (MultiEarth 2022). In terms of strategy optimization, cloud classification is utilized to filter optical images with dense cloud coverage to aid the supervised learning alike approach. The commonly used pix2pix framework with a few optimiz… ▽ More This technical report summarizes the analysis and approach on the image-to-image translation task in the Multimodal Learning for Earth and Environment Challenge (MultiEarth 2022). In terms of strategy optimization, cloud classification is utilized to filter optical images with dense cloud coverage to aid the supervised learning alike approach. The commonly used pix2pix framework with a few optimizations is applied to build the model. A weighted combination of mean squared error and mean absolute error is incorporated in the loss function. As for evaluation, peak to signal ratio and structural similarity were both considered in our preliminary analysis. Lastly, our method achieved the second place with a final error score of 0.0412. The results indicate great potential towards SAR-to-optical translation in remote sensing tasks, specifically for the support of long-term environmental monitoring and protection. △ Less

Submitted 4 July, 2022; v1 submitted 27 June, 2022; originally announced June 2022.

arXiv:2206.01375 [pdf]

doi 10.1021/acs.nanolett.2c02269

Exploiting dynamic nonlinearity in upconversion nanoparticles for super-resolution imaging

Authors: Chaohao Chen, Lei Ding, Baolei Liu, Ziqin Du, Yongtao Liu, Xiangjun Di, Xuchen Shan, Chenxiao Lin, Min Zhang, Xiaoxue Xu, Xiaolan Zhong, Jianfeng Wang, Lingqian Chang, Ben J. Halkon, Xin Chen, Faliang Cheng, Fan Wang

Abstract: Single-beam super-resolution microscopy, also known as superlinear microscopy, exploits the nonlinear response of fluorescent probes in confocal microscopy. The technique requires no complex purpose-built system, light field modulation, or beam shaping. Here, we present a strategy to enhance spatial resolution of superlinear microscopy by modulating excitation intensity during image acquisition. T… ▽ More Single-beam super-resolution microscopy, also known as superlinear microscopy, exploits the nonlinear response of fluorescent probes in confocal microscopy. The technique requires no complex purpose-built system, light field modulation, or beam shaping. Here, we present a strategy to enhance spatial resolution of superlinear microscopy by modulating excitation intensity during image acquisition. This modulation induces dynamic optical nonlinearity in upconversion nanoparticles (UCNPs), resulting in variations of higher spatial-frequency information in the obtained images. The high-order information can be extracted with a proposed weighted finite difference imaging algorithm from raw fluorescence images, to generate an image with a higher resolution than superlinear microscopy images. We apply this approach to resolve two adjacent nanoparticles within a diffraction-limited area, improving the resolution to 130 nm. This work suggests a new scope for developing dynamic nonlinear fluorescent probes in super-resolution nanoscopy. △ Less

Submitted 2 June, 2022; originally announced June 2022.

Comments: 26 pages with 4 figures

arXiv:2205.08770 [pdf, other]

Relation Extraction with Weighted Contrastive Pre-training on Distant Supervision

Authors: Zhen Wan, Fei Cheng, Qianying Liu, Zhuoyuan Mao, Haiyue Song, Sadao Kurohashi

Abstract: Contrastive pre-training on distant supervision has shown remarkable effectiveness in improving supervised relation extraction tasks. However, the existing methods ignore the intrinsic noise of distant supervision during the pre-training stage. In this paper, we propose a weighted contrastive learning method by leveraging the supervised data to estimate the reliability of pre-training instances an… ▽ More Contrastive pre-training on distant supervision has shown remarkable effectiveness in improving supervised relation extraction tasks. However, the existing methods ignore the intrinsic noise of distant supervision during the pre-training stage. In this paper, we propose a weighted contrastive learning method by leveraging the supervised data to estimate the reliability of pre-training instances and explicitly reduce the effect of noise. Experimental results on three supervised datasets demonstrate the advantages of our proposed weighted contrastive learning approach compared to two state-of-the-art non-weighted baselines.Our code and models are available at: https://github.com/YukinoWan/WCL △ Less

Submitted 10 February, 2023; v1 submitted 18 May, 2022; originally announced May 2022.

Comments: EACL 2023 (Findings)

arXiv:2205.04232 [pdf, other]

doi 10.1093/mnras/stac1305

Arecibo and FAST Timing Follow-up of twelve Millisecond Pulsars Discovered in Commensal Radio Astronomy FAST Survey

Authors: C. C. Miao, W. W. Zhu, D. Li, P. C. C. Freire, J. R. Niu, P. Wang, J. P. Yuan, M. Y. Xue, A. D. Cameron, D. J. Champion, M. Cruces, Y. T. Chen, M. M. Chi, X. F. Cheng, S. J. Dang, M. F. Ding, Y. Feng, Z. Y. Gan, G. Hobbs, M. Kramer, Z. J. Liu, Y. X. Li, Z. K. Luo, X. L. Miao, L. Q. Meng , et al. (24 additional authors not shown)

Abstract: We report the phase-connected timing ephemeris, polarization pulse profiles, Faraday rotation measurements, and Rotating-Vector-Model (RVM) fitting results of twelve millisecond pulsars (MSPs) discovered with the Five-hundred-meter Aperture Spherical radio Telescope (FAST) in the Commensal radio Astronomy FAST survey (CRAFTS). The timing campaigns were carried out with FAST and Arecibo over three… ▽ More We report the phase-connected timing ephemeris, polarization pulse profiles, Faraday rotation measurements, and Rotating-Vector-Model (RVM) fitting results of twelve millisecond pulsars (MSPs) discovered with the Five-hundred-meter Aperture Spherical radio Telescope (FAST) in the Commensal radio Astronomy FAST survey (CRAFTS). The timing campaigns were carried out with FAST and Arecibo over three years. Eleven of the twelve pulsars are in neutron star - white dwarf binary systems, with orbital periods between 2.4 and 100 d. Ten of them have spin periods, companion masses, and orbital eccentricities that are consistent with the theoretical expectations for MSP - Helium white dwarf (He WD) systems. The last binary pulsar (PSR J1912$-$0952) has a significantly smaller spin frequency and a smaller companion mass, the latter could be caused by a low orbital inclination for the system. Its orbital period of 29 days is well within the range of orbital periods where some MSP - He WD systems have shown anomalous eccentricities, however, the eccentricity of PSR J1912$-$0952 is typical of what one finds for the remaining MSP - He WD systems. △ Less

Submitted 9 May, 2022; originally announced May 2022.

Comments: 11 pages, 5 figures, MNRAS accepted

arXiv:2205.01843 [pdf]

doi 10.1093/nsr/nwab225

Direct observation of nodeless superconductivity and phonon modes in electron-doped copper oxide Sr$_{1-x}$Nd$_x$CuO$_2$

Authors: Jia-Qi Fan, Xue-Qing Yu, Fang-Jun Cheng, Heng Wang, Ruifeng Wang, Xiaobing Ma, Xiao-Peng Hu, Ding Zhang, Xu-Cun Ma, Qi-Kun Xue, Can-Li Song

Abstract: The microscopic understanding of high-temperature superconductivity in cuprates has been hindered by the apparent complexity of crystal structures in these materials. We used scanning tunneling microscopy and spectroscopy to study an electron-doped copper oxide compound Sr$_{1-x}$Nd$_x$CuO$_2$ that has only bare cations separating the CuO$_2$ planes and thus the simplest infinite-layer structure a… ▽ More The microscopic understanding of high-temperature superconductivity in cuprates has been hindered by the apparent complexity of crystal structures in these materials. We used scanning tunneling microscopy and spectroscopy to study an electron-doped copper oxide compound Sr$_{1-x}$Nd$_x$CuO$_2$ that has only bare cations separating the CuO$_2$ planes and thus the simplest infinite-layer structure among all cuprate superconductors. Tunneling conductance spectra of the major CuO$_2$ planes in the superconducting state revealed direct evidence for a nodeless pairing gap, regardless of variation of its magnitude with the local doping of trivalent neodymium. Furthermore, three distinct bosonic modes are observed as multiple peak-dip-hump features outside the superconducting gaps and their respective energies depend little on the spatially varying gaps. Along with the bosonic modes with energies identical to those of the external, bending and stretching phonons of copper oxides, our findings indicate their origin from lattice vibrations rather than spin excitations. △ Less

Submitted 3 May, 2022; originally announced May 2022.

Comments: 16 Pages, 4 figures, 5 supplemental figures

Journal ref: Natl. Sci. Rev. 9, nwab225 (2022)

arXiv:2204.04777 [pdf]

Multimodal Machine Learning in Precision Health

Authors: Adrienne Kline, Hanyin Wang, Yikuan Li, Saya Dennis, Meghan Hutch, Zhenxing Xu, Fei Wang, Feixiong Cheng, Yuan Luo

Abstract: As machine learning and artificial intelligence are more frequently being leveraged to tackle problems in the health sector, there has been increased interest in utilizing them in clinical decision-support. This has historically been the case in single modal data such as electronic health record data. Attempts to improve prediction and resemble the multimodal nature of clinical expert decision-mak… ▽ More As machine learning and artificial intelligence are more frequently being leveraged to tackle problems in the health sector, there has been increased interest in utilizing them in clinical decision-support. This has historically been the case in single modal data such as electronic health record data. Attempts to improve prediction and resemble the multimodal nature of clinical expert decision-making this has been met in the computational field of machine learning by a fusion of disparate data. This review was conducted to summarize this field and identify topics ripe for future research. We conducted this review in accordance with the PRISMA (Preferred Reporting Items for Systematic reviews and Meta-Analyses) extension for Scoping Reviews to characterize multi-modal data fusion in health. We used a combination of content analysis and literature searches to establish search strings and databases of PubMed, Google Scholar, and IEEEXplore from 2011 to 2021. A final set of 125 articles were included in the analysis. The most common health areas utilizing multi-modal methods were neurology and oncology. However, there exist a wide breadth of current applications. The most common form of information fusion was early fusion. Notably, there was an improvement in predictive performance performing heterogeneous data fusion. Lacking from the papers were clear clinical deployment strategies and pursuit of FDA-approved tools. These findings provide a map of the current literature on multimodal data fusion as applied to health diagnosis/prognosis problems. Multi-modal machine learning, while more robust in its estimations over unimodal methods, has drawbacks in its scalability and the time-consuming nature of information concatenation. △ Less

Submitted 10 April, 2022; originally announced April 2022.

arXiv:2204.03855 [pdf, other]

Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition

Authors: Qianying Liu, Zhuo Gong, Zhengdong Yang, Yuhang Yang, Sheng Li, Chenchen Ding, Nobuaki Minematsu, Hao Huang, Fei Cheng, Chenhui Chu, Sadao Kurohashi

Abstract: Low-resource speech recognition has been long-suffering from insufficient training data. In this paper, we propose an approach that leverages neighboring languages to improve low-resource scenario performance, founded on the hypothesis that similar linguistic units in neighboring languages exhibit comparable term frequency distributions, which enables us to construct a Huffman tree for performing… ▽ More Low-resource speech recognition has been long-suffering from insufficient training data. In this paper, we propose an approach that leverages neighboring languages to improve low-resource scenario performance, founded on the hypothesis that similar linguistic units in neighboring languages exhibit comparable term frequency distributions, which enables us to construct a Huffman tree for performing multilingual hierarchical Softmax decoding. This hierarchical structure enables cross-lingual knowledge sharing among similar tokens, thereby enhancing low-resource training outcomes. Empirical analyses demonstrate that our method is effective in improving the accuracy and efficiency of low-resource speech recognition. △ Less

Submitted 30 April, 2023; v1 submitted 8 April, 2022; originally announced April 2022.

Comments: 7 pages, ICASSP 2023

arXiv:2204.01680 [pdf, other]

TALLFormer: Temporal Action Localization with a Long-memory Transformer

Authors: Feng Cheng, Gedas Bertasius

Abstract: Most modern approaches in temporal action localization divide this problem into two parts: (i) short-term feature extraction and (ii) long-range temporal boundary localization. Due to the high GPU memory cost caused by processing long untrimmed videos, many methods sacrifice the representational power of the short-term feature extractor by either freezing the backbone or using a small spatial vide… ▽ More Most modern approaches in temporal action localization divide this problem into two parts: (i) short-term feature extraction and (ii) long-range temporal boundary localization. Due to the high GPU memory cost caused by processing long untrimmed videos, many methods sacrifice the representational power of the short-term feature extractor by either freezing the backbone or using a small spatial video resolution. This issue becomes even worse with the recent video transformer models, many of which have quadratic memory complexity. To address these issues, we propose TALLFormer, a memory-efficient and end-to-end trainable Temporal Action Localization Transformer with Long-term memory. Our long-term memory mechanism eliminates the need for processing hundreds of redundant video frames during each training iteration, thus, significantly reducing the GPU memory consumption and training time. These efficiency savings allow us (i) to use a powerful video transformer feature extractor without freezing the backbone or reducing the spatial video resolution, while (ii) also maintaining long-range temporal boundary localization capability. With only RGB frames as input and no external action recognition classifier, TALLFormer outperforms previous state-of-the-arts by a large margin, achieving an average mAP of 59.1% on THUMOS14 and 35.6% on ActivityNet-1.3. The code is public available: https://github.com/klauscc/TALLFormer. △ Less

Submitted 26 July, 2022; v1 submitted 4 April, 2022; originally announced April 2022.

Comments: Accepted by ECCV 2022

arXiv:2203.16755 [pdf, other]

Stochastic Backpropagation: A Memory Efficient Strategy for Training Video Models

Authors: Feng Cheng, Mingze Xu, Yuanjun Xiong, Hao Chen, Xinyu Li, Wei Li, Wei Xia

Abstract: We propose a memory efficient method, named Stochastic Backpropagation (SBP), for training deep neural networks on videos. It is based on the finding that gradients from incomplete execution for backpropagation can still effectively train the models with minimal accuracy loss, which attributes to the high redundancy of video. SBP keeps all forward paths but randomly and independently removes the b… ▽ More We propose a memory efficient method, named Stochastic Backpropagation (SBP), for training deep neural networks on videos. It is based on the finding that gradients from incomplete execution for backpropagation can still effectively train the models with minimal accuracy loss, which attributes to the high redundancy of video. SBP keeps all forward paths but randomly and independently removes the backward paths for each network layer in each training step. It reduces the GPU memory cost by eliminating the need to cache activation values corresponding to the dropped backward paths, whose amount can be controlled by an adjustable keep-ratio. Experiments show that SBP can be applied to a wide range of models for video tasks, leading to up to 80.0% GPU memory saving and 10% training speedup with less than 1% accuracy drop on action recognition and temporal action detection. △ Less

Submitted 30 March, 2022; originally announced March 2022.

Comments: CVPR 2022 Oral

arXiv:2203.14476 [pdf]

A Novel Remote Sensing Approach to Recognize and Monitor Red Palm Weevil in Date Palm Trees

Authors: Yashu Kang, Chunlei Chen, Fujian Cheng, Jianyong Zhang

Abstract: The spread of the Red Pal Weevil (RPW) has become an existential threat for palm trees around the world. In the Middle East, RPW is causing wide-spread damage to date palm Phoenix dactylifera L., having both agricultural impacts on the palm production and environmental impacts. Early detection of RPW is very challenging, especially at large scale. This research proposes a novel remote sensing appr… ▽ More The spread of the Red Pal Weevil (RPW) has become an existential threat for palm trees around the world. In the Middle East, RPW is causing wide-spread damage to date palm Phoenix dactylifera L., having both agricultural impacts on the palm production and environmental impacts. Early detection of RPW is very challenging, especially at large scale. This research proposes a novel remote sensing approach to recognize and monitor red palm weevil in date palm trees, using a combination of vegetation indices, object detection and semantic segmentation techniques. The study area consists of date palm trees with three classes, including healthy palms, smallish palms and severely infected palms. This proposed method achieved a promising 0.947 F1 score on test data set. This work paves the way for deploying artificial intelligence approaches to monitor RPW in large-scale as well as provide guidance for practitioners. △ Less

Submitted 27 March, 2022; originally announced March 2022.

arXiv:2203.08888 [pdf]

A Predicted Dearth of Majority Hypervolatile Ices in Oort Cloud Comets

Authors: C. M. Lisse, G. R. Gladstone, L. A. Young, D. P. Cruikshank, S. A. Sandford, B. Schmitt, S. A. Stern, H. A. Weaver, O. Umurhan, Y. J. Pendleton, J. T. Keane, J. M. Parker, R. P. Binzel, A. M. Earle, M. Horanyi, M. El-Maarry, A. F. Cheng, J. M. Moore, W. B. McKinnon, W. M. Grundy, J. J. Kavelaars, I. R. Linscott, W. Lyra, B. L. Lewis, D. T. Britt , et al. (8 additional authors not shown)

Abstract: We present new, ice species-specific New Horizons/Alice upper gas coma production limits from the 01 Jan 2019 MU69/Arrokoth flyby of Gladstone et al. (2021) and use them to make predictions about the rarity of majority hypervolatile (CO, N$_2$, CH$_4$) ices in KBOs and Oort Cloud comets. These predictions have a number of important implications for the study of the Oort Cloud, including: determina… ▽ More We present new, ice species-specific New Horizons/Alice upper gas coma production limits from the 01 Jan 2019 MU69/Arrokoth flyby of Gladstone et al. (2021) and use them to make predictions about the rarity of majority hypervolatile (CO, N$_2$, CH$_4$) ices in KBOs and Oort Cloud comets. These predictions have a number of important implications for the study of the Oort Cloud, including: determination of hypervolatile rich comets as the first objects emplaced into the Oort Cloud; measurement of CO/N$_2$/CH$_4$ abundance ratios in the proto-planetary disk from hypervolatile rich comets; and population statistical constraints on early (< 20 Myr) planetary aggregation driven versus later (> 50 Myr) planetary migration driven emplacement of objects into the Oort Cloud. They imply that the phenomenon of ultra-distant active comets like C/2017K2 (Jewitt et al. 2017, Hui et al. 2018) should be rare, and thus not a general characteristic of all comets. They also suggest that interstellar object 2I/Borisov did not originate in a planetary system that was inordinately CO rich (Bodewits et al. 2020), but rather could have been ejected onto an interstellar trajectory very early in its natal system's history. △ Less

Submitted 2 May, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

Comments: 16 Pages, 2 Figures, 1 Table; accepted for Publication in PSJ 14-Mar-2022

arXiv:2203.02901 [pdf, other]

A Robust Framework of Chromosome Straightening with ViT-Patch GAN

Authors: Sifan Song, Jinfeng Wang, Fengrui Cheng, Qirui Cao, Yihan Zuo, Yongteng Lei, Ruomai Yang, Chunxiao Yang, Frans Coenen, Jia Meng, Kang Dang, Jionglong Su

Abstract: Chromosomes carry the genetic information of humans. They exhibit non-rigid and non-articulated nature with varying degrees of curvature. Chromosome straightening is an important step for subsequent karyotype construction, pathological diagnosis and cytogenetic map development. However, robust chromosome straightening remains challenging, due to the unavailability of training images, distorted chr… ▽ More Chromosomes carry the genetic information of humans. They exhibit non-rigid and non-articulated nature with varying degrees of curvature. Chromosome straightening is an important step for subsequent karyotype construction, pathological diagnosis and cytogenetic map development. However, robust chromosome straightening remains challenging, due to the unavailability of training images, distorted chromosome details and shapes after straightening, as well as poor generalization capability. In this paper, we propose a novel architecture, ViT-Patch GAN, consisting of a self-learned motion transformation generator and a Vision Transformer-based patch (ViT-Patch) discriminator. The generator learns the motion representation of chromosomes for straightening. With the help of the ViT-Patch discriminator, the straightened chromosomes retain more shape and banding pattern details. The experimental results show that the proposed method achieves better performance on Fréchet Inception Distance (FID), Learned Perceptual Image Patch Similarity (LPIPS) and downstream chromosome classification accuracy, and shows excellent generalization capability on a large dataset. △ Less

Submitted 16 May, 2023; v1 submitted 6 March, 2022; originally announced March 2022.

Comments: Camera-ready version for IEEE ISBI2023

arXiv:2203.02676 [pdf, other]

ReGraph: Scaling Graph Processing on HBM-enabled FPGAs with Heterogeneous Pipelines

Authors: Xinyu Chen, Yao Chen, Feng Cheng, Hongshi Tan, Bingsheng He, Weng-Fai Wong

Abstract: The use of FPGAs for efficient graph processing has attracted significant interest. Recent memory subsystem upgrades including the introduction of HBM in FPGAs promise to further alleviate memory bottlenecks. However, modern multi-channel HBM requires much more processing pipelines to fully utilize its bandwidth potential. Existing designs do not scale well, resulting in underutilization of the HB… ▽ More The use of FPGAs for efficient graph processing has attracted significant interest. Recent memory subsystem upgrades including the introduction of HBM in FPGAs promise to further alleviate memory bottlenecks. However, modern multi-channel HBM requires much more processing pipelines to fully utilize its bandwidth potential. Existing designs do not scale well, resulting in underutilization of the HBM facilities even when all other resources are fully consumed. In this paper, we re-examined the graph processing workloads and found much diversity in processing. We also found that the diverse workloads can be easily classified into two types, namely dense and sparse partitions. This motivates us to propose a resource-efficient heterogeneous pipeline architecture. Our heterogeneous architecture comprises of two types of pipelines: Little pipelines to process dense partitions with good locality and Big pipelines to process sparse partitions with the extremely poor locality. Unlike traditional monolithic pipeline designs, the heterogeneous pipelines are tailored for more specific memory access patterns, and hence are more lightweight, allowing the architecture to scale up to more effectively with limited resources. In addition, we propose a model-guided task scheduling method that schedules partitions to the right pipeline types, generates the most efficient pipeline combination and balances workloads. Furthermore, we develop an automated open-source framework, called ReGraph, which automates the entire development process. ReGraph outperforms state-of-the-art FPGA accelerators by up to 5.9 times in terms of performance and 12times in terms of resource efficiency. △ Less

Submitted 5 March, 2022; originally announced March 2022.

arXiv:2202.05145 [pdf]

Deep learning for drug repurposing: methods, databases, and applications

Authors: Xiaoqin Pan, Xuan Lin, Dongsheng Cao, Xiangxiang Zeng, Philip S. Yu, Lifang He, Ruth Nussinov, Feixiong Cheng

Abstract: Drug development is time-consuming and expensive. Repurposing existing drugs for new therapies is an attractive solution that accelerates drug development at reduced experimental costs, specifically for Coronavirus Disease 2019 (COVID-19), an infectious disease caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). However, comprehensively obtaining and productively integrating av… ▽ More Drug development is time-consuming and expensive. Repurposing existing drugs for new therapies is an attractive solution that accelerates drug development at reduced experimental costs, specifically for Coronavirus Disease 2019 (COVID-19), an infectious disease caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). However, comprehensively obtaining and productively integrating available knowledge and big biomedical data to effectively advance deep learning models is still challenging for drug repurposing in other complex diseases. In this review, we introduce guidelines on how to utilize deep learning methodologies and tools for drug repurposing. We first summarized the commonly used bioinformatics and pharmacogenomics databases for drug repurposing. Next, we discuss recently developed sequence-based and graph-based representation approaches as well as state-of-the-art deep learning-based methods. Finally, we present applications of drug repurposing to fight the COVID-19 pandemic, and outline its future challenges. △ Less

Submitted 8 February, 2022; originally announced February 2022.

Comments: Accepted by WIREs Computational Molecular Science

arXiv:2202.04273 [pdf, other]

doi 10.3847/2041-8213/ac573d

Anomalous Flux in the Cosmic Optical Background Detected With New Horizons Observations

Authors: Tod R. Lauer, Marc Postman, John R. Spencer, Harold A. Weaver, S. Alan Stern, G. Randall Gladstone, Richard P. Binzel, Daniel T. Britt, Marc W. Buie, Bonnie J. Buratti, Andrew F. Cheng, W. M. Grundy, Mihaly Horányi, J. J. Kavelaars, Ivan R. Linscott, Carey M. Lisse, William B. McKinnon, Ralph L. McNutt, Jeffrey M. Moore, Jorge I. Núñez, Catherine B. Olkin, Joel W. Parker, Simon B. Porter, Dennis C. Reuter, Stuart J. Robbins , et al. (5 additional authors not shown)

Abstract: We used New Horizons LORRI images to measure the optical-band ($0.4\lesssimλ\lesssim0.9{\rmμm}$) sky brightness within a high galactic-latitude field selected to have reduced diffuse scattered light from the Milky Way galaxy (DGL), as inferred from the IRIS all-sky $100~μ$m map. We also selected the field to significantly reduce the scattered light from bright stars (SSL) outside the LORRI field.… ▽ More We used New Horizons LORRI images to measure the optical-band ($0.4\lesssimλ\lesssim0.9{\rmμm}$) sky brightness within a high galactic-latitude field selected to have reduced diffuse scattered light from the Milky Way galaxy (DGL), as inferred from the IRIS all-sky $100~μ$m map. We also selected the field to significantly reduce the scattered light from bright stars (SSL) outside the LORRI field. Suppression of DGL and SSL reduced the large uncertainties in the background flux levels present in our earlier New Horizons COB results. The raw total sky level, measured when New Horizons was 51.3 AU from the Sun, is $24.22\pm0.80{\rm ~nW ~m^{-2} ~sr^{-1}}.$ Isolating the COB contribution to the raw total required subtracting scattered light from bright stars and galaxies, faint stars below the photometric detection-limit within the field, and the hydrogen plus ionized-helium two-photon continua. This yielded a highly significant detection of the COB at ${\rm 16.37\pm 1.47 ~nW ~m^{-2} ~sr^{-1}}$ at the LORRI pivot wavelength of 0.608 $μ$m. This result is in strong tension with the hypothesis that the COB only comprises the integrated light of external galaxies (IGL) presently known from deep HST counts. Subtraction of the estimated IGL flux from the total COB level leaves a flux component of unknown origin at ${\rm 8.06\pm1.92 ~nW ~m^{-2} ~sr^{-1}}.$ Its amplitude is equal to the IGL. △ Less

Submitted 20 February, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

Comments: Accepted for publication in the Astrophysical Journal Letters

arXiv:2201.12737 [pdf]

Enhancing Innate and Adaptive Immune Systems by Cold Atmospheric Plasma (CAP) and Its Antitumor Immunity

Authors: Fengdong Cheng, Dayun Yan, Jie Chen, Zi Wang, Alex Horkowitz, Michael Keidar, Eduardo M. Sotomayor

Abstract: Cold atmospheric plasma (CAP) is a near room temperature ionized gas, generated under non-equilibrium discharge conditions. Here we show that a short exposure of rat peritoneal exudate macrophages and T-cells to CAP in vitro, triggered an inflammatory phenotype leading to better antigen-presenting and effector cell function respectively. Different from previous studies mainly using immortalized ce… ▽ More Cold atmospheric plasma (CAP) is a near room temperature ionized gas, generated under non-equilibrium discharge conditions. Here we show that a short exposure of rat peritoneal exudate macrophages and T-cells to CAP in vitro, triggered an inflammatory phenotype leading to better antigen-presenting and effector cell function respectively. Different from previous studies mainly using immortalized cell lines, both macrophage and T-cells in this study were primary cells isolated from mice. Furthermore, ex-vivo exposure of T-cells to CAP, followed by their adoptive transfer into tumor-bearing mice resulted in a strong antitumor effect in vivo. Mechanistically, CAP seems to disrupt tolerogenic pathways leading to enhanced production of pro-inflammatory cytokines while limiting the production of anti-inflammatory cytokines and the expression of inhibitory molecules such as programmed death-ligand 1 (PD-L1). CAP represents therefore a novel, non-toxic and easy to deliver technology to augment the function of immune cells and enhance antitumor responses when used as a component of T-cell adoptive immunotherapies strategies or, potentially in combination with other cancer immunotherapeutic approaches. △ Less

Submitted 30 January, 2022; originally announced January 2022.

arXiv:2201.11024 [pdf]

doi 10.1016/j.actaastro.2021.11.030

Operating Spacecraft Around Comets: Evaluation of the Near-Nucleus Environment

Authors: C. M. Lisse, M. R. Combi, T. L. Farnham, N. Dello Russo, S. Sandford, A. F. Cheng, U. Fink, W. M. Harris, J. McMahon, D. J. Scheeres, H. A. Weaver, J. Leary

Abstract: We present a study of the current state of knowledge concerning spacecraft operations and potential hazards while operating near a comet nucleus. Starting from simple calculations comparing the cometary coma environment to benign conditions on Earth, we progress to sophisticated engineering models of spacecraft behavior, and then confront these models with recent spacecraft proximity operations ex… ▽ More We present a study of the current state of knowledge concerning spacecraft operations and potential hazards while operating near a comet nucleus. Starting from simple calculations comparing the cometary coma environment to benign conditions on Earth, we progress to sophisticated engineering models of spacecraft behavior, and then confront these models with recent spacecraft proximity operations experience. Finally, we make recommendations from lessons learned for future spacecraft missions that enter into orbit around a comet for long-term operations. All of these considerations indicate that, with a proper spacecraft design and operations planning, the near-nucleus environment can be a relatively safe region in which to operate, even for an active short period comet near perihelion with gas production rates as high as 1e29 molecules/s. With gas densities similar to those found in good laboratory vacuums, dust densities similar to Class 100 cleanrooms, dust particle velocities of 10s of m/s, and microgravity forces that permit slow and deliberate operations, the conditions around a comet are generally more benign than a typical day on Mars. Even in strong dust jets near the nucleus surface, dust densities tend to be only a few grains/cm3, about the same as in a typical interior room on Earth. Stochastic forces on a modern spacecraft with tens of square meters of projected surface area can be accounted for using modern Attitude Control Systems to within tens of meters navigation error; surface contamination issues are only important for spacecraft spending months to years within a few kilometers of the nucleus surface; and the issues the Rosetta spacecraft faced, confusion of celestial star trackers by sunlit dust particles flying past the spacecraft, will be addressed using the next generation of star trackers implementing improved transient rejection algorithms. △ Less

Submitted 26 January, 2022; originally announced January 2022.

Comments: 38 Pages, 15 Figures, 1 Table; accepted for publication in Acta Astronautica 25-Nov-2021

arXiv:2201.09772 [pdf, other]

In Defence of Visual Analytics Systems: Replies to Critics

Authors: Aoyu Wu, Dazhen Deng, Furui Cheng, Yingcai Wu, Shixia Liu, Huamin Qu

Abstract: The last decade has witnessed many visual analytics (VA) systems that make successful applications to wide-ranging domains like urban analytics and explainable AI. However, their research rigor and contributions have been extensively challenged within the visualization community. We come in defence of VA systems by contributing two interview studies for gathering critics and responses to those cri… ▽ More The last decade has witnessed many visual analytics (VA) systems that make successful applications to wide-ranging domains like urban analytics and explainable AI. However, their research rigor and contributions have been extensively challenged within the visualization community. We come in defence of VA systems by contributing two interview studies for gathering critics and responses to those criticisms. First, we interview 24 researchers to collect criticisms the review comments on their VA work. Through an iterative coding and refinement process, the interview feedback is summarized into a list of 36 common criticisms. Second, we interview 17 researchers to validate our list and collect their responses, thereby discussing implications for defending and improving the scientific values and rigor of VA systems. We highlight that the presented knowledge is deep, extensive, but also imperfect, provocative, and controversial, and thus recommend reading with an inclusive and critical eye. We hope our work can provide thoughts and foundations for conducting VA research and spark discussions to promote the research field forward more rigorously and vibrantly. △ Less

Submitted 5 August, 2022; v1 submitted 24 January, 2022; originally announced January 2022.

Comments: 9+2 pages, 4 figures. Accepted to IEEE VIS 2022

arXiv:2201.04868 [pdf, other]

Interactive Data Analysis with Next-step Natural Language Query Recommendation

Authors: Xingbo Wang, Furui Cheng, Yong Wang, Ke Xu, Jiang Long, Hong Lu, Huamin Qu

Abstract: Natural language interfaces (NLIs) provide users with a convenient way to interactively analyze data through natural language queries. Nevertheless, interactive data analysis is a demanding process, especially for novice data analysts. When exploring large and complex SQL databases from different domains, data analysts do not necessarily have sufficient knowledge about different data tables and ap… ▽ More Natural language interfaces (NLIs) provide users with a convenient way to interactively analyze data through natural language queries. Nevertheless, interactive data analysis is a demanding process, especially for novice data analysts. When exploring large and complex SQL databases from different domains, data analysts do not necessarily have sufficient knowledge about different data tables and application domains. It makes them unable to systematically elicit a series of topically-related and meaningful queries for insight discovery in target domains. We develop a NLI with a step-wise query recommendation module to assist users in choosing appropriate next-step exploration actions. The system adopts a data-driven approach to suggest semantically relevant and context-aware queries for application domains of users' interest based on their query logs. Also, the system helps users organize query histories and results into a dashboard to communicate the discovered data insights. With a comparative user study, we show that our system can facilitate a more effective and systematic data analysis process than a baseline without the recommendation module. △ Less

Submitted 1 November, 2022; v1 submitted 13 January, 2022; originally announced January 2022.

Comments: 14 pages, 6 figures

arXiv:2201.04392 [pdf]

doi 10.1038/s41467-021-27811-6

A bimodal distribution of haze in Pluto's atmosphere

Authors: Siteng Fan, Peter Gao, Xi Zhang, Danica J. Adams, Nicholas W. Kutsop, Carver J. Bierson, Chao Liu, Jiani Yang, Leslie A. Young, Andrew F. Cheng, Yuk L. Yung

Abstract: Pluto, Titan, and Triton make up a unique class of solar system bodies, with icy surfaces and chemically reducing atmospheres rich in organic photochemistry and haze formation. Hazes play important roles in these atmospheres, with physical and chemical processes highly dependent on particle sizes, but the haze size distribution in reducing atmospheres is currently poorly understood. Here we report… ▽ More Pluto, Titan, and Triton make up a unique class of solar system bodies, with icy surfaces and chemically reducing atmospheres rich in organic photochemistry and haze formation. Hazes play important roles in these atmospheres, with physical and chemical processes highly dependent on particle sizes, but the haze size distribution in reducing atmospheres is currently poorly understood. Here we report observational evidence that Pluto's haze particles are bimodally distributed, which successfully reproduces the full phase scattering observations from New Horizons. Combined with previous simulations of Titan's haze, this result suggests that haze particles in reducing atmospheres undergo rapid shape change near pressure levels ~0.5Pa and favors a photochemical rather than a dynamical origin for the formation of Titan's detached haze. It also demonstrates that both oxidizing and reducing atmospheres can produce multi-modal hazes, and encourages reanalysis of observations of hazes on Titan and Triton. △ Less

Submitted 12 January, 2022; originally announced January 2022.

Comments: Published in Nature Communications, 26 pages, 12 figures

Journal ref: Nat Commun 13, 240 (2022)

arXiv:2112.06250 [pdf, other]

Boosting the Capability of Intelligent Vulnerability Detection by Training in a Human-Learning Manner

Authors: Shihan Dou, Yueming Wu, Wenxuan Li, Feng Cheng, Wei Yang, Yang Liu

Abstract: Due to its powerful automatic feature extraction, deep learning (DL) has been widely used in source code vulnerability detection. However, although it performs well on artificial datasets, its performance is not satisfactory when detecting real-world vulnerabilities due to the high complexity of real-world samples. In this paper, we propose to train DL-based vulnerability detection models in a hum… ▽ More Due to its powerful automatic feature extraction, deep learning (DL) has been widely used in source code vulnerability detection. However, although it performs well on artificial datasets, its performance is not satisfactory when detecting real-world vulnerabilities due to the high complexity of real-world samples. In this paper, we propose to train DL-based vulnerability detection models in a human-learning manner, that is, start with the simplest samples and then gradually transition to difficult knowledge. Specifically, we design a novel framework (Humer) that can enhance the detection ability of DL-based vulnerability detectors. To validate the effectiveness of Humer, we select five state-of-the-art DL-based vulnerability detection models (TokenCNN, VulDeePecker, StatementGRU, ASTGRU, and Devign) to complete our evaluations. Through the results, we find that the use of Humer can increase the F1 of these models by an average of 10.5%. Moreover, Humer can make the model detect up to 16.7% more real-world vulnerabilities. Meanwhile, we also conduct a case study to uncover vulnerabilities from real-world open source products by using these enhanced DL-based vulnerability detectors. Through the results, we finally discover 281 unreported vulnerabilities in NVD, of which 98 have been silently patched by vendors in the latest version of corresponding products, but 159 still exist in the products. △ Less

Submitted 12 December, 2021; originally announced December 2021.

arXiv:2111.05805 [pdf, other]

Cross-lingual Adaption Model-Agnostic Meta-Learning for Natural Language Understanding

Authors: Qianying Liu, Fei Cheng, Sadao Kurohashi

Abstract: Meta learning with auxiliary languages has demonstrated promising improvements for cross-lingual natural language processing. However, previous studies sample the meta-training and meta-testing data from the same language, which limits the ability of the model for cross-lingual transfer. In this paper, we propose XLA-MAML, which performs direct cross-lingual adaption in the meta-learning stage. We… ▽ More Meta learning with auxiliary languages has demonstrated promising improvements for cross-lingual natural language processing. However, previous studies sample the meta-training and meta-testing data from the same language, which limits the ability of the model for cross-lingual transfer. In this paper, we propose XLA-MAML, which performs direct cross-lingual adaption in the meta-learning stage. We conduct zero-shot and few-shot experiments on Natural Language Inference and Question Answering. The experimental results demonstrate the effectiveness of our method across different languages, tasks, and pretrained models. We also give analysis on various cross-lingual specific settings for meta-learning including sampling strategy and parallelism. △ Less

Submitted 10 November, 2021; originally announced November 2021.

Comments: 11 pages

arXiv:2111.04261 [pdf, other]

JaMIE: A Pipeline Japanese Medical Information Extraction System

Authors: Fei Cheng, Shuntaro Yada, Ribeka Tanaka, Eiji Aramaki, Sadao Kurohashi

Abstract: We present an open-access natural language processing toolkit for Japanese medical information extraction. We first propose a novel relation annotation schema for investigating the medical and temporal relations between medical entities in Japanese medical reports. We experiment with the practical annotation scenarios by separately annotating two different types of reports. We design a pipeline sy… ▽ More We present an open-access natural language processing toolkit for Japanese medical information extraction. We first propose a novel relation annotation schema for investigating the medical and temporal relations between medical entities in Japanese medical reports. We experiment with the practical annotation scenarios by separately annotating two different types of reports. We design a pipeline system with three components for recognizing medical entities, classifying entity modalities, and extracting relations. The empirical results show accurate analyzing performance and suggest the satisfactory annotation quality, the effective annotation strategy for targeting report types, and the superiority of the latest contextual embedding models. △ Less

Submitted 7 November, 2021; originally announced November 2021.

Comments: 8 pages

arXiv:2110.03529 [pdf]

Using Single-Trial Representational Similarity Analysis with EEG to track semantic similarity in emotional word processing

Authors: Feng Cheng

Abstract: Electroencephalography (EEG) is a powerful non-invasive brain imaging technique with a high temporal resolution that has seen extensive use across multiple areas of cognitive science research. This thesis adapts representational similarity analysis (RSA) to single-trial EEG datasets and introduces its principles to EEG researchers unfamiliar with multivariate analyses. We have two separate aims: 1… ▽ More Electroencephalography (EEG) is a powerful non-invasive brain imaging technique with a high temporal resolution that has seen extensive use across multiple areas of cognitive science research. This thesis adapts representational similarity analysis (RSA) to single-trial EEG datasets and introduces its principles to EEG researchers unfamiliar with multivariate analyses. We have two separate aims: 1. we want to explore the effectiveness of single-trial RSA on EEG datasets; 2. we want to utilize single-trial RSA and computational semantic models to investigate the role of semantic meaning in emotional word processing. We report two primary findings: 1. single-trial RSA on EEG datasets can produce meaningful and interpretable results given a high number of trials and subjects; 2. single-trial RSA reveals that emotional processing in the 500-800ms time window is associated with additional semantic analysis. △ Less

Submitted 4 October, 2021; originally announced October 2021.

arXiv:2110.03165 [pdf, other]

Offline RL With Resource Constrained Online Deployment

Authors: Jayanth Reddy Regatti, Aniket Anand Deshmukh, Frank Cheng, Young Hun Jung, Abhishek Gupta, Urun Dogan

Abstract: Offline reinforcement learning is used to train policies in scenarios where real-time access to the environment is expensive or impossible. As a natural consequence of these harsh conditions, an agent may lack the resources to fully observe the online environment before taking an action. We dub this situation the resource-constrained setting. This leads to situations where the offline dataset (ava… ▽ More Offline reinforcement learning is used to train policies in scenarios where real-time access to the environment is expensive or impossible. As a natural consequence of these harsh conditions, an agent may lack the resources to fully observe the online environment before taking an action. We dub this situation the resource-constrained setting. This leads to situations where the offline dataset (available for training) can contain fully processed features (using powerful language models, image models, complex sensors, etc.) which are not available when actions are actually taken online. This disconnect leads to an interesting and unexplored problem in offline RL: Is it possible to use a richly processed offline dataset to train a policy which has access to fewer features in the online environment? In this work, we introduce and formalize this novel resource-constrained problem setting. We highlight the performance gap between policies trained using the full offline dataset and policies trained using limited features. We address this performance gap with a policy transfer algorithm which first trains a teacher agent using the offline dataset where features are fully available, and then transfers this knowledge to a student agent that only uses the resource-constrained features. To better capture the challenge of this setting, we propose a data collection procedure: Resource Constrained-Datasets for RL (RC-D4RL). We evaluate our transfer algorithm on RC-D4RL and the popular D4RL benchmarks and observe consistent improvement over the baseline (TD3+BC without transfer). The code for the experiments is available at https://github.com/JayanthRR/RC-OfflineRL. △ Less

Submitted 7 December, 2021; v1 submitted 6 October, 2021; originally announced October 2021.

Comments: Added experiments on discrete control and real world datasets along with more analyses on continuous control tasks

arXiv:2109.07323 [pdf, other]

FORTAP: Using Formulas for Numerical-Reasoning-Aware Table Pretraining

Authors: Zhoujun Cheng, Haoyu Dong, Ran Jia, Pengfei Wu, Shi Han, Fan Cheng, Dongmei Zhang

Abstract: Tables store rich numerical data, but numerical reasoning over tables is still a challenge. In this paper, we find that the spreadsheet formula, which performs calculations on numerical values in tables, is naturally a strong supervision of numerical reasoning. More importantly, large amounts of spreadsheets with expert-made formulae are available on the web and can be obtained easily. FORTAP is t… ▽ More Tables store rich numerical data, but numerical reasoning over tables is still a challenge. In this paper, we find that the spreadsheet formula, which performs calculations on numerical values in tables, is naturally a strong supervision of numerical reasoning. More importantly, large amounts of spreadsheets with expert-made formulae are available on the web and can be obtained easily. FORTAP is the first method for numerical-reasoning-aware table pretraining by leveraging large corpus of spreadsheet formulae. We design two formula pretraining tasks to explicitly guide FORTAP to learn numerical reference and calculation in semi-structured tables. FORTAP achieves state-of-the-art results on two representative downstream tasks, cell type classification and formula prediction, showing great potential of numerical-reasoning-aware pretraining. △ Less

Submitted 25 March, 2022; v1 submitted 15 September, 2021; originally announced September 2021.

Comments: Accepted by ACL'22 main track

arXiv:2109.03696 [pdf, other]

Entanglement Wedge Minimum Cross-Section in Holographic Axion Gravity Theories

Authors: Fang-Jing Cheng, Zhe Yang, Chao Niu, Cheng-Yong Zhang, Peng Liu

Abstract: We study the mixed state entanglement properties in two holographic axion models by examining the behavior of the entanglement wedge minimum cross section (EWCS), and comparing it with the holographic entanglement entropy (HEE) and mutual information (MI). We find that the behavior of HEE, MI and EWCS with Hawking temperature is monotonic, while the behavior with the axion parameter $k$ is more ri… ▽ More We study the mixed state entanglement properties in two holographic axion models by examining the behavior of the entanglement wedge minimum cross section (EWCS), and comparing it with the holographic entanglement entropy (HEE) and mutual information (MI). We find that the behavior of HEE, MI and EWCS with Hawking temperature is monotonic, while the behavior with the axion parameter $k$ is more rich, which depends on the size of the configuration and the values of the other two parameters. Interestingly, the EWCS monotonically increases with the coupling constant $κ$ between the axion field and the Maxwell field, while HEE and MI can be non-monotonic. It suggests that the EWCS, as a mixed state entanglement measure, captures distinct degrees of freedom from the HEE and MI indeed. We also provide analytical understandings for most of the numerical results. △ Less

Submitted 8 February, 2022; v1 submitted 8 September, 2021; originally announced September 2021.

Comments: 25 pages, 15 figures; refs added, writing improved

arXiv:2108.02550 [pdf, other]

doi 10.1109/TVCG.2021.3114836

VBridge: Connecting the Dots Between Features and Data to Explain Healthcare Models

Authors: Furui Cheng, Dongyu Liu, Fan Du, Yanna Lin, Alexandra Zytek, Haomin Li, Huamin Qu, Kalyan Veeramachaneni

Abstract: Machine learning (ML) is increasingly applied to Electronic Health Records (EHRs) to solve clinical prediction tasks. Although many ML models perform promisingly, issues with model transparency and interpretability limit their adoption in clinical practice. Directly using existing explainable ML techniques in clinical settings can be challenging. Through literature surveys and collaborations with… ▽ More Machine learning (ML) is increasingly applied to Electronic Health Records (EHRs) to solve clinical prediction tasks. Although many ML models perform promisingly, issues with model transparency and interpretability limit their adoption in clinical practice. Directly using existing explainable ML techniques in clinical settings can be challenging. Through literature surveys and collaborations with six clinicians with an average of 17 years of clinical experience, we identified three key challenges, including clinicians' unfamiliarity with ML features, lack of contextual information, and the need for cohort-level evidence. Following an iterative design process, we further designed and developed VBridge, a visual analytics tool that seamlessly incorporates ML explanations into clinicians' decision-making workflow. The system includes a novel hierarchical display of contribution-based feature explanations and enriched interactions that connect the dots between ML features, explanations, and data. We demonstrated the effectiveness of VBridge through two case studies and expert interviews with four clinicians, showing that visually associating model explanations with patients' situational records can help clinicians better interpret and use model predictions when making clinician decisions. We further derived a list of design implications for developing future explainable ML tools to support clinical decision-making. △ Less

Submitted 22 September, 2021; v1 submitted 4 August, 2021; originally announced August 2021.

Comments: Accepted to IEEE VIS 2021, To Appeal in IEEE Transactions on Visualization and Computer Graphics

ACM Class: H.4.2; I.2.6; J.3

arXiv:2107.07996 [pdf, other]

doi 10.1016/j.icarus.2021.114624

The Excited Spin State of Dimorphos Resulting from the DART Impact

Authors: Harrison F. Agrusa, Ioannis Gkolias, Kleomenis Tsiganis, Derek C. Richardson, Alex J. Meyer, Daniel J. Scheeres, Matija Ćuk, Seth A. Jacobson, Patrick Michel, Özgür Karatekin, Andrew F. Cheng, Masatoshi Hirabayashi, Yun Zhang, Eugene G. Fahnestock, Alex B. Davis

Abstract: The NASA Double Asteroid Redirection Test (DART) mission is a planetary defense-driven test of a kinetic impactor on Dimorphos, the satellite of the binary asteroid 65803 Didymos. DART will intercept Dimorphos at a relative speed of ${\sim}6.5 \text{ km s}^{-1}$, perturbing Dimorphos's orbital velocity and changing the binary orbital period. We present three independent methods (one analytic and t… ▽ More The NASA Double Asteroid Redirection Test (DART) mission is a planetary defense-driven test of a kinetic impactor on Dimorphos, the satellite of the binary asteroid 65803 Didymos. DART will intercept Dimorphos at a relative speed of ${\sim}6.5 \text{ km s}^{-1}$, perturbing Dimorphos's orbital velocity and changing the binary orbital period. We present three independent methods (one analytic and two numerical) to investigate the post-impact attitude stability of Dimorphos as a function of its axial ratios, $a/b$ and $b/c$ ($a \ge b \ge c$), and the momentum transfer efficiency $β$. The first method uses a novel analytic approach in which we assume a circular orbit and a point-mass primary that identifies four fundamental frequencies of motion corresponding to the secondary's mean motion, libration, precession, and nutation frequencies. At resonance locations among these four frequencies, we find that attitude instabilities are possible. Using two independent numerical codes, we recover many of the resonances predicted by the analytic model and indeed show attitude instability. With one code, we use fast Lyapunov indicators to show that the secondary's attitude can evolve chaotically near the resonance locations. Then, using a high-fidelity numerical model, we find that Dimorphos enters a chaotic tumbling state near the resonance locations and is especially prone to unstable rotation about its long axis, which can be confirmed by ESA's Hera mission arriving at Didymos in late 2026. We also show that a fully coupled treatment of the spin and orbital evolution of both bodies is crucial to accurately model the long-term evolution of the secondary's spin state and libration amplitude. Finally, we discuss the implications of a post-impact tumbling or rolling state, including the possibility of terminating BYORP evolution if Dimorphos is no longer in synchronous rotation. △ Less

Submitted 29 July, 2021; v1 submitted 16 July, 2021; originally announced July 2021.

Comments: 38 pages, 13 figures, Accepted for publication in Icarus

arXiv:2105.05535 [pdf, other]

OCHADAI-KYOTO at SemEval-2021 Task 1: Enhancing Model Generalization and Robustness for Lexical Complexity Prediction

Authors: Yuki Taya, Lis Kanashiro Pereira, Fei Cheng, Ichiro Kobayashi

Abstract: We propose an ensemble model for predicting the lexical complexity of words and multiword expressions (MWEs). The model receives as input a sentence with a target word or MWEand outputs its complexity score. Given that a key challenge with this task is the limited size of annotated data, our model relies on pretrained contextual representations from different state-of-the-art transformer-based lan… ▽ More We propose an ensemble model for predicting the lexical complexity of words and multiword expressions (MWEs). The model receives as input a sentence with a target word or MWEand outputs its complexity score. Given that a key challenge with this task is the limited size of annotated data, our model relies on pretrained contextual representations from different state-of-the-art transformer-based language models (i.e., BERT and RoBERTa), and on a variety of training methods for further enhancing model generalization and robustness:multi-step fine-tuning and multi-task learning, and adversarial training. Additionally, we propose to enrich contextual representations by adding hand-crafted features during training. Our model achieved competitive results and ranked among the top-10 systems in both sub-tasks. △ Less

Submitted 15 June, 2021; v1 submitted 12 May, 2021; originally announced May 2021.

Showing 51–100 of 230 results for author: Cheng, F