Zum Hauptinhalt springen

Showing 1–50 of 74 results for author: Duan, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.17383  [pdf, other

    cs.LG cs.AI

    MoRe Fine-Tuning with 10x Fewer Parameters

    Authors: Wenxuan Tan, Nicholas Roberts, Tzu-Heng Huang, Jitian Zhao, John Cooper, Samuel Guo, Chengyu Duan, Frederic Sala

    Abstract: Parameter-efficient fine-tuning (PEFT) techniques have unlocked the potential to cheaply and easily specialize large pretrained models. However, the most prominent approaches, like low-rank adapters (LoRA), depend on heuristics or rules-of-thumb for their architectural choices -- potentially limiting their performance for new models and architectures. This limitation suggests that techniques from… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

  2. arXiv:2408.12840  [pdf, other

    cs.LG

    HGNAS: Hardware-Aware Graph Neural Architecture Search for Edge Devices

    Authors: Ao Zhou, Jianlei Yang, Yingjie Qi, Tong Qiao, Yumeng Shi, Cenlin Duan, Weisheng Zhao, Chunming Hu

    Abstract: Graph Neural Networks (GNNs) are becoming increasingly popular for graph-based learning tasks such as point cloud processing due to their state-of-the-art (SOTA) performance. Nevertheless, the research community has primarily focused on improving model expressiveness, lacking consideration of how to design efficient GNN models for edge scenarios with real-time requirements and limited resources. E… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

    Comments: Accepted by IEEE Transactions on Computers

  3. arXiv:2408.08533  [pdf, ps, other

    stat.ML cs.LG

    Unsupervised Transfer Learning via Adversarial Contrastive Training

    Authors: Chenguang Duan, Yuling Jiao, Huazhen Lin, Wensen Ma, Jerry Zhijian Yang

    Abstract: Learning a data representation for downstream supervised learning tasks under unlabeled scenario is both critical and challenging. In this paper, we propose a novel unsupervised transfer learning approach using adversarial contrastive training (ACT). Our experimental results demonstrate outstanding classification accuracy with both fine-tuned linear probe and K-NN protocol across various datasets,… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

  4. arXiv:2407.08725  [pdf, other

    cs.CV cs.AI cs.RO

    MetaUrban: A Simulation Platform for Embodied AI in Urban Spaces

    Authors: Wayne Wu, Honglin He, Yiran Wang, Chenda Duan, Jack He, Zhizheng Liu, Quanyi Li, Bolei Zhou

    Abstract: Public urban spaces like streetscapes and plazas serve residents and accommodate social life in all its vibrant variations. Recent advances in Robotics and Embodied AI make public urban spaces no longer exclusive to humans. Food delivery bots and electric wheelchairs have started sharing sidewalks with pedestrians, while diverse robot dogs and humanoids have recently emerged in the street. Ensurin… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: Technical report. Project page: https://metadriverse.github.io/metaurban/

  5. arXiv:2406.16976  [pdf, other

    cs.NE cs.AI cs.LG physics.chem-ph

    Efficient Evolutionary Search Over Chemical Space with Large Language Models

    Authors: Haorui Wang, Marta Skreta, Cher-Tian Ser, Wenhao Gao, Lingkai Kong, Felix Strieth-Kalthoff, Chenru Duan, Yuchen Zhuang, Yue Yu, Yanqiao Zhu, Yuanqi Du, Alán Aspuru-Guzik, Kirill Neklyudov, Chao Zhang

    Abstract: Molecular discovery, when formulated as an optimization problem, presents significant computational challenges because optimization objectives can be non-differentiable. Evolutionary Algorithms (EAs), often used to optimize black-box objectives in molecular discovery, traverse chemical space by performing random mutations and crossovers, leading to a large number of expensive objective evaluations… ▽ More

    Submitted 2 July, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

  6. arXiv:2406.00894  [pdf, other

    cs.LG cs.AI cs.CL

    Pretrained Hybrids with MAD Skills

    Authors: Nicholas Roberts, Samuel Guo, Zhiqi Gao, Satya Sai Srinath Namburi GNVV, Sonia Cromp, Chengjun Wu, Chengyu Duan, Frederic Sala

    Abstract: While Transformers underpin modern large language models (LMs), there is a growing list of alternative architectures with new capabilities, promises, and tradeoffs. This makes choosing the right LM architecture challenging. Recently-proposed $\textit{hybrid architectures}$ seek a best-of-all-worlds approach that reaps the benefits of all architectures. Hybrid design is difficult for two reasons: i… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  7. arXiv:2405.05512  [pdf, other

    cs.LG cs.AI math.NA math.ST

    Characteristic Learning for Provable One Step Generation

    Authors: Zhao Ding, Chenguang Duan, Yuling Jiao, Ruoxuan Li, Jerry Zhijian Yang, Pingwen Zhang

    Abstract: We propose the characteristic generator, a novel one-step generative model that combines the efficiency of sampling in Generative Adversarial Networks (GANs) with the stable performance of flow-based models. Our model is driven by characteristics, along which the probability density transport can be described by ordinary differential equations (ODEs). Specifically, We estimate the velocity field t… ▽ More

    Submitted 16 July, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

  8. arXiv:2405.03987  [pdf, other

    cs.LG physics.chem-ph

    Navigating Chemical Space with Latent Flows

    Authors: Guanghao Wei, Yining Huang, Chenru Duan, Yue Song, Yuanqi Du

    Abstract: Recent progress of deep generative models in the vision and language domain has stimulated significant interest in more structured data generation such as molecules. However, beyond generating new random molecules, efficient exploration and a comprehensive understanding of the vast chemical space are of great importance to molecular science and applications in drug design and materials discovery.… ▽ More

    Submitted 7 May, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

  9. arXiv:2404.17398  [pdf, other

    stat.ML cs.LG

    Online Policy Learning and Inference by Matrix Completion

    Authors: Congyuan Duan, Jingyang Li, Dong Xia

    Abstract: Making online decisions can be challenging when features are sparse and orthogonal to historical ones, especially when the optimal policy is learned through collaborative filtering. We formulate the problem as a matrix completion bandit (MCB), where the expected reward under each arm is characterized by an unknown low-rank matrix. The $ε$-greedy bandit and the online gradient descent algorithm are… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  10. arXiv:2404.13430  [pdf, other

    physics.chem-ph cs.LG

    React-OT: Optimal Transport for Generating Transition State in Chemical Reactions

    Authors: Chenru Duan, Guan-Horng Liu, Yuanqi Du, Tianrong Chen, Qiyuan Zhao, Haojun Jia, Carla P. Gomes, Evangelos A. Theodorou, Heather J. Kulik

    Abstract: Transition states (TSs) are transient structures that are key in understanding reaction mechanisms and designing catalysts but challenging to be captured in experiments. Alternatively, many optimization algorithms have been developed to search for TSs computationally. Yet the cost of these algorithms driven by quantum chemistry methods (usually density functional theory) is still high, posing chal… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: 5 figures, 1 table

  11. Multi-view X-ray Image Synthesis with Multiple Domain Disentanglement from CT Scans

    Authors: Lixing Tan, Shuang Song, Kangneng Zhou, Chengbo Duan, Lanying Wang, Huayang Ren, Linlin Liu, Wei Zhang, Ruoxiu Xiao

    Abstract: X-ray images play a vital role in the intraoperative processes due to their high resolution and fast imaging speed and greatly promote the subsequent segmentation, registration and reconstruction. However, over-dosed X-rays superimpose potential risks to human health to some extent. Data-driven algorithms from volume scans to X-ray images are restricted by the scarcity of paired X-ray and volume d… ▽ More

    Submitted 30 July, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: 13 pages, 10 figures, ACM MM2024

  12. arXiv:2404.09497  [pdf, other

    cs.AR

    Towards Efficient SRAM-PIM Architecture Design by Exploiting Unstructured Bit-Level Sparsity

    Authors: Cenlin Duan, Jianlei Yang, Yiou Wang, Yikun Wang, Yingjie Qi, Xiaolin He, Bonan Yan, Xueyan Wang, Xiaotao Jia, Weisheng Zhao

    Abstract: Bit-level sparsity in neural network models harbors immense untapped potential. Eliminating redundant calculations of randomly distributed zero-bits significantly boosts computational efficiency. Yet, traditional digital SRAM-PIM architecture, limited by rigid crossbar architecture, struggles to effectively exploit this unstructured sparsity. To address this challenge, we propose Dyadic Block PIM… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: Accepted by DAC'24

  13. arXiv:2403.00303  [pdf, other

    cs.CV

    ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting

    Authors: Chen Duan, Pei Fu, Shan Guo, Qianyi Jiang, Xiaoming Wei

    Abstract: In recent years, text-image joint pre-training techniques have shown promising results in various tasks. However, in Optical Character Recognition (OCR) tasks, aligning text instances with their corresponding text regions in images poses a challenge, as it requires effective alignment between text and OCR-Text (referring to the text in images as OCR-Text to distinguish from the text in natural lan… ▽ More

    Submitted 17 April, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR2024

  14. arXiv:2402.16901  [pdf, other

    q-bio.GN cs.AI cs.LG

    FGBERT: Function-Driven Pre-trained Gene Language Model for Metagenomics

    Authors: ChenRui Duan, Zelin Zang, Yongjie Xu, Hang He, Zihan Liu, Zijia Song, Ju-Sheng Zheng, Stan Z. Li

    Abstract: Metagenomic data, comprising mixed multi-species genomes, are prevalent in diverse environments like oceans and soils, significantly impacting human health and ecological functions. However, current research relies on K-mer representations, limiting the capture of structurally relevant gene contexts. To address these limitations and further our understanding of complex relationships between metage… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  15. arXiv:2402.01115  [pdf, other

    cs.CL eess.SP

    Interpretation of Intracardiac Electrograms Through Textual Representations

    Authors: William Jongwon Han, Diana Gomez, Avi Alok, Chaojing Duan, Michael A. Rosenberg, Douglas Weber, Emerson Liu, Ding Zhao

    Abstract: Understanding the irregular electrical activity of atrial fibrillation (AFib) has been a key challenge in electrocardiography. For serious cases of AFib, catheter ablations are performed to collect intracardiac electrograms (EGMs). EGMs offer intricately detailed and localized electrical activity of the heart and are an ideal modality for interpretable cardiac studies. Recent advancements in artif… ▽ More

    Submitted 11 April, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: 18 pages, 9 figures; Accepted to CHIL 2024

    ACM Class: I.2.7; J.3

  16. arXiv:2401.07543  [pdf, other

    cs.CE cs.AI

    Must: Maximizing Latent Capacity of Spatial Transcriptomics Data

    Authors: Zelin Zang, Liangyu Li, Yongjie Xu, Chenrui Duan, Kai Wang, Yang You, Yi Sun, Stan Z. Li

    Abstract: Spatial transcriptomics (ST) technologies have revolutionized the study of gene expression patterns in tissues by providing multimodality data in transcriptomic, spatial, and morphological, offering opportunities for understanding tissue biology beyond transcriptomics. However, we identify the modality bias phenomenon in ST data species, i.e., the inconsistent contribution of different modalities… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: 30 pages and 6 figures, plus 27 pages and 14 figures in appendices

  17. arXiv:2401.04535  [pdf, other

    stat.ML cs.LG

    Semi-Supervised Deep Sobolev Regression: Estimation, Variable Selection and Beyond

    Authors: Zhao Ding, Chenguang Duan, Yuling Jiao, Jerry Zhijian Yang

    Abstract: We propose SDORE, a semi-supervised deep Sobolev regressor, for the nonparametric estimation of the underlying regression function and its gradient. SDORE employs deep neural networks to minimize empirical risk with gradient norm regularization, allowing computation of the gradient norm on unlabeled data. We conduct a comprehensive analysis of the convergence rates of SDORE and establish a minimax… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    MSC Class: 62G05; 62G08; 65N21

  18. arXiv:2310.20424  [pdf, other

    cs.AR cs.LG

    DDC-PIM: Efficient Algorithm/Architecture Co-design for Doubling Data Capacity of SRAM-based Processing-In-Memory

    Authors: Cenlin Duan, Jianlei Yang, Xiaolin He, Yingjie Qi, Yikun Wang, Yiou Wang, Ziyan He, Bonan Yan, Xueyan Wang, Xiaotao Jia, Weitao Pan, Weisheng Zhao

    Abstract: Processing-in-memory (PIM), as a novel computing paradigm, provides significant performance benefits from the aspect of effective data movement reduction. SRAM-based PIM has been demonstrated as one of the most promising candidates due to its endurance and compatibility. However, the integration density of SRAM-based PIM is much lower than other non-volatile memory-based ones, due to its inherent… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: 14 pages, to be published in IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD)

  19. arXiv:2310.18917  [pdf, other

    cs.CV

    TivNe-SLAM: Dynamic Mapping and Tracking via Time-Varying Neural Radiance Fields

    Authors: Chengyao Duan, Zhiliu Yang

    Abstract: Previous attempts to integrate Neural Radiance Fields (NeRF) into the Simultaneous Localization and Mapping (SLAM) framework either rely on the assumption of static scenes or require the ground truth camera poses, which impedes their application in real-world scenarios. In this paper, we propose a time-varying representation to track and reconstruct the dynamic scenes. Firstly, two processes, trac… ▽ More

    Submitted 17 March, 2024; v1 submitted 29 October, 2023; originally announced October 2023.

  20. Enhancing Cross-Dataset Performance of Distracted Driving Detection With Score-Softmax Classifier

    Authors: Cong Duan, Zixuan Liu, Jiahao Xia, Minghai Zhang, Jiacai Liao, Libo Cao

    Abstract: Deep neural networks enable real-time monitoring of in-vehicle driver, facilitating the timely prediction of distractions, fatigue, and potential hazards. This technology is now integral to intelligent transportation systems. Recent research has exposed unreliable cross-dataset end-to-end driver behavior recognition due to overfitting, often referred to as ``shortcut learning", resulting from limi… ▽ More

    Submitted 20 October, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  21. arXiv:2308.11257  [pdf, other

    cs.CL

    HopPG: Self-Iterative Program Generation for Multi-Hop Question Answering over Heterogeneous Knowledge

    Authors: Yingyao Wang, Yongwei Zhou, Chaoqun Duan, Junwei Bao, Tiejun Zhao

    Abstract: The semantic parsing-based method is an important research branch for knowledge-based question answering. It usually generates executable programs lean upon the question and then conduct them to reason answers over a knowledge base. Benefit from this inherent mechanism, it has advantages in the performance and the interpretability. However, traditional semantic parsing methods usually generate a c… ▽ More

    Submitted 10 September, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

  22. arXiv:2307.09748  [pdf, other

    cs.CV

    Watch out Venomous Snake Species: A Solution to SnakeCLEF2023

    Authors: Feiran Hu, Peng Wang, Yangyang Li, Chenlong Duan, Zijian Zhu, Fei Wang, Faen Zhang, Yong Li, Xiu-Shen Wei

    Abstract: The SnakeCLEF2023 competition aims to the development of advanced algorithms for snake species identification through the analysis of images and accompanying metadata. This paper presents a method leveraging utilization of both images and metadata. Modern CNN models and strong data augmentation are utilized to learn better representation of images. To relieve the challenge of long-tailed distribut… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

    Comments: This work was the winner solution of the SnakeCLEF2023 challenge

  23. arXiv:2307.05378  [pdf, other

    cond-mat.mtrl-sci cs.LG

    M$^2$Hub: Unlocking the Potential of Machine Learning for Materials Discovery

    Authors: Yuanqi Du, Yingheng Wang, Yining Huang, Jianan Canal Li, Yanqiao Zhu, Tian Xie, Chenru Duan, John M. Gregoire, Carla P. Gomes

    Abstract: We introduce M$^2$Hub, a toolkit for advancing machine learning in materials discovery. Machine learning has achieved remarkable progress in modeling molecular structures, especially biomolecules for drug discovery. However, the development of machine learning approaches for modeling materials structures lag behind, which is partly due to the lack of an integrated platform that enables access to d… ▽ More

    Submitted 14 June, 2023; originally announced July 2023.

  24. arXiv:2306.13881  [pdf, other

    math.NA cs.AI cs.LG

    Current density impedance imaging with PINNs

    Authors: Chenguang Duan, Yuling Jiao, Xiliang Lu, Jerry Zhijian Yang

    Abstract: In this paper, we introduce CDII-PINNs, a computationally efficient method for solving CDII using PINNs in the framework of Tikhonov regularization. This method constructs a physics-informed loss function by merging the regularized least-squares output functional with an underlying differential equation, which describes the relationship between the conductivity and voltage. A pair of neural networ… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

  25. arXiv:2306.12241  [pdf, other

    cs.RO

    ScenarioNet: Open-Source Platform for Large-Scale Traffic Scenario Simulation and Modeling

    Authors: Quanyi Li, Zhenghao Peng, Lan Feng, Zhizheng Liu, Chenda Duan, Wenjie Mo, Bolei Zhou

    Abstract: Large-scale driving datasets such as Waymo Open Dataset and nuScenes substantially accelerate autonomous driving research, especially for perception tasks such as 3D detection and trajectory forecasting. Since the driving logs in these datasets contain HD maps and detailed object annotations which accurately reflect the real-world complexity of traffic behaviors, we can harvest a massive number of… ▽ More

    Submitted 30 October, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

  26. Physics-constrained Attack against Convolution-based Human Motion Prediction

    Authors: Chengxu Duan, Zhicheng Zhang, Xiaoli Liu, Yonghao Dang, Jianqin Yin

    Abstract: Human motion prediction has achieved a brilliant performance with the help of convolution-based neural networks. However, currently, there is no work evaluating the potential risk in human motion prediction when facing adversarial attacks. The adversarial attack will encounter problems against human motion prediction in naturalness and data scale. To solve the problems above, we propose a new adve… ▽ More

    Submitted 14 January, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

  27. arXiv:2306.11977  [pdf

    eess.IV cs.CV

    Encoding Enhanced Complex CNN for Accurate and Highly Accelerated MRI

    Authors: Zimeng Li, Sa Xiao, Cheng Wang, Haidong Li, Xiuchao Zhao, Caohui Duan, Qian Zhou, Qiuchen Rao, Yuan Fang, Junshuai Xie, Lei Shi, Fumin Guo, Chaohui Ye, Xin Zhou

    Abstract: Magnetic resonance imaging (MRI) using hyperpolarized noble gases provides a way to visualize the structure and function of human lung, but the long imaging time limits its broad research and clinical applications. Deep learning has demonstrated great potential for accelerating MRI by reconstructing images from undersampled data. However, most existing deep conventional neural networks (CNN) direc… ▽ More

    Submitted 13 November, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

  28. arXiv:2306.09375  [pdf, other

    cs.LG physics.chem-ph q-bio.QM

    Symmetry-Informed Geometric Representation for Molecules, Proteins, and Crystalline Materials

    Authors: Shengchao Liu, Weitao Du, Yanjing Li, Zhuoxinran Li, Zhiling Zheng, Chenru Duan, Zhiming Ma, Omar Yaghi, Anima Anandkumar, Christian Borgs, Jennifer Chayes, Hongyu Guo, Jian Tang

    Abstract: Artificial intelligence for scientific discovery has recently generated significant interest within the machine learning and scientific communities, particularly in the domains of chemistry, biology, and material discovery. For these scientific problems, molecules serve as the fundamental building blocks, and machine learning has emerged as a highly effective and powerful tool for modeling their g… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

  29. arXiv:2305.15241  [pdf, other

    cs.CV cs.CR cs.LG

    Robust Classification via a Single Diffusion Model

    Authors: Huanran Chen, Yinpeng Dong, Zhengyi Wang, Xiao Yang, Chengqi Duan, Hang Su, Jun Zhu

    Abstract: Diffusion models have been applied to improve adversarial robustness of image classifiers by purifying the adversarial noises or generating realistic data for adversarial training. However, diffusion-based purification can be evaded by stronger adaptive attacks while adversarial training does not perform well under unseen threats, exhibiting inevitable limitations of these methods. To better harne… ▽ More

    Submitted 21 May, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Accepted by ICML 2024

    Journal ref: ICML 2024

  30. arXiv:2304.06286  [pdf, other

    eess.SP cs.CV

    Automated Cardiovascular Record Retrieval by Multimodal Learning between Electrocardiogram and Clinical Report

    Authors: Jielin Qiu, Jiacheng Zhu, Shiqi Liu, William Han, Jingqi Zhang, Chaojing Duan, Michael Rosenberg, Emerson Liu, Douglas Weber, Ding Zhao

    Abstract: Automated interpretation of electrocardiograms (ECG) has garnered significant attention with the advancements in machine learning methodologies. Despite the growing interest, most current studies focus solely on classification or regression tasks, which overlook a crucial aspect of clinical cardio-disease diagnosis: the diagnostic report generated by experienced human clinicians. In this paper, we… ▽ More

    Submitted 6 November, 2023; v1 submitted 13 April, 2023; originally announced April 2023.

    Comments: Accepted to the ML4H 2023 Proceedings track

  31. arXiv:2304.06174  [pdf, other

    physics.chem-ph cs.LG

    Accurate transition state generation with an object-aware equivariant elementary reaction diffusion model

    Authors: Chenru Duan, Yuanqi Du, Haojun Jia, Heather J. Kulik

    Abstract: Transition state (TS) search is key in chemistry for elucidating reaction mechanisms and exploring reaction networks. The search for accurate 3D TS structures, however, requires numerous computationally intensive quantum chemistry calculations due to the complexity of potential energy surfaces. Here, we developed an object-aware SE(3) equivariant diffusion model that satisfies all physical symmetr… ▽ More

    Submitted 30 October, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

    Comments: 5 figures and 1 table

  32. arXiv:2212.03412  [pdf, other

    cs.CR cs.AI cs.CV cs.LG

    Artificial Intelligence Security Competition (AISC)

    Authors: Yinpeng Dong, Peng Chen, Senyou Deng, Lianji L, Yi Sun, Hanyu Zhao, Jiaxing Li, Yunteng Tan, Xinyu Liu, Yangyi Dong, Enhui Xu, Jincai Xu, Shu Xu, Xuelin Fu, Changfeng Sun, Haoliang Han, Xuchong Zhang, Shen Chen, Zhimin Sun, Junyi Cao, Taiping Yao, Shouhong Ding, Yu Wu, Jian Lin, Tianpeng Wu , et al. (27 additional authors not shown)

    Abstract: The security of artificial intelligence (AI) is an important research area towards safe, reliable, and trustworthy AI systems. To accelerate the research on AI security, the Artificial Intelligence Security Competition (AISC) was organized by the Zhongguancun Laboratory, China Industrial Control Systems Cyber Emergency Response Team, Institute for Artificial Intelligence, Tsinghua University, and… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Comments: Technical report of AISC

  33. arXiv:2210.14191  [pdf

    cond-mat.mtrl-sci cs.LG physics.chem-ph

    A Database of Ultrastable MOFs Reassembled from Stable Fragments with Machine Learning Models

    Authors: Aditya Nandy, Shuwen Yue, Changhwan Oh, Chenru Duan, Gianmarco G. Terrones, Yongchul G. Chung, Heather J. Kulik

    Abstract: High-throughput screening of large hypothetical databases of metal-organic frameworks (MOFs) can uncover new materials, but their stability in real-world applications is often unknown. We leverage community knowledge and machine learning (ML) models to identify MOFs that are thermally stable and stable upon activation. We separate these MOFs into their building blocks and recombine them to make a… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

  34. arXiv:2210.10350  [pdf, other

    cs.CL

    MuGER$^2$: Multi-Granularity Evidence Retrieval and Reasoning for Hybrid Question Answering

    Authors: Yingyao Wang, Junwei Bao, Chaoqun Duan, Youzheng Wu, Xiaodong He, Tiejun Zhao

    Abstract: Hybrid question answering (HQA) aims to answer questions over heterogeneous data, including tables and passages linked to table cells. The heterogeneous data can provide different granularity evidence to HQA models, e.t., column, row, cell, and link. Conventional HQA models usually retrieve coarse- or fine-grained evidence to reason the answer. Through comparison, we find that coarse-grained evide… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: Accepted to EMNLP(Findings) 2022

  35. arXiv:2210.08249  [pdf, other

    cs.CL

    UniRPG: Unified Discrete Reasoning over Table and Text as Program Generation

    Authors: Yongwei Zhou, Junwei Bao, Chaoqun Duan, Youzheng Wu, Xiaodong He, Tiejun Zhao

    Abstract: Question answering requiring discrete reasoning, e.g., arithmetic computing, comparison, and counting, over knowledge is a challenging task. In this paper, we propose UniRPG, a semantic-parsing-based approach advanced in interpretability and scalability, to perform unified discrete reasoning over heterogeneous knowledge resources, i.e., table and text, as program generation. Concretely, UniRPG con… ▽ More

    Submitted 15 October, 2022; originally announced October 2022.

    Comments: Accepted to EMNLP 2022

  36. arXiv:2209.08595  [pdf

    physics.chem-ph cond-mat.mtrl-sci cs.LG

    Low-cost machine learning approach to the prediction of transition metal phosphor excited state properties

    Authors: Gianmarco Terrones, Chenru Duan, Aditya Nandy, Heather J. Kulik

    Abstract: Photoactive iridium complexes are of broad interest due to their applications ranging from lighting to photocatalysis. However, the excited state property prediction of these complexes challenges ab initio methods such as time-dependent density functional theory (TDDFT) both from an accuracy and a computational cost perspective, complicating high throughput virtual screening (HTVS). We instead lev… ▽ More

    Submitted 18 September, 2022; originally announced September 2022.

  37. arXiv:2208.14657  [pdf, other

    cs.CV cs.MM

    EViT: Privacy-Preserving Image Retrieval via Encrypted Vision Transformer in Cloud Computing

    Authors: Qihua Feng, Peiya Li, Zhixun Lu, Chaozhuo Li, Zefang Wang, Zhiquan Liu, Chunhui Duan, Feiran Huang

    Abstract: Image retrieval systems help users to browse and search among extensive images in real-time. With the rise of cloud computing, retrieval tasks are usually outsourced to cloud servers. However, the cloud scenario brings a daunting challenge of privacy protection as cloud servers cannot be fully trusted. To this end, image-encryption-based privacy-preserving image retrieval schemes have been develop… ▽ More

    Submitted 31 August, 2022; originally announced August 2022.

    Comments: 29 pages

  38. arXiv:2208.05444  [pdf

    physics.chem-ph cond-mat.mtrl-sci cs.LG q-bio.BM

    Active Learning Exploration of Transition Metal Complexes to Discover Method-Insensitive and Synthetically Accessible Chromophores

    Authors: Chenru Duan, Aditya Nandy, Gianmarco Terrones, David W. Kastner, Heather J. Kulik

    Abstract: Transition metal chromophores with earth-abundant transition metals are an important design target for their applications in lighting and non-toxic bioimaging, but their design is challenged by the scarcity of complexes that simultaneously have optimal target absorption energies in the visible region as well as well-defined ground states. Machine learning (ML) accelerated discovery could overcome… ▽ More

    Submitted 15 September, 2022; v1 submitted 10 August, 2022; originally announced August 2022.

  39. arXiv:2207.10747  [pdf

    physics.chem-ph cond-mat.mtrl-sci cs.LG

    A Transferable Recommender Approach for Selecting the Best Density Functional Approximations in Chemical Discovery

    Authors: Chenru Duan, Aditya Nandy, Ralf Meyer, Naveen Arunachalam, Heather J. Kulik

    Abstract: Approximate density functional theory (DFT) has become indispensable owing to its cost-accuracy trade-off in comparison to more computationally demanding but accurate correlated wavefunction theory. To date, however, no single density functional approximation (DFA) with universal accuracy has been identified, leading to uncertainty in the quality of data generated from DFT. With electron density f… ▽ More

    Submitted 21 July, 2022; originally announced July 2022.

  40. arXiv:2205.02967  [pdf

    cond-mat.mtrl-sci cs.LG physics.chem-ph

    Putting Density Functional Theory to the Test in Machine-Learning-Accelerated Materials Discovery

    Authors: Chenru Duan, Fang Liu, Aditya Nandy, Heather J. Kulik

    Abstract: Accelerated discovery with machine learning (ML) has begun to provide the advances in efficiency needed to overcome the combinatorial challenge of computational materials design. Nevertheless, ML-accelerated discovery both inherits the biases of training data derived from density functional theory (DFT) and leads to many attempted calculations that are doomed to fail. Many compelling functional ma… ▽ More

    Submitted 5 May, 2022; originally announced May 2022.

    Journal ref: Journal of Physical Chemistry Letters, 2021, 12, 19, 4628-4637

  41. arXiv:2205.02879  [pdf

    cond-mat.mtrl-sci cs.LG physics.chem-ph

    Exploiting Ligand Additivity for Transferable Machine Learning of Multireference Character Across Known Transition Metal Complex Ligands

    Authors: Chenru Duan, Adriana J. Ladera, Julian C. -L. Liu, Michael G. Taylor, Isuru R. Ariyarathna, Heather J. Kulik

    Abstract: Accurate virtual high-throughput screening (VHTS) of transition metal complexes (TMCs) remains challenging due to the possibility of high multi-reference (MR) character that complicates property evaluation. We compute MR diagnostics for over 5,000 ligands present in previously synthesized transition metal complexes in the Cambridge Structural Database (CSD). To accomplish this task, we introduce a… ▽ More

    Submitted 5 May, 2022; originally announced May 2022.

  42. arXiv:2205.02550  [pdf, other

    cs.CL

    LUNA: Learning Slot-Turn Alignment for Dialogue State Tracking

    Authors: Yifan Wang, Jing Zhao, Junwei Bao, Chaoqun Duan, Youzheng Wu, Xiaodong He

    Abstract: Dialogue state tracking (DST) aims to predict the current dialogue state given the dialogue history. Existing methods generally exploit the utterances of all dialogue turns to assign value for each slot. This could lead to suboptimal results due to the information introduced from irrelevant utterances in the dialogue history, which may be useless and can even cause confusion. To address this probl… ▽ More

    Submitted 5 May, 2022; originally announced May 2022.

    Comments: Accepted to NAACL 2022

  43. arXiv:2204.14166  [pdf, other

    cs.CL

    OPERA:Operation-Pivoted Discrete Reasoning over Text

    Authors: Yongwei Zhou, Junwei Bao, Chaoqun Duan, Haipeng Sun, Jiahui Liang, Yifan Wang, Jing Zhao, Youzheng Wu, Xiaodong He, Tiejun Zhao

    Abstract: Machine reading comprehension (MRC) that requires discrete reasoning involving symbolic operations, e.g., addition, sorting, and counting, is a challenging task. According to this nature, semantic parsing-based methods predict interpretable but complex logical forms. However, logical form generation is nontrivial and even a little perturbation in a logical form will lead to wrong answers. To allev… ▽ More

    Submitted 3 May, 2022; v1 submitted 29 April, 2022; originally announced April 2022.

    Comments: Accepted to NAACL 2022

  44. arXiv:2203.01276  [pdf

    physics.chem-ph cond-mat.mtrl-sci cs.LG

    Machine learning models predict calculation outcomes with the transferability necessary for computational catalysis

    Authors: Chenru Duan, Aditya Nandy, Husain Adamji, Yuriy Roman-Leshkov, Heather J. Kulik

    Abstract: Virtual high throughput screening (VHTS) and machine learning (ML) have greatly accelerated the design of single-site transition-metal catalysts. VHTS of catalysts, however, is often accompanied with high calculation failure rate and wasted computational resources due to the difficulty of simultaneously converging all mechanistically relevant reactive intermediates to expected geometries and elect… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

  45. arXiv:2201.04243  [pdf

    physics.chem-ph cond-mat.mtrl-sci cs.LG

    Two Wrongs Can Make a Right: A Transfer Learning Approach for Chemical Discovery with Chemical Accuracy

    Authors: Chenru Duan, Daniel B. K. Chu, Aditya Nandy, Heather J. Kulik

    Abstract: Appropriately identifying and treating molecules and materials with significant multi-reference (MR) character is crucial for achieving high data fidelity in virtual high throughput screening (VHTS). Nevertheless, most VHTS is carried out with approximate density functional theory (DFT) using a single functional. Despite development of numerous MR diagnostics, the extent to which a single value of… ▽ More

    Submitted 11 January, 2022; originally announced January 2022.

  46. arXiv:2111.01905  [pdf

    physics.chem-ph cond-mat.mtrl-sci cs.LG

    Audacity of huge: overcoming challenges of data scarcity and data quality for machine learning in computational materials discovery

    Authors: Aditya Nandy, Chenru Duan, Heather J. Kulik

    Abstract: Machine learning (ML)-accelerated discovery requires large amounts of high-fidelity data to reveal predictive structure-property relationships. For many properties of interest in materials discovery, the challenging nature and high cost of data generation has resulted in a data landscape that is both scarcely populated and of dubious quality. Data-driven techniques starting to overcome these limit… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

  47. arXiv:2111.00203  [pdf, other

    cs.CV cs.GR cs.MM

    Imitating Arbitrary Talking Style for Realistic Audio-DrivenTalking Face Synthesis

    Authors: Haozhe Wu, Jia Jia, Haoyu Wang, Yishun Dou, Chao Duan, Qingshan Deng

    Abstract: People talk with diversified styles. For one piece of speech, different talking styles exhibit significant differences in the facial and head pose movements. For example, the "excited" style usually talks with the mouth wide open, while the "solemn" style is more standardized and seldomly exhibits exaggerated motions. Due to such huge differences between different styles, it is necessary to incorp… ▽ More

    Submitted 30 October, 2021; originally announced November 2021.

    Comments: Accepted by MM2021, code available at https://github.com/wuhaozhe/style_avatar

    ACM Class: I.1.4

  48. UNetFormer: A UNet-like Transformer for Efficient Semantic Segmentation of Remote Sensing Urban Scene Imagery

    Authors: Libo Wang, Rui Li, Ce Zhang, Shenghui Fang, Chenxi Duan, Xiaoliang Meng, Peter M. Atkinson

    Abstract: Semantic segmentation of remotely sensed urban scene images is required in a wide range of practical applications, such as land cover mapping, urban change detection, environmental protection, and economic assessment.Driven by rapid developments in deep learning technologies, the convolutional neural network (CNN) has dominated semantic segmentation for many years. CNN adopts hierarchical feature… ▽ More

    Submitted 26 June, 2022; v1 submitted 18 September, 2021; originally announced September 2021.

    Comments: Accepted by ISPRS

    Journal ref: journal = {ISPRS Journal of Photogrammetry and Remote Sensing},volume = {190},pages = {196-214},year = {2022},issn = {0924-2716},

  49. arXiv:2109.08098  [pdf

    cond-mat.mtrl-sci cs.LG physics.chem-ph

    MOFSimplify: Machine Learning Models with Extracted Stability Data of Three Thousand Metal-Organic Frameworks

    Authors: A. Nandy, G. Terrones, N. Arunachalam, C. Duan, D. W. Kastner, H. J. Kulik

    Abstract: We report a workflow and the output of a natural language processing (NLP)-based procedure to mine the extant metal-organic framework (MOF) literature describing structurally characterized MOFs and their solvent removal and thermal stabilities. We obtain over 2,000 solvent removal stability measures from text mining and 3,000 thermal decomposition temperatures from thermogravimetric analysis data.… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

  50. arXiv:2107.08766  [pdf, other

    cs.CV

    VisDrone-CC2020: The Vision Meets Drone Crowd Counting Challenge Results

    Authors: Dawei Du, Longyin Wen, Pengfei Zhu, Heng Fan, Qinghua Hu, Haibin Ling, Mubarak Shah, Junwen Pan, Ali Al-Ali, Amr Mohamed, Bakour Imene, Bin Dong, Binyu Zhang, Bouchali Hadia Nesma, Chenfeng Xu, Chenzhen Duan, Ciro Castiello, Corrado Mencar, Dingkang Liang, Florian Krüger, Gennaro Vessio, Giovanna Castellano, Jieru Wang, Junyu Gao, Khalid Abualsaud , et al. (30 additional authors not shown)

    Abstract: Crowd counting on the drone platform is an interesting topic in computer vision, which brings new challenges such as small object inference, background clutter and wide viewpoint. However, there are few algorithms focusing on crowd counting on the drone-captured data due to the lack of comprehensive datasets. To this end, we collect a large-scale dataset and organize the Vision Meets Drone Crowd C… ▽ More

    Submitted 19 July, 2021; originally announced July 2021.

    Comments: The method description of A7 Mutil-Scale Aware based SFANet (M-SFANet) is updated and missing references are added

    Journal ref: European Conference on Computer Vision. Springer, Cham, 2020: 675-691