Zum Hauptinhalt springen

Showing 1–50 of 55 results for author: Cong, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.09101  [pdf, other

    cs.DC

    Heterogeneity-Aware Memory Efficient Federated Learning via Progressive Layer Freezing

    Authors: Wu Yebo, Li Li, Tian Chunlin, Chang Tao, Lin Chi, Wang Cong, Xu Cheng-Zhong

    Abstract: In this paper, we propose SmartFreeze, a framework that effectively reduces the memory footprint by conducting the training in a progressive manner. Instead of updating the full model in each training round, SmartFreeze divides the shared model into blocks consisting of a specified number of layers. It first trains the front block with a well-designed output module, safely freezes it after converg… ▽ More

    Submitted 17 August, 2024; originally announced August 2024.

    Comments: Published as a conference paper at IWQoS 2024

  2. arXiv:2407.09047  [pdf, other

    cs.CV

    Cs2K: Class-specific and Class-shared Knowledge Guidance for Incremental Semantic Segmentation

    Authors: Wei Cong, Yang Cong, Yuyang Liu, Gan Sun

    Abstract: Incremental semantic segmentation endeavors to segment newly encountered classes while maintaining knowledge of old classes. However, existing methods either 1) lack guidance from class-specific knowledge (i.e., old class prototypes), leading to a bias towards new classes, or 2) constrain class-shared knowledge (i.e., old model weights) excessively without discrimination, resulting in a preference… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  3. arXiv:2403.20309  [pdf, other

    cs.CV

    InstantSplat: Sparse-view SfM-free Gaussian Splatting in Seconds

    Authors: Zhiwen Fan, Wenyan Cong, Kairun Wen, Kevin Wang, Jian Zhang, Xinghao Ding, Danfei Xu, Boris Ivanovic, Marco Pavone, Georgios Pavlakos, Zhangyang Wang, Yue Wang

    Abstract: While novel view synthesis (NVS) from a sparse set of images has advanced significantly in 3D computer vision, it relies on precise initial estimation of camera parameters using Structure-from-Motion (SfM). For instance, the recently developed Gaussian Splatting depends heavily on the accuracy of SfM-derived points and poses. However, SfM processes are time-consuming and often prove unreliable in… ▽ More

    Submitted 20 August, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

    Comments: Project Page: https://instantsplat.github.io/

  4. arXiv:2403.11234  [pdf, other

    cs.CV

    Universal Semi-Supervised Domain Adaptation by Mitigating Common-Class Bias

    Authors: Wenyu Zhang, Qingmu Liu, Felix Ong Wei Cong, Mohamed Ragab, Chuan-Sheng Foo

    Abstract: Domain adaptation is a critical task in machine learning that aims to improve model performance on a target domain by leveraging knowledge from a related source domain. In this work, we introduce Universal Semi-Supervised Domain Adaptation (UniSSDA), a practical yet challenging setting where the target domain is partially labeled, and the source and target label space may not strictly match. UniSS… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024

  5. arXiv:2402.16387  [pdf, other

    cs.LG cs.AI

    On the Generalization Capability of Temporal Graph Learning Algorithms: Theoretical Insights and a Simpler Method

    Authors: Weilin Cong, Jian Kang, Hanghang Tong, Mehrdad Mahdavi

    Abstract: Temporal Graph Learning (TGL) has become a prevalent technique across diverse real-world applications, especially in domains where data can be represented as a graph and evolves over time. Although TGL has recently seen notable progress in algorithmic solutions, its theoretical foundations remain largely unexplored. This paper aims at bridging this gap by investigating the generalization ability o… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  6. arXiv:2312.04572  [pdf

    cs.RO

    Harnessing LSTM for Nonlinear Ship Deck Motion Prediction in UAV Autonomous Landing amidst High Sea States

    Authors: Feifan Yu, Wenyuan Cong, Xinmin Chen, Yue Lin, Jiqiang Wang

    Abstract: Autonomous landing of UAVs in high sea states requires the UAV to land exclusively during the ship deck's "rest period," coinciding with minimal movement. Given this scenario, determining the ship's "rest period" based on its movement patterns becomes a fundamental prerequisite for addressing this challenge. This study employs the Long Short-Term Memory (LSTM) neural network to predict the ship's… ▽ More

    Submitted 15 November, 2023; originally announced December 2023.

    Comments: 11 pages, 7 figures, accept by ICANDVC2023

  7. arXiv:2312.02660  [pdf, other

    econ.GN cs.CE cs.CR cs.CY stat.AP

    Uniswap Daily Transaction Indices by Network

    Authors: Nir Chemaya, Lin William Cong, Emma Jorgensen, Dingyue Liu, Luyao Zhang

    Abstract: DeFi is transforming financial services by removing intermediaries and producing a wealth of open-source data. This transformation is propelled by Layer 2 (L2) solutions, aimed at boosting network efficiency and scalability beyond current Layer 1 (L1) capabilities. This study addresses the lack of detailed L2 impact analysis by examining over 50 million transactions from Uniswap. Our dataset, feat… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  8. arXiv:2310.14541  [pdf, other

    cs.CL

    Continual Named Entity Recognition without Catastrophic Forgetting

    Authors: Duzhen Zhang, Wei Cong, Jiahua Dong, Yahan Yu, Xiuyi Chen, Yonggang Zhang, Zhen Fang

    Abstract: Continual Named Entity Recognition (CNER) is a burgeoning area, which involves updating an existing model by incorporating new entity types sequentially. Nevertheless, continual learning approaches are often severely afflicted by catastrophic forgetting. This issue is intensified in CNER due to the consolidation of old entity types from previous steps into the non-entity type at each step, leading… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: Accepted by EMNLP2023 main conference as a long paper

  9. arXiv:2310.06949  [pdf, other

    eess.IV cs.LG physics.med-ph

    Diffusion Prior Regularized Iterative Reconstruction for Low-dose CT

    Authors: Wenjun Xia, Yongyi Shi, Chuang Niu, Wenxiang Cong, Ge Wang

    Abstract: Computed tomography (CT) involves a patient's exposure to ionizing radiation. To reduce the radiation dose, we can either lower the X-ray photon count or down-sample projection views. However, either of the ways often compromises image quality. To address this challenge, here we introduce an iterative reconstruction algorithm regularized by a diffusion prior. Drawing on the exceptional imaging pro… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  10. arXiv:2308.11793  [pdf, other

    cs.CV

    Enhancing NeRF akin to Enhancing LLMs: Generalizable NeRF Transformer with Mixture-of-View-Experts

    Authors: Wenyan Cong, Hanxue Liang, Peihao Wang, Zhiwen Fan, Tianlong Chen, Mukund Varma, Yi Wang, Zhangyang Wang

    Abstract: Cross-scene generalizable NeRF models, which can directly synthesize novel views of unseen scenes, have become a new spotlight of the NeRF field. Several existing attempts rely on increasingly end-to-end "neuralized" architectures, i.e., replacing scene representation and/or rendering modules with performant neural networks such as transformers, and turning novel view synthesis into a feed-forward… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: Accepted by ICCV2023

  11. arXiv:2308.08793  [pdf, other

    cs.CL

    Task Relation Distillation and Prototypical Pseudo Label for Incremental Named Entity Recognition

    Authors: Duzhen Zhang, Hongliu Li, Wei Cong, Rongtao Xu, Jiahua Dong, Xiuyi Chen

    Abstract: Incremental Named Entity Recognition (INER) involves the sequential learning of new entity types without accessing the training data of previously learned types. However, INER faces the challenge of catastrophic forgetting specific for incremental learning, further aggravated by background shift (i.e., old and future entity types are labeled as the non-entity type in the current task). To address… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: Accepted by CIKM2023 as a long paper with an oral presentation

  12. arXiv:2308.00376  [pdf, other

    cs.CV

    Deep Image Harmonization with Learnable Augmentation

    Authors: Li Niu, Junyan Cao, Wenyan Cong, Liqing Zhang

    Abstract: The goal of image harmonization is adjusting the foreground appearance in a composite image to make the whole image harmonious. To construct paired training images, existing datasets adopt different ways to adjust the illumination statistics of foregrounds of real images to produce synthetic composite images. However, different datasets have considerable domain gap and the performances on small-sc… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: Accepted by ICCV 2023

  13. arXiv:2307.10845  [pdf, other

    cs.LG cs.CV

    Self-paced Weight Consolidation for Continual Learning

    Authors: Wei Cong, Yang Cong, Gan Sun, Yuyang Liu, Jiahua Dong

    Abstract: Continual learning algorithms which keep the parameters of new tasks close to that of previous tasks, are popular in preventing catastrophic forgetting in sequential task learning settings. However, 1) the performance for the new continual learner will be degraded without distinguishing the contributions of previously learned tasks; 2) the computational cost will be greatly increased with the numb… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

  14. arXiv:2307.10822  [pdf, other

    cs.CV

    Gradient-Semantic Compensation for Incremental Semantic Segmentation

    Authors: Wei Cong, Yang Cong, Jiahua Dong, Gan Sun, Henghui Ding

    Abstract: Incremental semantic segmentation aims to continually learn the segmentation of new coming classes without accessing the training data of previously learned classes. However, most current methods fail to address catastrophic forgetting and background shift since they 1) treat all previous classes equally without considering different forgetting paces caused by imbalanced gradient back-propagation;… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

  15. arXiv:2307.10584  [pdf, other

    cs.CV

    Reference-based Painterly Inpainting via Diffusion: Crossing the Wild Reference Domain Gap

    Authors: Dejia Xu, Xingqian Xu, Wenyan Cong, Humphrey Shi, Zhangyang Wang

    Abstract: Have you ever imagined how it would look if we placed new objects into paintings? For example, what would it look like if we placed a basketball into Claude Monet's ``Water Lilies, Evening Effect''? We propose Reference-based Painterly Inpainting, a novel task that crosses the wild reference domain gap and implants novel objects into artworks. Although previous works have examined reference-based… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

  16. arXiv:2306.04107  [pdf, other

    cs.LG cs.AI cs.SI

    BeMap: Balanced Message Passing for Fair Graph Neural Network

    Authors: Xiao Lin, Jian Kang, Weilin Cong, Hanghang Tong

    Abstract: Fairness in graph neural networks has been actively studied recently. However, existing works often do not explicitly consider the role of message passing in introducing or amplifying the bias. In this paper, we first investigate the problem of bias amplification in message passing. We empirically and theoretically demonstrate that message passing could amplify the bias when the 1-hop neighbors fr… ▽ More

    Submitted 8 March, 2024; v1 submitted 6 June, 2023; originally announced June 2023.

    Comments: Accepted at the Second Learning on Graphs Conference (LoG 2023)

  17. arXiv:2304.04620  [pdf, other

    cs.CV

    Federated Incremental Semantic Segmentation

    Authors: Jiahua Dong, Duzhen Zhang, Yang Cong, Wei Cong, Henghui Ding, Dengxin Dai

    Abstract: Federated learning-based semantic segmentation (FSS) has drawn widespread attention via decentralized training on local clients. However, most FSS models assume categories are fixed in advance, thus heavily undergoing forgetting on old categories in practical applications where local clients receive new categories incrementally while have no memory storage to access old classes. Moreover, new clie… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

    Comments: Accepted to CVPR2023

  18. arXiv:2303.12861  [pdf, other

    eess.IV cs.LG eess.SP physics.bio-ph

    Parallel Diffusion Model-based Sparse-view Cone-beam Breast CT

    Authors: Wenjun Xia, Hsin Wu Tseng, Chuang Niu, Wenxiang Cong, Xiaohua Zhang, Shaohua Liu, Ruola Ning, Srinivasan Vedantham, Ge Wang

    Abstract: Breast cancer is the most prevalent cancer among women worldwide, and early detection is crucial for reducing its mortality rate and improving quality of life. Dedicated breast computed tomography (CT) scanners offer better image quality than mammography and tomosynthesis in general but at higher radiation dose. To enable breast CT for cancer screening, the challenge is to minimize the radiation d… ▽ More

    Submitted 28 January, 2024; v1 submitted 22 March, 2023; originally announced March 2023.

  19. arXiv:2302.11636  [pdf, other

    cs.LG cs.AI

    Do We Really Need Complicated Model Architectures For Temporal Networks?

    Authors: Weilin Cong, Si Zhang, Jian Kang, Baichuan Yuan, Hao Wu, Xin Zhou, Hanghang Tong, Mehrdad Mahdavi

    Abstract: Recurrent neural network (RNN) and self-attention mechanism (SAM) are the de facto methods to extract spatial-temporal information for temporal graph learning. Interestingly, we found that although both RNN and SAM could lead to a good performance, in practice neither of them is always necessary. In this paper, we propose GraphMixer, a conceptually and technically simple architecture that consists… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

  20. arXiv:2302.08990  [pdf, other

    cs.LG cs.AI

    Efficiently Forgetting What You Have Learned in Graph Representation Learning via Projection

    Authors: Weilin Cong, Mehrdad Mahdavi

    Abstract: As privacy protection receives much attention, unlearning the effect of a specific node from a pre-trained graph learning model has become equally important. However, due to the node dependency in the graph-structured data, representation unlearning in Graph Neural Networks (GNNs) is challenging and less well explored. In this paper, we fill in this gap by first studying the unlearning problem in… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

  21. arXiv:2211.10388  [pdf, other

    eess.IV cs.LG eess.SP physics.med-ph

    Patch-Based Denoising Diffusion Probabilistic Model for Sparse-View CT Reconstruction

    Authors: Wenjun Xia, Wenxiang Cong, Ge Wang

    Abstract: Sparse-view computed tomography (CT) can be used to reduce radiation dose greatly but is suffers from severe image artifacts. Recently, the deep learning based method for sparse-view CT reconstruction has attracted a major attention. However, neural networks often have a limited ability to remove the artifacts when they only work in the image domain. Deep learning-based sinogram processing can ach… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

  22. arXiv:2210.13852  [pdf, other

    cs.LG

    TabMixer: Excavating Label Distribution Learning with Small-scale Features

    Authors: Weiyi Cong, Zhuoran Zheng, Xiuyi Jia

    Abstract: Label distribution learning (LDL) differs from multi-label learning which aims at representing the polysemy of instances by transforming single-label values into descriptive degrees. Unfortunately, the feature space of the label distribution dataset is affected by human factors and the inductive bias of the feature extractor causing uncertainty in the feature space. Especially, for datasets with s… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

  23. arXiv:2206.08401  [pdf, other

    econ.GN cs.CR q-fin.ST stat.CO

    Is decentralized finance actually decentralized? A social network analysis of the Aave protocol on the Ethereum blockchain

    Authors: Ziqiao Ao, Lin William Cong, Gergely Horvath, Luyao Zhang

    Abstract: Decentralized finance (DeFi) has the potential to disrupt centralized finance by validating peer-to-peer transactions through tamper-proof smart contracts, thus significantly lowering the transaction cost charged by financial intermediaries. However, the actual realization of peer-to-peer transactions and the levels and effects of decentralization are largely unknown. Our research pioneers a block… ▽ More

    Submitted 30 November, 2023; v1 submitted 16 June, 2022; originally announced June 2022.

    Comments: Accepted at 29th Annual Global Finance Conference featuring Professor Robert Engle, The 2003 Nobel Laureate in Economic Sciences

    ACM Class: E.0; G.1; G.3; I.6; J.4; J.6

  24. arXiv:2205.00687  [pdf, other

    cs.CV cs.AI

    Deep Video Harmonization with Color Mapping Consistency

    Authors: Xinyuan Lu, Shengyuan Huang, Li Niu, Wenyan Cong, Liqing Zhang

    Abstract: Video harmonization aims to adjust the foreground of a composite video to make it compatible with the background. So far, video harmonization has only received limited attention and there is no public dataset for video harmonization. In this work, we construct a new video harmonization dataset HYouTube by adjusting the foreground of real videos to create synthetic composite videos. Moreover, we co… ▽ More

    Submitted 2 May, 2022; originally announced May 2022.

  25. arXiv:2111.10447  [pdf, other

    cs.LG

    DyFormer: A Scalable Dynamic Graph Transformer with Provable Benefits on Generalization Ability

    Authors: Weilin Cong, Yanhong Wu, Yuandong Tian, Mengting Gu, Yinglong Xia, Chun-cheng Jason Chen, Mehrdad Mahdavi

    Abstract: Transformers have achieved great success in several domains, including Natural Language Processing and Computer Vision. However, its application to real-world graphs is less explored, mainly due to its high computation cost and its poor generalizability caused by the lack of enough training data in the graph domain. To fill in this gap, we propose a scalable Transformer-like dynamic graph learning… ▽ More

    Submitted 29 January, 2023; v1 submitted 19 November, 2021; originally announced November 2021.

  26. arXiv:2111.08227  [pdf, other

    cs.LG physics.med-ph

    Phase function estimation from a diffuse optical image via deep learning

    Authors: Yuxuan Liang, Chuang Niu, Chen Wei, Shenghan Ren, Wenxiang Cong, Ge Wang

    Abstract: The phase function is a key element of a light propagation model for Monte Carlo (MC) simulation, which is usually fitted with an analytic function with associated parameters. In recent years, machine learning methods were reported to estimate the parameters of the phase function of a particular form such as the Henyey-Greenstein phase function but, to our knowledge, no studies have been performed… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

    Comments: 16 pages, 8 figures

  27. arXiv:2111.08202  [pdf, other

    cs.LG

    Learn Locally, Correct Globally: A Distributed Algorithm for Training Graph Neural Networks

    Authors: Morteza Ramezani, Weilin Cong, Mehrdad Mahdavi, Mahmut T. Kandemir, Anand Sivasubramaniam

    Abstract: Despite the recent success of Graph Neural Networks (GNNs), training GNNs on large graphs remains challenging. The limited resource capacities of the existing servers, the dependency between nodes in a graph, and the privacy concern due to the centralized storage and model learning have spurred the need to design an effective distributed algorithm for GNN training. However, existing distributed GN… ▽ More

    Submitted 13 March, 2022; v1 submitted 15 November, 2021; originally announced November 2021.

    Comments: The Tenth International Conference on Learning Representations (ICLR 2022)

  28. arXiv:2110.15174  [pdf, other

    cs.LG

    On Provable Benefits of Depth in Training Graph Convolutional Networks

    Authors: Weilin Cong, Morteza Ramezani, Mehrdad Mahdavi

    Abstract: Graph Convolutional Networks (GCNs) are known to suffer from performance degradation as the number of layers increases, which is usually attributed to over-smoothing. Despite the apparent consensus, we observe that there exists a discrepancy between the theoretical understanding of over-smoothing and the practical capabilities of GCNs. Specifically, we argue that over-smoothing does not necessaril… ▽ More

    Submitted 28 October, 2021; originally announced October 2021.

  29. arXiv:2109.10009  [pdf

    econ.GN cs.SI

    An AI-assisted Economic Model of Endogenous Mobility and Infectious Diseases: The Case of COVID-19 in the United States

    Authors: Lin William Cong, Ke Tang, Bing Wang, Jingyuan Wang

    Abstract: We build a deep-learning-based SEIR-AIM model integrating the classical Susceptible-Exposed-Infectious-Removed epidemiology model with forecast modules of infection, community mobility, and unemployment. Through linking Google's multi-dimensional mobility index to economic activities, public health status, and mitigation policies, our AI-assisted model captures the populace's endogenous response t… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: Preprint, not peer reviewed

  30. arXiv:2109.08809  [pdf, other

    cs.CV

    HYouTube: Video Harmonization Dataset

    Authors: Xinyuan Lu, Shengyuan Huang, Li Niu, Wenyan Cong, Liqing Zhang

    Abstract: Video composition aims to generate a composite video by combining the foreground of one video with the background of another video, but the inserted foreground may be incompatible with the background in terms of color and illumination. Video harmonization aims to adjust the foreground of a composite video to make it compatible with the background. So far, video harmonization has only received limi… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

  31. arXiv:2109.06671  [pdf, other

    cs.CV

    High-Resolution Image Harmonization via Collaborative Dual Transformations

    Authors: Wenyan Cong, Xinhao Tao, Li Niu, Jing Liang, Xuesong Gao, Qihao Sun, Liqing Zhang

    Abstract: Given a composite image, image harmonization aims to adjust the foreground to make it compatible with the background. High-resolution image harmonization is in high demand, but still remains unexplored. Conventional image harmonization methods learn global RGB-to-RGB transformation which could effortlessly scale to high resolution, but ignore diverse local context. Recent deep learning methods lea… ▽ More

    Submitted 23 March, 2022; v1 submitted 14 September, 2021; originally announced September 2021.

    Comments: Accepted by CVPR2022

  32. Deep Sequence Modeling: Development and Applications in Asset Pricing

    Authors: Lin William Cong, Ke Tang, Jingyuan Wang, Yang Zhang

    Abstract: We predict asset returns and measure risk premia using a prominent technique from artificial intelligence -- deep sequence modeling. Because asset returns often exhibit sequential dependence that may not be effectively captured by conventional time series models, sequence modeling offers a promising path with its data-driven approach and superior performance. In this paper, we first overview the d… ▽ More

    Submitted 20 August, 2021; originally announced August 2021.

  33. arXiv:2106.14490  [pdf, other

    cs.CV

    Making Images Real Again: A Comprehensive Survey on Deep Image Composition

    Authors: Li Niu, Wenyan Cong, Liu Liu, Yan Hong, Bo Zhang, Jing Liang, Liqing Zhang

    Abstract: As a common image editing operation, image composition aims to combine the foreground from one image and another background image, resulting in a composite image. However, there are many issues that could make the composite images unrealistic. These issues can be summarized as the inconsistency between foreground and background, which includes appearance inconsistency (e.g., incompatible illuminat… ▽ More

    Submitted 22 April, 2024; v1 submitted 28 June, 2021; originally announced June 2021.

  34. arXiv:2103.17104  [pdf, other

    cs.CV

    Deep Image Harmonization by Bridging the Reality Gap

    Authors: Junyan Cao, Wenyan Cong, Li Niu, Jianfu Zhang, Liqing Zhang

    Abstract: Image harmonization has been significantly advanced with large-scale harmonization dataset. However, the current way to build dataset is still labor-intensive, which adversely affects the extendability of dataset. To address this problem, we propose to construct rendered harmonization dataset with fewer human efforts to augment the existing real-world dataset. To leverage both real-world images an… ▽ More

    Submitted 11 October, 2022; v1 submitted 31 March, 2021; originally announced March 2021.

    Comments: Accepted by BMVC2022

  35. arXiv:2103.02696  [pdf, other

    cs.LG cs.AI cs.CV

    On the Importance of Sampling in Training GCNs: Tighter Analysis and Variance Reduction

    Authors: Weilin Cong, Morteza Ramezani, Mehrdad Mahdavi

    Abstract: Graph Convolutional Networks (GCNs) have achieved impressive empirical advancement across a wide variety of semi-supervised node classification tasks. Despite their great success, training GCNs on large graphs suffers from computational and memory issues. A potential path to circumvent these obstacles is sampling-based methods, where at each layer a subset of nodes is sampled. Although recent stud… ▽ More

    Submitted 1 November, 2021; v1 submitted 3 March, 2021; originally announced March 2021.

  36. arXiv:2011.14873  [pdf, other

    eess.IV cs.CV

    Deep Interactive Denoiser (DID) for X-Ray Computed Tomography

    Authors: Ti Bai, Biling Wang, Dan Nguyen, Bao Wang, Bin Dong, Wenxiang Cong, Mannudeep K. Kalra, Steve Jiang

    Abstract: Low dose computed tomography (LDCT) is desirable for both diagnostic imaging and image guided interventions. Denoisers are openly used to improve the quality of LDCT. Deep learning (DL)-based denoisers have shown state-of-the-art performance and are becoming one of the mainstream methods. However, there exists two challenges regarding the DL-based denoisers: 1) a trained model typically does not g… ▽ More

    Submitted 6 December, 2020; v1 submitted 30 November, 2020; originally announced November 2020.

    Comments: under review

  37. arXiv:2009.09169  [pdf, other

    cs.CV

    BargainNet: Background-Guided Domain Translation for Image Harmonization

    Authors: Wenyan Cong, Li Niu, Jianfu Zhang, Jing Liang, Liqing Zhang

    Abstract: Image composition is a fundamental operation in image editing field. However, unharmonious foreground and background downgrade the quality of composite image. Image harmonization, which adjusts the foreground to improve the consistency, is an essential yet challenging task. Previous deep learning based methods mainly focus on directly learning the mapping from composite image to real image, while… ▽ More

    Submitted 3 April, 2021; v1 submitted 19 September, 2020; originally announced September 2020.

    Comments: Accepted by ICME2021 as Oral

  38. arXiv:2008.01846  [pdf

    eess.IV cs.CV cs.LG

    Stabilizing Deep Tomographic Reconstruction

    Authors: Weiwen Wu, Dianlin Hu, Wenxiang Cong, Hongming Shan, Shaoyu Wang, Chuang Niu, Pingkun Yan, Hengyong Yu, Varut Vardhanabhuti, Ge Wang

    Abstract: Tomographic image reconstruction with deep learning is an emerging field, but a recent landmark study reveals that several deep reconstruction networks are unstable for computed tomography (CT) and magnetic resonance imaging (MRI). Specifically, three kinds of instabilities were reported: (1) strong image artefacts from tiny perturbations, (2) small features missing in a deeply reconstructed image… ▽ More

    Submitted 13 September, 2021; v1 submitted 4 August, 2020; originally announced August 2020.

    Comments: 78 pages, 30 figures, 149 references

  39. arXiv:2007.03882  [pdf, other

    eess.IV cs.CV

    Low-dimensional Manifold Constrained Disentanglement Network for Metal Artifact Reduction

    Authors: Chuang Niu, Wenxiang Cong, Fenglei Fan, Hongming Shan, Mengzhou Li, Jimin Liang, Ge Wang

    Abstract: Deep neural network based methods have achieved promising results for CT metal artifact reduction (MAR), most of which use many synthesized paired images for training. As synthesized metal artifacts in CT images may not accurately reflect the clinical counterparts, an artifact disentanglement network (ADN) was proposed with unpaired clinical images directly, producing promising results on clinical… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

  40. arXiv:2006.13866  [pdf, other

    cs.LG stat.ML

    Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

    Authors: Weilin Cong, Rana Forsati, Mahmut Kandemir, Mehrdad Mahdavi

    Abstract: Sampling methods (e.g., node-wise, layer-wise, or subgraph) has become an indispensable strategy to speed up training large-scale Graph Neural Networks (GNNs). However, existing sampling methods are mostly based on the graph structural information and ignore the dynamicity of optimization, which leads to high variance in estimating the stochastic gradients. The high variance issue can be very pron… ▽ More

    Submitted 5 September, 2021; v1 submitted 24 June, 2020; originally announced June 2020.

  41. arXiv:2005.07627  [pdf, other

    cs.CR

    Blockchain Architecture forAuditing Automation and TrustBuilding in Public Markets

    Authors: Sean Cao, Lin William Cong, Meng Han, Qixuan Hou, Baozhong Yang

    Abstract: Business transactions by public firms are required to be reported, verified, and audited periodically, which is traditionally a labor-intensive and time-consuming process. To streamline this procedure, we design FutureAB (Future Auditing Blockchain) which aims to automate the reporting and auditing process, thereby allowing auditors to focus on discretionary accounts to better detect and prevent f… ▽ More

    Submitted 15 May, 2020; originally announced May 2020.

  42. arXiv:1912.04278  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Deep Efficient End-to-end Reconstruction (DEER) Network for Few-view Breast CT Image Reconstruction

    Authors: Huidong Xie, Hongming Shan, Wenxiang Cong, Chi Liu, Xiaohua Zhang, Shaohua Liu, Ruola Ning, Ge Wang

    Abstract: Breast CT provides image volumes with isotropic resolution in high contrast, enabling detection of small calcification (down to a few hundred microns in size) and subtle density differences. Since breast is sensitive to x-ray radiation, dose reduction of breast CT is an important topic, and for this purpose, few-view scanning is a main approach. In this article, we propose a Deep Efficient End-to-… ▽ More

    Submitted 3 November, 2020; v1 submitted 8 December, 2019; originally announced December 2019.

  43. arXiv:1911.13239  [pdf, other

    cs.CV

    DoveNet: Deep Image Harmonization via Domain Verification

    Authors: Wenyan Cong, Jianfu Zhang, Li Niu, Liu Liu, Zhixin Ling, Weiyuan Li, Liqing Zhang

    Abstract: Image composition is an important operation in image processing, but the inconsistency between foreground and background significantly degrades the quality of composite image. Image harmonization, aiming to make the foreground compatible with the background, is a promising yet challenging task. However, the lack of high-quality publicly available dataset for image harmonization greatly hinders the… ▽ More

    Submitted 30 October, 2020; v1 submitted 27 November, 2019; originally announced November 2019.

    Comments: Accepted by CVPR2020. arXiv admin note: text overlap with arXiv:1908.10526

  44. arXiv:1909.11721  [pdf

    physics.med-ph cs.CV eess.IV

    Deep-learning-based Breast CT for Radiation Dose Reduction

    Authors: Wenxiang Cong, Hongming Shan, Xiaohua Zhang, Shaohua Liu, Ruola Ning, Ge Wang

    Abstract: Cone-beam breast computed tomography (CT) provides true 3D breast images with isotropic resolution and high-contrast information, detecting calcifications as small as a few hundred microns and revealing subtle tissue differences. However, breast is highly sensitive to x-ray radiation. It is critically important for healthcare to reduce radiation dose. Few-view cone-beam CT only uses a fraction of… ▽ More

    Submitted 25 September, 2019; originally announced September 2019.

    Comments: 7 pages, 4 figures

  45. arXiv:1908.10526  [pdf, other

    cs.CV

    Image Harmonization Dataset iHarmony4: HCOCO, HAdobe5k, HFlickr, and Hday2night

    Authors: Wenyan Cong, Jianfu Zhang, Li Niu, Liu Liu, Zhixin Ling, Weiyuan Li, Liqing Zhang

    Abstract: Image composition is an important operation in image processing, but the inconsistency between foreground and background significantly degrades the quality of composite image. Image harmonization, which aims to make the foreground compatible with the background, is a promising yet challenging task. However, the lack of high-quality public dataset for image harmonization, which significantly hinder… ▽ More

    Submitted 20 March, 2020; v1 submitted 27 August, 2019; originally announced August 2019.

    Comments: Our full paper arXiv:1911.13239 "DoveNet: Deep Image Harmonization via Domain Verification" is accepted by CVPR2020

  46. arXiv:1907.01262  [pdf, ps, other

    eess.IV cs.CV cs.LG

    Dual Network Architecture for Few-view CT -- Trained on ImageNet Data and Transferred for Medical Imaging

    Authors: Huidong Xie, Hongming Shan, Wenxiang Cong, Xiaohua Zhang, Shaohua Liu, Ruola Ning, Ge Wang

    Abstract: X-ray computed tomography (CT) reconstructs cross-sectional images from projection data. However, ionizing X-ray radiation associated with CT scanning might induce cancer and genetic damage. Therefore, the reduction of radiation dose has attracted major attention. Few-view CT image reconstruction is an important topic to reduce the radiation dose. Recently, data-driven algorithms have shown great… ▽ More

    Submitted 12 September, 2019; v1 submitted 2 July, 2019; originally announced July 2019.

    Comments: 11 pages, 5 figures, 2019 SPIE Optical Engineering + Applications

  47. arXiv:1811.08075   

    cs.CV

    Scene Graph Generation via Conditional Random Fields

    Authors: Weilin Cong, William Wang, Wang-Chien Lee

    Abstract: Despite the great success object detection and segmentation models have achieved in recognizing individual objects in images, performance on cognitive tasks such as image caption, semantic image retrieval, and visual QA is far from satisfactory. To achieve better performance on these cognitive tasks, merely recognizing individual object instances is insufficient. Instead, the interactions between… ▽ More

    Submitted 23 January, 2024; v1 submitted 19 November, 2018; originally announced November 2018.

    Comments: Need to withdraw this draft as requested by collaborators

  48. arXiv:1808.04256  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    CT Super-resolution GAN Constrained by the Identical, Residual, and Cycle Learning Ensemble(GAN-CIRCLE)

    Authors: Chenyu You, Guang Li, Yi Zhang, Xiaoliu Zhang, Hongming Shan, Shenghong Ju, Zhen Zhao, Zhuiyang Zhang, Wenxiang Cong, Michael W. Vannier, Punam K. Saha, Ge Wang

    Abstract: Computed tomography (CT) is widely used in screening, diagnosis, and image-guided therapy for both clinical and research purposes. Since CT involves ionizing radiation, an overarching thrust of related technical research is development of novel methods enabling ultrahigh quality imaging with fine structural details while reducing the X-ray radiation. In this paper, we present a semi-supervised dee… ▽ More

    Submitted 6 September, 2018; v1 submitted 10 August, 2018; originally announced August 2018.

    Report number: TMI-2019-0250

    Journal ref: IEEE Transactions on Medical Imaging 2019

  49. Structure-sensitive Multi-scale Deep Neural Network for Low-Dose CT Denoising

    Authors: Chenyu You, Qingsong Yang, Hongming Shan, Lars Gjesteby, Guang Li, Shenghong Ju, Zhuiyang Zhang, Zhen Zhao, Yi Zhang, Wenxiang Cong, Ge Wang

    Abstract: Computed tomography (CT) is a popular medical imaging modality in clinical applications. At the same time, the x-ray radiation dose associated with CT scans raises public concerns due to its potential risks to the patients. Over the past years, major efforts have been dedicated to the development of Low-Dose CT (LDCT) methods. However, the radiation dose reduction compromises the signal-to-noise r… ▽ More

    Submitted 10 August, 2018; v1 submitted 1 May, 2018; originally announced May 2018.

    Comments: IEEE Access 2018

  50. 3D Convolutional Encoder-Decoder Network for Low-Dose CT via Transfer Learning from a 2D Trained Network

    Authors: Hongming Shan, Yi Zhang, Qingsong Yang, Uwe Kruger, Mannudeep K. Kalra, Ling Sun, Wenxiang Cong, Ge Wang

    Abstract: Low-dose computed tomography (CT) has attracted a major attention in the medical imaging field, since CT-associated x-ray radiation carries health risks for patients. The reduction of CT radiation dose, however, compromises the signal-to-noise ratio, and may compromise the image quality and the diagnostic performance. Recently, deep-learning-based algorithms have achieved promising results in low-… ▽ More

    Submitted 29 April, 2018; v1 submitted 15 February, 2018; originally announced February 2018.

    Comments: To be published in the IEEE TMI

    Journal ref: IEEE Transactions on Medical Imaging 37(6) (2018) 1522-1534