Zum Hauptinhalt springen

Showing 1–21 of 21 results for author: Huo, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.01291  [pdf, other

    cs.CV

    TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and Resampling

    Authors: Dong Huo, Zixin Guo, Xinxin Zuo, Zhihao Shi, Juwei Lu, Peng Dai, Songcen Xu, Li Cheng, Yee-Hong Yang

    Abstract: Given a 3D mesh, we aim to synthesize 3D textures that correspond to arbitrary textual descriptions. Current methods for generating and assembling textures from sampled views often result in prominent seams or excessive smoothing. To tackle these issues, we present TexGen, a novel multi-view sampling and resampling framework for texture generation leveraging a pre-trained text-to-image diffusion m… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

    Comments: European Conference on Computer Vision (ECCV) 2024

  2. arXiv:2407.01211  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Efficient Cutting Tool Wear Segmentation Based on Segment Anything Model

    Authors: Zongshuo Li, Ding Huo, Markus Meurer, Thomas Bergs

    Abstract: Tool wear conditions impact the surface quality of the workpiece and its final geometric precision. In this research, we propose an efficient tool wear segmentation approach based on Segment Anything Model, which integrates U-Net as an automated prompt generator to streamline the processes of tool wear detection. Our evaluation covered three Point-of-Interest generation methods and further investi… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  3. arXiv:2405.16732  [pdf, ps, other

    stat.ML cs.LG math.OC math.ST

    The Collusion of Memory and Nonlinearity in Stochastic Approximation With Constant Stepsize

    Authors: Dongyan Huo, Yixuan Zhang, Yudong Chen, Qiaomin Xie

    Abstract: In this work, we investigate stochastic approximation (SA) with Markovian data and nonlinear updates under constant stepsize $α>0$. Existing work has primarily focused on either i.i.d. data or linear update rules. We take a new perspective and carefully examine the simultaneous presence of Markovian dependency of data and nonlinear update rules, delineating how the interplay between these two stru… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  4. arXiv:2404.06023  [pdf, other

    stat.ML cs.LG math.OC math.PR

    Prelimit Coupling and Steady-State Convergence of Constant-stepsize Nonsmooth Contractive SA

    Authors: Yixuan Zhang, Dongyan Huo, Yudong Chen, Qiaomin Xie

    Abstract: Motivated by Q-learning, we study nonsmooth contractive stochastic approximation (SA) with constant stepsize. We focus on two important classes of dynamics: 1) nonsmooth contractive SA with additive noise, and 2) synchronous and asynchronous Q-learning, which features both additive and multiplicative noise. For both dynamics, we establish weak convergence of the iterates to a stationary limit dist… ▽ More

    Submitted 24 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: ACM SIGMETRICS 2024. 71 pages, 3 figures

  5. arXiv:2312.10894  [pdf, other

    stat.ML cs.LG stat.ME

    Effectiveness of Constant Stepsize in Markovian LSA and Statistical Inference

    Authors: Dongyan Huo, Yudong Chen, Qiaomin Xie

    Abstract: In this paper, we study the effectiveness of using a constant stepsize in statistical inference via linear stochastic approximation (LSA) algorithms with Markovian data. After establishing a Central Limit Theorem (CLT), we outline an inference procedure that uses averaged LSA iterates to construct confidence intervals (CIs). Our procedure leverages the fast mixing property of constant-stepsize LSA… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Comments: AAAI 2024

  6. arXiv:2308.10195  [pdf, other

    cs.MM cs.CL cs.CV eess.IV

    WMFormer++: Nested Transformer for Visible Watermark Removal via Implict Joint Learning

    Authors: Dongjian Huo, Zehong Zhang, Hanjing Su, Guanbin Li, Chaowei Fang, Qingyao Wu

    Abstract: Watermarking serves as a widely adopted approach to safeguard media copyright. In parallel, the research focus has extended to watermark removal techniques, offering an adversarial means to enhance watermark robustness and foster advancements in the watermarking field. Existing watermark removal methods mainly rely on UNet with task-specific decoder branches--one for watermark localization and the… ▽ More

    Submitted 21 August, 2023; v1 submitted 20 August, 2023; originally announced August 2023.

  7. MVA2023 Small Object Detection Challenge for Spotting Birds: Dataset, Methods, and Results

    Authors: Yuki Kondo, Norimichi Ukita, Takayuki Yamaguchi, Hao-Yu Hou, Mu-Yi Shen, Chia-Chi Hsu, En-Ming Huang, Yu-Chen Huang, Yu-Cheng Xia, Chien-Yao Wang, Chun-Yi Lee, Da Huo, Marc A. Kastner, Tingwei Liu, Yasutomo Kawanishi, Takatsugu Hirayama, Takahiro Komamizu, Ichiro Ide, Yosuke Shinya, Xinyao Liu, Guang Liang, Syusuke Yasui

    Abstract: Small Object Detection (SOD) is an important machine vision topic because (i) a variety of real-world applications require object detection for distant objects and (ii) SOD is a challenging task due to the noisy, blurred, and less-informative image appearances of small objects. This paper proposes a new SOD dataset consisting of 39,070 images including 137,121 bird instances, which is called the S… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

    Comments: This paper is included in the proceedings of the 18th International Conference on Machine Vision Applications (MVA2023). It will be officially published at a later date. Project page : https://www.mva-org.jp/mva2023/challenge

    Journal ref: 2023 18th International Conference on Machine Vision and Applications (MVA)

  8. Learning to Recover Spectral Reflectance from RGB Images

    Authors: Dong Huo, Jian Wang, Yiming Qian, Yee-Hong Yang

    Abstract: This paper tackles spectral reflectance recovery (SRR) from RGB images. Since capturing ground-truth spectral reflectance and camera spectral sensitivity are challenging and costly, most existing approaches are trained on synthetic images and utilize the same parameters for all unseen testing images, which are suboptimal especially when the trained models are tested on real images because they nev… ▽ More

    Submitted 22 April, 2024; v1 submitted 4 April, 2023; originally announced April 2023.

    Comments: IEEE Transactions on Image Processing (TIP), 2024

  9. arXiv:2210.00953  [pdf, other

    stat.ML cs.LG math.OC

    Bias and Extrapolation in Markovian Linear Stochastic Approximation with Constant Stepsizes

    Authors: Dongyan Huo, Yudong Chen, Qiaomin Xie

    Abstract: We consider Linear Stochastic Approximation (LSA) with a constant stepsize and Markovian data. Viewing the joint process of the data and LSA iterate as a time-homogeneous Markov chain, we prove its convergence to a unique limiting and stationary distribution in Wasserstein distance and establish non-asymptotic, geometric convergence rates. Furthermore, we show that the bias vector of this limit ad… ▽ More

    Submitted 21 August, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: SIGMETRICS 2023

  10. arXiv:2209.15211  [pdf, other

    cs.CV

    Dual Progressive Transformations for Weakly Supervised Semantic Segmentation

    Authors: Dongjian Huo, Yukun Su, Qingyao Wu

    Abstract: Weakly supervised semantic segmentation (WSSS), which aims to mine the object regions by merely using class-level labels, is a challenging task in computer vision. The current state-of-the-art CNN-based methods usually adopt Class-Activation-Maps (CAMs) to highlight the potential areas of the object, however, they may suffer from the part-activated issues. To this end, we try an early attempt to e… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

  11. arXiv:2209.09005   

    cs.GT

    Learning Optimal Deterministic Auctions with Correlated Valuation Distributions

    Authors: Da Huo, Zhilin Zhang, Zhenzhe Zheng, Chuan Yu, Jian Xu, Fan Wu

    Abstract: In mechanism design, it is challenging to design the optimal auction with correlated values in general settings. Although value distribution can be further exploited to improve revenue, the complex correlation structure makes it hard to acquire in practice. Data-driven auction mechanisms, powered by machine learning, enable to design auctions directly from historical auction data, without relying… ▽ More

    Submitted 18 February, 2023; v1 submitted 19 September, 2022; originally announced September 2022.

    Comments: The proof of the epxressiveness of CAN is wrong. We made some unnecessary assumptions. We need to correct this idea and resubmit it later

  12. arXiv:2206.09243  [pdf, other

    cs.CV eess.IV

    Structured Light with Redundancy Codes

    Authors: Zhanghao Sun, Yu Zhang, Yicheng Wu, Dong Huo, Yiming Qian, Jian Wang

    Abstract: Structured light (SL) systems acquire high-fidelity 3D geometry with active illumination projection. Conventional systems exhibit challenges when working in environments with strong ambient illumination, global illumination and cross-device interference. This paper proposes a general-purposed technique to improve the robustness of SL by projecting redundant optical signals in addition to the nativ… ▽ More

    Submitted 18 June, 2022; originally announced June 2022.

  13. Glass Segmentation with RGB-Thermal Image Pairs

    Authors: Dong Huo, Jian Wang, Yiming Qian, Yee-Hong Yang

    Abstract: This paper proposes a new glass segmentation method utilizing paired RGB and thermal images. Due to the large difference between the transmission property of visible light and that of the thermal energy through the glass where most glass is transparent to the visible light but opaque to thermal energy, glass regions of a scene are made more distinguishable with a pair of RGB and thermal images tha… ▽ More

    Submitted 16 March, 2023; v1 submitted 11 April, 2022; originally announced April 2022.

    Comments: IEEE Transactions on Image Processing (TIP), 2023

  14. arXiv:2202.00179  [pdf, other

    eess.IV cs.CV

    Blind Image Deconvolution Using Variational Deep Image Prior

    Authors: Dong Huo, Abbas Masoumzadeh, Rafsanjany Kushol, Yee-Hong Yang

    Abstract: Conventional deconvolution methods utilize hand-crafted image priors to constrain the optimization. While deep-learning-based methods have simplified the optimization by end-to-end training, they fail to generalize well to blurs unseen in the training dataset. Thus, training image-specific models is important for higher generalization. Deep image prior (DIP) provides an approach to optimize the we… ▽ More

    Submitted 5 June, 2023; v1 submitted 31 January, 2022; originally announced February 2022.

    Comments: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023

  15. arXiv:2106.14336  [pdf, other

    cs.CV

    Blind Non-Uniform Motion Deblurring using Atrous Spatial Pyramid Deformable Convolution and Deblurring-Reblurring Consistency

    Authors: Dong Huo, Abbas Masoumzadeh, Yee-Hong Yang

    Abstract: Many deep learning based methods are designed to remove non-uniform (spatially variant) motion blur caused by object motion and camera shake without knowing the blur kernel. Some methods directly output the latent sharp image in one stage, while others utilize a multi-stage strategy (\eg multi-scale, multi-patch, or multi-temporal) to gradually restore the sharp image. However, these methods have… ▽ More

    Submitted 21 April, 2022; v1 submitted 27 June, 2021; originally announced June 2021.

    Comments: CVPRW 2022

  16. arXiv:2106.03593  [pdf, other

    cs.GT cs.AI cs.LG

    Neural Auction: End-to-End Learning of Auction Mechanisms for E-Commerce Advertising

    Authors: Xiangyu Liu, Chuan Yu, Zhilin Zhang, Zhenzhe Zheng, Yu Rong, Hongtao Lv, Da Huo, Yiqing Wang, Dagui Chen, Jian Xu, Fan Wu, Guihai Chen, Xiaoqiang Zhu

    Abstract: In e-commerce advertising, it is crucial to jointly consider various performance metrics, e.g., user experience, advertiser utility, and platform revenue. Traditional auction mechanisms, such as GSP and VCG auctions, can be suboptimal due to their fixed allocation rules to optimize a single performance metric (e.g., revenue or social welfare). Recently, data-driven auctions, learned directly from… ▽ More

    Submitted 13 July, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

    Comments: To appear in the Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2021

  17. arXiv:2009.12461  [pdf, other

    eess.IV cs.CV

    Blind Image Super-Resolution with Spatial Context Hallucination

    Authors: Dong Huo, Yee-Hong Yang

    Abstract: Deep convolution neural networks (CNNs) play a critical role in single image super-resolution (SISR) since the amazing improvement of high performance computing. However, most of the super-resolution (SR) methods only focus on recovering bicubic degradation. Reconstructing high-resolution (HR) images from randomly blurred and noisy low-resolution (LR) images is still a challenging problem. In this… ▽ More

    Submitted 25 September, 2020; originally announced September 2020.

    Comments: 14 pages, 6 figures

  18. arXiv:2009.00774  [pdf, other

    cs.LG cs.CR stat.ML

    Vulnerability-Aware Poisoning Mechanism for Online RL with Unknown Dynamics

    Authors: Yanchao Sun, Da Huo, Furong Huang

    Abstract: Poisoning attacks on Reinforcement Learning (RL) systems could take advantage of RL algorithm's vulnerabilities and cause failure of the learning. However, prior works on poisoning RL usually either unrealistically assume the attacker knows the underlying Markov Decision Process (MDP), or directly apply the poisoning methods in supervised learning to RL. In this work, we build a generic poisoning… ▽ More

    Submitted 15 February, 2022; v1 submitted 1 September, 2020; originally announced September 2020.

    Journal ref: The Ninth International Conference on Learning Representations (ICLR 2021)

  19. arXiv:2007.04298  [pdf, other

    cs.CL cs.LG

    Building Interpretable Interaction Trees for Deep NLP Models

    Authors: Die Zhang, Huilin Zhou, Hao Zhang, Xiaoyi Bao, Da Huo, Ruizhao Chen, Xu Cheng, Mengyue Wu, Quanshi Zhang

    Abstract: This paper proposes a method to disentangle and quantify interactions among words that are encoded inside a DNN for natural language processing. We construct a tree to encode salient interactions extracted by the DNN. Six metrics are proposed to analyze properties of interactions between constituents in a sentence. The interaction is defined based on Shapley values of words, which are considered a… ▽ More

    Submitted 16 January, 2021; v1 submitted 29 June, 2020; originally announced July 2020.

  20. arXiv:1912.13410  [pdf, other

    cs.CR

    Logic Bugs in IoT Platforms and Systems: A Review

    Authors: Wei Zhou, Chen Cao, Dongdong Huo, Kai Cheng, Lan Zhang, Le Guan, Tao Liu, Yaowen Zheng, Yuqing Zhang, Limin Sun, Yazhe Wang, Peng Liu

    Abstract: In recent years, IoT platforms and systems have been rapidly emerging. Although IoT is a new technology, new does not mean simpler (than existing networked systems). Contrarily, the complexity (of IoT platforms and systems) is actually being increased in terms of the interactions between the physical world and cyberspace. The increased complexity indeed results in new vulnerabilities. This paper s… ▽ More

    Submitted 2 March, 2020; v1 submitted 31 December, 2019; originally announced December 2019.

  21. arXiv:0804.2699  [pdf, ps, other

    cs.CC cs.DS

    A Critique of a Polynomial-time SAT Solver Devised by Sergey Gubin

    Authors: Ian Christopher, Dennis Huo, Bryan Jacobs

    Abstract: This paper refutes the validity of the polynomial-time algorithm for solving satisfiability proposed by Sergey Gubin. Gubin introduces the algorithm using 3-SAT and eventually expands it to accept a broad range of forms of the Boolean satisfiability problem. Because 3-SAT is NP-complete, the algorithm would have implied P = NP, had it been correct. Additionally, this paper refutes the correctnes… ▽ More

    Submitted 16 April, 2008; originally announced April 2008.