Zum Hauptinhalt springen

Showing 1–30 of 30 results for author: Xiang, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.11251  [pdf, other

    cs.CV

    Irregularity Inspection using Neural Radiance Field

    Authors: Tianqi Ding, Dawei Xiang

    Abstract: With the increasing growth of industrialization, more and more industries are relying on machine automation for production. However, defect detection in large-scale production machinery is becoming increasingly important. Due to their large size and height, it is often challenging for professionals to conduct defect inspections on such large machinery. For example, the inspection of aging and misa… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  2. arXiv:2407.10988  [pdf, other

    cs.LG

    Residual resampling-based physics-informed neural network for neutron diffusion equations

    Authors: Heng Zhang, Yun-Ling He, Dong Liu, Qin Hang, He-Min Yao, Di Xiang

    Abstract: The neutron diffusion equation plays a pivotal role in the analysis of nuclear reactors. Nevertheless, employing the Physics-Informed Neural Network (PINN) method for its solution entails certain limitations. Traditional PINN approaches often utilize fully connected network (FCN) architecture, which is susceptible to overfitting, training instability, and gradient vanishing issues as the network d… ▽ More

    Submitted 23 June, 2024; originally announced July 2024.

  3. arXiv:2407.10310  [pdf, other

    cs.CY eess.SY

    Impact of Different Infrastructures and Traffic Scenarios on Behavioral and Physiological Responses of E-scooter Users

    Authors: Dong Chen, Arman Hosseini, Arik Smith, David Xiang, Arsalan Heydarian, Omid Shoghli, Bradford Campbell

    Abstract: As micromobility devices such as e-scooters gain global popularity, emergency departments around the world have observed a rising trend in related injuries. However, the majority of current research on e-scooter safety relies heavily on surveys, news reports, and data from vendors, with a noticeable scarcity of naturalistic studies examining the effects of riders' behaviors and physiological respo… ▽ More

    Submitted 5 May, 2024; originally announced July 2024.

    Comments: 6 pages, 8 figures

  4. arXiv:2407.06612  [pdf

    eess.IV cs.CV cs.LG

    AI-based Automatic Segmentation of Prostate on Multi-modality Images: A Review

    Authors: Rui Jin, Derun Li, Dehui Xiang, Lei Zhang, Hailing Zhou, Fei Shi, Weifang Zhu, Jing Cai, Tao Peng, Xinjian Chen

    Abstract: Prostate cancer represents a major threat to health. Early detection is vital in reducing the mortality rate among prostate cancer patients. One approach involves using multi-modality (CT, MRI, US, etc.) computer-aided diagnosis (CAD) systems for the prostate region. However, prostate segmentation is challenging due to imperfections in the images and the prostate's complex tissue structure. The ad… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  5. arXiv:2407.03187  [pdf

    cs.CY

    Holistic view of the road transportation system based on real-time data sharing mechanism

    Authors: Li Tao, Dong Xiang, Hao Junfeng, Yin Ping, Xu Xiaoxue, Lai Maokai, Li Yuan, Peng Ting

    Abstract: Traditional manual driving and single-vehicle-based intelligent driving have limitations in real-time and accurate acquisition of the current driving status and intentions of surrounding vehicles, leading to vehicles typically maintaining appropriate safe distances from each other. Yet, accidents still frequently occur, especially in merging areas; meanwhile, it is difficult to comprehensively obt… ▽ More

    Submitted 3 July, 2024; v1 submitted 3 July, 2024; originally announced July 2024.

  6. arXiv:2404.04421  [pdf, other

    cs.GR cs.CV

    PhysAvatar: Learning the Physics of Dressed 3D Avatars from Visual Observations

    Authors: Yang Zheng, Qingqing Zhao, Guandao Yang, Wang Yifan, Donglai Xiang, Florian Dubost, Dmitry Lagun, Thabo Beeler, Federico Tombari, Leonidas Guibas, Gordon Wetzstein

    Abstract: Modeling and rendering photorealistic avatars is of crucial importance in many applications. Existing methods that build a 3D avatar from visual observations, however, struggle to reconstruct clothed humans. We introduce PhysAvatar, a novel framework that combines inverse rendering with inverse physics to automatically estimate the shape and appearance of a human from multi-view video data along w… ▽ More

    Submitted 9 April, 2024; v1 submitted 5 April, 2024; originally announced April 2024.

    Comments: Project Page: https://qingqing-zhao.github.io/PhysAvatar

  7. arXiv:2402.09434  [pdf, other

    eess.SP cs.LG

    Disentangling Imperfect: A Wavelet-Infused Multilevel Heterogeneous Network for Human Activity Recognition in Flawed Wearable Sensor Data

    Authors: Mengna Liu, Dong Xiang, Xu Cheng, Xiufeng Liu, Dalin Zhang, Shengyong Chen, Christian S. Jensen

    Abstract: The popularity and diffusion of wearable devices provides new opportunities for sensor-based human activity recognition that leverages deep learning-based algorithms. Although impressive advances have been made, two major challenges remain. First, sensor data is often incomplete or noisy due to sensor placement and other issues as well as data transmission failure, calling for imputation of missin… ▽ More

    Submitted 26 January, 2024; originally announced February 2024.

    Comments: 14 pages, 7 figures

  8. arXiv:2401.17268  [pdf, other

    cs.CL cs.AI cs.LG

    Weaver: Foundation Models for Creative Writing

    Authors: Tiannan Wang, Jiamin Chen, Qingrui Jia, Shuai Wang, Ruoyu Fang, Huilin Wang, Zhaowei Gao, Chunzhao Xie, Chuou Xu, Jihong Dai, Yibin Liu, Jialong Wu, Shengwei Ding, Long Li, Zhiwei Huang, Xinle Deng, Teng Yu, Gangan Ma, Han Xiao, Zixin Chen, Danjun Xiang, Yunxia Wang, Yuanyuan Zhu, Yi Xiao, Jing Wang , et al. (21 additional authors not shown)

    Abstract: This work introduces Weaver, our first family of large language models (LLMs) dedicated to content creation. Weaver is pre-trained on a carefully selected corpus that focuses on improving the writing capabilities of large language models. We then fine-tune Weaver for creative and professional writing purposes and align it to the preference of professional writers using a suit of novel methods for… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  9. arXiv:2311.05828  [pdf, other

    cs.CV

    Diffusion Shape Prior for Wrinkle-Accurate Cloth Registration

    Authors: Jingfan Guo, Fabian Prada, Donglai Xiang, Javier Romero, Chenglei Wu, Hyun Soo Park, Takaaki Shiratori, Shunsuke Saito

    Abstract: Registering clothes from 4D scans with vertex-accurate correspondence is challenging, yet important for dynamic appearance modeling and physics parameter estimation from real-world data. However, previous methods either rely on texture information, which is not always reliable, or achieve only coarse-level alignment. In this work, we present a novel approach to enabling accurate surface registrati… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: Project page: https://www-users.cse.umn.edu/~guo00109/projects/3dv2024/

  10. Drivable Avatar Clothing: Faithful Full-Body Telepresence with Dynamic Clothing Driven by Sparse RGB-D Input

    Authors: Donglai Xiang, Fabian Prada, Zhe Cao, Kaiwen Guo, Chenglei Wu, Jessica Hodgins, Timur Bagautdinov

    Abstract: Clothing is an important part of human appearance but challenging to model in photorealistic avatars. In this work we present avatars with dynamically moving loose clothing that can be faithfully driven by sparse RGB-D inputs as well as body and face motion. We propose a Neural Iterative Closest Point (N-ICP) algorithm that can efficiently track the coarse garment shape given sparse depth input. G… ▽ More

    Submitted 11 October, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: SIGGRAPH Asia 2023 Conference Paper. Project website: https://xiangdonglai.github.io/www-sa23-drivable-clothing/

  11. arXiv:2309.02318  [pdf, other

    cs.CV eess.IV

    TiAVox: Time-aware Attenuation Voxels for Sparse-view 4D DSA Reconstruction

    Authors: Zhenghong Zhou, Huangxuan Zhao, Jiemin Fang, Dongqiao Xiang, Lei Chen, Lingxia Wu, Feihong Wu, Wenyu Liu, Chuansheng Zheng, Xinggang Wang

    Abstract: Four-dimensional Digital Subtraction Angiography (4D DSA) plays a critical role in the diagnosis of many medical diseases, such as Arteriovenous Malformations (AVM) and Arteriovenous Fistulas (AVF). Despite its significant application value, the reconstruction of 4D DSA demands numerous views to effectively model the intricate vessels and radiocontrast flow, thereby implying a significant radiatio… ▽ More

    Submitted 19 December, 2023; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: 10 pages, 8 figures

  12. Dressing Avatars: Deep Photorealistic Appearance for Physically Simulated Clothing

    Authors: Donglai Xiang, Timur Bagautdinov, Tuur Stuyck, Fabian Prada, Javier Romero, Weipeng Xu, Shunsuke Saito, Jingfan Guo, Breannan Smith, Takaaki Shiratori, Yaser Sheikh, Jessica Hodgins, Chenglei Wu

    Abstract: Despite recent progress in developing animatable full-body avatars, realistic modeling of clothing - one of the core aspects of human self-expression - remains an open challenge. State-of-the-art physical simulation methods can generate realistically behaving clothing geometry at interactive rates. Modeling photorealistic appearance, however, usually requires physically-based rendering which is to… ▽ More

    Submitted 19 September, 2022; v1 submitted 30 June, 2022; originally announced June 2022.

    Comments: SIGGRAPH Asia 2022 (ACM ToG) camera ready. The supplementary video can be found on https://research.facebook.com/publications/dressing-avatars-deep-photorealistic-appearance-for-physically-simulated-clothing/

  13. arXiv:2206.03373  [pdf, other

    cs.CV

    Garment Avatars: Realistic Cloth Driving using Pattern Registration

    Authors: Oshri Halimi, Fabian Prada, Tuur Stuyck, Donglai Xiang, Timur Bagautdinov, He Wen, Ron Kimmel, Takaaki Shiratori, Chenglei Wu, Yaser Sheikh

    Abstract: Virtual telepresence is the future of online communication. Clothing is an essential part of a person's identity and self-expression. Yet, ground truth data of registered clothes is currently unavailable in the required resolution and accuracy for training telepresence models for realistic cloth animation. Here, we propose an end-to-end pipeline for building drivable representations for clothing.… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

  14. arXiv:2108.01440  [pdf, other

    cs.IR cs.AI cs.LG

    Adaptively Optimize Content Recommendation Using Multi Armed Bandit Algorithms in E-commerce

    Authors: Ding Xiang, Becky West, Jiaqi Wang, Xiquan Cui, Jinzhou Huang

    Abstract: E-commerce sites strive to provide users the most timely relevant information in order to reduce shopping frictions and increase customer satisfaction. Multi armed bandit models (MAB) as a type of adaptive optimization algorithms provide possible approaches for such purposes. In this paper, we analyze using three classic MAB algorithms, epsilon-greedy, Thompson sampling (TS), and upper confidence… ▽ More

    Submitted 19 August, 2021; v1 submitted 30 July, 2021; originally announced August 2021.

  15. Modeling Clothing as a Separate Layer for an Animatable Human Avatar

    Authors: Donglai Xiang, Fabian Prada, Timur Bagautdinov, Weipeng Xu, Yuan Dong, He Wen, Jessica Hodgins, Chenglei Wu

    Abstract: We have recently seen great progress in building photorealistic animatable full-body codec avatars, but generating high-fidelity animation of clothing is still difficult. To address these difficulties, we propose a method to build an animatable clothed body avatar with an explicit representation of the clothing on the upper body from multi-view captured videos. We use a two-layer mesh representati… ▽ More

    Submitted 4 October, 2021; v1 submitted 28 June, 2021; originally announced June 2021.

    Comments: Camera ready for SIGGRAPH Asia 2021 Technical Papers. https://research.fb.com/publications/modeling-clothing-as-a-separate-layer-for-an-animatable-human-avatar/

  16. arXiv:2105.13965  [pdf, other

    cs.CV cs.RO

    Revitalizing Optimization for 3D Human Pose and Shape Estimation: A Sparse Constrained Formulation

    Authors: Taosha Fan, Kalyan Vasudev Alwala, Donglai Xiang, Weipeng Xu, Todd Murphey, Mustafa Mukadam

    Abstract: We propose a novel sparse constrained formulation and from it derive a real-time optimization method for 3D human pose and shape estimation. Our optimization method, SCOPE (Sparse Constrained Optimization for 3D human Pose and shapE estimation), is orders of magnitude faster (avg. 4 ms convergence) than existing optimization methods, while being mathematically equivalent to their dense unconstrain… ▽ More

    Submitted 4 October, 2021; v1 submitted 28 May, 2021; originally announced May 2021.

    Comments: 21 pages, including appendix

  17. arXiv:2104.03106  [pdf, other

    cs.CV

    V2F-Net: Explicit Decomposition of Occluded Pedestrian Detection

    Authors: Mingyang Shang, Dawei Xiang, Zhicheng Wang, Erjin Zhou

    Abstract: Occlusion is very challenging in pedestrian detection. In this paper, we propose a simple yet effective method named V2F-Net, which explicitly decomposes occluded pedestrian detection into visible region detection and full body estimation. V2F-Net consists of two sub-networks: Visible region Detection Network (VDN) and Full body Estimation Network (FEN). VDN tries to localize visible regions and F… ▽ More

    Submitted 7 April, 2021; originally announced April 2021.

    Comments: 11 pages, 4 figures

  18. arXiv:2009.10711  [pdf, other

    cs.CV cs.GR

    MonoClothCap: Towards Temporally Coherent Clothing Capture from Monocular RGB Video

    Authors: Donglai Xiang, Fabian Prada, Chenglei Wu, Jessica Hodgins

    Abstract: We present a method to capture temporally coherent dynamic clothing deformation from a monocular RGB video input. In contrast to the existing literature, our method does not require a pre-scanned personalized mesh template, and thus can be applied to in-the-wild videos. To constrain the output to a valid deformation space, we build statistical deformation models for three types of clothing: T-shir… ▽ More

    Submitted 23 November, 2020; v1 submitted 22 September, 2020; originally announced September 2020.

    Comments: 3DV 2020 Camera Ready

  19. arXiv:1911.00996  [pdf

    q-bio.QM cs.LG stat.ML

    A Study of Data Pre-processing Techniques for Imbalanced Biomedical Data Classification

    Authors: Shigang Liu, Jun Zhang, Yang Xiang, Wanlei Zhou, Dongxi Xiang

    Abstract: Biomedical data are widely accepted in developing prediction models for identifying a specific tumor, drug discovery and classification of human cancers. However, previous studies usually focused on different classifiers, and overlook the class imbalance problem in real-world biomedical datasets. There are a lack of studies on evaluation of data pre-processing techniques, such as resampling and fe… ▽ More

    Submitted 3 November, 2019; originally announced November 2019.

    Comments: This paper is scheduled for inclusion in V16 N3 2020, International Journal of Bioinformatics Research and Applications (IJBRA)

    Journal ref: V16 N3, International Journal of Bioinformatics Research and Applications (IJBRA), 2020

  20. arXiv:1910.00941  [pdf, ps, other

    cs.IT cs.CC

    A Self-contained Analysis of the Lempel-Ziv Compression Algorithm

    Authors: Madhu Sudan, David Xiang

    Abstract: This article gives a self-contained analysis of the performance of the Lempel-Ziv compression algorithm on (hidden) Markovian sources. Specifically we include a full proof of the assertion that the compression rate approaches the entropy rate of the chain being compressed.

    Submitted 2 October, 2019; originally announced October 2019.

  21. arXiv:1909.13423  [pdf, other

    cs.CV cs.LG

    Single-Network Whole-Body Pose Estimation

    Authors: Gines Hidalgo, Yaadhav Raaj, Haroon Idrees, Donglai Xiang, Hanbyul Joo, Tomas Simon, Yaser Sheikh

    Abstract: We present the first single-network approach for 2D~whole-body pose estimation, which entails simultaneous localization of body, face, hands, and feet keypoints. Due to the bottom-up formulation, our method maintains constant real-time performance regardless of the number of people in the image. The network is trained in a single stage using multi-task learning, through an improved architecture wh… ▽ More

    Submitted 29 September, 2019; originally announced September 2019.

    Comments: ICCV 2019

  22. arXiv:1908.04470  [pdf, ps, other

    stat.ML cs.LG

    Comparison theorems on large-margin learning

    Authors: Jun Fan, Dao-Hong Xiang

    Abstract: This paper studies binary classification problem associated with a family of loss functions called large-margin unified machines (LUM), which offers a natural bridge between distribution-based likelihood approaches and margin-based approaches. It also can overcome the so-called data piling issue of support vector machine in the high-dimension and low-sample size setting. In this paper we establish… ▽ More

    Submitted 12 August, 2019; originally announced August 2019.

  23. arXiv:1904.09882  [pdf, other

    cs.CV

    You2Me: Inferring Body Pose in Egocentric Video via First and Second Person Interactions

    Authors: Evonne Ng, Donglai Xiang, Hanbyul Joo, Kristen Grauman

    Abstract: The body pose of a person wearing a camera is of great interest for applications in augmented reality, healthcare, and robotics, yet much of the person's body is out of view for a typical wearable camera. We propose a learning-based approach to estimate the camera wearer's 3D body pose from egocentric video sequences. Our key insight is to leverage interactions with another person---whose body pos… ▽ More

    Submitted 27 March, 2020; v1 submitted 22 April, 2019; originally announced April 2019.

  24. arXiv:1901.09388  [pdf, other

    cs.SE

    Moving Deep Learning into Web Browser: How Far Can We Go?

    Authors: Yun Ma, Dongwei Xiang, Shuyu Zheng, Deyu Tian, Xuanzhe Liu

    Abstract: Recently, several JavaScript-based deep learning frameworks have emerged, making it possible to perform deep learning tasks directly in browsers. However, little is known on what and how well we can do with these frameworks for deep learning in browsers. To bridge the knowledge gap, in this paper, we conduct the first empirical study of deep learning in browsers. We survey 7 most popular JavaScrip… ▽ More

    Submitted 24 March, 2019; v1 submitted 27 January, 2019; originally announced January 2019.

  25. arXiv:1812.01598  [pdf, other

    cs.CV cs.GR

    Monocular Total Capture: Posing Face, Body, and Hands in the Wild

    Authors: Donglai Xiang, Hanbyul Joo, Yaser Sheikh

    Abstract: We present the first method to capture the 3D total motion of a target person from a monocular view input. Given an image or a monocular video, our method reconstructs the motion from body, face, and fingers represented by a 3D deformable mesh model. We use an efficient representation called 3D Part Orientation Fields (POFs), to encode the 3D orientations of all body parts in the common 2D image s… ▽ More

    Submitted 4 December, 2018; originally announced December 2018.

    Comments: 17 pages, 16 figures

  26. arXiv:1810.02815  [pdf, other

    cs.CE econ.GN math.OC

    A General Sensitivity Analysis Approach for Demand Response Optimizations

    Authors: Ding Xiang, Ermin Wei

    Abstract: It is well-known that demand response can improve the system efficiency as well as lower consumers' (prosumers') electricity bills. However, it is not clear how we can either qualitatively identify the prosumer with the most impact potential or quantitatively estimate each prosumer's contribution to the total social welfare improvement when additional resource capacity/flexibility is introduced to… ▽ More

    Submitted 7 October, 2018; originally announced October 2018.

    Comments: 17 pages

  27. arXiv:1709.07625  [pdf, ps, other

    stat.ML cs.LG

    Total stability of kernel methods

    Authors: Andreas Christmann, Daohong Xiang, Ding-Xuan Zhou

    Abstract: Regularized empirical risk minimization using kernels and their corresponding reproducing kernel Hilbert spaces (RKHSs) plays an important role in machine learning. However, the actually used kernel often depends on one or on a few hyperparameters or the kernel is even data dependent in a much more complicated manner. Examples are Gaussian RBF kernels, kernel learning, and hierarchical Gaussian ke… ▽ More

    Submitted 22 September, 2017; originally announced September 2017.

  28. arXiv:1704.02956  [pdf, other

    cs.CV

    Surface Normals in the Wild

    Authors: Weifeng Chen, Donglai Xiang, Jia Deng

    Abstract: We study the problem of single-image depth estimation for images in the wild. We collect human annotated surface normals and use them to train a neural network that directly predicts pixel-wise depth. We propose two novel loss functions for training with surface normal annotations. Experiments on NYU Depth and our own dataset demonstrate that our approach can significantly improve the quality of d… ▽ More

    Submitted 10 April, 2017; originally announced April 2017.

  29. arXiv:1604.04505  [pdf, ps, other

    stat.ML cs.LG

    A short note on extension theorems and their connection to universal consistency in machine learning

    Authors: Andreas Christmann, Florian Dumpert, Dao-Hong Xiang

    Abstract: Statistical machine learning plays an important role in modern statistics and computer science. One main goal of statistical machine learning is to provide universally consistent algorithms, i.e., the estimator converges in probability or in some stronger sense to the Bayes risk or to the Bayes decision function. Kernel methods based on minimizing the regularized risk over a reproducing kernel Hil… ▽ More

    Submitted 15 April, 2016; originally announced April 2016.

    Comments: 14 pages

  30. arXiv:1511.04510  [pdf, other

    cs.CV

    Semantic Object Parsing with Local-Global Long Short-Term Memory

    Authors: Xiaodan Liang, Xiaohui Shen, Donglai Xiang, Jiashi Feng, Liang Lin, Shuicheng Yan

    Abstract: Semantic object parsing is a fundamental task for understanding objects in detail in computer vision community, where incorporating multi-level contextual information is critical for achieving such fine-grained pixel-level recognition. Prior methods often leverage the contextual information through post-processing predicted confidence maps. In this work, we propose a novel deep Local-Global Long S… ▽ More

    Submitted 14 November, 2015; originally announced November 2015.

    Comments: 10 pages