Zum Hauptinhalt springen

Showing 1–23 of 23 results for author: Mou, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.05966  [pdf, other

    cs.LG math.NA math.OC math.PR

    On Bellman equations for continuous-time policy evaluation I: discretization and approximation

    Authors: Wenlong Mou, Yuhua Zhu

    Abstract: We study the problem of computing the value function from a discretely-observed trajectory of a continuous-time diffusion process. We develop a new class of algorithms based on easily implementable numerical schemes that are compatible with discrete-time reinforcement learning (RL) with function approximation. We establish high-order numerical accuracy as well as the approximation error guarantees… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: WM and YZ contributed equally to this work

  2. arXiv:2403.07877  [pdf, other

    cs.RO cs.CV

    Generating Future Observations to Estimate Grasp Success in Cluttered Environments

    Authors: Daniel Fernandes Gomes, Wenxuan Mou, Paolo Paoletti, Shan Luo

    Abstract: End-to-end self-supervised models have been proposed for estimating the success of future candidate grasps and video predictive models for generating future observations. However, none have yet studied these two strategies side-by-side for addressing the aforementioned grasping problem. We investigate and compare a model-free approach, to estimate the success of a candidate grasp, against a model-… ▽ More

    Submitted 18 December, 2023; originally announced March 2024.

    Journal ref: 5th UK Robot Manipulation Workshop 2024

  3. arXiv:2312.10074  [pdf

    cs.HC

    STAGER checklist: Standardized Testing and Assessment Guidelines for Evaluating Generative AI Reliability

    Authors: Jinghong Chen, Lingxuan Zhu, Weiming Mou, Zaoqu Liu, Quan Cheng, Anqi Lin, Jian Zhang, Peng Luo

    Abstract: Generative Artificial Intelligence (AI) holds immense potential in medical applications. Numerous studies have explored the efficacy of various generative AI models within healthcare contexts, but there is a lack of a comprehensive and systematic evaluation framework. Given that some studies evaluating the ability of generative AI for medical applications have deficiencies in their methodological… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: 11 pages, 0 figure, 2 tables

  4. arXiv:2209.13075  [pdf, other

    math.ST cs.IT stat.ML

    Off-policy estimation of linear functionals: Non-asymptotic theory for semi-parametric efficiency

    Authors: Wenlong Mou, Martin J. Wainwright, Peter L. Bartlett

    Abstract: The problem of estimating a linear functional based on observational data is canonical in both the causal inference and bandit literatures. We analyze a broad class of two-stage procedures that first estimate the treatment effect function, and then use this quantity to estimate the linear functional. We prove non-asymptotic upper bounds on the mean-squared error of such procedures: these bounds re… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

    Comments: 56 pages, 6 figures

  5. arXiv:2201.08518  [pdf, ps, other

    math.ST cs.LG math.OC stat.ML

    Optimal variance-reduced stochastic approximation in Banach spaces

    Authors: Wenlong Mou, Koulik Khamaru, Martin J. Wainwright, Peter L. Bartlett, Michael I. Jordan

    Abstract: We study the problem of estimating the fixed point of a contractive operator defined on a separable Banach space. Focusing on a stochastic query model that provides noisy evaluations of the operator, we analyze a variance-reduced stochastic approximation scheme, and establish non-asymptotic bounds for both the operator defect and the estimation error, measured in an arbitrary semi-norm. In contras… ▽ More

    Submitted 29 November, 2022; v1 submitted 20 January, 2022; originally announced January 2022.

  6. arXiv:2112.12770  [pdf, ps, other

    math.OC cs.LG math.PR math.ST stat.ML

    Optimal and instance-dependent guarantees for Markovian linear stochastic approximation

    Authors: Wenlong Mou, Ashwin Pananjady, Martin J. Wainwright, Peter L. Bartlett

    Abstract: We study stochastic approximation procedures for approximately solving a $d$-dimensional linear fixed point equation based on observing a trajectory of length $n$ from an ergodic Markov chain. We first exhibit a non-asymptotic bound of the order $t_{\mathrm{mix}} \tfrac{d}{n}$ on the squared error of the last iterate of a standard scheme, where $t_{\mathrm{mix}}$ is a mixing time. We then prove a… ▽ More

    Submitted 11 May, 2024; v1 submitted 23 December, 2021; originally announced December 2021.

    Comments: Published at Mathematical Statistics and Learning

  7. arXiv:2101.10819  [pdf, other

    cs.RO

    When Would You Trust a Robot? A Study on Trust and Theory of Mind in Human-Robot Interactions

    Authors: Wenxuan Mou, Martina Ruocco, Debora Zanatto, Angelo Cangelosi

    Abstract: Trust is a critical issue in Human Robot Interactions as it is the core of human desire to accept and use a non human agent. Theory of Mind has been defined as the ability to understand the beliefs and intentions of others that may differ from one's own. Evidences in psychology and HRI suggest that trust and Theory of Mind are interconnected and interdependent concepts, as the decision to trust an… ▽ More

    Submitted 26 January, 2021; originally announced January 2021.

    Comments: 7 pages, 4 figures, conference

  8. arXiv:2012.05299  [pdf, other

    cs.LG math.OC math.ST stat.ML

    Optimal oracle inequalities for solving projected fixed-point equations

    Authors: Wenlong Mou, Ashwin Pananjady, Martin J. Wainwright

    Abstract: Linear fixed point equations in Hilbert spaces arise in a variety of settings, including reinforcement learning, and computational methods for solving differential and integral equations. We study methods that use a collection of random observations to compute approximate solutions by searching over a known low-dimensional subspace of the Hilbert space. First, we prove an instance-dependent upper… ▽ More

    Submitted 9 December, 2020; originally announced December 2020.

  9. arXiv:2008.07353  [pdf, ps, other

    cs.LG cs.AI cs.DS stat.ML

    On the Sample Complexity of Reinforcement Learning with Policy Space Generalization

    Authors: Wenlong Mou, Zheng Wen, Xi Chen

    Abstract: We study the optimal sample complexity in large-scale Reinforcement Learning (RL) problems with policy space generalization, i.e. the agent has a prior knowledge that the optimal policy lies in a known policy space. Existing results show that without a generalization model, the sample complexity of an RL algorithm will inevitably depend on the cardinalities of state space and action space, which a… ▽ More

    Submitted 17 August, 2020; originally announced August 2020.

  10. arXiv:2004.04719  [pdf, ps, other

    stat.ML cs.LG math.OC math.ST

    On Linear Stochastic Approximation: Fine-grained Polyak-Ruppert and Non-Asymptotic Concentration

    Authors: Wenlong Mou, Chris Junchi Li, Martin J. Wainwright, Peter L. Bartlett, Michael I. Jordan

    Abstract: We undertake a precise study of the asymptotic and non-asymptotic properties of stochastic approximation procedures with Polyak-Ruppert averaging for solving a linear system $\bar{A} θ= \bar{b}$. When the matrix $\bar{A}$ is Hurwitz, we prove a central limit theorem (CLT) for the averaged iterates with fixed step size and number of iterations going to infinity. The CLT characterizes the exact asym… ▽ More

    Submitted 9 April, 2020; originally announced April 2020.

  11. arXiv:2004.02980  [pdf, other

    cs.CV cs.LG eess.IV

    LUVLi Face Alignment: Estimating Landmarks' Location, Uncertainty, and Visibility Likelihood

    Authors: Abhinav Kumar, Tim K. Marks, Wenxuan Mou, Ye Wang, Michael Jones, Anoop Cherian, Toshiaki Koike-Akino, Xiaoming Liu, Chen Feng

    Abstract: Modern face alignment methods have become quite accurate at predicting the locations of facial landmarks, but they do not typically estimate the uncertainty of their predicted locations nor predict whether landmarks are visible. In this paper, we present a novel framework for jointly predicting landmark locations, associated uncertainties of these predicted locations, and landmark visibilities. We… ▽ More

    Submitted 6 April, 2020; originally announced April 2020.

    Comments: Accepted to CVPR 2020

  12. arXiv:1912.05153  [pdf, other

    stat.ML cs.DS cs.LG math.PR stat.CO

    Sampling for Bayesian Mixture Models: MCMC with Polynomial-Time Mixing

    Authors: Wenlong Mou, Nhat Ho, Martin J. Wainwright, Peter L. Bartlett, Michael I. Jordan

    Abstract: We study the problem of sampling from the power posterior distribution in Bayesian Gaussian mixture models, a robust version of the classical posterior. This power posterior is known to be non-log-concave and multi-modal, which leads to exponential mixing times for some standard MCMC algorithms. We introduce and study the Reflected Metropolis-Hastings Random Walk (RMRW) algorithm for sampling. For… ▽ More

    Submitted 11 December, 2019; originally announced December 2019.

  13. arXiv:1910.00551  [pdf, ps, other

    stat.ML cs.DS cs.LG stat.CO

    An Efficient Sampling Algorithm for Non-smooth Composite Potentials

    Authors: Wenlong Mou, Nicolas Flammarion, Martin J. Wainwright, Peter L. Bartlett

    Abstract: We consider the problem of sampling from a density of the form $p(x) \propto \exp(-f(x)- g(x))$, where $f: \mathbb{R}^d \rightarrow \mathbb{R}$ is a smooth and strongly convex function and $g: \mathbb{R}^d \rightarrow \mathbb{R}$ is a convex and Lipschitz function. We propose a new algorithm based on the Metropolis-Hastings framework, and prove that it mixes to within TV distance $\varepsilon$ of… ▽ More

    Submitted 1 October, 2019; originally announced October 2019.

  14. arXiv:1908.10859  [pdf, ps, other

    stat.ML cs.DS cs.LG math.OC stat.CO

    High-Order Langevin Diffusion Yields an Accelerated MCMC Algorithm

    Authors: Wenlong Mou, Yi-An Ma, Martin J. Wainwright, Peter L. Bartlett, Michael I. Jordan

    Abstract: We propose a Markov chain Monte Carlo (MCMC) algorithm based on third-order Langevin dynamics for sampling from distributions with log-concave and smooth densities. The higher-order dynamics allow for more flexible discretization schemes, and we develop a specific method that combines splitting with more accurate integration. For a broad class of $d$-dimensional distributions arising from generali… ▽ More

    Submitted 26 May, 2020; v1 submitted 28 August, 2019; originally announced August 2019.

    Comments: Changes from v1: improved algorithm with $O (d^{1/4} / \varepsilon^{1/2})$ mixing time

  15. arXiv:1806.07507  [pdf, other

    cs.RO

    iCLAP: Shape Recognition by Combining Proprioception and Touch Sensing

    Authors: Shan Luo, Wenxuan Mou, Kaspar Althoefer, Hongbin Liu

    Abstract: For humans, both the proprioception and touch sensing are highly utilized when performing haptic perception. However, most approaches in robotics use only either proprioceptive data or touch data in haptic object recognition. In this paper, we present a novel method named Iterative Closest Labeled Point (iCLAP) to link the kinesthetic cues and tactile patterns fundamentally and also introduce its… ▽ More

    Submitted 19 June, 2018; originally announced June 2018.

    Comments: 10 pages, 12 figures, accepted to Autonomous Robots

  16. Localizing the Object Contact through Matching Tactile Features with Visual Map

    Authors: Shan Luo, Wenxuan Mou, Kaspar Althoefer, Hongbin Liu

    Abstract: This paper presents a novel framework for integration of vision and tactile sensing by localizing tactile readings in a visual object map. Intuitively, there are some correspondences, e.g., prominent features, between visual and tactile object identification. To apply it in robotics, we propose to localize tactile readings in visual images by sharing same sets of feature descriptors through two se… ▽ More

    Submitted 15 August, 2017; originally announced August 2017.

    Comments: 6 pages, 8 figures, ICRA 2015

    Journal ref: ICRA 2015

  17. Iterative Closest Labeled Point for Tactile Object Shape Recognition

    Authors: Shan Luo, Wenxuan Mou, Kaspar Althoefer, Hongbin Liu

    Abstract: Tactile data and kinesthetic cues are two important sensing sources in robot object recognition and are complementary to each other. In this paper, we propose a novel algorithm named Iterative Closest Labeled Point (iCLAP) to recognize objects using both tactile and kinesthetic information.The iCLAP first assigns different local tactile features with distinct label numbers. The label numbers of th… ▽ More

    Submitted 15 August, 2017; originally announced August 2017.

    Comments: 6 pages, 8 figures, IROS 2016

  18. arXiv:1707.05947  [pdf, other

    cs.LG math.OC stat.ML

    Generalization Bounds of SGLD for Non-convex Learning: Two Theoretical Viewpoints

    Authors: Wenlong Mou, Liwei Wang, Xiyu Zhai, Kai Zheng

    Abstract: Algorithm-dependent generalization error bounds are central to statistical learning theory. A learning algorithm may use a large hypothesis space, but the limited number of iterations controls its model capacity and generalization error. The impacts of stochastic gradient methods on generalization error for non-convex learning problems not only have important theoretical consequences, but are also… ▽ More

    Submitted 19 July, 2017; originally announced July 2017.

  19. arXiv:1706.03316  [pdf, ps, other

    cs.LG cs.DS

    Collect at Once, Use Effectively: Making Non-interactive Locally Private Learning Possible

    Authors: Kai Zheng, Wenlong Mou, Liwei Wang

    Abstract: Non-interactive Local Differential Privacy (LDP) requires data analysts to collect data from users through noisy channel at once. In this paper, we extend the frontiers of Non-interactive LDP learning and estimation from several aspects. For learning with smooth generalized linear losses, we propose an approximate stochastic gradient oracle estimated from non-interactive LDP channel, using Chebysh… ▽ More

    Submitted 11 June, 2017; originally announced June 2017.

  20. arXiv:1703.09947  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Efficient Private ERM for Smooth Objectives

    Authors: Jiaqi Zhang, Kai Zheng, Wenlong Mou, Liwei Wang

    Abstract: In this paper, we consider efficient differentially private empirical risk minimization from the viewpoint of optimization algorithms. For strongly convex and smooth objectives, we prove that gradient descent with output perturbation not only achieves nearly optimal utility, but also significantly improves the running time of previous state-of-the-art private optimization algorithms, for both $ε$-… ▽ More

    Submitted 24 May, 2017; v1 submitted 29 March, 2017; originally announced March 2017.

  21. arXiv:1612.04659  [pdf, other

    cs.NE cs.DS cs.LG

    Stable Memory Allocation in the Hippocampus: Fundamental Limits and Neural Realization

    Authors: Wenlong Mou, Zhi Wang, Liwei Wang

    Abstract: It is believed that hippocampus functions as a memory allocator in brain, the mechanism of which remains unrevealed. In Valiant's neuroidal model, the hippocampus was described as a randomly connected graph, the computation on which maps input to a set of activated neuroids with stable size. Valiant proposed three requirements for the hippocampal circuit to become a stable memory allocator (SMA):… ▽ More

    Submitted 14 December, 2016; originally announced December 2016.

  22. arXiv:1612.04571  [pdf, ps, other

    cs.DS cs.CG

    A Refined Analysis of LSH for Well-dispersed Data Points

    Authors: Wenlong Mou, Liwei Wang

    Abstract: Near neighbor problems are fundamental in algorithms for high-dimensional Euclidean spaces. While classical approaches suffer from the curse of dimensionality, locality sensitive hashing (LSH) can effectively solve a-approximate r-near neighbor problem, and has been proven to be optimal in the worst case. However, for real-world data sets, LSH can naturally benefit from well-dispersed data and low… ▽ More

    Submitted 14 December, 2016; originally announced December 2016.

    Comments: Paper accepted to SIAM Conference on Analytic Algorithmics and Combinatorics (ANALCO) 2017

  23. arXiv:1507.03148  [pdf, other

    cs.CV

    Face Alignment Assisted by Head Pose Estimation

    Authors: Heng Yang, Wenxuan Mou, Yichi Zhang, Ioannis Patras, Hatice Gunes, Peter Robinson

    Abstract: In this paper we propose a supervised initialization scheme for cascaded face alignment based on explicit head pose estimation. We first investigate the failure cases of most state of the art face alignment approaches and observe that these failures often share one common global property, i.e. the head pose variation is usually large. Inspired by this, we propose a deep convolutional network model… ▽ More

    Submitted 18 July, 2015; v1 submitted 11 July, 2015; originally announced July 2015.

    Comments: Accepted by BMVC2015