Zum Hauptinhalt springen

Showing 1–41 of 41 results for author: Jha, D K

.
  1. arXiv:2407.16976  [pdf, other

    cs.RO

    Simultaneous Trajectory Optimization and Contact Selection for Contact-rich Manipulation with High-Fidelity Geometry

    Authors: Mengchao Zhang, Devesh K. Jha, Arvind U. Raghunathan, Kris Hauser

    Abstract: Contact-implicit trajectory optimization (CITO) is an effective method to plan complex trajectories for various contact-rich systems including manipulation and locomotion. CITO formulates a mathematical program with complementarity constraints (MPCC) that enforces that contact forces must be zero when points are not in contact. However, MPCC solve times increase steeply with the number of allowabl… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2306.06465

  2. arXiv:2406.05331  [pdf, other

    cs.RO

    Autonomous Robotic Assembly: From Part Singulation to Precise Assembly

    Authors: Kei Ota, Devesh K. Jha, Siddarth Jain, Bill Yerazunis, Radu Corcodel, Yash Shukla, Antonia Bronars, Diego Romeres

    Abstract: Imagine a robot that can assemble a functional product from the individual parts presented in any configuration to the robot. Designing such a robotic system is a complex problem which presents several open challenges. To bypass these challenges, the current generation of assembly systems is built with a lot of system integration effort to provide the structure and precision necessary for assembly… ▽ More

    Submitted 11 June, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

    Comments: Under submission

  3. arXiv:2401.02883  [pdf, other

    cs.RO eess.SY

    iPolicy: Incremental Policy Algorithms for Feedback Motion Planning

    Authors: Guoxiang Zhao, Devesh K. Jha, Yebin Wang, Minghui Zhu

    Abstract: This paper presents policy-based motion planning for robotic systems. The motion planning literature has been mostly focused on open-loop trajectory planning which is followed by tracking online. In contrast, we solve the problem of path planning and controller synthesis simultaneously by solving the related feedback control problem. We present a novel incremental policy (iPolicy) algorithm for mo… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

  4. arXiv:2312.10571  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Multi-level Reasoning for Robotic Assembly: From Sequence Inference to Contact Selection

    Authors: Xinghao Zhu, Devesh K. Jha, Diego Romeres, Lingfeng Sun, Masayoshi Tomizuka, Anoop Cherian

    Abstract: Automating the assembly of objects from their parts is a complex problem with innumerable applications in manufacturing, maintenance, and recycling. Unlike existing research, which is limited to target segmentation, pose regression, or using fixed target blueprints, our work presents a holistic multi-level framework for part assembly planning consisting of part assembly sequence inference, part mo… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

    Comments: Supplementary video is available at https://www.youtube.com/watch?v=XNYkWSHkAaU&ab_channel=MitsubishiElectricResearchLabs%28MERL%29

  5. arXiv:2312.06876  [pdf, other

    cs.RO cs.AI

    Interactive Planning Using Large Language Models for Partially Observable Robotics Tasks

    Authors: Lingfeng Sun, Devesh K. Jha, Chiori Hori, Siddarth Jain, Radu Corcodel, Xinghao Zhu, Masayoshi Tomizuka, Diego Romeres

    Abstract: Designing robotic agents to perform open vocabulary tasks has been the long-standing goal in robotics and AI. Recently, Large Language Models (LLMs) have achieved impressive results in creating robotic agents for performing open vocabulary tasks. However, planning for these tasks in the presence of uncertainties is challenging as it requires \enquote{chain-of-thought} reasoning, aggregating inform… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: 22 pages, 4 figures

  6. arXiv:2309.14552  [pdf, other

    cs.RO cs.AI cs.LG

    Tactile Estimation of Extrinsic Contact Patch for Stable Placement

    Authors: Kei Ota, Devesh K. Jha, Krishna Murthy Jatavallabhula, Asako Kanezaki, Joshua B. Tenenbaum

    Abstract: Precise perception of contact interactions is essential for fine-grained manipulation skills for robots. In this paper, we present the design of feedback skills for robots that must learn to stack complex-shaped objects on top of each other (see Fig.1). To design such a system, a robot should be able to reason about the stability of placement from very gentle contact interactions. Our results demo… ▽ More

    Submitted 23 March, 2024; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: Accepted at ICRA2024

  7. arXiv:2306.06465  [pdf, other

    cs.RO

    Simultaneous Trajectory Optimization and Contact Selection for Multi-Modal Manipulation Planning

    Authors: Mengchao Zhang, Devesh K. Jha, Arvind U. Raghunathan, Kris Hauser

    Abstract: Complex dexterous manipulations require switching between prehensile and non-prehensile grasps, and sliding and pivoting the object against the environment. This paper presents a manipulation planner that is able to reason about diverse changes of contacts to discover such plans. It implements a hybrid approach that performs contact-implicit trajectory optimization for pivoting and sliding manipul… ▽ More

    Submitted 10 June, 2023; originally announced June 2023.

    Comments: 10 pages, 9 figures, to be published in RSS 2023

  8. arXiv:2305.10960  [pdf, other

    cs.RO cs.AI

    A Virtual Reality Teleoperation Interface for Industrial Robot Manipulators

    Authors: Eric Rosen, Devesh K. Jha

    Abstract: We address the problem of teleoperating an industrial robot manipulator via a commercially available Virtual Reality (VR) interface. Previous works on VR teleoperation for robot manipulators focus primarily on collaborative or research robot platforms (whose dynamics and constraints differ from industrial robot arms), or only address tasks where the robot's dynamics are not as important (e.g: pick… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: 7 pages, 6 figures

  9. Covariance Steering for Uncertain Contact-rich Systems

    Authors: Yuki Shirai, Devesh K. Jha, Arvind U. Raghunathan

    Abstract: Planning and control for uncertain contact systems is challenging as it is not clear how to propagate uncertainty for planning. Contact-rich tasks can be modeled efficiently using complementarity constraints among other techniques. In this paper, we present a stochastic optimization technique with chance constraints for systems with stochastic complementarity constraints. We use a particle filter-… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: Accepted to the 2023 International Conference on Robotics and Automation (ICRA2023)

  10. arXiv:2303.08965  [pdf, other

    cs.RO cs.AI eess.SY

    Robust Pivoting Manipulation using Contact Implicit Bilevel Optimization

    Authors: Yuki Shirai, Devesh K. Jha, Arvind U. Raghunathan

    Abstract: Generalizable manipulation requires that robots be able to interact with novel objects and environment. This requirement makes manipulation extremely challenging as a robot has to reason about complex frictional interactions with uncertainty in physical properties of the object and the environment. In this paper, we study robust optimization for planning of pivoting manipulation in the presence of… ▽ More

    Submitted 4 July, 2024; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: Accepted for IEEE Transactions on Robotics. arXiv admin note: text overlap with arXiv:2203.11412

  11. arXiv:2303.06034  [pdf, other

    cs.RO cs.AI cs.LG

    Tactile-Filter: Interactive Tactile Perception for Part Mating

    Authors: Kei Ota, Devesh K. Jha, Hsiao-Yu Tung, Joshua B. Tenenbaum

    Abstract: Humans rely on touch and tactile sensing for a lot of dexterous manipulation tasks. Our tactile sensing provides us with a lot of information regarding contact formations as well as geometric information about objects during any interaction. With this motivation, vision-based tactile sensors are being widely used for various robotic perception and control tasks. In this paper, we present a method… ▽ More

    Submitted 5 June, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

    Comments: Accepted at RSS2023

  12. arXiv:2303.03385  [pdf, other

    cs.RO

    Simultaneous Tactile Estimation and Control of Extrinsic Contact

    Authors: Sangwoon Kim, Devesh K. Jha, Diego Romeres, Parag Patre, Alberto Rodriguez

    Abstract: We propose a method that simultaneously estimates and controls extrinsic contact with tactile feedback. The method enables challenging manipulation tasks that require controlling light forces and accurate motions in contact, such as balancing an unknown object on a thin rod standing upright. A factor graph-based framework fuses a sequence of tactile and kinematic measurements to estimate and contr… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: 2023 International Conference on Robotics and Automation (ICRA)

  13. Tactile Tool Manipulation

    Authors: Yuki Shirai, Devesh K. Jha, Arvind U. Raghunathan, Dennis Hong

    Abstract: Humans can effortlessly perform very complex, dexterous manipulation tasks by reacting to sensor observations. In contrast, robots can not perform reactive manipulation and they mostly operate in open-loop while interacting with their environment. Consequently, the current manipulation algorithms either are inefficient in performance or can only work in highly structured environments. In this pape… ▽ More

    Submitted 23 March, 2023; v1 submitted 16 January, 2023; originally announced January 2023.

    Comments: Accepted to ICRA2023. Video: https://youtu.be/VsClK04qDhk

  14. arXiv:2212.01434  [pdf, other

    cs.RO cs.AI

    Generalizable Human-Robot Collaborative Assembly Using Imitation Learning and Force Control

    Authors: Devesh K. Jha, Siddarth Jain, Diego Romeres, William Yerazunis, Daniel Nikovski

    Abstract: Robots have been steadily increasing their presence in our daily lives, where they can work along with humans to provide assistance in various tasks on industry floors, in offices, and in homes. Automated assembly is one of the key applications of robots, and the next generation assembly systems could become much more efficient by creating collaborative human-robot systems. However, although colla… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

  15. arXiv:2210.12806  [pdf, other

    cs.RO cs.LG

    Active Exploration for Robotic Manipulation

    Authors: Tim Schneider, Boris Belousov, Georgia Chalvatzaki, Diego Romeres, Devesh K. Jha, Jan Peters

    Abstract: Robotic manipulation stands as a largely unsolved problem despite significant advances in robotics and machine learning in recent years. One of the key challenges in manipulation is the exploration of the dynamics of the environment when there is continuous contact between the objects being manipulated. This paper proposes a model-based active exploration approach that enables efficient learning i… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

    Comments: Published without appendix at "International Conference on Intelligent Robots and Systems (IROS)" 2022

  16. arXiv:2209.14461  [pdf, other

    cs.RO cs.AI

    Constrained Dynamic Movement Primitives for Safe Learning of Motor Skills

    Authors: Seiji Shaw, Devesh K. Jha, Arvind Raghunathan, Radu Corcodel, Diego Romeres, George Konidaris, Daniel Nikovski

    Abstract: Dynamic movement primitives are widely used for learning skills which can be demonstrated to a robot by a skilled human or controller. While their generalization capabilities and simple formulation make them very appealing to use, they possess no strong guarantees to satisfy operational safety constraints for a task. In this paper, we present constrained dynamic movement primitives (CDMP) which ca… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

  17. arXiv:2204.10447  [pdf, other

    cs.RO

    Design of Adaptive Compliance Controllers for Safe Robotic Assembly

    Authors: Devesh K. Jha, Diego Romeres, Siddarth Jain, William Yerazunis, Daniel Nikovski

    Abstract: Insertion operations are a critical element of most robotic assembly operation, and peg-in-hole (PiH) insertion is one of the most widely studied tasks in the industrial and academic manipulation communities. PiH insertion is in fact an entire class of problems, where the complexity of the problem can depend on the type of misalignment and contact formation during an insertion attempt. In this pap… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

    Comments: 8 pages, 10 figures

  18. Robust Pivoting: Exploiting Frictional Stability Using Bilevel Optimization

    Authors: Yuki Shirai, Devesh K. Jha, Arvind Raghunathan, Diego Romeres

    Abstract: Generalizable manipulation requires that robots be able to interact with novel objects and environment. This requirement makes manipulation extremely challenging as a robot has to reason about complex frictional interaction with uncertainty in physical properties of the object. In this paper, we study robust optimization for control of pivoting manipulation in the presence of uncertainties. We pre… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    Comments: Accepted to the 2022 IEEE International Conference on Robotics and Automation (ICRA 2022)

  19. arXiv:2203.10013  [pdf, other

    cs.RO eess.SY

    PYROBOCOP: Python-based Robotic Control & Optimization Package for Manipulation

    Authors: Arvind Raghunathan, Devesh K. Jha, Diego Romeres

    Abstract: PYROBOCOP is a Python-based package for control, optimization and estimation of robotic systems described by nonlinear Differential Algebraic Equations (DAEs). In particular, the package can handle systems with contacts that are described by complementarity constraints and provides a general framework for specifying obstacle avoidance constraints. The package performs direct transcription of the D… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

    Comments: 7 pages, ICRA22. arXiv admin note: substantial text overlap with arXiv:2106.03220

  20. arXiv:2203.02616  [pdf, ps, other

    cs.RO cs.AI

    Chance-Constrained Optimization in Contact-Rich Systems for Robust Manipulation

    Authors: Yuki Shirai, Devesh K. Jha, Arvind Raghunathan, Diego Romeres

    Abstract: This paper presents a chance-constrained formulation for robust trajectory optimization during manipulation. In particular, we present a chance-constrained optimization for Stochastic Discrete-time Linear Complementarity Systems (SDLCS). To solve the optimization problem, we formulate Mixed-Integer Quadratic Programming with Chance Constraints (MIQPCC). In our formulation, we explicitly consider j… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

    Comments: 9 pages, 9 figures

    Journal ref: Under review at IROS 2022

  21. arXiv:2111.10488  [pdf, other

    cs.RO cs.AI

    Imitation and Supervised Learning of Compliance for Robotic Assembly

    Authors: Devesh K. Jha, Diego Romeres, William Yerazunis, Daniel Nikovski

    Abstract: We present the design of a learning-based compliance controller for assembly operations for industrial robots. We propose a solution within the general setting of learning from demonstration (LfD), where a nominal trajectory is provided through demonstration by an expert teacher. This can be used to learn a suitable representation of the skill that can be generalized to novel positions of one of t… ▽ More

    Submitted 19 November, 2021; originally announced November 2021.

    Comments: 8 pages, 7 figures

  22. arXiv:2106.03220  [pdf, other

    cs.RO cs.AI

    PYROBOCOP : Python-based Robotic Control & Optimization Package for Manipulation and Collision Avoidance

    Authors: Arvind U. Raghunathan, Devesh K. Jha, Diego Romeres

    Abstract: PYROBOCOP is a lightweight Python-based package for control and optimization of robotic systems described by nonlinear Differential Algebraic Equations (DAEs). In particular, the package can handle systems with contacts that are described by complementarity constraints and provides a general framework for specifying obstacle avoidance constraints. The package performs direct transcription of the D… ▽ More

    Submitted 6 June, 2021; originally announced June 2021.

    Comments: Under review at IJRR

  23. arXiv:2106.02992  [pdf, ps, other

    cs.RO eess.SY

    Distributed Task Allocation in Homogeneous Swarms Using Language Measure Theory

    Authors: Devesh K. Jha

    Abstract: In this paper, we present algorithms for synthesizing controllers to distribute a group (possibly swarms) of homogeneous robots (agents) over heterogeneous tasks which are operated in parallel. We present algorithms as well as analysis for global and local-feedback-based controller for the swarms. Using ergodicity property of irreducible Markov chains, we design a controller for global swarm contr… ▽ More

    Submitted 24 June, 2021; v1 submitted 5 June, 2021; originally announced June 2021.

    Comments: Under review

  24. arXiv:2106.00898  [pdf, other

    cs.RO

    Trajectory Optimization for Manipulation of Deformable Objects: Assembly of Belt Drive Units

    Authors: Shiyu Jin, Diego Romeres, Arvind Ragunathan, Devesh K. Jha, Masayoshi Tomizuka

    Abstract: This paper presents a novel trajectory optimization formulation to solve the robotic assembly of the belt drive unit. Robotic manipulations involving contacts and deformable objects are challenging in both dynamic modeling and trajectory planning. For modeling, variations in the belt tension and contact forces between the belt and the pulley could dramatically change the system dynamics. For traje… ▽ More

    Submitted 20 June, 2021; v1 submitted 1 June, 2021; originally announced June 2021.

  25. arXiv:2104.01167  [pdf, other

    cs.RO

    Tactile-RL for Insertion: Generalization to Objects of Unknown Geometry

    Authors: Siyuan Dong, Devesh K. Jha, Diego Romeres, Sangwoon Kim, Daniel Nikovski, Alberto Rodriguez

    Abstract: Object insertion is a classic contact-rich manipulation task. The task remains challenging, especially when considering general objects of unknown geometry, which significantly limits the ability to understand the contact configuration between the object and the environment. We study the problem of aligning the object and environment with a tactile-based feedback insertion policy. The insertion pr… ▽ More

    Submitted 2 April, 2021; originally announced April 2021.

  26. arXiv:2103.11238  [pdf, ps, other

    stat.ML cs.LG

    Markov Modeling of Time-Series Data using Symbolic Analysis

    Authors: Devesh K. Jha

    Abstract: Markov models are often used to capture the temporal patterns of sequential data for statistical learning applications. While the Hidden Markov modeling-based learning mechanisms are well studied in literature, we analyze a symbolic-dynamics inspired approach. Under this umbrella, Markov modeling of time-series data consists of two major steps -- discretization of continuous attributes followed by… ▽ More

    Submitted 23 March, 2021; v1 submitted 20 March, 2021; originally announced March 2021.

  27. arXiv:2102.07920  [pdf, other

    cs.LG cs.AI cs.RO

    Training Larger Networks for Deep Reinforcement Learning

    Authors: Kei Ota, Devesh K. Jha, Asako Kanezaki

    Abstract: The success of deep learning in the computer vision and natural language processing communities can be attributed to training of very deep neural networks with millions or billions of parameters which can then be trained with massive amounts of data. However, similar trend has largely eluded training of deep reinforcement learning (RL) algorithms where larger networks do not lead to performance im… ▽ More

    Submitted 15 February, 2021; originally announced February 2021.

    Comments: Under submission

  28. arXiv:2011.07193  [pdf, other

    cs.LG cs.AI cs.RO

    Data-Efficient Learning for Complex and Real-Time Physical Problem Solving using Augmented Simulation

    Authors: Kei Ota, Devesh K. Jha, Diego Romeres, Jeroen van Baar, Kevin A. Smith, Takayuki Semitsu, Tomoaki Oiki, Alan Sullivan, Daniel Nikovski, Joshua B. Tenenbaum

    Abstract: Humans quickly solve tasks in novel systems with complex dynamics, without requiring much interaction. While deep reinforcement learning algorithms have achieved tremendous success in many complex tasks, these algorithms need a large number of samples to learn meaningful policies. In this paper, we present a task for navigating a marble to the center of a circular maze. While this system is very i… ▽ More

    Submitted 15 February, 2021; v1 submitted 13 November, 2020; originally announced November 2020.

    Comments: Under submission

  29. arXiv:2011.00155  [pdf, other

    cs.RO cs.AI cs.LG

    Deep Reactive Planning in Dynamic Environments

    Authors: Kei Ota, Devesh K. Jha, Tadashi Onishi, Asako Kanezaki, Yusuke Yoshiyasu, Yoko Sasaki, Toshisada Mariyama, Daniel Nikovski

    Abstract: The main novelty of the proposed approach is that it allows a robot to learn an end-to-end policy which can adapt to changes in the environment during execution. While goal conditioning of policies has been studied in the RL literature, such approaches are not easily extended to cases where the robot's goal can change during execution. This is something that humans are naturally able to do. Howeve… ▽ More

    Submitted 5 November, 2020; v1 submitted 30 October, 2020; originally announced November 2020.

    Comments: 15 pages, 5 figures. Accepted at CoRL 2020

  30. arXiv:2007.11646  [pdf, other

    cs.RO cs.LG

    Understanding Multi-Modal Perception Using Behavioral Cloning for Peg-In-a-Hole Insertion Tasks

    Authors: Yifang Liu, Diego Romeres, Devesh K. Jha, Daniel Nikovski

    Abstract: One of the main challenges in peg-in-a-hole (PiH) insertion tasks is in handling the uncertainty in the location of the target hole. In order to address it, high-dimensional sensor inputs from sensor modalities such as vision, force/torque sensing, and proprioception can be combined to learn control policies that are robust to this uncertainty in the target pose. Whereas deep learning has shown su… ▽ More

    Submitted 22 July, 2020; originally announced July 2020.

    Comments: Published at a RSS20 workshop

  31. arXiv:2003.11696  [pdf, other

    cs.LG cs.RO stat.ML

    CAZSL: Zero-Shot Regression for Pushing Models by Generalizing Through Context

    Authors: Wenyu Zhang, Skyler Seto, Devesh K. Jha

    Abstract: Learning accurate models of the physical world is required for a lot of robotic manipulation tasks. However, during manipulation, robots are expected to interact with unknown workpieces so that building predictive models which can generalize over a number of these objects is highly desirable. In this paper, we study the problem of designing deep learning agents which can generalize their models of… ▽ More

    Submitted 1 November, 2020; v1 submitted 25 March, 2020; originally announced March 2020.

    Comments: Accepted at IROS 2020

  32. arXiv:2003.03747  [pdf, other

    physics.optics eess.SP physics.comp-ph

    Generative Deep Learning Model for a Multi-level Nano-Optic Broadband Power Splitter

    Authors: Yingheng Tang, Keisuke Kojima, Toshiaki Koike-Akino, Ye Wang, Pengxiang Wu, Mohammad Tahersima, Devesh K. Jha, Kieran Parsons, Minghao Qi

    Abstract: We propose a novel Conditional Variational Autoencoder (CVAE) model, enhanced with adversarial censoring and active learning, for the generation of 550 nm broad bandwidth (1250 nm to 1800 nm) power splitters with arbitrary splitting ratio. The device footprint is 2.25 x 2.25 μ m2 with a 20 x 20 etched hole combination. It is the first demonstration to apply the CVAE model and the adversarial censo… ▽ More

    Submitted 8 March, 2020; originally announced March 2020.

  33. arXiv:2003.01641  [pdf, other

    cs.LG cs.RO stat.ML

    Efficient Exploration in Constrained Environments with Goal-Oriented Reference Path

    Authors: Kei Ota, Yoko Sasaki, Devesh K. Jha, Yusuke Yoshiyasu, Asako Kanezaki

    Abstract: In this paper, we consider the problem of building learning agents that can efficiently learn to navigate in constrained environments. The main goal is to design agents that can efficiently learn to understand and generalize to different environments using high-dimensional inputs (a 2D map), while following feasible paths that avoid obstacles in obstacle-cluttered environment. To achieve this, we… ▽ More

    Submitted 3 March, 2020; originally announced March 2020.

    Comments: 8 pages, 10 figures

  34. arXiv:2003.01629  [pdf, other

    cs.LG cs.RO stat.ML

    Can Increasing Input Dimensionality Improve Deep Reinforcement Learning?

    Authors: Kei Ota, Tomoaki Oiki, Devesh K. Jha, Toshisada Mariyama, Daniel Nikovski

    Abstract: Deep reinforcement learning (RL) algorithms have recently achieved remarkable successes in various sequential decision making tasks, leveraging advances in methods for training large deep networks. However, these methods usually require large amounts of training data, which is often a big problem for real-world applications. One natural question to ask is whether learning good representations for… ▽ More

    Submitted 26 June, 2020; v1 submitted 3 March, 2020; originally announced March 2020.

    Comments: 11 pages, 10 figures. Accepted to ICML 2020

  35. arXiv:2002.10621  [pdf, other

    cs.LG cs.RO eess.SP eess.SY stat.ML

    Model-Based Reinforcement Learning for Physical Systems Without Velocity and Acceleration Measurements

    Authors: Alberto Dalla Libera, Diego Romeres, Devesh K. Jha, Bill Yerazunis, Daniel Nikovski

    Abstract: In this paper, we propose a derivative-free model learning framework for Reinforcement Learning (RL) algorithms based on Gaussian Process Regression (GPR). In many mechanical systems, only positions can be measured by the sensing instruments. Then, instead of representing the system state as suggested by the physics with a collection of positions, velocities, and accelerations, we define the state… ▽ More

    Submitted 24 February, 2020; originally announced February 2020.

    Comments: Accepted at RA-L

  36. arXiv:2001.10098  [pdf, other

    cs.LG eess.SP stat.ML

    Multi-label Prediction in Time Series Data using Deep Neural Networks

    Authors: Wenyu Zhang, Devesh K. Jha, Emil Laftchiev, Daniel Nikovski

    Abstract: This paper addresses a multi-label predictive fault classification problem for multidimensional time-series data. While fault (event) detection problems have been thoroughly studied in literature, most of the state-of-the-art techniques can't reliably predict faults (events) over a desired future horizon. In the most general setting of these types of problems, one or more samples of data across mu… ▽ More

    Submitted 27 January, 2020; originally announced January 2020.

    Comments: Accepted by IJPHM. Presented at PHM19

  37. arXiv:2001.08092  [pdf, other

    cs.LG cs.RO eess.SY stat.ML

    Local Policy Optimization for Trajectory-Centric Reinforcement Learning

    Authors: Patrik Kolaric, Devesh K. Jha, Arvind U. Raghunathan, Frank L. Lewis, Mouhacine Benosman, Diego Romeres, Daniel Nikovski

    Abstract: The goal of this paper is to present a method for simultaneous trajectory and local stabilizing policy optimization to generate local policies for trajectory-centric model-based reinforcement learning (MBRL). This is motivated by the fact that global policy optimization for non-linear systems could be a very challenging problem both algorithmically and numerically. However, a lot of robotic manipu… ▽ More

    Submitted 22 January, 2020; originally announced January 2020.

    Journal ref: ICRA 2020

  38. arXiv:1907.02151  [pdf, other

    eess.SY cs.LG math.DS math.OC

    Safe Approximate Dynamic Programming Via Kernelized Lipschitz Estimation

    Authors: Ankush Chakrabarty, Devesh K. Jha, Gregery T. Buzzard, Yebin Wang, Kyriakos Vamvoudakis

    Abstract: We develop a method for obtaining safe initial policies for reinforcement learning via approximate dynamic programming (ADP) techniques for uncertain systems evolving with discrete-time dynamics. We employ kernelized Lipschitz estimation and semidefinite programming for computing admissible initial control policies with provably high probability. Such admissible controllers enable safe initializat… ▽ More

    Submitted 3 July, 2019; originally announced July 2019.

  39. arXiv:1905.05927  [pdf, ps, other

    cs.LG cs.CV math.OC stat.ML

    Game Theoretic Optimization via Gradient-based Nikaido-Isoda Function

    Authors: Arvind U. Raghunathan, Anoop Cherian, Devesh K. Jha

    Abstract: Computing Nash equilibrium (NE) of multi-player games has witnessed renewed interest due to recent advances in generative adversarial networks. However, computing equilibrium efficiently is challenging. To this end, we introduce the Gradient-based Nikaido-Isoda (GNI) function which serves: (i) as a merit function, vanishing only at the first-order stationary points of each player's optimization pr… ▽ More

    Submitted 14 May, 2019; originally announced May 2019.

    Comments: Accepted at International Conference on Machine Learning (ICML), 2019

  40. arXiv:1903.05751  [pdf, other

    stat.ML cs.LG cs.RO

    Trajectory Optimization for Unknown Constrained Systems using Reinforcement Learning

    Authors: Kei Ota, Devesh K. Jha, Tomoaki Oiki, Mamoru Miura, Takashi Nammoto, Daniel Nikovski, Toshisada Mariyama

    Abstract: In this paper, we propose a reinforcement learning-based algorithm for trajectory optimization for constrained dynamical systems. This problem is motivated by the fact that for most robotic systems, the dynamics may not always be known. Generating smooth, dynamically feasible trajectories could be difficult for such systems. Using sampling-based algorithms for motion planning may result in traject… ▽ More

    Submitted 3 March, 2020; v1 submitted 13 March, 2019; originally announced March 2019.

    Comments: 8 pages, 6 figures, Accepted to IROS 2019

  41. arXiv:1709.09274  [pdf, ps, other

    stat.ML

    Symbolic Analysis-based Reduced Order Markov Modeling of Time Series Data

    Authors: Devesh K Jha, Nurali Virani, Jan Reimann, Abhishek Srivastav, Asok Ray

    Abstract: This paper presents a technique for reduced-order Markov modeling for compact representation of time-series data. In this work, symbolic dynamics-based tools have been used to infer an approximate generative Markov model. The time-series data are first symbolized by partitioning the continuous measurement space of the signal and then, the discrete sequential data are modeled using symbolic dynamic… ▽ More

    Submitted 26 September, 2017; originally announced September 2017.

    Comments: 21 pages, 12 figures