Zum Hauptinhalt springen

Showing 1–18 of 18 results for author: Amrutur, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13208  [pdf, other

    cs.CV

    Block-level Text Spotting with LLMs

    Authors: Ganesh Bannur, Bharadwaj Amrutur

    Abstract: Text spotting has seen tremendous progress in recent years yielding performant techniques which can extract text at the character, word or line level. However, extracting blocks of text from images (block-level text spotting) is relatively unexplored. Blocks contain more context than individual lines, words or characters and so block-level text spotting would enhance downstream applications, such… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 19 pages, 7 figures

  2. CORNET 2.0: A Co-Simulation Middleware for Robot Networks

    Authors: Srikrishna Acharya, Bharadwaj Amrutur, Mukunda Bharatheesha, Yogesh Simmhan

    Abstract: We present a networked co-simulation framework for multi-robot systems applications. We require a simulation framework that captures both physical interactions and communications aspects to effectively design such complex systems. This is necessary to co-design the multi-robots' autonomy logic and the communication protocols. The proposed framework extends existing tools to simulate the robot's au… ▽ More

    Submitted 8 June, 2022; v1 submitted 14 September, 2021; originally announced September 2021.

    Journal ref: 2022 14th International Conference on COMmunication Systems & NETworkS (COMSNETS)

  3. arXiv:2101.01055  [pdf, other

    cs.LG cs.RO

    Stochastic Action Prediction for Imitation Learning

    Authors: Sagar Gubbi Venkatesh, Nihesh Rathod, Shishir Kolathaya, Bharadwaj Amrutur

    Abstract: Imitation learning is a data-driven approach to acquiring skills that relies on expert demonstrations to learn a policy that maps observations to actions. When performing demonstrations, experts are not always consistent and might accomplish the same task in slightly different ways. In this paper, we demonstrate inherent stochasticity in demonstrations collected for tasks including line following… ▽ More

    Submitted 26 December, 2020; originally announced January 2021.

  4. Scene Text Detection for Augmented Reality -- Character Bigram Approach to reduce False Positive Rate

    Authors: Sagar Gubbi, Bharadwaj Amrutur

    Abstract: Natural scene text detection is an important aspect of scene understanding and could be a useful tool in building engaging augmented reality applications. In this work, we address the problem of false positives in text spotting. We propose improving the performace of sliding window text spotters by looking for character pairs (bigrams) rather than single characters. An efficient convolutional neur… ▽ More

    Submitted 26 December, 2020; originally announced January 2021.

  5. arXiv:2101.01053  [pdf, other

    cs.RO cs.LG

    Multi-Instance Aware Localization for End-to-End Imitation Learning

    Authors: Sagar Gubbi Venkatesh, Raviteja Upadrashta, Shishir Kolathaya, Bharadwaj Amrutur

    Abstract: Existing architectures for imitation learning using image-to-action policy networks perform poorly when presented with an input image containing multiple instances of the object of interest, especially when the number of expert demonstrations available for training are limited. We show that end-to-end policy networks can be trained in a sample efficient manner by (a) appending the feature map outp… ▽ More

    Submitted 26 December, 2020; originally announced January 2021.

    Comments: Accepted at IROS 2020

  6. Imitation Learning for High Precision Peg-in-Hole Tasks

    Authors: Sagar Gubbi, Shishir Kolathaya, Bharadwaj Amrutur

    Abstract: Industrial robot manipulators are not able to match the precision and speed with which humans are able to execute contact rich tasks even to this day. Therefore, as a means overcome this gap, we demonstrate generative methods for imitating a peg-in-hole insertion task in a 6-DOF robot manipulator. In particular, generative adversarial imitation learning (GAIL) is used to successfully achieve this… ▽ More

    Submitted 26 December, 2020; originally announced January 2021.

    Comments: Accepted at ICCAR 2020

  7. arXiv:2012.13695  [pdf, other

    cs.RO cs.CL cs.LG

    Translating Natural Language Instructions to Computer Programs for Robot Manipulation

    Authors: Sagar Gubbi Venkatesh, Raviteja Upadrashta, Bharadwaj Amrutur

    Abstract: It is highly desirable for robots that work alongside humans to be able to understand instructions in natural language. Existing language conditioned imitation learning models directly predict the actuator commands from the image observation and the instruction text. Rather than directly predicting actuator commands, we propose translating the natural language instruction to a Python function whic… ▽ More

    Submitted 20 March, 2021; v1 submitted 26 December, 2020; originally announced December 2020.

    Comments: Submitted to IROS 2021

  8. arXiv:2012.13693  [pdf, other

    cs.RO cs.CL cs.LG

    Spatial Reasoning from Natural Language Instructions for Robot Manipulation

    Authors: Sagar Gubbi Venkatesh, Anirban Biswas, Raviteja Upadrashta, Vikram Srinivasan, Partha Talukdar, Bharadwaj Amrutur

    Abstract: Robots that can manipulate objects in unstructured environments and collaborate with humans can benefit immensely by understanding natural language. We propose a pipelined architecture of two stages to perform spatial reasoning on the text input. All the objects in the scene are first localized, and then the instruction for the robot in natural language and the localized co-ordinates are mapped to… ▽ More

    Submitted 26 March, 2021; v1 submitted 26 December, 2020; originally announced December 2020.

    Comments: Accepted for ICRA 2021

  9. One-Shot Object Localization Using Learnt Visual Cues via Siamese Networks

    Authors: Sagar Gubbi Venkatesh, Bharadwaj Amrutur

    Abstract: A robot that can operate in novel and unstructured environments must be capable of recognizing new, previously unseen, objects. In this work, a visual cue is used to specify a novel object of interest which must be localized in new environments. An end-to-end neural network equipped with a Siamese network is used to learn the cue, infer the object of interest, and then to localize it in new enviro… ▽ More

    Submitted 26 December, 2020; originally announced December 2020.

  10. Teaching Robots Novel Objects by Pointing at Them

    Authors: Sagar Gubbi Venkatesh, Raviteja Upadrashta, Shishir Kolathaya, Bharadwaj Amrutur

    Abstract: Robots that must operate in novel environments and collaborate with humans must be capable of acquiring new knowledge from human experts during operation. We propose teaching a robot novel objects it has not encountered before by pointing a hand at the new object of interest. An end-to-end neural network is used to attend to the novel object of interest indicated by the pointing hand and then to l… ▽ More

    Submitted 25 December, 2020; originally announced December 2020.

  11. arXiv:2010.16342  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Robust Quadrupedal Locomotion on Sloped Terrains: A Linear Policy Approach

    Authors: Kartik Paigwar, Lokesh Krishna, Sashank Tirumala, Naman Khetan, Aditya Sagi, Ashish Joglekar, Shalabh Bhatnagar, Ashitava Ghosal, Bharadwaj Amrutur, Shishir Kolathaya

    Abstract: In this paper, with a view toward fast deployment of locomotion gaits in low-cost hardware, we use a linear policy for realizing end-foot trajectories in the quadruped robot, Stoch $2$. In particular, the parameters of the end-foot trajectories are shaped via a linear feedback policy that takes the torso orientation and the terrain slope as inputs. The corresponding desired joint angles are obtain… ▽ More

    Submitted 10 November, 2020; v1 submitted 30 October, 2020; originally announced October 2020.

    Comments: Accepted in 4th Conference on Robot Learning 2020, MIT, USA

  12. arXiv:2008.04248  [pdf, other

    eess.SP cs.RO

    Robust and Scalable Techniques for TWR and TDoA based localization using Ultra Wide Band Radios

    Authors: Rakshit Ramesh, Aaron John-Sabu, Harshitha S, Siddarth Ramesh, Vishwas Navada B, Mukunth Arunachalam, Bharadwaj Amrutur

    Abstract: Current trends in autonomous vehicles and their applications indicates an increasing need in positioning at low battery and compute cost. Lidars provide accurate localization at the cost of high compute and power consumption which could be detrimental for drones. Modern requirements for autonomous drones such as No-Permit-No-Takeoff (NPNT) and applications restricting drones to a corridor require… ▽ More

    Submitted 10 August, 2020; originally announced August 2020.

  13. arXiv:2007.14290  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Learning Stable Manoeuvres in Quadruped Robots from Expert Demonstrations

    Authors: Sashank Tirumala, Sagar Gubbi, Kartik Paigwar, Aditya Sagi, Ashish Joglekar, Shalabh Bhatnagar, Ashitava Ghosal, Bharadwaj Amrutur, Shishir Kolathaya

    Abstract: With the research into development of quadruped robots picking up pace, learning based techniques are being explored for developing locomotion controllers for such robots. A key problem is to generate leg trajectories for continuously varying target linear and angular velocities, in a stable manner. In this paper, we propose a two pronged approach to address this problem. First, multiple simpler p… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

    Comments: 6 pages, Robot and Human Interaction Conference Italy 2020

  14. arXiv:2003.08361  [pdf, other

    cs.DC

    Vermillion: A High-Performance Scalable IoT Middleware for Smart Cities

    Authors: Poorna Chandra Tejasvi, Vasanth Rajaraman, Arun Babu Puthuparambil, Akhil Pankaj, Bharadwaj Amrutur

    Abstract: With the massive increase in the number of IoT devices being deployed in smart cities, it becomes paramount for middlewares to be able to handle very high loads and support demanding use-cases. In order to do so, middlewares must scale horizontally while providing a commensurate increase in availability and throughput. Currently, most open-source IoT middlewares do not provide out-of-the-box suppo… ▽ More

    Submitted 14 March, 2020; originally announced March 2020.

  15. arXiv:1912.12907  [pdf, other

    cs.RO

    Gait Library Synthesis for Quadruped Robots via Augmented Random Search

    Authors: Sashank Tirumala, Aditya Sagi, Kartik Paigwar, Ashish Joglekar, Shalabh Bhatnagar, Ashitava Ghosal, Bharadwaj Amrutur, Shishir Kolathaya

    Abstract: In this paper, with a view toward fast deployment of learned locomotion gaits in low-cost hardware, we generate a library of walking trajectories, namely, forward trot, backward trot, side-step, and turn in our custom-built quadruped robot, Stoch 2, using reinforcement learning. There are existing approaches that determine optimal policies for each time step, whereas we determine an optimal policy… ▽ More

    Submitted 30 December, 2019; originally announced December 2019.

    Comments: 7 pages, 11 figures, 1 table

  16. arXiv:1905.06077  [pdf, other

    cs.RO cs.LG

    Learning Active Spine Behaviors for Dynamic and Efficient Locomotion in Quadruped Robots

    Authors: Shounak Bhattacharya, Abhik Singla, Abhimanyu, Dhaivat Dholakiya, Shalabh Bhatnagar, Bharadwaj Amrutur, Ashitava Ghosal, Shishir Kolathaya

    Abstract: In this work, we provide a simulation framework to perform systematic studies on the effects of spinal joint compliance and actuation on bounding performance of a 16-DOF quadruped spined robot Stoch 2. Fast quadrupedal locomotion with active spine is an extremely hard problem, and involves a complex coordination between the various degrees of freedom. Therefore, past attempts at addressing this pr… ▽ More

    Submitted 15 May, 2019; v1 submitted 15 May, 2019; originally announced May 2019.

    Comments: Submitted to IEEE RO-MAN 2019. Supplementary video: https://youtu.be/INp4aa-8z2E

  17. arXiv:1901.00697  [pdf, other

    cs.RO

    Design, Development and Experimental Realization of a Quadrupedal Research Platform: Stoch

    Authors: Dhaivat Dholakiya, Shounak Bhattacharya, Ajay Gunalan, Abhik Singla, Shalabh Bhatnagar, Bharadwaj Amrutur, Ashitava Ghosal, Shishir Kolathaya

    Abstract: In this paper, we present a complete description of the hardware design and control architecture of our custom built quadruped robot, called the `Stoch'. Our goal is to realize a robust, modular, and a reliable quadrupedal platform, using which various locomotion behaviors are explored. This platform enables us to explore different research problems in legged locomotion, which use both traditional… ▽ More

    Submitted 27 February, 2019; v1 submitted 3 January, 2019; originally announced January 2019.

    Comments: Accepted by International Conference on Control, Automation and Robotics (ICCAR) 2019. Supplementary Video: https://youtu.be/Wxx9pwwTIL4

  18. arXiv:1810.03842  [pdf, other

    cs.RO cs.LG

    Realizing Learned Quadruped Locomotion Behaviors through Kinematic Motion Primitives

    Authors: Abhik Singla, Shounak Bhattacharya, Dhaivat Dholakiya, Shalabh Bhatnagar, Ashitava Ghosal, Bharadwaj Amrutur, Shishir Kolathaya

    Abstract: Humans and animals are believed to use a very minimal set of trajectories to perform a wide variety of tasks including walking. Our main objective in this paper is two fold 1) Obtain an effective tool to realize these basic motion patterns for quadrupedal walking, called the kinematic motion primitives (kMPs), via trajectories learned from deep reinforcement learning (D-RL) and 2) Realize a set of… ▽ More

    Submitted 26 February, 2019; v1 submitted 9 October, 2018; originally announced October 2018.

    Comments: Accepted by ICRA 2019. Supplementary Video: https://youtu.be/kiLKSqI4KhE