Zum Hauptinhalt springen

Showing 1–50 of 103 results for author: Krishna, K M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.14513  [pdf, ps, other

    math.OA cs.IT math.FA

    Modular Deutsch Entropic Uncertainty Principle

    Authors: K. Mahesh Krishna

    Abstract: Khosravi, Drnovšek and Moslehian [\textit{Filomat, 2012}] derived Buzano inequality for Hilbert C*-modules. Using this inequality we derive Deutsch entropic uncertainty principle for Hilbert C*-modules over commutative unital C*-algebras.

    Submitted 8 August, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

    Comments: 4 Pages, 0 Figures

    MSC Class: 46L08; 42C15; 46L05

  2. arXiv:2405.08003  [pdf, ps, other

    math.FA cs.IT math.OA math.QA

    Continuous Krishna-Parthasarathy Entropic Uncertainty Principle

    Authors: K. Mahesh Krishna

    Abstract: In 2002, Krishna and Parthasarathy [\textit{Sankhyā Ser. A}] derived discrete quantum version of Maassen-Uffink [\textit{Phys. Rev. Lett., 1988}] entropic uncertainty principle. In this paper, using the notion of continuous operator-valued frames, we derive an entropic uncertainty principle for arbitrary family of operators indexed by measure spaces having finite measure. We give an application to… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 7 pages, 0 Figures

    MSC Class: 81P15; 94A17; 42C15

    Journal ref: Special issue of Infinite Dimensional Analysis, Quantum Probability and Related Topics in honour of Prof. K. R. Parthasarathy, 18 March 2024

  3. Open-Set 3D Semantic Instance Maps for Vision Language Navigation -- O3D-SIM

    Authors: Laksh Nanwani, Kumaraditya Gupta, Aditya Mathur, Swayam Agrawal, A. H. Abdul Hafez, K. Madhava Krishna

    Abstract: Humans excel at forming mental maps of their surroundings, equipping them to understand object relationships and navigate based on language queries. Our previous work SI Maps [1] showed that having instance-level information and the semantic understanding of an environment helps significantly improve performance for language-guided tasks. We extend this instance-level approach to 3D while increasi… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Journal ref: Advanced Robotics - Taylor and Francis - 2024

  4. arXiv:2404.04643  [pdf, other

    cs.RO cs.CV

    Constrained 6-DoF Grasp Generation on Complex Shapes for Improved Dual-Arm Manipulation

    Authors: Gaurav Singh, Sanket Kalwar, Md Faizal Karim, Bipasha Sen, Nagamanikandan Govindan, Srinath Sridhar, K Madhava Krishna

    Abstract: Efficiently generating grasp poses tailored to specific regions of an object is vital for various robotic manipulation tasks, especially in a dual-arm setup. This scenario presents a significant challenge due to the complex geometries involved, requiring a deep understanding of the local geometry to generate grasps efficiently on the specified constrained regions. Existing methods only explore set… ▽ More

    Submitted 15 July, 2024; v1 submitted 6 April, 2024; originally announced April 2024.

    Comments: Project Page: https://constrained-grasp-diffusion.github.io/

  5. arXiv:2404.03307  [pdf, other

    cs.RO eess.SY

    Bi-level Trajectory Optimization on Uneven Terrains with Differentiable Wheel-Terrain Interaction Model

    Authors: Amith Manoharan, Aditya Sharma, Himani Belsare, Kaustab Pal, K. Madhava Krishna, Arun Kumar Singh

    Abstract: Navigation of wheeled vehicles on uneven terrain necessitates going beyond the 2D approaches for trajectory planning. Specifically, it is essential to incorporate the full 6dof variation of vehicle pose and its associated stability cost in the planning process. To this end, most recent works aim to learn a neural network model to predict the vehicle evolution. However, such approaches are data-int… ▽ More

    Submitted 11 April, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: 8 pages, 7 figures, submitted to IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024)

  6. arXiv:2404.00910  [pdf, ps, other

    math.FA cs.IT math-ph

    Unexpected Uncertainty Principle for Disc Banach Spaces

    Authors: K. Mahesh Krishna

    Abstract: Let $(\{f_n\}_{n=1}^\infty, \{τ_n\}_{n=1}^\infty)$ and $(\{g_n\}_{n=1}^\infty, \{ω_n\}_{n=1}^\infty)$ be unbounded continuous p-Schauder frames ($0<p<1$) for a disc Banach space $\mathcal{X}$. Then for every $x \in ( \mathcal{D}(θ_f) \cap\mathcal{D}(θ_g))\setminus\{0\}$, we show that \begin{align}\label{UB} (1) \quad \quad \quad \quad \|θ_f x\|_0\|θ_g x\|_0 \geq \frac{1}{\left(\displaystyle\sup_{n… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 6 Pages, 0 Figures

    MSC Class: 42C15

  7. arXiv:2403.20116  [pdf, other

    cs.RO

    LeGo-Drive: Language-enhanced Goal-oriented Closed-Loop End-to-End Autonomous Driving

    Authors: Pranjal Paul, Anant Garg, Tushar Choudhary, Arun Kumar Singh, K. Madhava Krishna

    Abstract: Existing Vision-Language models (VLMs) estimate either long-term trajectory waypoints or a set of control actions as a reactive solution for closed-loop planning based on their rich scene comprehension. However, these estimations are coarse and are subjective to their "world understanding" which may generate sub-optimal decisions due to perception errors. In this paper, we introduce LeGo-Drive, wh… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

  8. arXiv:2403.17946  [pdf, ps, other

    math.FA cs.IT math-ph

    Nonlinear Heisenberg-Robertson-Schrodinger Uncertainty Principle

    Authors: K. Mahesh Krishna

    Abstract: We derive an uncertainty principle for Lipschitz maps acting on subsets of Banach spaces. We show that this nonlinear uncertainty principle reduces to the Heisenberg-Robertson-Schrodinger uncertainty principle for linear operators acting on Hilbert spaces.

    Submitted 8 August, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

    Comments: 4 Pages, 0 Figures

    MSC Class: 26A16; 46B99

  9. arXiv:2402.08591  [pdf, ps, other

    math.FA cs.IT math-ph

    Nonlinear Maccone-Pati Uncertainty Principle

    Authors: K. Mahesh Krishna

    Abstract: We show that one of the two important uncertainty principles derived by Maccone and Pati \textit{[Phys. Rev. Lett., 2014]} can be derived for arbitrary maps defined on subsets of $\mathcal{L}^p$ spaces for $1< p<\infty$. Our main tool is the Clarkson inequalities. We also derive a nonlinear uncertainty principle for weak parallelogram spaces and Type-p Banach spaces.

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 6 pages, 0 figures

    MSC Class: 46B20; 46E30

  10. arXiv:2402.04255  [pdf, ps, other

    math.FA cs.IT

    Functional Kuppinger-Durisi-Bölcskei Uncertainty Principle

    Authors: K. Mahesh Krishna

    Abstract: Let $\mathcal{X}$ be a Banach space. Let $\{τ_j\}_{j=1}^n, \{ω_k\}_{k=1}^m\subseteq \mathcal{X}$ and $\{f_j\}_{j=1}^n$, $\{g_k\}_{k=1}^m\subseteq \mathcal{X}^*$ satisfy $ |f_j(τ_j)|\geq 1$ for all $ 1\leq j \leq n$, $|g_k(ω_k)|\geq 1 $ for all $1\leq k \leq m$. If $x \in \mathcal{X}\setminus \{0\}$ is such that $x=θ_τθ_f x=θ_ωθ_g x$, then we show that \begin{align}\label{FKDB} (1) \quad\quad\quad\… ▽ More

    Submitted 1 January, 2024; originally announced February 2024.

    Comments: 9 Pages, 0 Figures

    MSC Class: 46A45; 46B45; 42C15

  11. arXiv:2401.17399  [pdf, other

    cs.RO

    ATPPNet: Attention based Temporal Point cloud Prediction Network

    Authors: Kaustab Pal, Aditya Sharma, Avinash Sharma, K. Madhava Krishna

    Abstract: Point cloud prediction is an important yet challenging task in the field of autonomous driving. The goal is to predict future point cloud sequences that maintain object structures while accurately representing their temporal motion. These predicted point clouds help in other subsequent tasks like object trajectory estimation for collision avoidance or estimating locations with the least odometry d… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: Accepted for presentation at the 2024 IEEE International Conference on Robotics and Automation (ICRA)

  12. arXiv:2312.00366  [pdf, ps, other

    math.FA cs.IT math-ph

    Unbounded Donoho-Stark-Elad-Bruckstein-Ricaud-Torrésani Uncertainty Principles

    Authors: K. Mahesh Krishna

    Abstract: Let $(Ω, μ)$, $(Δ, ν)$ be measure spaces and $p=1$ or $p=\infty$. Let $(\{f_α\}_{α\in Ω}, \{τ_α\}_{α\in Ω})$ and $(\{g_β\}_{β\in Δ}, \{ω_β\}_{β\in Δ})$ be unbounded continuous p-Schauder frames for a Banach space $\mathcal{X}$. Then for every $x \in ( \mathcal{D}(θ_f) \cap\mathcal{D}(θ_g))\setminus\{0\}$, we show that \begin{align}\label{UB} (1) \quad \quad \quad \quad μ(\operatorname{supp}(θ_f… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: 6 Figures, 0 Figures

    MSC Class: 42C15

  13. arXiv:2311.14635  [pdf

    cs.CV cs.RO

    Automated Detection and Counting of Windows using UAV Imagery based Remote Sensing

    Authors: Dhruv Patel, Shivani Chepuri, Sarvesh Thakur, K. Harikumar, Ravi Kiran S., K. Madhava Krishna

    Abstract: Despite the technological advancements in the construction and surveying sector, the inspection of salient features like windows in an under-construction or existing building is predominantly a manual process. Moreover, the number of windows present in a building is directly related to the magnitude of deformation it suffers under earthquakes. In this research, a method to accurately detect and co… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

  14. NeuroSMPC: A Neural Network guided Sampling Based MPC for On-Road Autonomous Driving

    Authors: Kaustab Pal, Aditya Sharma, Mohd Omama, Parth N. Shah, K. Madhava Krishna

    Abstract: In this paper we show an effective means of integrating data driven frameworks to sampling based optimal control to vastly reduce the compute time for easy adoption and adaptation to real time applications such as on-road autonomous driving in the presence of dynamic actors. Presented with training examples, a spatio-temporal CNN learns to predict the optimal mean control over a finite horizon tha… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: Published in 2023 IEEE 19th International Conference on Automation Science and Engineering (CASE)

  15. arXiv:2310.08270  [pdf, other

    cs.RO

    Hilbert Space Embedding-based Trajectory Optimization for Multi-Modal Uncertain Obstacle Trajectory Prediction

    Authors: Basant Sharma, Aditya Sharma, K. Madhava Krishna, Arun Kumar Singh

    Abstract: Safe autonomous driving critically depends on how well the ego-vehicle can predict the trajectories of neighboring vehicles. To this end, several trajectory prediction algorithms have been presented in the existing literature. Many of these approaches output a multi-modal distribution of obstacle trajectories instead of a single deterministic prediction to account for the underlying uncertainty. H… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  16. arXiv:2310.04181  [pdf, other

    cs.CV cs.RO

    DiffPrompter: Differentiable Implicit Visual Prompts for Semantic-Segmentation in Adverse Conditions

    Authors: Sanket Kalwar, Mihir Ungarala, Shruti Jain, Aaron Monis, Krishna Reddy Konda, Sourav Garg, K Madhava Krishna

    Abstract: Semantic segmentation in adverse weather scenarios is a critical task for autonomous driving systems. While foundation models have shown promise, the need for specialized adaptors becomes evident for handling more challenging scenarios. We introduce DiffPrompter, a novel differentiable visual and latent prompting mechanism aimed at expanding the learning capabilities of existing adaptors in founda… ▽ More

    Submitted 26 March, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

  17. arXiv:2310.02251  [pdf, other

    cs.CV cs.RO

    Talk2BEV: Language-enhanced Bird's-eye View Maps for Autonomous Driving

    Authors: Tushar Choudhary, Vikrant Dewangan, Shivam Chandhok, Shubham Priyadarshan, Anushka Jain, Arun K. Singh, Siddharth Srivastava, Krishna Murthy Jatavallabhula, K. Madhava Krishna

    Abstract: Talk2BEV is a large vision-language model (LVLM) interface for bird's-eye view (BEV) maps in autonomous driving contexts. While existing perception systems for autonomous driving scenarios have largely focused on a pre-defined (closed) set of object categories and driving scenarios, Talk2BEV blends recent advances in general-purpose language and vision models with BEV-structured map representation… ▽ More

    Submitted 14 November, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: Project page at https://llmbev.github.io/talk2bev/

  18. arXiv:2307.01215  [pdf, ps, other

    math.FA cs.IT

    Functional Donoho-Stark Approximate Support Uncertainty Principle

    Authors: K. Mahesh Krishna

    Abstract: Let $(\{f_j\}_{j=1}^n, \{τ_j\}_{j=1}^n)$ and $(\{g_k\}_{k=1}^n, \{ω_k\}_{k=1}^n)$ be two p-orthonormal bases for a finite dimensional Banach space $\mathcal{X}$. If $ x \in \mathcal{X}\setminus\{0\}$ is such that $θ_fx$ is $\varepsilon$-supported on $M\subseteq \{1,\dots, n\}$ w.r.t. p-norm and $θ_gx$ is $δ$-supported on $N\subseteq \{1,\dots, n\}$ w.r.t. p-norm, then we show that \begin{align}\la… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

    Comments: 7 Pages, 0 Figures

    MSC Class: 42C15; 46B03; 46B04

  19. arXiv:2306.06093  [pdf, other

    cs.CV

    HyP-NeRF: Learning Improved NeRF Priors using a HyperNetwork

    Authors: Bipasha Sen, Gaurav Singh, Aditya Agarwal, Rohith Agaram, K Madhava Krishna, Srinath Sridhar

    Abstract: Neural Radiance Fields (NeRF) have become an increasingly popular representation to capture high-quality appearance and shape of scenes and objects. However, learning generalizable NeRF priors over categories of scenes or objects has been challenging due to the high dimensionality of network weight space. To address the limitations of existing work on generalization, multi-view consistency and to… ▽ More

    Submitted 23 December, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: Project Page: https://hyp-nerf.github.io

  20. arXiv:2306.04939  [pdf, other

    cs.RO

    UAP-BEV: Uncertainty Aware Planning using Bird's Eye View generated from Surround Monocular Images

    Authors: Vikrant Dewangan, Basant Sharma, Tushar Choudhary, Sarthak Sharma, Aakash Aanegola, Arun K. Singh, K. Madhava Krishna

    Abstract: Autonomous driving requires accurate reasoning of the location of objects from raw sensor data. Recent end-to-end learning methods go from raw sensor data to a trajectory output via Bird's Eye View(BEV) segmentation as an interpretable intermediate representation. Motion planning over cost maps generated via Birds Eye View (BEV) segmentation has emerged as a prominent approach in autonomous drivin… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: Accepted to CASE 2023. Project video available at https://vikr-182.github.io/UAP-BEV

  21. arXiv:2306.01014  [pdf, ps, other

    math.FA cs.IT

    Functional Ghobber-Jaming Uncertainty Principle

    Authors: K. Mahesh Krishna

    Abstract: Let $(\{f_j\}_{j=1}^n, \{τ_j\}_{j=1}^n)$ and $(\{g_k\}_{k=1}^n, \{ω_k\}_{k=1}^n)$ be two p-orthonormal bases for a finite dimensional Banach space $\mathcal{X}$. Let $M,N\subseteq \{1, \dots, n\}$ be such that \begin{align*} o(M)^\frac{1}{q}o(N)^\frac{1}{p}< \frac{1}{\displaystyle \max_{1\leq j,k\leq n}|g_k(τ_j) |}, \end{align*} where $q$ is the conjugate index of $p$. Then for all… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: 7 Pages, 0 Figures

    MSC Class: 42C15; 46B03; 46B04

  22. Instance-Level Semantic Maps for Vision Language Navigation

    Authors: Laksh Nanwani, Anmol Agarwal, Kanishk Jain, Raghav Prabhakar, Aaron Monis, Aditya Mathur, Krishna Murthy, Abdul Hafez, Vineet Gandhi, K. Madhava Krishna

    Abstract: Humans have a natural ability to perform semantic associations with the surrounding objects in the environment. This allows them to create a mental map of the environment, allowing them to navigate on-demand when given linguistic instructions. A natural goal in Vision Language Navigation (VLN) research is to impart autonomous agents with similar capabilities. Recent works take a step towards this… ▽ More

    Submitted 1 July, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

    Journal ref: IEEE RO-MAN 2023

  23. arXiv:2304.03324  [pdf, ps, other

    math.FA cs.IT

    Functional Donoho-Stark-Elad-Bruckstein-Ricaud-Torrésani Uncertainty Principle

    Authors: K. Mahesh Krishna

    Abstract: Let $(\{f_j\}_{j=1}^n, \{τ_j\}_{j=1}^n)$ and $(\{g_k\}_{k=1}^m, \{ω_k\}_{k=1}^m)$ be p-Schauder frames for a finite dimensional Banach space $\mathcal{X}$. Then for every $x \in \mathcal{X}\setminus\{0\}$, we show that \begin{align} (1) \quad \|θ_f x\|_0^\frac{1}{p}\|θ_g x\|_0^\frac{1}{q} \geq \frac{1}{\displaystyle\max_{1\leq j\leq n, 1\leq k\leq m}|f_j(ω_k)|}\quad \text{and} \quad \|θ_g x\|_0^\f… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: 5 Pages, 0 Figures

    MSC Class: 42C15

  24. arXiv:2304.01074  [pdf, other

    cs.RO

    FinderNet: A Data Augmentation Free Canonicalization aided Loop Detection and Closure technique for Point clouds in 6-DOF separation

    Authors: Sudarshan S Harithas, Gurkirat Singh, Aneesh Chavan, Sarthak Sharma, Suraj Patni, Chetan Arora, K. Madhava Krishna

    Abstract: We focus on the problem of LiDAR point cloud based loop detection (or Finding) and closure (LDC) in a multi-agent setting. State-of-the-art (SOTA) techniques directly generate learned embeddings of a given point cloud, require large data transfers, and are not robust to wide variations in 6 Degrees-of-Freedom (DOF) viewpoint. Moreover, absence of strong priors in an unstructured point cloud leads… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  25. arXiv:2211.16882  [pdf, other

    cs.CV cs.RO

    MVRackLay: Monocular Multi-View Layout Estimation for Warehouse Racks and Shelves

    Authors: Pranjali Pathre, Anurag Sahu, Ashwin Rao, Avinash Prabhu, Meher Shashwat Nigam, Tanvi Karandikar, Harit Pandya, K. Madhava Krishna

    Abstract: In this paper, we propose and showcase, for the first time, monocular multi-view layout estimation for warehouse racks and shelves. Unlike typical layout estimation methods, MVRackLay estimates multi-layered layouts, wherein each layer corresponds to the layout of a shelf within a rack. Given a sequence of images of a warehouse scene, a dual-headed Convolutional-LSTM architecture outputs segmented… ▽ More

    Submitted 30 November, 2022; originally announced November 2022.

    Journal ref: IEEE International Conference on Robotics and Biomimetics (ROBIO) 2022

  26. arXiv:2210.07062  [pdf, ps, other

    cs.IT math.FA math.NT

    Non-Archimedean Welch Bounds and Non-Archimedean Zauner Conjecture

    Authors: K. Mahesh Krishna

    Abstract: Let $\mathbb{K}$ be a non-Archimedean (complete) valued field satisfying \begin{align*} \left|\sum_{j=1}^{n}λ_j^2\right|=\max_{1\leq j \leq n}|λ_j|^2, \quad \forall λ_j \in \mathbb{K}, 1\leq j \leq n, \forall n \in \mathbb{N}. \end{align*} For $d\in \mathbb{N}$, let $\mathbb{K}^d$ be the standard $d$-dimensional non-Archimedean Hilbert space. Let $m \in \mathbb{N}$ and… ▽ More

    Submitted 28 August, 2022; originally announced October 2022.

    Comments: 9 Pages, 0 Figures

    MSC Class: 12J25; 46S10; 47S10

  27. arXiv:2209.14922  [pdf, other

    cs.CV cs.RO

    GDIP: Gated Differentiable Image Processing for Object-Detection in Adverse Conditions

    Authors: Sanket Kalwar, Dhruv Patel, Aakash Aanegola, Krishna Reddy Konda, Sourav Garg, K Madhava Krishna

    Abstract: Detecting objects under adverse weather and lighting conditions is crucial for the safe and continuous operation of an autonomous vehicle, and remains an unsolved problem. We present a Gated Differentiable Image Processing (GDIP) block, a domain-agnostic network architecture, which can be plugged into existing object detection networks (e.g., Yolo) and trained end-to-end with adverse condition ima… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

    Comments: Submitted to ICRA2023. More information at https://gatedip.github.io

  28. arXiv:2209.13418  [pdf, other

    cs.CV cs.RO

    UAV-based Visual Remote Sensing for Automated Building Inspection

    Authors: Kushagra Srivastava, Dhruv Patel, Aditya Kumar Jha, Mohhit Kumar Jha, Jaskirat Singh, Ravi Kiran Sarvadevabhatla, Pradeep Kumar Ramancharla, Harikumar Kandath, K. Madhava Krishna

    Abstract: Unmanned Aerial Vehicle (UAV) based remote sensing system incorporated with computer vision has demonstrated potential for assisting building construction and in disaster management like damage assessment during earthquakes. The vulnerability of a building to earthquake can be assessed through inspection that takes into account the expected damage progression of the associated component and the co… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: Paper accepted at CVCIE Workshop at ECCV, 2022 and the project page is https://uvrsabi.github.io/

  29. arXiv:2209.11972  [pdf, other

    cs.CV

    Ground then Navigate: Language-guided Navigation in Dynamic Scenes

    Authors: Kanishk Jain, Varun Chhangani, Amogh Tiwari, K. Madhava Krishna, Vineet Gandhi

    Abstract: We investigate the Vision-and-Language Navigation (VLN) problem in the context of autonomous driving in outdoor settings. We solve the problem by explicitly grounding the navigable regions corresponding to the textual command. At each timestamp, the model predicts a segmentation mask corresponding to the intermediate or the final navigable region. Our work contrasts with existing efforts in VLN, w… ▽ More

    Submitted 24 September, 2022; originally announced September 2022.

  30. arXiv:2209.04805  [pdf, other

    cs.RO

    Real-Time Heuristic Framework for Safe Landing of UAVs in Dynamic Scenarios

    Authors: Jaskirat Singh, Neel Adwani, Harikumar Kandath, K. Madhava Krishna

    Abstract: The world we live in is full of technology and with each passing day the advancement and usage of UAVs increases efficiently. As a result of the many application scenarios, there are some missions where the UAVs are vulnerable to external disruptions, such as a ground station's loss of connectivity, security missions, safety concerns, and delivery-related missions. Therefore, depending on the scen… ▽ More

    Submitted 11 September, 2022; originally announced September 2022.

    Comments: 8 pages, 6 figures, 36 references

  31. arXiv:2208.03038  [pdf, other

    cs.RO math.OC

    Leveraging Distributional Bias for Reactive Collision Avoidance under Uncertainty: A Kernel Embedding Approach

    Authors: Anish Gupta, Arun Kumar Singh, K. Madhava Krishna

    Abstract: Many commodity sensors that measure the robot and dynamic obstacle's state have non-Gaussian noise characteristics. Yet, many current approaches treat the underlying-uncertainty in motion and perception as Gaussian, primarily to ensure computational tractability. On the other hand, existing planners working with non-Gaussian uncertainty do not shed light on leveraging distributional characteristic… ▽ More

    Submitted 22 September, 2022; v1 submitted 5 August, 2022; originally announced August 2022.

  32. arXiv:2207.03557  [pdf, other

    cs.RO

    Flow Synthesis Based Visual Servoing Frameworks for Monocular Obstacle Avoidance Amidst High-Rises

    Authors: Harshit K. Sankhla, M. Nomaan Qureshi, Shankara Narayanan V., Vedansh Mittal, Gunjan Gupta, Harit Pandya, K. Madhava Krishna

    Abstract: We propose a novel flow synthesis based visual servoing framework enabling long-range obstacle avoidance for Micro Air Vehicles (MAV) flying amongst tall skyscrapers. Recent deep learning based frameworks use optical flow to do high-precision visual servoing. In this paper, we explore the question: can we design a surrogate flow for these high-precision visual-servoing methods, which leads to obst… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

    Comments: Accepted to IEEE International Conference on Automation Science and Engineering (CASE), 2022

  33. arXiv:2205.04090  [pdf, other

    cs.RO

    Approaches and Challenges in Robotic Perception for Table-top Rearrangement and Planning

    Authors: Aditya Agarwal, Bipasha Sen, Shankara Narayanan V, Vishal Reddy Mandadi, Brojeshwar Bhowmick, K Madhava Krishna

    Abstract: Table-top Rearrangement and Planning is a challenging problem that relies heavily on an excellent perception stack. The perception stack involves observing and registering the 3D scene on the table, detecting what objects are on the table, and how to manipulate them. Consequently, it greatly influences the system's task-planning and motion-planning stacks that follow. We present a comprehensive ov… ▽ More

    Submitted 3 June, 2022; v1 submitted 9 May, 2022; originally announced May 2022.

    Comments: 5 pages including references, 3 figures

  34. arXiv:2204.00865  [pdf, other

    cs.RO

    UrbanFly: Uncertainty-Aware Planning for Navigation Amongst High-Rises with Monocular Visual-Inertial SLAM Maps

    Authors: Sudarshan S Harithas, Ayyappa Swamy Thatavarthy, Gurkirat Singh, Arun K Singh, K Madhava Krishna

    Abstract: We present UrbanFly: an uncertainty-aware real-time planning framework for quadrotor navigation in urban high-rise environments. A core aspect of UrbanFly is its ability to robustly plan directly on the sparse point clouds generated by a Monocular Visual Inertial SLAM (VINS) backend. It achieves this by using the sparse point clouds to build an uncertainty-integrated cuboid representation of the e… ▽ More

    Submitted 3 October, 2022; v1 submitted 2 April, 2022; originally announced April 2022.

    Comments: Submitted to ACC 2023, Code available at https://github.com/sudarshan-s-harithas/UrbanFly

  35. arXiv:2203.06897  [pdf, other

    cs.RO

    Drift Reduced Navigation with Deep Explainable Features

    Authors: Mohd Omama, Sundar Sripada Venugopalaswamy Sriraman, Sandeep Chinchali, Arun Kumar Singh, K. Madhava Krishna

    Abstract: Modern autonomous vehicles (AVs) often rely on vision, LIDAR, and even radar-based simultaneous localization and mapping (SLAM) frameworks for precise localization and navigation. However, modern SLAM frameworks often lead to unacceptably high levels of drift (i.e., localization error) when AVs observe few visually distinct features or encounter occlusions due to dynamic obstacles. This paper argu… ▽ More

    Submitted 25 November, 2022; v1 submitted 14 March, 2022; originally announced March 2022.

    Comments: Accepted in IROS 2022

  36. arXiv:2203.05206  [pdf, other

    cs.CV cs.RO

    ReF -- Rotation Equivariant Features for Local Feature Matching

    Authors: Abhishek Peri, Kinal Mehta, Avneesh Mishra, Michael Milford, Sourav Garg, K. Madhava Krishna

    Abstract: Sparse local feature matching is pivotal for many computer vision and robotics tasks. To improve their invariance to challenging appearance conditions and viewing angles, and hence their usefulness, existing learning-based methods have primarily focused on data augmentation-based training. In this work, we propose an alternative, complementary approach that centers on inducing bias in the model ar… ▽ More

    Submitted 10 March, 2022; originally announced March 2022.

  37. arXiv:2201.04421  [pdf, ps, other

    math.FA cs.IT math.DS math.NT math.OA

    The $(abc,pqr)$-problem for Approximate Schauder Frames for Banach Spaces

    Authors: K. Mahesh Krishna

    Abstract: Motivated from the complete solution of important $abc$-problem for Gabor system for the Hilbert space $\mathcal{L}^2(\mathbb{R})$ by Dai and Sun [\textit{Memoirs of Amer. Math. Soc., 2016}] and from the existential result of approximate Schauder frames for $\mathcal{L}^p(\mathbb{R})$ using translation operators on $\mathcal{L}^p(\mathbb{R})$ by Freeman, Odell, Schlumprecht, and Zsak [\textit{Isra… ▽ More

    Submitted 12 January, 2022; originally announced January 2022.

    Comments: 6 Pages, 0 Figures

    MSC Class: 42C15; 46E30

  38. arXiv:2112.13034  [pdf, other

    cs.RO

    Non Holonomic Collision Avoidance of Dynamic Obstacles under Non-Parametric Uncertainty: A Hilbert Space Approach

    Authors: Unni Krishnan R Nair, Anish Gupta, D. A. Sasi Kiran, Ajay Shrihari, Vanshil Shah, Arun Kumar Singh, K. Madhava Krishna

    Abstract: We consider the problem of an agent/robot with non-holonomic kinematics avoiding many dynamic obstacles. State and velocity noise of both the robot and obstacles as well as the robot's control noise are modelled as non-parametric distributions as often the Gaussian assumptions of noise models are violated in real-world scenarios. Under these assumptions, we formulate a robust MPC that samples robo… ▽ More

    Submitted 1 January, 2022; v1 submitted 24 December, 2021; originally announced December 2021.

  39. Grounding Linguistic Commands to Navigable Regions

    Authors: Nivedita Rufus, Kanishk Jain, Unni Krishnan R Nair, Vineet Gandhi, K Madhava Krishna

    Abstract: Humans have a natural ability to effortlessly comprehend linguistic commands such as "park next to the yellow sedan" and instinctively know which region of the road the vehicle should navigate. Extending this ability to autonomous vehicles is the next step towards creating fully autonomous agents that respond and act according to human commands. To this end, we propose the novel task of Referring… ▽ More

    Submitted 24 December, 2021; originally announced December 2021.

    Journal ref: 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021, pp. 8593-8600

  40. arXiv:2112.11254  [pdf, other

    cs.RO

    Design And Analysis Of Three-Output Open Differential with 3-DOF

    Authors: Rama Vadapalli, Nagamanikandan Govindan, K Madhava Krishna

    Abstract: This paper presents a novel passive three-output differential with three degrees of freedom (3DOF), that translates motion and torque from a single input to three outputs. The proposed Three-Output Open Differential is designed such that its functioning is analogous to the functioning of a traditional two-output open differential. That is, the differential translates equal motion and torque to all… ▽ More

    Submitted 21 December, 2021; originally announced December 2021.

    Comments: 8 pages, 9 figures, to be published in proceedings of the ASME 2021 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference IDETC/CIE 2021

  41. arXiv:2112.02996  [pdf, other

    cs.RO

    Modular Pipe Climber III with Three-Output Open Differential

    Authors: Rama Vadapalli, Saharsh Agarwal, Vishnu Kumar, Kartik Suryavanshi, Nagamanikandan, K Madhava Krishna

    Abstract: The paper introduces the novel Modular Pipe Climber III with a Three-Output Open Differential (3-OOD) mechanism to eliminate slipping of the tracks due to the changing cross-sections of the pipe. This will be achieved in any orientation of the robot. Previous pipe climbers use three-wheel/track modules, each with an individual driving mechanism to achieve stable traversing. Slipping of tracks is p… ▽ More

    Submitted 8 January, 2022; v1 submitted 1 November, 2021; originally announced December 2021.

  42. arXiv:2110.14928  [pdf, other

    cs.RO

    Learning Actions for Drift-Free Navigation in Highly Dynamic Scenes

    Authors: Mohd Omama, Sundar Sripada V. S., Sandeep Chinchali, K. Madhava Krishna

    Abstract: We embark on a hitherto unreported problem of an autonomous robot (self-driving car) navigating in dynamic scenes in a manner that reduces its localization error and eventual cumulative drift or Absolute Trajectory Error, which is pronounced in such dynamic scenes. With the hugely popular Velodyne-16 3D LIDAR as the main sensing modality, and the accurate LIDAR-based Localization and Mapping algor… ▽ More

    Submitted 31 March, 2022; v1 submitted 28 October, 2021; originally announced October 2021.

    Comments: Accepted in American Control Conference 2022

  43. arXiv:2110.02904  [pdf, other

    cs.RO

    CCO-VOXEL: Chance Constrained Optimization over Uncertain Voxel-Grid Representation for Safe Trajectory Planning

    Authors: Sudarshan S Harithas, Rishabh Dev Yadav, Deepak Singh, Arun Kumar Singh, K Madhava Krishna

    Abstract: We present CCO-VOXEL: the very first chance-constrained optimization (CCO) algorithm that can compute trajectory plans with probabilistic safety guarantees in real-time directly on the voxel-grid representation of the world. CCO-VOXEL maps the distribution over the distance to the closest obstacle to a distribution over collision-constraint violation and computes an optimal trajectory that minimiz… ▽ More

    Submitted 5 April, 2022; v1 submitted 6 October, 2021; originally announced October 2021.

    Comments: Accepted at ICRA 2022 , Code available at https://github.com/sudarshan-s-harithas/CCO-VOXEL

  44. arXiv:2109.10392  [pdf, other

    cs.RO

    Multi-Modal Model Predictive Control through Batch Non-Holonomic Trajectory Optimization: Application to Highway Driving

    Authors: Vivek K. Adajania, Aditya Sharma, Anish Gupta, Houman Masnavi, K Madhava Krishna, Arun K. Singh

    Abstract: Standard Model Predictive Control (MPC) or trajectory optimization approaches perform only a local search to solve a complex non-convex optimization problem. As a result, they cannot capture the multi-modal characteristic of human driving. A global optimizer can be a potential solution but is computationally intractable in a real-time setting. In this paper, we present a real-time MPC capable of s… ▽ More

    Submitted 14 March, 2022; v1 submitted 21 September, 2021; originally announced September 2021.

    Comments: Published IEEE Robotics and Automation Letters (RA-L)

  45. AutoLay: Benchmarking amodal layout estimation for autonomous driving

    Authors: Kaustubh Mani, N. Sai Shankar, Krishna Murthy Jatavallabhula, K. Madhava Krishna

    Abstract: Given an image or a video captured from a monocular camera, amodal layout estimation is the task of predicting semantics and occupancy in bird's eye view. The term amodal implies we also reason about entities in the scene that are occluded or truncated in image space. While several recent efforts have tackled this problem, there is a lack of standardization in task specification, datasets, and eva… ▽ More

    Submitted 20 August, 2021; originally announced August 2021.

    Comments: published in 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  46. arXiv:2103.10400  [pdf, other

    cs.RO cs.CV

    RP-VIO: Robust Plane-based Visual-Inertial Odometry for Dynamic Environments

    Authors: Karnik Ram, Chaitanya Kharyal, Sudarshan S. Harithas, K. Madhava Krishna

    Abstract: Modern visual-inertial navigation systems (VINS) are faced with a critical challenge in real-world deployment: they need to operate reliably and robustly in highly dynamic environments. Current best solutions merely filter dynamic objects as outliers based on the semantics of the object category. Such an approach does not scale as it requires semantic classifiers to encompass all possibly-moving o… ▽ More

    Submitted 5 December, 2021; v1 submitted 18 March, 2021; originally announced March 2021.

    Comments: Presented at IROS 2021, code and dataset available at https://karnikram.info/rp-vio

  47. Monocular Multi-Layer Layout Estimation for Warehouse Racks

    Authors: Meher Shashwat Nigam, Avinash Prabhu, Anurag Sahu, Puru Gupta, Tanvi Karandikar, N. Sai Shankar, Ravi Kiran Sarvadevabhatla, K. Madhava Krishna

    Abstract: Given a monocular colour image of a warehouse rack, we aim to predict the bird's-eye view layout for each shelf in the rack, which we term as multi-layer layout prediction. To this end, we present RackLay, a deep neural network for real-time shelf layout estimation from a single image. Unlike previous layout estimation methods, which provide a single layout for the dominant ground plane alone, Rac… ▽ More

    Submitted 28 October, 2021; v1 submitted 16 March, 2021; originally announced March 2021.

    Comments: Visit our project repository at https://github.com/Avinash2468/RackLay

  48. arXiv:2103.08573  [pdf, other

    cs.CV cs.RO

    RoRD: Rotation-Robust Descriptors and Orthographic Views for Local Feature Matching

    Authors: Udit Singh Parihar, Aniket Gujarathi, Kinal Mehta, Satyajit Tourani, Sourav Garg, Michael Milford, K. Madhava Krishna

    Abstract: The use of local detectors and descriptors in typical computer vision pipelines work well until variations in viewpoint and appearance change become extreme. Past research in this area has typically focused on one of two approaches to this challenge: the use of projections into spaces more suitable for feature matching under extreme viewpoint changes, and attempting to learn features that are inhe… ▽ More

    Submitted 24 March, 2022; v1 submitted 15 March, 2021; originally announced March 2021.

    Comments: Accepted to IROS 2021. Project Page: https://uditsinghparihar.github.io/RoRD/

  49. arXiv:2011.12912  [pdf, other

    cs.CV cs.AI cs.RO

    DRACO: Weakly Supervised Dense Reconstruction And Canonicalization of Objects

    Authors: Rahul Sajnani, AadilMehdi Sanchawala, Krishna Murthy Jatavallabhula, Srinath Sridhar, K. Madhava Krishna

    Abstract: We present DRACO, a method for Dense Reconstruction And Canonicalization of Object shape from one or more RGB images. Canonical shape reconstruction, estimating 3D object shape in a coordinate space canonicalized for scale, rotation, and translation parameters, is an emerging paradigm that holds promise for a multitude of robotic applications. Prior approaches either rely on painstakingly gathered… ▽ More

    Submitted 25 November, 2020; originally announced November 2020.

    Comments: Preprint. For project page and code, see https://aadilmehdis.github.io/DRACO-Project-Page/

  50. arXiv:2011.07613  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    BirdSLAM: Monocular Multibody SLAM in Bird's-Eye View

    Authors: Swapnil Daga, Gokul B. Nair, Anirudha Ramesh, Rahul Sajnani, Junaid Ahmed Ansari, K. Madhava Krishna

    Abstract: In this paper, we present BirdSLAM, a novel simultaneous localization and mapping (SLAM) system for the challenging scenario of autonomous driving platforms equipped with only a monocular camera. BirdSLAM tackles challenges faced by other monocular SLAM systems (such as scale ambiguity in monocular reconstruction, dynamic object localization, and uncertainty in feature representation) by using an… ▽ More

    Submitted 15 November, 2020; originally announced November 2020.

    Comments: Accepted in VISIGRAPP (VISAPP) 2021