Skip to main content

Showing 1–50 of 134 results for author: Krishna, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.08003  [pdf, ps, other

    math.FA cs.IT math.OA math.QA

    Continuous Krishna-Parthasarathy Entropic Uncertainty Principle

    Authors: K. Mahesh Krishna

    Abstract: In 2002, Krishna and Parthasarathy [\textit{Sankhyā Ser. A}] derived discrete quantum version of Maassen-Uffink [\textit{Phys. Rev. Lett., 1988}] entropic uncertainty principle. In this paper, using the notion of continuous operator-valued frames, we derive an entropic uncertainty principle for arbitrary family of operators indexed by measure spaces having finite measure. We give an application to… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 7 pages, 0 Figures

    MSC Class: 81P15; 94A17; 42C15

    Journal ref: Special issue of Infinite Dimensional Analysis, Quantum Probability and Related Topics in honour of Prof. K. R. Parthasarathy, 18 March 2024

  2. arXiv:2404.17922  [pdf, other

    cs.CV cs.RO

    Open-Set 3D Semantic Instance Maps for Vision Language Navigation -- O3D-SIM

    Authors: Laksh Nanwani, Kumaraditya Gupta, Aditya Mathur, Swayam Agrawal, A. H. Abdul Hafez, K. Madhava Krishna

    Abstract: Humans excel at forming mental maps of their surroundings, equipping them to understand object relationships and navigate based on language queries. Our previous work SI Maps [1] showed that having instance-level information and the semantic understanding of an environment helps significantly improve performance for language-guided tasks. We extend this instance-level approach to 3D while increasi… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

  3. arXiv:2404.17842  [pdf, other

    cs.SE cs.AI

    Using LLMs in Software Requirements Specifications: An Empirical Evaluation

    Authors: Madhava Krishna, Bhagesh Gaur, Arsh Verma, Pankaj Jalote

    Abstract: The creation of a Software Requirements Specification (SRS) document is important for any software development project. Given the recent prowess of Large Language Models (LLMs) in answering natural language queries and generating sophisticated textual outputs, our study explores their capability to produce accurate, coherent, and structured drafts of these documents to accelerate the software deve… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: Accepted to RE@Next! at the IEEE International Requirements Engineering Conference 2024 at Reykjavik, Iceland

  4. arXiv:2404.06442  [pdf, other

    cs.CV cs.RO

    QueSTMaps: Queryable Semantic Topological Maps for 3D Scene Understanding

    Authors: Yash Mehan, Kumaraditya Gupta, Rohit Jayanti, Anirudh Govil, Sourav Garg, Madhava Krishna

    Abstract: Understanding the structural organisation of 3D indoor scenes in terms of rooms is often accomplished via floorplan extraction. Robotic tasks such as planning and navigation require a semantic understanding of the scene as well. This is typically achieved via object-level semantic segmentation. However, such methods struggle to segment out topological regions like "kitchen" in the scene. In this w… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  5. arXiv:2404.04643  [pdf, other

    cs.RO cs.CV

    Constrained 6-DoF Grasp Generation on Complex Shapes for Improved Dual-Arm Manipulation

    Authors: Gaurav Singh, Sanket Kalwar, Md Faizal Karim, Bipasha Sen, Nagamanikandan Govindan, Srinath Sridhar, K Madhava Krishna

    Abstract: Efficiently generating grasp poses tailored to specific regions of an object is vital for various robotic manipulation tasks, especially in a dual-arm setup. This scenario presents a significant challenge due to the complex geometries involved, requiring a deep understanding of the local geometry to generate grasps efficiently on the specified constrained regions. Existing methods only explore set… ▽ More

    Submitted 15 July, 2024; v1 submitted 6 April, 2024; originally announced April 2024.

    Comments: Project Page: https://constrained-grasp-diffusion.github.io/

  6. arXiv:2404.03587  [pdf, other

    cs.RO cs.AI

    Anticipate & Collab: Data-driven Task Anticipation and Knowledge-driven Planning for Human-robot Collaboration

    Authors: Shivam Singh, Karthik Swaminathan, Raghav Arora, Ramandeep Singh, Ahana Datta, Dipanjan Das, Snehasis Banerjee, Mohan Sridharan, Madhava Krishna

    Abstract: An agent assisting humans in daily living activities can collaborate more effectively by anticipating upcoming tasks. Data-driven methods represent the state of the art in task anticipation, planning, and related problems, but these methods are resource-hungry and opaque. Our prior work introduced a proof of concept framework that used an LLM to anticipate 3 high-level tasks that served as goals f… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  7. arXiv:2404.03307  [pdf, other

    cs.RO eess.SY

    Bi-level Trajectory Optimization on Uneven Terrains with Differentiable Wheel-Terrain Interaction Model

    Authors: Amith Manoharan, Aditya Sharma, Himani Belsare, Kaustab Pal, K. Madhava Krishna, Arun Kumar Singh

    Abstract: Navigation of wheeled vehicles on uneven terrain necessitates going beyond the 2D approaches for trajectory planning. Specifically, it is essential to incorporate the full 6dof variation of vehicle pose and its associated stability cost in the planning process. To this end, most recent works aim to learn a neural network model to predict the vehicle evolution. However, such approaches are data-int… ▽ More

    Submitted 11 April, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: 8 pages, 7 figures, submitted to IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2024)

  8. arXiv:2404.00910  [pdf, ps, other

    math.FA cs.IT math-ph

    Unexpected Uncertainty Principle for Disc Banach Spaces

    Authors: K. Mahesh Krishna

    Abstract: Let $(\{f_n\}_{n=1}^\infty, \{τ_n\}_{n=1}^\infty)$ and $(\{g_n\}_{n=1}^\infty, \{ω_n\}_{n=1}^\infty)$ be unbounded continuous p-Schauder frames ($0<p<1$) for a disc Banach space $\mathcal{X}$. Then for every $x \in ( \mathcal{D}(θ_f) \cap\mathcal{D}(θ_g))\setminus\{0\}$, we show that \begin{align}\label{UB} (1) \quad \quad \quad \quad \|θ_f x\|_0\|θ_g x\|_0 \geq \frac{1}{\left(\displaystyle\sup_{n… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 6 Pages, 0 Figures

    MSC Class: 42C15

  9. arXiv:2403.20116  [pdf, other

    cs.RO

    LeGo-Drive: Language-enhanced Goal-oriented Closed-Loop End-to-End Autonomous Driving

    Authors: Pranjal Paul, Anant Garg, Tushar Choudhary, Arun Kumar Singh, K. Madhava Krishna

    Abstract: Existing Vision-Language models (VLMs) estimate either long-term trajectory waypoints or a set of control actions as a reactive solution for closed-loop planning based on their rich scene comprehension. However, these estimations are coarse and are subjective to their "world understanding" which may generate sub-optimal decisions due to perception errors. In this paper, we introduce LeGo-Drive, wh… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

  10. arXiv:2403.17946  [pdf, ps, other

    math.FA cs.IT math-ph

    Nonlinear Heisenberg-Robertson-Schrodinger Uncertainty Principle

    Authors: K. Mahesh Krishna

    Abstract: We derive an uncertainty principle for Lipschitz maps acting on subsets of Banach spaces. We show that this nonlinear uncertainty principle reduces to the Heisenberg-Robertson-Schrodinger uncertainty principle for linear operators acting on Hilbert spaces.

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 4 Pages, 0 Figures

    MSC Class: 26A16; 46B99

  11. arXiv:2402.08591  [pdf, ps, other

    math.FA cs.IT math-ph

    Nonlinear Maccone-Pati Uncertainty Principle

    Authors: K. Mahesh Krishna

    Abstract: We show that one of the two important uncertainty principles derived by Maccone and Pati \textit{[Phys. Rev. Lett., 2014]} can be derived for arbitrary maps defined on subsets of $\mathcal{L}^p$ spaces for $1< p<\infty$. Our main tool is the Clarkson inequalities. We also derive a nonlinear uncertainty principle for weak parallelogram spaces and Type-p Banach spaces.

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 6 pages, 0 figures

    MSC Class: 46B20; 46E30

  12. arXiv:2402.04255  [pdf, ps, other

    math.FA cs.IT

    Functional Kuppinger-Durisi-Bölcskei Uncertainty Principle

    Authors: K. Mahesh Krishna

    Abstract: Let $\mathcal{X}$ be a Banach space. Let $\{τ_j\}_{j=1}^n, \{ω_k\}_{k=1}^m\subseteq \mathcal{X}$ and $\{f_j\}_{j=1}^n$, $\{g_k\}_{k=1}^m\subseteq \mathcal{X}^*$ satisfy $ |f_j(τ_j)|\geq 1$ for all $ 1\leq j \leq n$, $|g_k(ω_k)|\geq 1 $ for all $1\leq k \leq m$. If $x \in \mathcal{X}\setminus \{0\}$ is such that $x=θ_τθ_f x=θ_ωθ_g x$, then we show that \begin{align}\label{FKDB} (1) \quad\quad\quad\… ▽ More

    Submitted 1 January, 2024; originally announced February 2024.

    Comments: 9 Pages, 0 Figures

    MSC Class: 46A45; 46B45; 42C15

  13. arXiv:2401.17399  [pdf, other

    cs.RO

    ATPPNet: Attention based Temporal Point cloud Prediction Network

    Authors: Kaustab Pal, Aditya Sharma, Avinash Sharma, K. Madhava Krishna

    Abstract: Point cloud prediction is an important yet challenging task in the field of autonomous driving. The goal is to predict future point cloud sequences that maintain object structures while accurately representing their temporal motion. These predicted point clouds help in other subsequent tasks like object trajectory estimation for collision avoidance or estimating locations with the least odometry d… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: Accepted for presentation at the 2024 IEEE International Conference on Robotics and Automation (ICRA)

  14. arXiv:2401.02472  [pdf, ps, other

    cs.DC

    Code Generation for a Variety of Accelerators for a Graph DSL

    Authors: Ashwina Kumar, M. Venkata Krishna, Prasanna Bartakke, Rahul Kumar, Rajesh Pandian M, Nibedita Behera, Rupesh Nasre

    Abstract: Sparse graphs are ubiquitous in real and virtual worlds. With the phenomenal growth in semi-structured and unstructured data, sizes of the underlying graphs have witnessed a rapid growth over the years. Analyzing such large structures necessitates parallel processing, which is challenged by the intrinsic irregularity of sparse computation, memory access, and communication. It would be ideal if pro… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Comments: arXiv admin note: text overlap with arXiv:2305.03317

  15. arXiv:2312.16648  [pdf, other

    cs.RO cs.CV

    LIP-Loc: LiDAR Image Pretraining for Cross-Modal Localization

    Authors: Sai Shubodh Puligilla, Mohammad Omama, Husain Zaidi, Udit Singh Parihar, Madhava Krishna

    Abstract: Global visual localization in LiDAR-maps, crucial for autonomous driving applications, remains largely unexplored due to the challenging issue of bridging the cross-modal heterogeneity gap. Popular multi-modal learning approach Contrastive Language-Image Pre-Training (CLIP) has popularized contrastive symmetric loss using batch construction technique by applying it to multi-modal domains of text a… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

    Comments: To be presented at WACV-W 2024. Project page: https://shubodhs.ai/liploc

  16. arXiv:2312.00366  [pdf, ps, other

    math.FA cs.IT math-ph

    Unbounded Donoho-Stark-Elad-Bruckstein-Ricaud-Torrésani Uncertainty Principles

    Authors: K. Mahesh Krishna

    Abstract: Let $(Ω, μ)$, $(Δ, ν)$ be measure spaces and $p=1$ or $p=\infty$. Let $(\{f_α\}_{α\in Ω}, \{τ_α\}_{α\in Ω})$ and $(\{g_β\}_{β\in Δ}, \{ω_β\}_{β\in Δ})$ be unbounded continuous p-Schauder frames for a Banach space $\mathcal{X}$. Then for every $x \in ( \mathcal{D}(θ_f) \cap\mathcal{D}(θ_g))\setminus\{0\}$, we show that \begin{align}\label{UB} (1) \quad \quad \quad \quad μ(\operatorname{supp}(θ_f… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: 6 Figures, 0 Figures

    MSC Class: 42C15

  17. arXiv:2311.14635  [pdf

    cs.CV cs.RO

    Automated Detection and Counting of Windows using UAV Imagery based Remote Sensing

    Authors: Dhruv Patel, Shivani Chepuri, Sarvesh Thakur, K. Harikumar, Ravi Kiran S., K. Madhava Krishna

    Abstract: Despite the technological advancements in the construction and surveying sector, the inspection of salient features like windows in an under-construction or existing building is predominantly a manual process. Moreover, the number of windows present in a building is directly related to the magnitude of deformation it suffers under earthquakes. In this research, a method to accurately detect and co… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

  18. NeuroSMPC: A Neural Network guided Sampling Based MPC for On-Road Autonomous Driving

    Authors: Kaustab Pal, Aditya Sharma, Mohd Omama, Parth N. Shah, K. Madhava Krishna

    Abstract: In this paper we show an effective means of integrating data driven frameworks to sampling based optimal control to vastly reduce the compute time for easy adoption and adaptation to real time applications such as on-road autonomous driving in the presence of dynamic actors. Presented with training examples, a spatio-temporal CNN learns to predict the optimal mean control over a finite horizon tha… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: Published in 2023 IEEE 19th International Conference on Automation Science and Engineering (CASE)

  19. arXiv:2310.08270  [pdf, other

    cs.RO

    Hilbert Space Embedding-based Trajectory Optimization for Multi-Modal Uncertain Obstacle Trajectory Prediction

    Authors: Basant Sharma, Aditya Sharma, K. Madhava Krishna, Arun Kumar Singh

    Abstract: Safe autonomous driving critically depends on how well the ego-vehicle can predict the trajectories of neighboring vehicles. To this end, several trajectory prediction algorithms have been presented in the existing literature. Many of these approaches output a multi-modal distribution of obstacle trajectories instead of a single deterministic prediction to account for the underlying uncertainty. H… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  20. arXiv:2310.04802  [pdf, other

    cs.RO

    Hierarchical Unsupervised Topological SLAM

    Authors: Ayush Sharma, Yash Mehan, Pradyumna Dasu, Sourav Garg, Madhava Krishna

    Abstract: In this paper we present a novel framework for unsupervised topological clustering resulting in improved loop. In this paper we present a novel framework for unsupervised topological clustering resulting in improved loop detection and closure for SLAM. A navigating mobile robot clusters its traversal into visually similar topologies where each cluster (topology) contains a set of similar looking i… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

    Comments: Accepted to IEEE ITSC 2023

  21. arXiv:2310.04181  [pdf, other

    cs.CV cs.RO

    DiffPrompter: Differentiable Implicit Visual Prompts for Semantic-Segmentation in Adverse Conditions

    Authors: Sanket Kalwar, Mihir Ungarala, Shruti Jain, Aaron Monis, Krishna Reddy Konda, Sourav Garg, K Madhava Krishna

    Abstract: Semantic segmentation in adverse weather scenarios is a critical task for autonomous driving systems. While foundation models have shown promise, the need for specialized adaptors becomes evident for handling more challenging scenarios. We introduce DiffPrompter, a novel differentiable visual and latent prompting mechanism aimed at expanding the learning capabilities of existing adaptors in founda… ▽ More

    Submitted 26 March, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

  22. arXiv:2310.02324  [pdf, other

    cs.RO

    ALT-Pilot: Autonomous navigation with Language augmented Topometric maps

    Authors: Mohammad Omama, Pranav Inani, Pranjal Paul, Sarat Chandra Yellapragada, Krishna Murthy Jatavallabhula, Sandeep Chinchali, Madhava Krishna

    Abstract: We present an autonomous navigation system that operates without assuming HD LiDAR maps of the environment. Our system, ALT-Pilot, relies only on publicly available road network information and a sparse (and noisy) set of crowdsourced language landmarks. With the help of onboard sensors and a language-augmented topometric map, ALT-Pilot autonomously pilots the vehicle to any destination on the roa… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

  23. arXiv:2310.02251  [pdf, other

    cs.CV cs.RO

    Talk2BEV: Language-enhanced Bird's-eye View Maps for Autonomous Driving

    Authors: Tushar Choudhary, Vikrant Dewangan, Shivam Chandhok, Shubham Priyadarshan, Anushka Jain, Arun K. Singh, Siddharth Srivastava, Krishna Murthy Jatavallabhula, K. Madhava Krishna

    Abstract: Talk2BEV is a large vision-language model (LVLM) interface for bird's-eye view (BEV) maps in autonomous driving contexts. While existing perception systems for autonomous driving scenarios have largely focused on a pre-defined (closed) set of object categories and driving scenarios, Talk2BEV blends recent advances in general-purpose language and vision models with BEV-structured map representation… ▽ More

    Submitted 14 November, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: Project page at https://llmbev.github.io/talk2bev/

  24. arXiv:2309.11414  [pdf, other

    cs.RO cs.AI cs.LG

    EDMP: Ensemble-of-costs-guided Diffusion for Motion Planning

    Authors: Kallol Saha, Vishal Mandadi, Jayaram Reddy, Ajit Srikanth, Aditya Agarwal, Bipasha Sen, Arun Singh, Madhava Krishna

    Abstract: Classical motion planning for robotic manipulation includes a set of general algorithms that aim to minimize a scene-specific cost of executing a given plan. This approach offers remarkable adaptability, as they can be directly used off-the-shelf for any new scene without needing specific training datasets. However, without a prior understanding of what diverse valid trajectories are and without s… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

    Comments: 8 pages, 8 figures, submitted to ICRA 2024 (International Conference on Robotics and Automation)

  25. arXiv:2308.00688  [pdf, other

    cs.CV cs.AI cs.RO

    AnyLoc: Towards Universal Visual Place Recognition

    Authors: Nikhil Keetha, Avneesh Mishra, Jay Karhade, Krishna Murthy Jatavallabhula, Sebastian Scherer, Madhava Krishna, Sourav Garg

    Abstract: Visual Place Recognition (VPR) is vital for robot localization. To date, the most performant VPR approaches are environment- and task-specific: while they exhibit strong performance in structured environments (predominantly urban driving), their performance degrades severely in unstructured environments, rendering most approaches brittle to robust real-world deployment. In this work, we develop a… ▽ More

    Submitted 28 November, 2023; v1 submitted 1 August, 2023; originally announced August 2023.

    Comments: IEEE RA-L 2023 (Presented at ICRA 2024)

  26. arXiv:2307.01215  [pdf, ps, other

    math.FA cs.IT

    Functional Donoho-Stark Approximate Support Uncertainty Principle

    Authors: K. Mahesh Krishna

    Abstract: Let $(\{f_j\}_{j=1}^n, \{τ_j\}_{j=1}^n)$ and $(\{g_k\}_{k=1}^n, \{ω_k\}_{k=1}^n)$ be two p-orthonormal bases for a finite dimensional Banach space $\mathcal{X}$. If $ x \in \mathcal{X}\setminus\{0\}$ is such that $θ_fx$ is $\varepsilon$-supported on $M\subseteq \{1,\dots, n\}$ w.r.t. p-norm and $θ_gx$ is $δ$-supported on $N\subseteq \{1,\dots, n\}$ w.r.t. p-norm, then we show that \begin{align}\la… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

    Comments: 7 Pages, 0 Figures

    MSC Class: 42C15; 46B03; 46B04

  27. arXiv:2306.06093  [pdf, other

    cs.CV

    HyP-NeRF: Learning Improved NeRF Priors using a HyperNetwork

    Authors: Bipasha Sen, Gaurav Singh, Aditya Agarwal, Rohith Agaram, K Madhava Krishna, Srinath Sridhar

    Abstract: Neural Radiance Fields (NeRF) have become an increasingly popular representation to capture high-quality appearance and shape of scenes and objects. However, learning generalizable NeRF priors over categories of scenes or objects has been challenging due to the high dimensionality of network weight space. To address the limitations of existing work on generalization, multi-view consistency and to… ▽ More

    Submitted 23 December, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: Project Page: https://hyp-nerf.github.io

  28. arXiv:2306.04939  [pdf, other

    cs.RO

    UAP-BEV: Uncertainty Aware Planning using Bird's Eye View generated from Surround Monocular Images

    Authors: Vikrant Dewangan, Basant Sharma, Tushar Choudhary, Sarthak Sharma, Aakash Aanegola, Arun K. Singh, K. Madhava Krishna

    Abstract: Autonomous driving requires accurate reasoning of the location of objects from raw sensor data. Recent end-to-end learning methods go from raw sensor data to a trajectory output via Bird's Eye View(BEV) segmentation as an interpretable intermediate representation. Motion planning over cost maps generated via Birds Eye View (BEV) segmentation has emerged as a prominent approach in autonomous drivin… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: Accepted to CASE 2023. Project video available at https://vikr-182.github.io/UAP-BEV

  29. arXiv:2306.01540  [pdf, other

    cs.RO

    CLIPGraphs: Multimodal Graph Networks to Infer Object-Room Affinities

    Authors: Ayush Agrawal, Raghav Arora, Ahana Datta, Snehasis Banerjee, Brojeshwar Bhowmick, Krishna Murthy Jatavallabhula, Mohan Sridharan, Madhava Krishna

    Abstract: This paper introduces a novel method for determining the best room to place an object in, for embodied scene rearrangement. While state-of-the-art approaches rely on large language models (LLMs) or reinforcement learned (RL) policies for this task, our approach, CLIPGraphs, efficiently combines commonsense domain knowledge, data-driven methods, and recent advances in multimodal learning. Specifica… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Journal ref: RO-MAN 2023 Conference

  30. arXiv:2306.01014  [pdf, ps, other

    math.FA cs.IT

    Functional Ghobber-Jaming Uncertainty Principle

    Authors: K. Mahesh Krishna

    Abstract: Let $(\{f_j\}_{j=1}^n, \{τ_j\}_{j=1}^n)$ and $(\{g_k\}_{k=1}^n, \{ω_k\}_{k=1}^n)$ be two p-orthonormal bases for a finite dimensional Banach space $\mathcal{X}$. Let $M,N\subseteq \{1, \dots, n\}$ be such that \begin{align*} o(M)^\frac{1}{q}o(N)^\frac{1}{p}< \frac{1}{\displaystyle \max_{1\leq j,k\leq n}|g_k(τ_j) |}, \end{align*} where $q$ is the conjugate index of $p$. Then for all… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: 7 Pages, 0 Figures

    MSC Class: 42C15; 46B03; 46B04

  31. Instance-Level Semantic Maps for Vision Language Navigation

    Authors: Laksh Nanwani, Anmol Agarwal, Kanishk Jain, Raghav Prabhakar, Aaron Monis, Aditya Mathur, Krishna Murthy, Abdul Hafez, Vineet Gandhi, K. Madhava Krishna

    Abstract: Humans have a natural ability to perform semantic associations with the surrounding objects in the environment. This allows them to create a mental map of the environment, allowing them to navigate on-demand when given linguistic instructions. A natural goal in Vision Language Navigation (VLN) research is to impart autonomous agents with similar capabilities. Recent works take a step towards this… ▽ More

    Submitted 1 July, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

    Journal ref: IEEE RO-MAN 2023

  32. arXiv:2305.06178  [pdf

    cs.RO cs.AI cs.LG

    Sequence-Agnostic Multi-Object Navigation

    Authors: Nandiraju Gireesh, Ayush Agrawal, Ahana Datta, Snehasis Banerjee, Mohan Sridharan, Brojeshwar Bhowmick, Madhava Krishna

    Abstract: The Multi-Object Navigation (MultiON) task requires a robot to localize an instance (each) of multiple object classes. It is a fundamental task for an assistive robot in a home or a factory. Existing methods for MultiON have viewed this as a direct extension of Object Navigation (ON), the task of localising an instance of one object class, and are pre-sequenced, i.e., the sequence in which the obj… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Journal ref: ICRA 2023 conference

  33. arXiv:2304.03324  [pdf, ps, other

    math.FA cs.IT

    Functional Donoho-Stark-Elad-Bruckstein-Ricaud-Torrésani Uncertainty Principle

    Authors: K. Mahesh Krishna

    Abstract: Let $(\{f_j\}_{j=1}^n, \{τ_j\}_{j=1}^n)$ and $(\{g_k\}_{k=1}^m, \{ω_k\}_{k=1}^m)$ be p-Schauder frames for a finite dimensional Banach space $\mathcal{X}$. Then for every $x \in \mathcal{X}\setminus\{0\}$, we show that \begin{align} (1) \quad \|θ_f x\|_0^\frac{1}{p}\|θ_g x\|_0^\frac{1}{q} \geq \frac{1}{\displaystyle\max_{1\leq j\leq n, 1\leq k\leq m}|f_j(ω_k)|}\quad \text{and} \quad \|θ_g x\|_0^\f… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: 5 Pages, 0 Figures

    MSC Class: 42C15

  34. arXiv:2304.01074  [pdf, other

    cs.RO

    FinderNet: A Data Augmentation Free Canonicalization aided Loop Detection and Closure technique for Point clouds in 6-DOF separation

    Authors: Sudarshan S Harithas, Gurkirat Singh, Aneesh Chavan, Sarthak Sharma, Suraj Patni, Chetan Arora, K. Madhava Krishna

    Abstract: We focus on the problem of LiDAR point cloud based loop detection (or Finding) and closure (LDC) in a multi-agent setting. State-of-the-art (SOTA) techniques directly generate learned embeddings of a given point cloud, require large data transfers, and are not robust to wide variations in 6 Degrees-of-Freedom (DOF) viewpoint. Moreover, absence of strong priors in an unstructured point cloud leads… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  35. arXiv:2302.07241  [pdf, other

    cs.CV cs.AI cs.RO

    ConceptFusion: Open-set Multimodal 3D Mapping

    Authors: Krishna Murthy Jatavallabhula, Alihusein Kuwajerwala, Qiao Gu, Mohd Omama, Tao Chen, Alaa Maalouf, Shuang Li, Ganesh Iyer, Soroush Saryazdi, Nikhil Keetha, Ayush Tewari, Joshua B. Tenenbaum, Celso Miguel de Melo, Madhava Krishna, Liam Paull, Florian Shkurti, Antonio Torralba

    Abstract: Building 3D maps of the environment is central to robot navigation, planning, and interaction with objects in a scene. Most existing approaches that integrate semantic concepts with 3D maps largely remain confined to the closed-set setting: they can only reason about a finite set of concepts, pre-defined at training time. Further, these maps can only be queried using class labels, or in recent wor… ▽ More

    Submitted 23 October, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: RSS 2023. Project page: https://concept-fusion.github.io Explainer video: https://www.youtube.com/watch?v=rkXgws8fiDs Code: https://github.com/concept-fusion/concept-fusion

  36. arXiv:2301.07213  [pdf, other

    cs.CV cs.RO

    SCARP: 3D Shape Completion in ARbitrary Poses for Improved Grasping

    Authors: Bipasha Sen, Aditya Agarwal, Gaurav Singh, Brojeshwar B., Srinath Sridhar, Madhava Krishna

    Abstract: Recovering full 3D shapes from partial observations is a challenging task that has been extensively addressed in the computer vision community. Many deep learning methods tackle this problem by training 3D shape generation networks to learn a prior over the full 3D shapes. In this training regime, the methods expect the inputs to be in a fixed canonical form, without which they fail to learn a val… ▽ More

    Submitted 17 January, 2023; originally announced January 2023.

    Comments: Accepted at ICRA 2023

  37. arXiv:2212.02493  [pdf, other

    cs.CV

    Canonical Fields: Self-Supervised Learning of Pose-Canonicalized Neural Fields

    Authors: Rohith Agaram, Shaurya Dewan, Rahul Sajnani, Adrien Poulenard, Madhava Krishna, Srinath Sridhar

    Abstract: Coordinate-based implicit neural networks, or neural fields, have emerged as useful representations of shape and appearance in 3D computer vision. Despite advances, however, it remains challenging to build neural fields for categories of objects without datasets like ShapeNet that provide "canonicalized" object instances that are consistently aligned for their 3D position and orientation (pose). W… ▽ More

    Submitted 17 May, 2023; v1 submitted 5 December, 2022; originally announced December 2022.

  38. arXiv:2211.16882  [pdf, other

    cs.CV cs.RO

    MVRackLay: Monocular Multi-View Layout Estimation for Warehouse Racks and Shelves

    Authors: Pranjali Pathre, Anurag Sahu, Ashwin Rao, Avinash Prabhu, Meher Shashwat Nigam, Tanvi Karandikar, Harit Pandya, K. Madhava Krishna

    Abstract: In this paper, we propose and showcase, for the first time, monocular multi-view layout estimation for warehouse racks and shelves. Unlike typical layout estimation methods, MVRackLay estimates multi-layered layouts, wherein each layer corresponds to the layout of a shelf within a rack. Given a sequence of images of a warehouse scene, a dual-headed Convolutional-LSTM architecture outputs segmented… ▽ More

    Submitted 30 November, 2022; originally announced November 2022.

    Journal ref: IEEE International Conference on Robotics and Biomimetics (ROBIO) 2022

  39. arXiv:2210.10005  [pdf, other

    cs.CV

    Otsu based Differential Evolution Method for Image Segmentation

    Authors: Afreen Shaikh, Sharmila Botcha, Murali Krishna

    Abstract: This paper proposes an OTSU based differential evolution method for satellite image segmentation and compares it with four other methods such as Modified Artificial Bee Colony Optimizer (MABC), Artificial Bee Colony (ABC), Genetic Algorithm (GA), and Particle Swarm Optimization (PSO) using the objective function proposed by Otsu for optimal multilevel thresholding. The experiments conducted and th… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    ACM Class: I.2.10; I.4.6

  40. arXiv:2210.07062  [pdf, ps, other

    cs.IT math.FA math.NT

    Non-Archimedean Welch Bounds and Non-Archimedean Zauner Conjecture

    Authors: K. Mahesh Krishna

    Abstract: Let $\mathbb{K}$ be a non-Archimedean (complete) valued field satisfying \begin{align*} \left|\sum_{j=1}^{n}λ_j^2\right|=\max_{1\leq j \leq n}|λ_j|^2, \quad \forall λ_j \in \mathbb{K}, 1\leq j \leq n, \forall n \in \mathbb{N}. \end{align*} For $d\in \mathbb{N}$, let $\mathbb{K}^d$ be the standard $d$-dimensional non-Archimedean Hilbert space. Let $m \in \mathbb{N}$ and… ▽ More

    Submitted 28 August, 2022; originally announced October 2022.

    Comments: 9 Pages, 0 Figures

    MSC Class: 12J25; 46S10; 47S10

  41. arXiv:2209.14922  [pdf, other

    cs.CV cs.RO

    GDIP: Gated Differentiable Image Processing for Object-Detection in Adverse Conditions

    Authors: Sanket Kalwar, Dhruv Patel, Aakash Aanegola, Krishna Reddy Konda, Sourav Garg, K Madhava Krishna

    Abstract: Detecting objects under adverse weather and lighting conditions is crucial for the safe and continuous operation of an autonomous vehicle, and remains an unsolved problem. We present a Gated Differentiable Image Processing (GDIP) block, a domain-agnostic network architecture, which can be plugged into existing object detection networks (e.g., Yolo) and trained end-to-end with adverse condition ima… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

    Comments: Submitted to ICRA2023. More information at https://gatedip.github.io

  42. arXiv:2209.13418  [pdf, other

    cs.CV cs.RO

    UAV-based Visual Remote Sensing for Automated Building Inspection

    Authors: Kushagra Srivastava, Dhruv Patel, Aditya Kumar Jha, Mohhit Kumar Jha, Jaskirat Singh, Ravi Kiran Sarvadevabhatla, Pradeep Kumar Ramancharla, Harikumar Kandath, K. Madhava Krishna

    Abstract: Unmanned Aerial Vehicle (UAV) based remote sensing system incorporated with computer vision has demonstrated potential for assisting building construction and in disaster management like damage assessment during earthquakes. The vulnerability of a building to earthquake can be assessed through inspection that takes into account the expected damage progression of the associated component and the co… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: Paper accepted at CVCIE Workshop at ECCV, 2022 and the project page is https://uvrsabi.github.io/

  43. arXiv:2209.11972  [pdf, other

    cs.CV

    Ground then Navigate: Language-guided Navigation in Dynamic Scenes

    Authors: Kanishk Jain, Varun Chhangani, Amogh Tiwari, K. Madhava Krishna, Vineet Gandhi

    Abstract: We investigate the Vision-and-Language Navigation (VLN) problem in the context of autonomous driving in outdoor settings. We solve the problem by explicitly grounding the navigable regions corresponding to the textual command. At each timestamp, the model predicts a segmentation mask corresponding to the intermediate or the final navigable region. Our work contrasts with existing efforts in VLN, w… ▽ More

    Submitted 24 September, 2022; originally announced September 2022.

  44. arXiv:2209.04805  [pdf, other

    cs.RO

    Real-Time Heuristic Framework for Safe Landing of UAVs in Dynamic Scenarios

    Authors: Jaskirat Singh, Neel Adwani, Harikumar Kandath, K. Madhava Krishna

    Abstract: The world we live in is full of technology and with each passing day the advancement and usage of UAVs increases efficiently. As a result of the many application scenarios, there are some missions where the UAVs are vulnerable to external disruptions, such as a ground station's loss of connectivity, security missions, safety concerns, and delivery-related missions. Therefore, depending on the scen… ▽ More

    Submitted 11 September, 2022; originally announced September 2022.

    Comments: 8 pages, 6 figures, 36 references

  45. arXiv:2208.13031  [pdf, other

    cs.RO cs.AI

    Spatial Relation Graph and Graph Convolutional Network for Object Goal Navigation

    Authors: D. A. Sasi Kiran, Kritika Anand, Chaitanya Kharyal, Gulshan Kumar, Nandiraju Gireesh, Snehasis Banerjee, Ruddra dev Roychoudhury, Mohan Sridharan, Brojeshwar Bhowmick, Madhava Krishna

    Abstract: This paper describes a framework for the object-goal navigation task, which requires a robot to find and move to the closest instance of a target object class from a random starting position. The framework uses a history of robot trajectories to learn a Spatial Relational Graph (SRG) and Graph Convolutional Network (GCN)-based embeddings for the likelihood of proximity of different semantically-la… ▽ More

    Submitted 27 August, 2022; originally announced August 2022.

    Comments: CASE 2022 paper

  46. arXiv:2208.13009  [pdf, other

    cs.RO cs.AI

    Object Goal Navigation using Data Regularized Q-Learning

    Authors: Nandiraju Gireesh, D. A. Sasi Kiran, Snehasis Banerjee, Mohan Sridharan, Brojeshwar Bhowmick, Madhava Krishna

    Abstract: Object Goal Navigation requires a robot to find and navigate to an instance of a target object class in a previously unseen environment. Our framework incrementally builds a semantic map of the environment over time, and then repeatedly selects a long-term goal ('where to go') based on the semantic map to locate the target object instance. Long-term goal selection is formulated as a vision-based d… ▽ More

    Submitted 27 August, 2022; originally announced August 2022.

    Comments: CASE 2022 paper

  47. arXiv:2208.03038  [pdf, other

    cs.RO math.OC

    Leveraging Distributional Bias for Reactive Collision Avoidance under Uncertainty: A Kernel Embedding Approach

    Authors: Anish Gupta, Arun Kumar Singh, K. Madhava Krishna

    Abstract: Many commodity sensors that measure the robot and dynamic obstacle's state have non-Gaussian noise characteristics. Yet, many current approaches treat the underlying-uncertainty in motion and perception as Gaussian, primarily to ensure computational tractability. On the other hand, existing planners working with non-Gaussian uncertainty do not shed light on leveraging distributional characteristic… ▽ More

    Submitted 22 September, 2022; v1 submitted 5 August, 2022; originally announced August 2022.

  48. arXiv:2207.03557  [pdf, other

    cs.RO

    Flow Synthesis Based Visual Servoing Frameworks for Monocular Obstacle Avoidance Amidst High-Rises

    Authors: Harshit K. Sankhla, M. Nomaan Qureshi, Shankara Narayanan V., Vedansh Mittal, Gunjan Gupta, Harit Pandya, K. Madhava Krishna

    Abstract: We propose a novel flow synthesis based visual servoing framework enabling long-range obstacle avoidance for Micro Air Vehicles (MAV) flying amongst tall skyscrapers. Recent deep learning based frameworks use optical flow to do high-precision visual servoing. In this paper, we explore the question: can we design a surrogate flow for these high-precision visual-servoing methods, which leads to obst… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

    Comments: Accepted to IEEE International Conference on Automation Science and Engineering (CASE), 2022

  49. arXiv:2205.04090  [pdf, other

    cs.RO

    Approaches and Challenges in Robotic Perception for Table-top Rearrangement and Planning

    Authors: Aditya Agarwal, Bipasha Sen, Shankara Narayanan V, Vishal Reddy Mandadi, Brojeshwar Bhowmick, K Madhava Krishna

    Abstract: Table-top Rearrangement and Planning is a challenging problem that relies heavily on an excellent perception stack. The perception stack involves observing and registering the 3D scene on the table, detecting what objects are on the table, and how to manipulate them. Consequently, it greatly influences the system's task-planning and motion-planning stacks that follow. We present a comprehensive ov… ▽ More

    Submitted 3 June, 2022; v1 submitted 9 May, 2022; originally announced May 2022.

    Comments: 5 pages including references, 3 figures

  50. arXiv:2204.00865  [pdf, other

    cs.RO

    UrbanFly: Uncertainty-Aware Planning for Navigation Amongst High-Rises with Monocular Visual-Inertial SLAM Maps

    Authors: Sudarshan S Harithas, Ayyappa Swamy Thatavarthy, Gurkirat Singh, Arun K Singh, K Madhava Krishna

    Abstract: We present UrbanFly: an uncertainty-aware real-time planning framework for quadrotor navigation in urban high-rise environments. A core aspect of UrbanFly is its ability to robustly plan directly on the sparse point clouds generated by a Monocular Visual Inertial SLAM (VINS) backend. It achieves this by using the sparse point clouds to build an uncertainty-integrated cuboid representation of the e… ▽ More

    Submitted 3 October, 2022; v1 submitted 2 April, 2022; originally announced April 2022.

    Comments: Submitted to ACC 2023, Code available at https://github.com/sudarshan-s-harithas/UrbanFly