Zum Hauptinhalt springen

Showing 1–32 of 32 results for author: Vora, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00863  [pdf, other

    cs.CV

    Dynamically Modulating Visual Place Recognition Sequence Length For Minimum Acceptable Performance Scenarios

    Authors: Connor Malone, Ankit Vora, Thierry Peynot, Michael Milford

    Abstract: Mobile robots and autonomous vehicles are often required to function in environments where critical position estimates from sensors such as GPS become uncertain or unreliable. Single image visual place recognition (VPR) provides an alternative for localization but often requires techniques such as sequence matching to improve robustness, which incurs additional computation and latency costs. Even… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: DOI TBC

  2. arXiv:2407.00101  [pdf, other

    cs.LG cs.AI cs.CC cs.DC cs.NE

    Hybrid Approach to Parallel Stochastic Gradient Descent

    Authors: Aakash Sudhirbhai Vora, Dhrumil Chetankumar Joshi, Aksh Kantibhai Patel

    Abstract: Stochastic Gradient Descent is used for large datasets to train models to reduce the training time. On top of that data parallelism is widely used as a method to efficiently train neural networks using multiple worker nodes in parallel. Synchronous and asynchronous approach to data parallelism is used by most systems to train the model in parallel. However, both of them have their drawbacks. We pr… ▽ More

    Submitted 27 June, 2024; originally announced July 2024.

  3. arXiv:2405.19338  [pdf, other

    eess.SP cs.AI cs.CV

    Accurate Patient Alignment without Unnecessary Imaging Dose via Synthesizing Patient-specific 3D CT Images from 2D kV Images

    Authors: Yuzhen Ding, Jason M. Holmes, Hongying Feng, Baoxin Li, Lisa A. McGee, Jean-Claude M. Rwigema, Sujay A. Vora, Daniel J. Ma, Robert L. Foote, Samir H. Patel, Wei Liu

    Abstract: In radiotherapy, 2D orthogonally projected kV images are used for patient alignment when 3D-on-board imaging(OBI) unavailable. But tumor visibility is constrained due to the projection of patient's anatomy onto a 2D plane, potentially leading to substantial setup errors. In treatment room with 3D-OBI such as cone beam CT(CBCT), the field of view(FOV) of CBCT is limited with unnecessarily high imag… ▽ More

    Submitted 1 April, 2024; originally announced May 2024.

    Comments: 17 pages, 8 figures and tables

  4. arXiv:2404.09169  [pdf, other

    cs.RO

    Increasing SLAM Pose Accuracy by Ground-to-Satellite Image Registration

    Authors: Yanhao Zhang, Yujiao Shi, Shan Wang, Ankit Vora, Akhil Perincherry, Yongbo Chen, Hongdong Li

    Abstract: Vision-based localization for autonomous driving has been of great interest among researchers. When a pre-built 3D map is not available, the techniques of visual simultaneous localization and mapping (SLAM) are typically adopted. Due to error accumulation, visual SLAM (vSLAM) usually suffers from long-term drift. This paper proposes a framework to increase the localization accuracy by fusing the v… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: 7 pages, 6 figures, to be published in 2024 International Conference on Robotics and Automation (ICRA)

  5. arXiv:2403.05513  [pdf, other

    cs.RO

    A Detection and Filtering Framework for Collaborative Localization

    Authors: Thirumalaesh Ashokkumar, Katherine A Skinner, Siddarth Agarwal, Ankit Vora, Ashutosh Bhown

    Abstract: Increasingly, autonomous vehicles (AVs) are becoming a reality, such as the Advanced Driver Assistance Systems (ADAS) in vehicles that assist drivers in driving and parking functions with vehicles today. The localization problem for AVs relies primarily on multiple sensors, including cameras, LiDARs, and radars. Manufacturing, installing, calibrating, and maintaining these sensors can be very expe… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  6. arXiv:2312.00975  [pdf

    physics.med-ph cs.LG

    Noisy probing dose facilitated dose prediction for pencil beam scanning proton therapy: physics enhances generalizability

    Authors: Lian Zhang, Jason M. Holmes, Zhengliang Liu, Hongying Feng, Terence T. Sio, Carlos E. Vargas, Sameer R. Keole, Kristin Stützer, Sheng Li, Tianming Liu, Jiajian Shen, William W. Wong, Sujay A. Vora, Wei Liu

    Abstract: Purpose: Prior AI-based dose prediction studies in photon and proton therapy often neglect underlying physics, limiting their generalizability to handle outlier clinical cases, especially for pencil beam scanning proton therapy (PBSPT). Our aim is to design a physics-aware and generalizable AI-based PBSPT dose prediction method that has the underlying physics considered to achieve high generalizab… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  7. arXiv:2310.03874  [pdf, other

    physics.med-ph cs.CL

    Benchmarking a foundation LLM on its ability to re-label structure names in accordance with the AAPM TG-263 report

    Authors: Jason Holmes, Lian Zhang, Yuzhen Ding, Hongying Feng, Zhengliang Liu, Tianming Liu, William W. Wong, Sujay A. Vora, Jonathan B. Ashman, Wei Liu

    Abstract: Purpose: To introduce the concept of using large language models (LLMs) to re-label structure names in accordance with the American Association of Physicists in Medicine (AAPM) Task Group (TG)-263 standard, and to establish a benchmark for future studies to reference. Methods and Materials: The Generative Pre-trained Transformer (GPT)-4 application programming interface (API) was implemented as… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: 20 pages, 5 figures, 1 table

  8. arXiv:2308.08110  [pdf, other

    cs.CV

    View Consistent Purification for Accurate Cross-View Localization

    Authors: Shan Wang, Yanhao Zhang, Akhil Perincherry, Ankit Vora, Hongdong Li

    Abstract: This paper proposes a fine-grained self-localization method for outdoor robotics that utilizes a flexible number of onboard cameras and readily accessible satellite images. The proposed method addresses limitations in existing cross-view localization methods that struggle to handle noise sources such as moving objects and seasonal variations. It is the first sparse visual-only method that enhances… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

    Comments: Accepted for ICCV 2023

  9. arXiv:2308.01125  [pdf, other

    cs.CV cs.RO

    Stereo Visual Odometry with Deep Learning-Based Point and Line Feature Matching using an Attention Graph Neural Network

    Authors: Shenbagaraj Kannapiran, Nalin Bendapudi, Ming-Yuan Yu, Devarth Parikh, Spring Berman, Ankit Vora, Gaurav Pandey

    Abstract: Robust feature matching forms the backbone for most Visual Simultaneous Localization and Mapping (vSLAM), visual odometry, 3D reconstruction, and Structure from Motion (SfM) algorithms. However, recovering feature matches from texture-poor scenes is a major challenge and still remains an open area of research. In this paper, we present a Stereo Visual Odometry (StereoVO) technique based on point a… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

  10. arXiv:2307.08015  [pdf, other

    cs.CV

    Boosting 3-DoF Ground-to-Satellite Camera Localization Accuracy via Geometry-Guided Cross-View Transformer

    Authors: Yujiao Shi, Fei Wu, Akhil Perincherry, Ankit Vora, Hongdong Li

    Abstract: Image retrieval-based cross-view localization methods often lead to very coarse camera pose estimation, due to the limited sampling density of the database satellite images. In this paper, we propose a method to increase the accuracy of a ground camera's location and orientation by estimating the relative rotation and translation between the ground-level image and its matched/retrieved satellite i… ▽ More

    Submitted 19 July, 2023; v1 submitted 16 July, 2023; originally announced July 2023.

    Comments: Accepted to ICCV 2023

  11. arXiv:2307.02644  [pdf, ps, other

    cs.IT

    Achievable Rates for Information Extraction from a Strategic Sender

    Authors: Anuj S. Vora, Ankur A. Kulkarni

    Abstract: We consider a setting of non-cooperative communication where a receiver wants to recover randomly generated sequences of symbols that are observed by a strategic sender. The sender aims to maximize an average utility that may not align with the recovery criterion of the receiver, whereby the received signals may not be truthful. We pose this problem as a sequential game between the sender and the… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: Submitted to IEEE Transactions on Information Theory

  12. arXiv:2306.17536  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    DisPlacing Objects: Improving Dynamic Vehicle Detection via Visual Place Recognition under Adverse Conditions

    Authors: Stephen Hausler, Sourav Garg, Punarjay Chakravarty, Shubham Shrivastava, Ankit Vora, Michael Milford

    Abstract: Can knowing where you are assist in perceiving objects in your surroundings, especially under adverse weather and lighting conditions? In this work we investigate whether a prior map can be leveraged to aid in the detection of dynamic objects in a scene without the need for a 3D map or pixel-level map-query correspondences. We contribute an algorithm which refines an initial set of candidate objec… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Comments: Accepted to IROS 2023

  13. arXiv:2306.17529  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Locking On: Leveraging Dynamic Vehicle-Imposed Motion Constraints to Improve Visual Localization

    Authors: Stephen Hausler, Sourav Garg, Punarjay Chakravarty, Shubham Shrivastava, Ankit Vora, Michael Milford

    Abstract: Most 6-DoF localization and SLAM systems use static landmarks but ignore dynamic objects because they cannot be usefully incorporated into a typical pipeline. Where dynamic objects have been incorporated, typical approaches have attempted relatively sophisticated identification and localization of these objects, limiting their robustness or general utility. In this research, we propose a middle gr… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Comments: Accepted to IROS 2023

  14. arXiv:2306.04699  [pdf, other

    cs.CV

    DiViNeT: 3D Reconstruction from Disparate Views via Neural Template Regularization

    Authors: Aditya Vora, Akshay Gadi Patil, Hao Zhang

    Abstract: We present a volume rendering-based neural surface reconstruction method that takes as few as three disparate RGB images as input. Our key idea is to regularize the reconstruction, which is severely ill-posed and leaving significant gaps between the sparse views, by learning a set of neural templates to act as surface priors. Our method, coined DiViNet, operates in two stages. It first learns the… ▽ More

    Submitted 1 November, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: To be presented at NeurIPS, 2023

  15. arXiv:2207.13506  [pdf, other

    cs.CV

    Satellite Image Based Cross-view Localization for Autonomous Vehicle

    Authors: Shan Wang, Yanhao Zhang, Ankit Vora, Akhil Perincherry, Hongdong Li

    Abstract: Existing spatial localization techniques for autonomous vehicles mostly use a pre-built 3D-HD map, often constructed using a survey-grade 3D mapping vehicle, which is not only expensive but also laborious. This paper shows that by using an off-the-shelf high-definition satellite image as a ready-to-use map, we are able to achieve cross-view vehicle localization up to a satisfactory accuracy, provi… ▽ More

    Submitted 20 April, 2023; v1 submitted 27 July, 2022; originally announced July 2022.

    Comments: Accepted by ICRA2023

  16. arXiv:2206.13883  [pdf, other

    cs.RO cs.AI cs.CV

    Improving Worst Case Visual Localization Coverage via Place-specific Sub-selection in Multi-camera Systems

    Authors: Stephen Hausler, Ming Xu, Sourav Garg, Punarjay Chakravarty, Shubham Shrivastava, Ankit Vora, Michael Milford

    Abstract: 6-DoF visual localization systems utilize principled approaches rooted in 3D geometry to perform accurate camera pose estimation of images to a map. Current techniques use hierarchical pipelines and learned 2D feature extractors to improve scalability and increase performance. However, despite gains in typical [email protected] type metrics, these systems still have limited utility for real-world appli… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

    Comments: 8 pages, 5 figures, To be published in RA-L 2022

  17. arXiv:2109.14065  [pdf, other

    cs.RO

    Localization of a Smart Infrastructure Fisheye Camera in a Prior Map for Autonomous Vehicles

    Authors: Subodh Mishra, Armin Parchami, Enrique Corona, Punarjay Chakravarty, Ankit Vora, Devarth Parikh, Gaurav Pandey

    Abstract: This work presents a technique for localization of a smart infrastructure node, consisting of a fisheye camera, in a prior map. These cameras can detect objects that are outside the line of sight of the autonomous vehicles (AV) and send that information to AVs using V2X technology. However, in order for this information to be of any use to the AV, the detected objects should be provided in the ref… ▽ More

    Submitted 28 September, 2021; originally announced September 2021.

    Comments: Submitted to ICRA 2022

  18. arXiv:2109.10457  [pdf, other

    cs.RO

    Infrastructure Node-based Vehicle Localization for Autonomous Driving

    Authors: Elijah S. Lee, Ankit Vora, Armin Parchami, Punarjay Chakravarty, Gaurav Pandey, Vijay Kumar

    Abstract: Vehicle localization is essential for autonomous vehicle (AV) navigation and Advanced Driver Assistance Systems (ADAS). Accurate vehicle localization is often achieved via expensive inertial navigation systems or by employing compute-intensive vision processing (LiDAR/camera) to augment the low-cost and noisy inertial sensors. Here we have developed a framework for fusing the information obtained… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: 7 pages, 8 figures

  19. arXiv:2101.09569  [pdf, other

    cs.CV cs.LG cs.RO

    S-BEV: Semantic Birds-Eye View Representation for Weather and Lighting Invariant 3-DoF Localization

    Authors: Mokshith Voodarla, Shubham Shrivastava, Sagar Manglani, Ankit Vora, Siddharth Agarwal, Punarjay Chakravarty

    Abstract: We describe a light-weight, weather and lighting invariant, Semantic Bird's Eye View (S-BEV) signature for vision-based vehicle re-localization. A topological map of S-BEV signatures is created during the first traversal of the route, which are used for coarse localization in subsequent route traversal. A fine-grained localizer is then trained to output the global 3-DoF pose of the vehicle using i… ▽ More

    Submitted 23 January, 2021; originally announced January 2021.

    Comments: 7 pages, 8 figures

  20. arXiv:2010.15008  [pdf, ps, other

    cs.IR cs.GT math.OC

    Optimal Questionnaires for Screening of Strategic Agents

    Authors: Anuj S. Vora, Ankur A. Kulkarni

    Abstract: During the COVID-$19$ pandemic the health authorities at airports and train stations try to screen and identify the travellers possibly exposed to the virus. However, many individuals avoid getting tested and hence may misreport their travel history. This is a challenge for the health authorities who wish to ascertain the truly susceptible cases in spite of this strategic misreporting. We investig… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

    Comments: Longer version of our paper submitted to ICASSP 2021

    MSC Class: 91A28; 94D99

  21. Ensembling Low Precision Models for Binary Biomedical Image Segmentation

    Authors: Tianyu Ma, Hang Zhang, Hanley Ong, Amar Vora, Thanh D. Nguyen, Ajay Gupta, Yi Wang, Mert Sabuncu

    Abstract: Segmentation of anatomical regions of interest such as vessels or small lesions in medical images is still a difficult problem that is often tackled with manual input by an expert. One of the major challenges for this task is that the appearance of foreground (positive) regions can be similar to background (negative) regions. As a result, many automatic segmentation algorithms tend to exhibit asym… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

    Comments: 10 pages, 4 figures

  22. arXiv:2006.10641  [pdf, ps, other

    cs.IT cs.GT eess.SY

    Shannon meets Myerson: Information Extraction from a Strategic Sender

    Authors: Anuj S. Vora, Ankur A. Kulkarni

    Abstract: We study a setting where a receiver must design a questionnaire to recover a sequence of symbols known to strategic sender, whose utility may not be incentive compatible. We allow the receiver the possibility of selecting the alternatives presented in the questionnaire, and thereby linking decisions across the components of the sequence. We show that, despite the strategic sender and the noise in… ▽ More

    Submitted 15 September, 2022; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: Submitted to Games and Economic Behaviour

  23. arXiv:2003.11192  [pdf, other

    cs.RO cs.CV

    Aerial Imagery based LIDAR Localization for Autonomous Vehicles

    Authors: Ankit Vora, Siddharth Agarwal, Gaurav Pandey, James McBride

    Abstract: This paper presents a localization technique using aerial imagery maps and LIDAR based ground reflectivity for autonomous vehicles in urban environments. Traditional localization techniques using LIDAR reflectivity rely on high definition reflectivity maps generated from a mapping vehicle. The cost and effort required to maintain such prior maps are generally very high because it requires a fleet… ▽ More

    Submitted 24 March, 2020; originally announced March 2020.

    Comments: 6 pages, 7 figures, Submitted to International Conference on Intelligent Robots and Systems (IROS-2020), For the video, see https://www.youtube.com/watch?v=vcY74Z9bOLk

  24. arXiv:2003.07969  [pdf, other

    cs.RO cs.CV cs.MA

    Ford Multi-AV Seasonal Dataset

    Authors: Siddharth Agarwal, Ankit Vora, Gaurav Pandey, Wayne Williams, Helen Kourous, James McBride

    Abstract: This paper presents a challenging multi-agent seasonal dataset collected by a fleet of Ford autonomous vehicles at different days and times during 2017-18. The vehicles traversed an average route of 66 km in Michigan that included a mix of driving scenarios such as the Detroit Airport, freeways, city-centers, university campus and suburban neighbourhoods, etc. Each vehicle used in this data collec… ▽ More

    Submitted 17 March, 2020; originally announced March 2020.

    Comments: 7 pages, 7 figures, Submitted to International Journal of Robotics Research (IJRR), Visit website at https://avdata.ford.com

    Journal ref: IJRR, Volume: 39 issue: 12 (2020), page(s): 1367-1376

  25. arXiv:1907.05324  [pdf, ps, other

    cs.IT cs.GT math.OC

    Minimax Theorems for Finite Blocklength Lossy Joint Source-Channel Coding over an AVC

    Authors: Anuj S. Vora, Ankur A. Kulkarni

    Abstract: Motivated by applications in the security of cyber-physical systems, we pose the finite blocklength communication problem in the presence of a jammer as a zero-sum game between the encoder-decoder team and the jammer, by allowing the communicating team as well as the jammer only locally randomized strategies. The communicating team's problem is non-convex under locally randomized codes, and hence,… ▽ More

    Submitted 11 July, 2019; originally announced July 2019.

    Comments: Under review with Problems of Information Transmission

    MSC Class: 94A15; 91A99

  26. arXiv:1906.01061  [pdf, other

    cs.RO eess.SP eess.SY

    Localization Requirements for Autonomous Vehicles

    Authors: Tyler G. R. Reid, Sarah E. Houts, Robert Cammarata, Graham Mills, Siddharth Agarwal, Ankit Vora, Gaurav Pandey

    Abstract: Autonomous vehicles require precise knowledge of their position and orientation in all weather and traffic conditions for path planning, perception, control, and general safe operation. Here we derive these requirements for autonomous vehicles based on first principles. We begin with the safety integrity level, defining the allowable probability of failure per hour of operation based on desired im… ▽ More

    Submitted 3 June, 2019; originally announced June 2019.

    Comments: Under review with the SAE Journal of Connected and Automated Vehicles

    Journal ref: SAE Intl. J CAV 2(3):2019

  27. arXiv:1809.08766  [pdf, other

    cs.CV

    FCHD: Fast and accurate head detection in crowded scenes

    Authors: Aditya Vora, Vinay Chilaka

    Abstract: In this paper, we propose FCHD-Fully Convolutional Head Detector, an end-to-end trainable head detection model. Our proposed architecture is a single fully convolutional network which is responsible for both bounding box prediction and classification. This makes our model lightweight with low inference time and memory requirements. Along with run-time, our model has better overall average precisio… ▽ More

    Submitted 5 May, 2019; v1 submitted 24 September, 2018; originally announced September 2018.

    Comments: 5 pages, 4 figures, accepted for publication at International Conference on Image Processing, 2019

  28. arXiv:1806.00428  [pdf, ps, other

    cs.CV cs.LG stat.ML

    A Classification approach towards Unsupervised Learning of Visual Representations

    Authors: Aditya Vora

    Abstract: In this paper, we present a technique for unsupervised learning of visual representations. Specifically, we train a model for foreground and background classification task, in the process of which it learns visual representations. Foreground and background patches for training come af- ter mining for such patches from hundreds and thousands of unlabelled videos available on the web which we ex- tr… ▽ More

    Submitted 1 June, 2018; originally announced June 2018.

  29. arXiv:1706.09719  [pdf, other

    cs.CV

    Iterative Spectral Clustering for Unsupervised Object Localization

    Authors: Aditya Vora, Shanmuganathan Raman

    Abstract: This paper addresses the problem of unsupervised object localization in an image. Unlike previous supervised and weakly supervised algorithms that require bounding box or image level annotations for training classifiers in order to learn features representing the object, we propose a simple yet effective technique for localization using iterative spectral clustering. This iterative spectral cluste… ▽ More

    Submitted 29 June, 2017; originally announced June 2017.

  30. arXiv:1706.09544  [pdf, other

    cs.CV

    Flow-free Video Object Segmentation

    Authors: Aditya Vora, Shanmuganathan Raman

    Abstract: Segmenting foreground object from a video is a challenging task because of the large deformations of the objects, occlusions, and background clutter. In this paper, we propose a frame-by-frame but computationally efficient approach for video object segmentation by clustering visually similar generic object segments throughout the video. Our algorithm segments various object instances appearing in… ▽ More

    Submitted 28 June, 2017; originally announced June 2017.

  31. arXiv:0805.0087  [pdf, ps, other

    cs.DC cs.CR cs.NI

    Universe Detectors for Sybil Defense in Ad Hoc Wireless Networks

    Authors: Adnan Vora, Mikhail Nesterenko, Sébastien Tixeuil, Sylvie Delaët

    Abstract: The Sybil attack in unknown port networks such as wireless is not considered tractable. A wireless node is not capable of independently differentiating the universe of real nodes from the universe of arbitrary non-existent fictitious nodes created by the attacker. Similar to failure detectors, we propose to use universe detectors to help nodes determine which universe is real. In this paper, we… ▽ More

    Submitted 13 May, 2008; v1 submitted 1 May, 2008; originally announced May 2008.

    Report number: RR-6529

  32. arXiv:0803.3632  [pdf, ps, other

    cs.OS cs.DC cs.DS

    Void Traversal for Guaranteed Delivery in Geometric Routing

    Authors: Mikhail Nesterenko, Adnan Vora

    Abstract: Geometric routing algorithms like GFG (GPSR) are lightweight, scalable algorithms that can be used to route in resource-constrained ad hoc wireless networks. However, such algorithms run on planar graphs only. To efficiently construct a planar graph, they require a unit-disk graph. To make the topology unit-disk, the maximum link length in the network has to be selected conservatively. In practi… ▽ More

    Submitted 25 March, 2008; originally announced March 2008.

    ACM Class: C.2.2; C.2.1; F.2.2

    Journal ref: The 2nd IEEE International Conference on Mobile Ad-hoc and Sensor Systems (MASS 2005), Washington, DC, November, 2005