Zum Hauptinhalt springen

Showing 1–20 of 20 results for author: Joshi, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.05674  [pdf, other

    cs.AI cs.CL cs.PL

    LLM-Based Open-Domain Integrated Task and Knowledge Assistants with Programmable Policies

    Authors: Harshit Joshi, Shicheng Liu, James Chen, Robert Weigle, Monica S. Lam

    Abstract: Programming LLM-based knowledge and task assistants that faithfully conform to developer-provided policies is challenging. These agents must retrieve and provide consistent, accurate, and relevant information to address user's queries and needs. Yet such agents generate unfounded responses ("hallucinate"). Traditional dialogue trees can only handle a limited number of conversation flows, making th… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: preprint

  2. arXiv:2311.06212  [pdf, other

    stat.ML cs.LG stat.AP

    Differentiable VQ-VAE's for Robust White Matter Streamline Encodings

    Authors: Andrew Lizarraga, Brandon Taraku, Edouardo Honig, Ying Nian Wu, Shantanu H. Joshi

    Abstract: Given the complex geometry of white matter streamlines, Autoencoders have been proposed as a dimension-reduction tool to simplify the analysis streamlines in a low-dimensional latent spaces. However, despite these recent successes, the majority of encoder architectures only perform dimension reduction on single streamlines as opposed to a full bundle of streamlines. This is a severe limitation of… ▽ More

    Submitted 18 November, 2023; v1 submitted 10 November, 2023; originally announced November 2023.

    Comments: 5 pages, 4 figures, 1 table

  3. arXiv:2301.13779  [pdf, other

    cs.PL cs.AI cs.SE

    FLAME: A small language model for spreadsheet formulas

    Authors: Harshit Joshi, Abishai Ebenezer, José Cambronero, Sumit Gulwani, Aditya Kanade, Vu Le, Ivan Radiček, Gust Verbruggen

    Abstract: Spreadsheets are a vital tool for end-user data management. Using large language models for formula authoring assistance in these environments can be difficult, as these models are expensive to train and challenging to deploy due to their size (up to billions of parameters). We present FLAME, a transformer-based model trained exclusively on Excel formulas that leverages domain insights to achieve… ▽ More

    Submitted 19 December, 2023; v1 submitted 31 January, 2023; originally announced January 2023.

    Comments: Accepted to AAAI 2024

  4. arXiv:2209.01498  [pdf, other

    q-bio.QM cs.LG eess.IV

    StreamNet: A WAE for White Matter Streamline Analysis

    Authors: Andrew Lizarraga, Katherine L. Narr, Kirsten A. Donald, Shantanu H. Joshi

    Abstract: We present StreamNet, an autoencoder architecture for the analysis of the highly heterogeneous geometry of large collections of white matter streamlines. This proposed framework takes advantage of geometry-preserving properties of the Wasserstein-1 metric in order to achieve direct encoding and reconstruction of entire bundles of streamlines. We show that the model not only accurately captures the… ▽ More

    Submitted 19 October, 2022; v1 submitted 3 September, 2022; originally announced September 2022.

  5. arXiv:2208.11640  [pdf, other

    cs.SE cs.AI cs.PL

    Repair Is Nearly Generation: Multilingual Program Repair with LLMs

    Authors: Harshit Joshi, José Cambronero, Sumit Gulwani, Vu Le, Ivan Radicek, Gust Verbruggen

    Abstract: Most programmers make mistakes when writing code. Some of these mistakes are small and require few edits to the original program -- a class of errors recently termed last mile mistakes. These errors break the flow for experienced developers and can stump novice programmers. Existing automated repair techniques targeting this class of errors are language-specific and do not easily carry over to new… ▽ More

    Submitted 5 December, 2022; v1 submitted 24 August, 2022; originally announced August 2022.

    Comments: 13 pages, Accepted at AAAI 2023

  6. arXiv:2207.11765  [pdf, other

    cs.SE cs.AI

    Neurosymbolic Repair for Low-Code Formula Languages

    Authors: Rohan Bavishi, Harshit Joshi, José Pablo Cambronero Sánchez, Anna Fariha, Sumit Gulwani, Vu Le, Ivan Radicek, Ashish Tiwari

    Abstract: Most users of low-code platforms, such as Excel and PowerApps, write programs in domain-specific formula languages to carry out nontrivial tasks. Often users can write most of the program they want, but introduce small mistakes that yield broken formulas. These mistakes, which can be both syntactic and semantic, are hard for low-code users to identify and fix, even though they can be resolved with… ▽ More

    Submitted 24 July, 2022; originally announced July 2022.

  7. arXiv:2204.00054  [pdf, ps, other

    cs.NI cs.RO

    Distributed Robust Geocast Multicast Routing for Inter-Vehicle Communication

    Authors: Harshvardhan P. Joshi, Mihail L. Sichitiu, Maria Kihl

    Abstract: Numerous protocols for geocast have been proposed in literature. It has been shown that explicit route setup approaches perform poorly with VANETs due to limited route lifetime and frequent network fragmentation. The broadcast based approaches have considerable redundancy and add significantly to the overhead of the protocol. A completely distributed and robust geocast approach is presented in thi… ▽ More

    Submitted 31 March, 2022; originally announced April 2022.

    Comments: 12 pages

    Journal ref: Proceedings of WEIRD workshop on WiMax, Wireless and Mobility, 2007

  8. A Reinforcement Approach for Detecting P2P Botnet Communities in Dynamic Communication Graphs

    Authors: Harshvardhan P. Joshi, Rudra Dutta

    Abstract: Peer-to-peer (P2P) botnets use decentralized command and control networks that make them resilient to disruptions. The P2P botnet overlay networks manifest structures in mutual-contact graphs, also called communication graphs, formed using network traffic information. It has been shown that these structures can be detected using community detection techniques from graph theory. These previous work… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

  9. arXiv:2108.03697  [pdf, other

    cs.CV cs.CG math.DG stat.AP stat.CO

    Alignment of Tractography Streamlines using Deformation Transfer via Parallel Transport

    Authors: Andrew Lizarraga, David Lee, Antoni Kubicki, Ashish Sahib, Elvis Nunez, Katherine Narr, Shantanu H. Joshi

    Abstract: We present a geometric framework for aligning white matter fiber tracts. By registering fiber tracts between brains, one expects to see overlap of anatomical structures that often provide meaningful comparisons across subjects. However, the geometry of white matter tracts is highly heterogeneous, and finding direct tract-correspondence across multiple individuals remains a challenging problem. We… ▽ More

    Submitted 8 August, 2021; originally announced August 2021.

  10. arXiv:2104.13449  [pdf, other

    cs.CV cs.LG math.DG

    SrvfNet: A Generative Network for Unsupervised Multiple Diffeomorphic Shape Alignment

    Authors: Elvis Nunez, Andrew Lizarraga, Shantanu H. Joshi

    Abstract: We present SrvfNet, a generative deep learning framework for the joint multiple alignment of large collections of functional data comprising square-root velocity functions (SRVF) to their templates. Our proposed framework is fully unsupervised and is capable of aligning to a predefined template as well as jointly predicting an optimal template from data while simultaneously achieving alignment. Ou… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

  11. arXiv:2101.04427  [pdf, other

    quant-ph cs.CR cs.NI

    Quantum Internet- Applications, Functionalities, Enabling Technologies, Challenges, and Research Directions

    Authors: Amoldeep Singh, Kapal Dev, Harun Siljak, Hem Dutt Joshi, Maurizio Magarini

    Abstract: The advanced notebooks, mobile phones, and internet applications in today's world that we use are all entrenched in classical communication bits of zeros and ones. Classical internet has laid its foundation originating from the amalgamation of mathematics and Claude Shannon's theory of information. But today's internet technology is a playground for eavesdroppers. This poses a serious challenge to… ▽ More

    Submitted 1 June, 2021; v1 submitted 12 January, 2021; originally announced January 2021.

    Comments: This survey paper is submitted in IEEE Communications Surveys and Tutorials and revised on 27th May 2021. It includes 31 pages, 14 figures, and 5 tables

  12. arXiv:2011.15103  [pdf, other

    cs.CV

    Automating Artifact Detection in Video Games

    Authors: Parmida Davarmanesh, Kuanhao Jiang, Tingting Ou, Artem Vysogorets, Stanislav Ivashkevich, Max Kiehn, Shantanu H. Joshi, Nicholas Malaya

    Abstract: In spite of advances in gaming hardware and software, gameplay is often tainted with graphics errors, glitches, and screen artifacts. This proof of concept study presents a machine learning approach for automated detection of graphics corruptions in video games. Based on a sample of representative screen corruption examples, the model was able to identify 10 of the most commonly occurring screen a… ▽ More

    Submitted 30 November, 2020; originally announced November 2020.

  13. arXiv:2005.08011  [pdf, ps, other

    eess.SP cs.IT

    Decision Fusion in Space-Time Spreading aided Distributed MIMO WSNs

    Authors: I. Dey, H. Joshi, N. Marchetti

    Abstract: In this letter, we propose space-time spreading (STS) of local sensor decisions before reporting them over a wireless multiple access channel (MAC), in order to achieve flexible balance between diversity and multiplexing gain as well as eliminate any chance of intrinsic interference inherent in MAC scenarios. Spreading of the sensor decisions using dispersion vectors exploits the benefits of multi… ▽ More

    Submitted 16 May, 2020; originally announced May 2020.

    Comments: 5 pages, 5 figures

  14. arXiv:2002.01792  [pdf

    cs.IR cs.DL

    Experiments with Different Indexing Techniques for Text Retrieval tasks on Gujarati Language using Bag of Words Approach

    Authors: Dr. Jyoti Pareek, Hardik Joshi, Krunal Chauhan, Rushikesh Patel

    Abstract: This paper presents results of various experiments carried out to improve text retrieval of gujarati text documents. Text retrieval involves searching and ranking of text documents for a given set of query terms. We have tested various retrieval models that uses bag-of-words approach. Bag-of-words approach is a traditional approach that is being used till date where the text document is represente… ▽ More

    Submitted 5 February, 2020; originally announced February 2020.

  15. arXiv:2001.08085  [pdf

    cs.IR

    Experiments on Manual Thesaurus based Query Expansion for Ad-hoc Monolingual Gujarati Information Retrieval Tasks

    Authors: Hardik Joshi, Jyoti Pareek

    Abstract: In this paper, we present the experimental work done on Query Expansion (QE) for retrieval tasks of Gujarati text documents. In information retrieval, it is very difficult to estimate the exact user need, query expansion adds terms to the original query, which provides more information about the user need. There are various approaches to query expansion. In our work, manual thesaurus based query e… ▽ More

    Submitted 18 January, 2020; originally announced January 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1209.0126

  16. arXiv:1912.05255  [pdf, other

    eess.SP cs.LG

    Novel Deep Learning Framework for Wideband Spectrum Characterization at Sub-Nyquist Rate

    Authors: Shivam Chandhok, Himani Joshi, A V Subramanyam, Sumit J. Darak

    Abstract: Introduction of spectrum-sharing in 5G and subsequent generation networks demand base-station(s) with the capability to characterize the wideband spectrum spanned over licensed, shared and unlicensed non-contiguous frequency bands. Spectrum characterization involves the identification of vacant bands along with center frequency and parameters (energy, modulation, etc.) of occupied bands. Such char… ▽ More

    Submitted 7 May, 2020; v1 submitted 11 December, 2019; originally announced December 2019.

  17. arXiv:1512.05726  [pdf, other

    cs.CL cs.NE

    Semi-supervised Question Retrieval with Gated Convolutions

    Authors: Tao Lei, Hrishikesh Joshi, Regina Barzilay, Tommi Jaakkola, Katerina Tymoshenko, Alessandro Moschitti, Lluis Marquez

    Abstract: Question answering forums are rapidly growing in size with no effective automated ability to refer to and reuse answers already available for previous posted questions. In this paper, we develop a methodology for finding semantically related questions. The task is difficult since 1) key pieces of information are often buried in extraneous details in the question body and 2) available annotations o… ▽ More

    Submitted 3 April, 2016; v1 submitted 17 December, 2015; originally announced December 2015.

    Comments: NAACL 2016

  18. arXiv:1406.6840  [pdf

    cs.IR cs.DL

    From Citation count to Argumentation count: a new metric to indicate the usefulness of an article

    Authors: Hardik Joshi

    Abstract: Citation count is a quantifiable measure to indicate the number of times an article is cited by other articles. It is believed that if an article is cited often then it must be an important or influential article; however, there is no guarantee that the most cited articles are good in quality. In this paper, the author suggests argumentation count, a new metric for citation analysis. The proposed… ▽ More

    Submitted 26 June, 2014; originally announced June 2014.

    Comments: Technical Conference cum Workshop on Digital Library Using DSpace hosted by Gujarat National Law University on 21-23 March, 2013

  19. arXiv:1301.4337  [pdf

    cs.MM cs.CR

    A Novel Digital Watermarking Algorithm using Random Matrix Image

    Authors: Mahimn Pandya, Hiren Joshi, Ashish Jani

    Abstract: The availability of bandwidth for internet access is sufficient enough to communicate digital assets. These digital assets are subjected to various types of threats. [19] As a result of this, protection mechanism required for the protection of digital assets is of priority in research. The threat of current focus is unauthorized copying of digital assets which give boost to piracy. This under the… ▽ More

    Submitted 22 January, 2013; v1 submitted 18 January, 2013; originally announced January 2013.

    Comments: 4 pages, 8 figures

    Journal ref: International Journal of Computer Applications, Volume 61, Number 2, pp. 18-12, 2013

  20. arXiv:1209.0126  [pdf

    cs.IR

    Evaluation of some Information Retrieval models for Gujarati Ad hoc Monolingual Tasks

    Authors: Hardik J. Joshi, Pareek Jyoti

    Abstract: This paper describes the work towards Gujarati Ad hoc Monolingual Retrieval task for widely used Information Retrieval (IR) models. We present an indexing baseline for the Gujarati Language represented by Mean Average Precision (MAP) values. Our objective is to obtain a relative picture of a better IR model for Gujarati Language. Results show that Classical IR models like Term Frequency Inverse Do… ▽ More

    Submitted 1 September, 2012; originally announced September 2012.

    Comments: 6 pages, Some text in Gujarati Language

    Journal ref: VNSGU Journal of Science and Technology,3,2,176-181,2012