Zum Hauptinhalt springen

Showing 1–19 of 19 results for author: Nath, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.14678  [pdf, other

    cs.IR cs.AI cs.LG

    Bridging the Gap: Unpacking the Hidden Challenges in Knowledge Distillation for Online Ranking Systems

    Authors: Nikhil Khani, Shuo Yang, Aniruddh Nath, Yang Liu, Pendo Abbo, Li Wei, Shawn Andrews, Maciej Kula, Jarrod Kahn, Zhe Zhao, Lichan Hong, Ed Chi

    Abstract: Knowledge Distillation (KD) is a powerful approach for compressing a large model into a smaller, more efficient model, particularly beneficial for latency-sensitive applications like recommender systems. However, current KD research predominantly focuses on Computer Vision (CV) and NLP tasks, overlooking unique data characteristics and challenges inherent to recommender systems. This paper address… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

  2. arXiv:2406.11897  [pdf, other

    cs.AI cs.LG math.OC

    A Benchmark for Maximum Cut: Towards Standardization of the Evaluation of Learned Heuristics for Combinatorial Optimization

    Authors: Ankur Nath, Alan Kuhnle

    Abstract: Recently, there has been much work on the design of general heuristics for graph-based, combinatorial optimization problems via the incorporation of Graph Neural Networks (GNNs) to learn distribution-specific solution structures.However, there is a lack of consistency in the evaluation of these heuristics, in terms of the baselines and instances chosen, which makes it difficult to assess the relat… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  3. arXiv:2405.05202  [pdf, other

    cs.DS cs.DM cs.LG

    Discretely Beyond $1/e$: Guided Combinatorial Algorithms for Submodular Maximization

    Authors: Yixin Chen, Ankur Nath, Chunli Peng, Alan Kuhnle

    Abstract: For constrained, not necessarily monotone submodular maximization, all known approximation algorithms with ratio greater than $1/e$ require continuous ideas, such as queries to the multilinear extension of a submodular function and its gradient, which are typically expensive to simulate with the original set function. For combinatorial algorithms, the best known approximation ratios for both size… ▽ More

    Submitted 22 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

  4. arXiv:2404.08949  [pdf, other

    cs.CL

    Multimodal Cross-Document Event Coreference Resolution Using Linear Semantic Transfer and Mixed-Modality Ensembles

    Authors: Abhijnan Nath, Huma Jamil, Shafiuddin Rehan Ahmed, George Baker, Rahul Ghosh, James H. Martin, Nathaniel Blanchard, Nikhil Krishnaswamy

    Abstract: Event coreference resolution (ECR) is the task of determining whether distinct mentions of events within a multi-document corpus are actually linked to the same underlying occurrence. Images of the events can help facilitate resolution when language is ambiguous. Here, we propose a multimodal cross-document event coreference resolution method that integrates visual and textual cues with a simple l… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

    Comments: To appear at LREC-COLING 2024

  5. arXiv:2404.04299  [pdf, other

    q-bio.QM cs.AI

    GENEVIC: GENetic data Exploration and Visualization via Intelligent interactive Console

    Authors: Anindita Nath, Savannah Mwesigwa, Yulin Dai, Xiaoqian Jiang, Zhongming Zhao

    Abstract: Summary: The vast generation of genetic data poses a significant challenge in efficiently uncovering valuable knowledge. Introducing GENEVIC, an AI-driven chat framework that tackles this challenge by bridging the gap between genetic data generation and biomedical knowledge discovery. Leveraging generative AI, notably ChatGPT, it serves as a biologist's 'copilot'. It automates the analysis, retrie… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  6. arXiv:2404.03196  [pdf, other

    cs.CL

    Okay, Let's Do This! Modeling Event Coreference with Generated Rationales and Knowledge Distillation

    Authors: Abhijnan Nath, Shadi Manafi, Avyakta Chelle, Nikhil Krishnaswamy

    Abstract: In NLP, Event Coreference Resolution (ECR) is the task of connecting event clusters that refer to the same underlying real-life event, usually via neural systems. In this work, we investigate using abductive free-text rationales (FTRs) generated by modern autoregressive LLMs as distant supervision of smaller student models for cross-document coreference (CDCR) of events. We implement novel rationa… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: To be published in NAACL 2024 Main

  7. arXiv:2310.19990  [pdf, other

    cs.AI cs.LG

    Unveiling the Limits of Learned Local Search Heuristics: Are You the Mightiest of the Meek?

    Authors: Ankur Nath, Alan Kuhnle

    Abstract: In recent years, combining neural networks with local search heuristics has become popular in the field of combinatorial optimization. Despite its considerable computational demands, this approach has exhibited promising outcomes with minimal manual engineering. However, we have identified three critical limitations in the empirical evaluation of these integration attempts. Firstly, instances with… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  8. arXiv:2306.05434  [pdf, other

    cs.CL

    How Good is the Model in Model-in-the-loop Event Coreference Resolution Annotation?

    Authors: Shafiuddin Rehan Ahmed, Abhijnan Nath, Michael Regan, Adam Pollins, Nikhil Krishnaswamy, James H. Martin

    Abstract: Annotating cross-document event coreference links is a time-consuming and cognitively demanding task that can compromise annotation quality and efficiency. To address this, we propose a model-in-the-loop annotation approach for event coreference resolution, where a machine learning model suggests likely corefering event pairs only. We evaluate the effectiveness of this approach by first simulating… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: The 17th Liguistics Annotation Workshop, 2023 (LAW-XVII) short paper. 10 pages, 6 figures, 1 table

  9. arXiv:2305.13641  [pdf, other

    cs.CL

    AxomiyaBERTa: A Phonologically-aware Transformer Model for Assamese

    Authors: Abhijnan Nath, Sheikh Mannan, Nikhil Krishnaswamy

    Abstract: Despite their successes in NLP, Transformer-based language models still require extensive computing resources and suffer in low-resource or low-compute settings. In this paper, we present AxomiyaBERTa, a novel BERT model for Assamese, a morphologically-rich low-resource language (LRL) of Eastern India. AxomiyaBERTa is trained only on the masked language modeling (MLM) task, without the typical add… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: 16 pages, 6 figures, 8 tables, appearing in Findings of the ACL: ACL 2023. This version compiled using pdfLaTeX-compatible Assamese script font. Assamese text may appear differently here than in official ACL 2023 proceedings

  10. arXiv:2305.05672  [pdf, other

    cs.CL

    $2 * n$ is better than $n^2$: Decomposing Event Coreference Resolution into Two Tractable Problems

    Authors: Shafiuddin Rehan Ahmed, Abhijnan Nath, James H. Martin, Nikhil Krishnaswamy

    Abstract: Event Coreference Resolution (ECR) is the task of linking mentions of the same event either within or across documents. Most mention pairs are not coreferent, yet many that are coreferent can be identified through simple techniques such as lemma matching of the event triggers or the sentences in which they appear. Existing methods for training coreference systems sample from a largely skewed distr… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: Findings of the Association of Computational Linguistics, ACL 2023. 13 pages, 7 figures, 6 tables

  11. arXiv:2107.06426  [pdf, other

    cs.CL cs.AI

    TSCAN : Dialog Structure discovery using SCAN

    Authors: Apurba Nath, Aayush Kubba

    Abstract: Can we discover dialog structure by dividing utterances into labelled clusters. Can these labels be generated from the data. Typically for dialogs we need an ontology and use that to discover structure, however by using unsupervised classification and self-labelling we are able to intuit this structure without any labels or ontology. In this paper we apply SCAN (Semantic Clustering using Nearest N… ▽ More

    Submitted 18 July, 2021; v1 submitted 13 July, 2021; originally announced July 2021.

  12. arXiv:2107.04681  [pdf

    cs.HC

    A Survey on Personal Image Retrieval Systems

    Authors: Amit Kumar Nath, Andy Wang

    Abstract: The number of photographs taken worldwide is growing rapidly and steadily. While a small subset of these images is annotated and shared by users through social media platforms, due to the sheer number of images in personal photo repositories (shared or not shared), finding specific images remains challenging. This survey explores existing image retrieval techniques as well as photo-organizer appli… ▽ More

    Submitted 9 July, 2021; originally announced July 2021.

  13. arXiv:2104.12141  [pdf, ps, other

    cs.CG

    Coresets for $k$-median clustering under Fréchet and Hausdorff distances

    Authors: Abhinandan Nath

    Abstract: We give algorithms for computing coresets for $(1+\varepsilon)$-approximate $k$-median clustering of polygonal curves (under the discrete and continuous Fréchet distance) and point sets (under the Hausdorff distance), when the cluster centers are restricted to be of low complexity. Ours is the first such result, where the size of the coreset is independent of the number of input curves/point sets… ▽ More

    Submitted 25 April, 2021; originally announced April 2021.

  14. arXiv:2012.11532  [pdf, other

    cs.LG eess.SP

    Dual-CyCon Net: A Cycle Consistent Dual-Domain Convolutional Neural Network Framework for Detection of Partial Discharge

    Authors: Mohammad Zunaed, Ankur Nath, Md. Saifur Rahman

    Abstract: In the last decade, researchers have been investigating the severity of insulation breakdown caused by partial discharge (PD) in overhead transmission lines with covered conductors or electrical equipment such as generators and motors used in various industries. Developing an effective partial discharge detection system can lead to significant savings on maintenance and prevent power disruptions.… ▽ More

    Submitted 19 October, 2021; v1 submitted 21 December, 2020; originally announced December 2020.

  15. arXiv:2004.00722  [pdf, ps, other

    cs.CG

    k-Median clustering under discrete Fréchet and Hausdorff distances

    Authors: Abhinandan Nath, Erin Taylor

    Abstract: We give the first near-linear time $(1+\eps)$-approximation algorithm for $k$-median clustering of polygonal trajectories under the discrete Fréchet distance, and the first polynomial time $(1+\eps)$-approximation algorithm for $k$-median clustering of finite point sets under the Hausdorff distance, provided the cluster centers, ambient dimension, and $k$ are bounded by a constant. The main techni… ▽ More

    Submitted 1 April, 2020; originally announced April 2020.

    Comments: A shorter version to appear in SoCG 2020

  16. arXiv:1901.07715  [pdf, ps, other

    cs.DC

    Enhancing MapReduce Fault Recovery Through Binocular Speculation

    Authors: Huansong Fu, Yue Zhu, Amit Kumar Nath, Md. Muhib Khan, Weikuan Yu

    Abstract: MapReduce speculation plays an important role in finding potential task stragglers and failures. But a tacit dichotomy exists in MapReduce due to its inherent two-phase (map and reduce) management scheme in which map tasks and reduce tasks have distinctly different execution behaviors, yet reduce tasks are dependent on the results of map tasks. We reveal that speculation policies for fault handlin… ▽ More

    Submitted 22 January, 2019; originally announced January 2019.

    Comments: 10 pages, 9 figures

  17. arXiv:1808.05827  [pdf

    cs.CR

    Confidential Encrypted Data Hiding and Retrieval Using QR Authentication System

    Authors: Somdip Dey, Asoke Nath, Shalabh Agarwal

    Abstract: Now, security and authenticity of data is a big challenge. To solve this problem, we propose an innovative method to authenticate the digital documents. In this paper, we propose a new method, where the marks obtained by a candidate will also be encoded in QR CodeTM in encrypted form, so that if an intruder tries to change the marks in the mark sheet then he can not do that in the QR CodeTM, becau… ▽ More

    Submitted 17 August, 2018; originally announced August 2018.

    Journal ref: 2013 International Conference on Communication Systems and Network Technologies

  18. arXiv:1509.05751  [pdf, ps, other

    cs.CG

    Computing the Gromov-Hausdorff Distance for Metric Trees

    Authors: Pankaj K. Agarwal, Kyle Fox, Abhinandan Nath, Anastasios Sidiropoulos, Yusu Wang

    Abstract: The Gromov-Hausdorff (GH) distance is a natural way to measure distance between two metric spaces. We prove that it is $\mathrm{NP}$-hard to approximate the Gromov-Hausdorff distance better than a factor of $3$ for geodesic metrics on a pair of trees. We complement this result by providing a polynomial time $O(\min\{n, \sqrt{rn}\})$-approximation algorithm for computing the GH distance between a p… ▽ More

    Submitted 13 June, 2017; v1 submitted 18 September, 2015; originally announced September 2015.

    Comments: Appeared in Proceedings of the 26th International Symposium on Algorithms and Computation

  19. arXiv:1507.01698  [pdf, other

    cs.SE cs.LG

    Learning Tractable Probabilistic Models for Fault Localization

    Authors: Aniruddh Nath, Pedro Domingos

    Abstract: In recent years, several probabilistic techniques have been applied to various debugging problems. However, most existing probabilistic debugging systems use relatively simple statistical models, and fail to generalize across multiple programs. In this work, we propose Tractable Fault Localization Models (TFLMs) that can be learned from data, and probabilistically infer the location of the bug. Wh… ▽ More

    Submitted 7 July, 2015; originally announced July 2015.

    Comments: Fifth International Workshop on Statistical Relational AI (StaR-AI 2015)