Zum Hauptinhalt springen

Showing 1–37 of 37 results for author: Jain, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17968  [pdf, other

    cs.IR cs.AI cs.LG stat.ML

    Efficient Document Ranking with Learnable Late Interactions

    Authors: Ziwei Ji, Himanshu Jain, Andreas Veit, Sashank J. Reddi, Sadeep Jayasumana, Ankit Singh Rawat, Aditya Krishna Menon, Felix Yu, Sanjiv Kumar

    Abstract: Cross-Encoder (CE) and Dual-Encoder (DE) models are two fundamental approaches for query-document relevance in information retrieval. To predict relevance, CE models use joint query-document embeddings, while DE models maintain factorized query and document embeddings; usually, the former has higher quality while the latter benefits from lower latency. Recently, late-interaction models have been p… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  2. arXiv:2404.12063  [pdf, other

    cs.LG cs.CE cs.NE math.NA

    FastVPINNs: Tensor-Driven Acceleration of VPINNs for Complex Geometries

    Authors: Thivin Anandh, Divij Ghose, Himanshu Jain, Sashikumaar Ganesan

    Abstract: Variational Physics-Informed Neural Networks (VPINNs) utilize a variational loss function to solve partial differential equations, mirroring Finite Element Analysis techniques. Traditional hp-VPINNs, while effective for high-frequency problems, are computationally intensive and scale poorly with increasing element counts, limiting their use in complex geometries. This work introduces FastVPINNs, a… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 31 pages, 19 figures, 4 algorithms

  3. arXiv:2402.14889  [pdf

    cs.CL cs.AI

    COBIAS: Contextual Reliability in Bias Assessment

    Authors: Priyanshul Govil, Hemang Jain, Vamshi Krishna Bonagiri, Aman Chadha, Ponnurangam Kumaraguru, Manas Gaur, Sanorita Dey

    Abstract: Large Language Models (LLMs) are trained on extensive web corpora, which enable them to understand and generate human-like text. However, this training process also results in inherent biases within the models. These biases arise from web data's diverse and often uncurated nature, containing various stereotypes and prejudices. Previous works on debiasing models rely on benchmark datasets to measur… ▽ More

    Submitted 17 June, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

  4. arXiv:2310.15141  [pdf, other

    cs.LG cs.CL cs.DS cs.IT

    SpecTr: Fast Speculative Decoding via Optimal Transport

    Authors: Ziteng Sun, Ananda Theertha Suresh, Jae Hun Ro, Ahmad Beirami, Himanshu Jain, Felix Yu

    Abstract: Autoregressive sampling from large language models has led to state-of-the-art results in several natural language tasks. However, autoregressive sampling generates tokens one at a time making it slow, and even prohibitive in certain tasks. One way to speed up sampling is $\textit{speculative decoding}$: use a small model to sample a $\textit{draft}$ (block or sequence of tokens), and then score a… ▽ More

    Submitted 17 January, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023

  5. arXiv:2302.04149  [pdf, other

    cs.CV

    Domain Adaptation of Synthetic Driving Datasets for Real-World Autonomous Driving

    Authors: Koustav Mullick, Harshil Jain, Sanchit Gupta, Amit Arvind Kale

    Abstract: While developing perception based deep learning models, the benefit of synthetic data is enormous. However, performance of networks trained with synthetic data for certain computer vision tasks degrade significantly when tested on real world data due to the domain gap between them. One of the popular solutions in bridging this gap between synthetic and actual world data is to frame it as a domain… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

  6. arXiv:2211.04367  [pdf, other

    cs.LG cs.CV

    Much Easier Said Than Done: Falsifying the Causal Relevance of Linear Decoding Methods

    Authors: Lucas Hayne, Abhijit Suresh, Hunar Jain, Rahul Kumar, R. McKell Carter

    Abstract: Linear classifier probes are frequently utilized to better understand how neural networks function. Researchers have approached the problem of determining unit importance in neural networks by probing their learned, internal representations. Linear classifier probes identify highly selective units as the most important for network function. Whether or not a network actually relies on high selectiv… ▽ More

    Submitted 8 November, 2022; originally announced November 2022.

    Comments: 6 pages, 3 figures, to be published in I Can't Believe It's Note Better Workshop at NeurIPS 2022

  7. arXiv:2208.09015  [pdf, other

    cs.CL cs.LG

    Treeformer: Dense Gradient Trees for Efficient Attention Computation

    Authors: Lovish Madaan, Srinadh Bhojanapalli, Himanshu Jain, Prateek Jain

    Abstract: Standard inference and training with transformer based architectures scale quadratically with input sequence length. This is prohibitively large for a variety of applications especially in web-page translation, query-answering etc. Consequently, several approaches have been developed recently to speedup attention computation by enforcing different attention structures such as sparsity, low-rank, a… ▽ More

    Submitted 17 March, 2023; v1 submitted 18 August, 2022; originally announced August 2022.

    Comments: ICLR 2023

  8. arXiv:2208.06825  [pdf, other

    cs.LG

    Teacher Guided Training: An Efficient Framework for Knowledge Transfer

    Authors: Manzil Zaheer, Ankit Singh Rawat, Seungyeon Kim, Chong You, Himanshu Jain, Andreas Veit, Rob Fergus, Sanjiv Kumar

    Abstract: The remarkable performance gains realized by large pretrained models, e.g., GPT-3, hinge on the massive amounts of data they are exposed to during training. Analogously, distilling such large models to compact models for efficient deployment also necessitates a large amount of (labeled or unlabeled) training data. In this paper, we propose the teacher-guided training (TGT) framework for training a… ▽ More

    Submitted 14 August, 2022; originally announced August 2022.

  9. arXiv:2201.07612  [pdf, other

    cs.LG

    ReGNL: Rapid Prediction of GDP during Disruptive Events using Nightlights

    Authors: Rushabh Musthyala, Rudrajit Kargupta, Hritish Jain, Dipanjan Chakraborty

    Abstract: Policy makers often make decisions based on parameters such as GDP, unemployment rate, industrial output, etc. The primary methods to obtain or even estimate such information are resource intensive and time consuming. In order to make timely and well-informed decisions, it is imperative to be able to come up with proxies for these parameters which can be sampled quickly and efficiently, especially… ▽ More

    Submitted 19 January, 2022; originally announced January 2022.

  10. arXiv:2112.03252  [pdf, other

    cs.CV

    CSG0: Continual Urban Scene Generation with Zero Forgetting

    Authors: Himalaya Jain, Tuan-Hung Vu, Patrick Pérez, Matthieu Cord

    Abstract: With the rapid advances in generative adversarial networks (GANs), the visual quality of synthesised scenes keeps improving, including for complex urban scenes with applications to automated driving. We address in this work a continual scene generation setup in which GANs are trained on a stream of distinct domains; ideally, the learned models should eventually be able to generate new scenes in al… ▽ More

    Submitted 2 May, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

    Comments: Published at the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022 Workshop on Continual Learning

  11. DeepXML: A Deep Extreme Multi-Label Learning Framework Applied to Short Text Documents

    Authors: Kunal Dahiya, Deepak Saini, Anshul Mittal, Ankush Shaw, Kushal Dave, Akshay Soni, Himanshu Jain, Sumeet Agarwal, Manik Varma

    Abstract: Scalability and accuracy are well recognized challenges in deep extreme multi-label learning where the objective is to train architectures for automatically annotating a data point with the most relevant subset of labels from an extremely large label set. This paper develops the DeepXML framework that addresses these challenges by decomposing the deep extreme multi-label task into four simpler sub… ▽ More

    Submitted 12 November, 2021; originally announced November 2021.

    ACM Class: F.2.2; I.2.7

    Journal ref: Web Search and Data Mining 2021

  12. arXiv:2110.06821  [pdf, other

    cs.LG cs.CL cs.CV

    Leveraging redundancy in attention with Reuse Transformers

    Authors: Srinadh Bhojanapalli, Ayan Chakrabarti, Andreas Veit, Michal Lukasik, Himanshu Jain, Frederick Liu, Yin-Wen Chang, Sanjiv Kumar

    Abstract: Pairwise dot product-based attention allows Transformers to exchange information between tokens in an input-dependent way, and is key to their success across diverse applications in language and vision. However, a typical Transformer model computes such pairwise attention scores repeatedly for the same sequence, in multiple heads in multiple layers. We systematically analyze the empirical similari… ▽ More

    Submitted 13 October, 2021; originally announced October 2021.

  13. arXiv:2106.08823  [pdf, other

    cs.LG

    Eigen Analysis of Self-Attention and its Reconstruction from Partial Computation

    Authors: Srinadh Bhojanapalli, Ayan Chakrabarti, Himanshu Jain, Sanjiv Kumar, Michal Lukasik, Andreas Veit

    Abstract: State-of-the-art transformer models use pairwise dot-product based self-attention, which comes at a computational cost quadratic in the input sequence length. In this paper, we investigate the global structure of attention scores computed using this dot product mechanism on a typical distribution of inputs, and study the principal components of their variation. Through eigen analysis of full atten… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

    Comments: 14 pages

  14. arXiv:2106.01629  [pdf, other

    cs.CV

    Semantic Palette: Guiding Scene Generation with Class Proportions

    Authors: Guillaume Le Moing, Tuan-Hung Vu, Himalaya Jain, Patrick Pérez, Matthieu Cord

    Abstract: Despite the recent progress of generative adversarial networks (GANs) at synthesizing photo-realistic images, producing complex urban scenes remains a challenging problem. Previous works break down scene generation into two consecutive phases: unconditional semantic layout synthesis and image synthesis conditioned on layouts. In this work, we propose to condition layout generation as well for high… ▽ More

    Submitted 3 June, 2021; originally announced June 2021.

    Comments: Accepted to IEEE CVPR 2021

  15. arXiv:2011.03705  [pdf, other

    cs.CV

    Blind Motion Deblurring through SinGAN Architecture

    Authors: Harshil Jain, Rohit Patil, Indra Deep Mastan, Shanmuganathan Raman

    Abstract: Blind motion deblurring involves reconstructing a sharp image from an observation that is blurry. It is a problem that is ill-posed and lies in the categories of image restoration problems. The training data-based methods for image deblurring mostly involve training models that take a lot of time. These models are data-hungry i.e., they require a lot of training data to generate satisfactory resul… ▽ More

    Submitted 7 November, 2020; originally announced November 2020.

    Comments: Deep Internal Learning: Training with no prior examples. ECCV'2020 Workshop

  16. A Sui Generis QA Approach using RoBERTa for Adverse Drug Event Identification

    Authors: Harshit Jain, Nishant Raj, Suyash Mishra

    Abstract: Extraction of adverse drug events from biomedical literature and other textual data is an important component to monitor drug-safety and this has attracted attention of many researchers in healthcare. Existing works are more pivoted around entity-relation extraction using bidirectional long short term memory networks (Bi-LSTM) which does not attain the best feature representations. In this paper,… ▽ More

    Submitted 30 October, 2020; originally announced November 2020.

    Journal ref: BMC Bioinformatics 22, 330 (2021)

  17. arXiv:2010.07447  [pdf, ps, other

    cs.CL cs.LG

    Semantic Label Smoothing for Sequence to Sequence Problems

    Authors: Michal Lukasik, Himanshu Jain, Aditya Krishna Menon, Seungyeon Kim, Srinadh Bhojanapalli, Felix Yu, Sanjiv Kumar

    Abstract: Label smoothing has been shown to be an effective regularization strategy in classification, that prevents overfitting and helps in label de-noising. However, extending such methods directly to seq2seq settings, such as Machine Translation, is challenging: the large target output space of such problems makes it intractable to apply label smoothing over all possible outputs. Most existing approache… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

  18. arXiv:2010.05223  [pdf, other

    cs.LG cs.CL

    End to End Binarized Neural Networks for Text Classification

    Authors: Harshil Jain, Akshat Agarwal, Kumar Shridhar, Denis Kleyko

    Abstract: Deep neural networks have demonstrated their superior performance in almost every Natural Language Processing task, however, their increasing complexity raises concerns. In particular, these networks require high expenses on computational hardware, and training budget is a concern for many. Even for a trained network, the inference phase can be too demanding for resource-constrained devices, thus… ▽ More

    Submitted 11 October, 2020; originally announced October 2020.

    Comments: 14 pages. Accepted at the SustaiNLP Workshop on Simple and Efficient Natural Language Processing at EMNLP 2020

  19. arXiv:2007.07314  [pdf, other

    cs.LG stat.ML

    Long-tail learning via logit adjustment

    Authors: Aditya Krishna Menon, Sadeep Jayasumana, Ankit Singh Rawat, Himanshu Jain, Andreas Veit, Sanjiv Kumar

    Abstract: Real-world classification problems typically exhibit an imbalanced or long-tailed label distribution, wherein many labels are associated with only a few samples. This poses a challenge for generalisation on such labels, and also makes naïve learning biased towards dominant labels. In this paper, we present two simple modifications of standard softmax cross-entropy training to cope with these chall… ▽ More

    Submitted 9 July, 2021; v1 submitted 14 July, 2020; originally announced July 2020.

    Comments: Published as a conference paper in ICLR 2021

  20. arXiv:2007.06555  [pdf, other

    cs.LG cs.DS stat.ML

    Adversarial robustness via robust low rank representations

    Authors: Pranjal Awasthi, Himanshu Jain, Ankit Singh Rawat, Aravindan Vijayaraghavan

    Abstract: Adversarial robustness measures the susceptibility of a classifier to imperceptible perturbations made to the inputs at test time. In this work we highlight the benefits of natural low rank representations that often exist for real data such as images, for training neural networks with certified robustness guarantees. Our first contribution is for certified robustness to perturbations measured i… ▽ More

    Submitted 1 August, 2020; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: fixed a bug in the proof of Proposition B.2

  21. arXiv:2002.12096  [pdf, other

    cs.CV

    Action Quality Assessment using Siamese Network-Based Deep Metric Learning

    Authors: Hiteshi Jain, Gaurav Harit, Avinash Sharma

    Abstract: Automated vision-based score estimation models can be used as an alternate opinion to avoid judgment bias. In the past works the score estimation models were learned by regressing the video representations to the ground truth score provided by the judges. However such regression-based solutions lack interpretability in terms of giving reasons for the awarded score. One solution to make the scores… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

    Comments: 12 pages, 5 Figures, 8 tables

  22. arXiv:2001.09599  [pdf, other

    cs.AR

    Achieving Multi-Port Memory Performance on Single-Port Memory with Coding Techniques

    Authors: Hardik Jain, Matthew Edwards, Ethan Elenberg, Ankit Singh Rawat, Sriram Vishwanath

    Abstract: Many performance critical systems today must rely on performance enhancements, such as multi-port memories, to keep up with the increasing demand of memory-access capacity. However, the large area footprints and complexity of existing multi-port memory designs limit their applicability. This paper explores a coding theoretic framework to address this problem. In particular, this paper introduces a… ▽ More

    Submitted 27 January, 2020; originally announced January 2020.

    Comments: 10 pages, 20 figures, ICICT 2020 conference

  23. GraphGen: A Scalable Approach to Domain-agnostic Labeled Graph Generation

    Authors: Nikhil Goyal, Harsh Vardhan Jain, Sayan Ranu

    Abstract: Graph generative models have been extensively studied in the data mining literature. While traditional techniques are based on generating structures that adhere to a pre-decided distribution, recent techniques have shifted towards learning this distribution directly from the data. While learning-based approaches have imparted significant improvement in quality, some limitations remain to be addres… ▽ More

    Submitted 8 April, 2020; v1 submitted 22 January, 2020; originally announced January 2020.

    Comments: Fixed typo in Table 1; The Web Conference (WWW) 2020

  24. arXiv:1912.01540  [pdf, other

    cs.CV cs.LG

    QUEST: Quantized embedding space for transferring knowledge

    Authors: Himalaya Jain, Spyros Gidaris, Nikos Komodakis, Patrick Pérez, Matthieu Cord

    Abstract: Knowledge distillation refers to the process of training a compact student network to achieve better accuracy by learning from a high capacity teacher network. Most of the existing knowledge distillation methods direct the student to follow the teacher by matching the teacher's output, feature maps or their distribution. In this work, we propose a novel way to achieve this goal: by distilling the… ▽ More

    Submitted 17 July, 2020; v1 submitted 3 December, 2019; originally announced December 2019.

    Comments: Accepted at ECCV 2020

  25. arXiv:1911.08271  [pdf

    cs.CY

    Python vs. R: A Text Mining Approach for analyzing the Research Trends in Scopus Database

    Authors: Neeraj Bhanot, Harwinder Singh, Divyansu Sharma, Harshit Jain, Shreyansh Jain

    Abstract: In the contemporary world, with the incubation of advanced technologies and tremendous outbursts of research works, analyzing big data to incorporate research strategies becomes more helpful using the tools and techniques presented in the current research scenario. This paper indeed tries to tackle the most prominent challenges relating to big data analysis by utilizing a text mining approach to a… ▽ More

    Submitted 10 November, 2019; originally announced November 2019.

    Comments: This study aims to help researchers by developing a Python based algorithm to analyse research trends using Scopus Database considering large amount of information in different domains as it will help the beginners to get fair enough idea of research being carried out in their fields of interest. A comparison with R has also been done to find as in which platform provides more relevant results

  26. arXiv:1911.05161  [pdf, other

    cs.IR

    All It Takes is 20 Questions!: A Knowledge Graph Based Approach

    Authors: Alvin Dey, Harsh Kumar Jain, Vikash Kumar Pandey, Tanmoy Chakraborty

    Abstract: 20 Questions (20Q) is a two-player game. One player is the answerer, and the other is a questioner. The answerer chooses an entity from a specified domain and does not reveal this to the other player. The questioner can ask at most 20 questions to the answerer to guess the entity. The answerer can reply to the questions asked by saying yes/no/maybe. In this paper, we propose a novel approach based… ▽ More

    Submitted 12 November, 2019; originally announced November 2019.

  27. arXiv:1911.02888  [pdf, other

    cs.CV cs.LG eess.IV

    This dataset does not exist: training models from generated images

    Authors: Victor Besnier, Himalaya Jain, Andrei Bursuc, Matthieu Cord, Patrick Pérez

    Abstract: Current generative networks are increasingly proficient in generating high-resolution realistic images. These generative networks, especially the conditional ones, can potentially become a great tool for providing new image datasets. This naturally brings the question: Can we train a classifier only on the generated data? This potential availability of nearly unlimited amounts of training data cha… ▽ More

    Submitted 7 November, 2019; originally announced November 2019.

  28. arXiv:1904.01886  [pdf, other

    cs.CV

    DADA: Depth-aware Domain Adaptation in Semantic Segmentation

    Authors: Tuan-Hung Vu, Himalaya Jain, Maxime Bucher, Matthieu Cord, Patrick Pérez

    Abstract: Unsupervised domain adaptation (UDA) is important for applications where large scale annotation of representative data is challenging. For semantic segmentation in particular, it helps deploy on real "target domain" data models that are trained on annotated images from a different "source domain", notably a virtual environment. To this end, most previous works consider semantic segmentation as the… ▽ More

    Submitted 19 August, 2019; v1 submitted 3 April, 2019; originally announced April 2019.

    Comments: Accepted in ICCV'19

  29. arXiv:1811.12833  [pdf, other

    cs.CV

    ADVENT: Adversarial Entropy Minimization for Domain Adaptation in Semantic Segmentation

    Authors: Tuan-Hung Vu, Himalaya Jain, Maxime Bucher, Matthieu Cord, Patrick Pérez

    Abstract: Semantic segmentation is a key problem for many computer vision tasks. While approaches based on convolutional neural networks constantly break new records on different benchmarks, generalizing well to diverse testing environments remains a major challenge. In numerous real world applications, there is indeed a large gap between data distributions in train and test domains, which results in severe… ▽ More

    Submitted 17 April, 2019; v1 submitted 30 November, 2018; originally announced November 2018.

    Comments: Accepted in CVPR'19. Code is available at https://github.com/valeoai/ADVENT

  30. arXiv:1712.04480  [pdf, other

    cs.CV

    Learning a Complete Image Indexing Pipeline

    Authors: Himalaya Jain, Joaquin Zepeda, Patrick Pérez, Rémi Gribonval

    Abstract: To work at scale, a complete image indexing system comprises two components: An inverted file index to restrict the actual search to only a subset that should contain most of the items relevant to the query; An approximate distance computation mechanism to rapidly scan these lists. While supervised deep learning has recently enabled improvements to the latter, the former continues to be based on u… ▽ More

    Submitted 12 December, 2017; originally announced December 2017.

  31. arXiv:1711.10283  [pdf, other

    cs.MA

    Data Backup Network Formation with Heterogeneous Agents

    Authors: Harshit Jain, Guduru Sai Teja, Pramod Mane, Kapil Ahuja, Nagarajan Krishnamurthy

    Abstract: Social storage systems are becoming increasingly popular compared to the existing data backup systems like local, centralized and P2P systems. An endogenously built symmetric social storage model and its aspects like the utility of each agent, bilateral stability, contentment, and efficiency have been extensively discussed in Mane et. al. (2017). We include heterogeneity in this model by using the… ▽ More

    Submitted 28 November, 2017; originally announced November 2017.

    Comments: 3 Pages, double columns, 1 figure, extended abstract

    MSC Class: 91

  32. arXiv:1710.03027  [pdf

    cs.CV

    A Bottom Up Procedure for Text Line Segmentation of Latin Script

    Authors: Himanshu Jain, Archana Praveen Kumar

    Abstract: In this paper we present a bottom up procedure for segmentation of text lines written or printed in the Latin script. The proposed method uses a combination of image morphology, feature extraction and Gaussian mixture model to perform this task. The experimental results show the validity of the procedure.

    Submitted 9 October, 2017; originally announced October 2017.

    Comments: Accepted and presented at the IEEE conference "International Conference on Advances in Computing, Communications and Informatics (ICACCI) 2017"

    MSC Class: 68T45

  33. arXiv:1710.03025  [pdf

    cs.CV

    A Sequential Thinning Algorithm For Multi-Dimensional Binary Patterns

    Authors: Himanshu Jain, Archana Praveen Kumar

    Abstract: Thinning is the removal of contour pixels/points of connected components in an image to produce their skeleton with retained connectivity and structural properties. The output requirements of a thinning procedure often vary with application. This paper proposes a sequential algorithm that is very easy to understand and modify based on application to perform the thinning of multi-dimensional binary… ▽ More

    Submitted 16 November, 2017; v1 submitted 9 October, 2017; originally announced October 2017.

    MSC Class: 68T10

  34. arXiv:1708.02932  [pdf, other

    cs.CV

    SUBIC: A supervised, structured binary code for image search

    Authors: Himalaya Jain, Joaquin Zepeda, Patrick Pérez, Rémi Gribonval

    Abstract: For large-scale visual search, highly compressed yet meaningful representations of images are essential. Structured vector quantizers based on product quantization and its variants are usually employed to achieve such compression while minimizing the loss of accuracy. Yet, unlike binary hashing schemes, these unsupervised methods have not yet benefited from the supervision, end-to-end learning and… ▽ More

    Submitted 9 August, 2017; originally announced August 2017.

    Comments: Accepted at ICCV 2017 (Spotlight)

  35. arXiv:1706.06651  [pdf, other

    cs.MM cs.CV

    Passive Classification of Source Printer using Text-line-level Geometric Distortion Signatures from Scanned Images of Printed Documents

    Authors: Hardik Jain, Gaurav Gupta, Sharad Joshi, Nitin Khanna

    Abstract: In this digital era, one thing that still holds the convention is a printed archive. Printed documents find their use in many critical domains such as contract papers, legal tenders and proof of identity documents. As more advanced printing, scanning and image editing techniques are becoming available, forgeries on these legal tenders pose a serious threat. Ability to easily and reliably identify… ▽ More

    Submitted 20 June, 2017; originally announced June 2017.

    Comments: 20 pages

  36. arXiv:1608.03308  [pdf, other

    cs.CV

    Approximate search with quantized sparse representations

    Authors: Himalaya Jain, Patrick Pérez, Rémi Gribonval, Joaquin Zepeda, Hervé Jégou

    Abstract: This paper tackles the task of storing a large collection of vectors, such as visual descriptors, and of searching in it. To this end, we propose to approximate database vectors by constrained sparse coding, where possible atom weights are restricted to belong to a finite subset. This formulation encompasses, as particular cases, previous state-of-the-art methods such as product or residual quanti… ▽ More

    Submitted 10 August, 2016; originally announced August 2016.

    Comments: ECCV 2016

  37. arXiv:1507.02743  [pdf, ps, other

    cs.LG cs.IR math.OC stat.ML

    Locally Non-linear Embeddings for Extreme Multi-label Learning

    Authors: Kush Bhatia, Himanshu Jain, Purushottam Kar, Prateek Jain, Manik Varma

    Abstract: The objective in extreme multi-label learning is to train a classifier that can automatically tag a novel data point with the most relevant subset of labels from an extremely large label set. Embedding based approaches make training and prediction tractable by assuming that the training label matrix is low-rank and hence the effective number of labels can be reduced by projecting the high dimensio… ▽ More

    Submitted 9 July, 2015; originally announced July 2015.