Zum Hauptinhalt springen

Showing 1–45 of 45 results for author: Chandrasekhar, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2101.04859  [pdf

    cs.LG eess.SP

    A*HAR: A New Benchmark towards Semi-supervised learning for Class-imbalanced Human Activity Recognition

    Authors: Govind Narasimman, Kangkang Lu, Arun Raja, Chuan Sheng Foo, Mohamed Sabry Aly, Jie Lin, Vijay Chandrasekhar

    Abstract: Despite the vast literature on Human Activity Recognition (HAR) with wearable inertial sensor data, it is perhaps surprising that there are few studies investigating semisupervised learning for HAR, particularly in a challenging scenario with class imbalance problem. In this work, we present a new benchmark, called A*HAR, towards semisupervised learning for class-imbalanced HAR. We evaluate state-… ▽ More

    Submitted 12 January, 2021; originally announced January 2021.

    Comments: 5 pages, 3 figures

  2. arXiv:2007.04756  [pdf, other

    cs.AI cs.CV cs.LG cs.NE

    Learning to Prune Deep Neural Networks via Reinforcement Learning

    Authors: Manas Gupta, Siddharth Aravindan, Aleksandra Kalisz, Vijay Chandrasekhar, Lin Jie

    Abstract: This paper proposes PuRL - a deep reinforcement learning (RL) based algorithm for pruning neural networks. Unlike current RL based model compression approaches where feedback is given only at the end of each episode to the agent, PuRL provides rewards at every pruning step. This enables PuRL to achieve sparsity and accuracy comparable to current state-of-the-art methods, while having a much shorte… ▽ More

    Submitted 9 July, 2020; originally announced July 2020.

    Comments: Accepted at the ICML 2020 Workshop on Automated Machine Learning (AutoML 2020)

  3. arXiv:2006.14265  [pdf, other

    cs.LG cs.CV stat.ML

    Empirical Analysis of Overfitting and Mode Drop in GAN Training

    Authors: Yasin Yazici, Chuan-Sheng Foo, Stefan Winkler, Kim-Hui Yap, Vijay Chandrasekhar

    Abstract: We examine two key questions in GAN training, namely overfitting and mode drop, from an empirical perspective. We show that when stochasticity is removed from the training procedure, GANs can overfit and exhibit almost no mode drop. Our results shed light on important characteristics of the GAN training procedure. They also provide evidence against prevailing intuitions that GANs do not memorize t… ▽ More

    Submitted 25 June, 2020; originally announced June 2020.

    Comments: To appear in ICIP2020

  4. Classify and Generate: Using Classification Latent Space Representations for Image Generations

    Authors: Saisubramaniam Gopalakrishnan, Pranshu Ranjan Singh, Yasin Yazici, Chuan-Sheng Foo, Vijay Chandrasekhar, ArulMurugan Ambikapathi

    Abstract: Utilization of classification latent space information for downstream reconstruction and generation is an intriguing and a relatively unexplored area. In general, discriminative representations are rich in class-specific features but are too sparse for reconstruction, whereas, in autoencoders the representations are dense but have limited indistinguishable class-specific features, making them less… ▽ More

    Submitted 14 December, 2021; v1 submitted 16 April, 2020; originally announced April 2020.

    Journal ref: Saisubramaniam Gopalakrishnan, Pranshu Ranjan Singh et. al., Classify and generate: Using classification latent space representations for image generations, Neurocomputing, Volume 471, 2022, Pages 296-334, ISSN 0925-2312

  5. arXiv:1912.04219  [pdf, other

    cs.CV

    FaultNet: Faulty Rail-Valves Detection using Deep Learning and Computer Vision

    Authors: Ramanpreet Singh Pahwa, Jin Chao, Jestine Paul, Yiqun Li, Ma Tin Lay Nwe, Shudong Xie, Ashish James, Arulmurugan Ambikapathi, Zeng Zeng, Vijay Ramaseshan Chandrasekhar

    Abstract: Regular inspection of rail valves and engines is an important task to ensure the safety and efficiency of railway networks around the globe. Over the past decade, computer vision and pattern recognition based techniques have gained traction for such inspection and defect detection tasks. An automated end-to-end trained system can potentially provide a low-cost, high throughput, and cheap alternati… ▽ More

    Submitted 8 November, 2019; originally announced December 2019.

    Comments: 8 pages, 8 figures, ITSC 2019

    Journal ref: IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE - ITSC 2019

  6. arXiv:1909.07541  [pdf, other

    cs.CV cs.RO

    A*3D Dataset: Towards Autonomous Driving in Challenging Environments

    Authors: Quang-Hieu Pham, Pierre Sevestre, Ramanpreet Singh Pahwa, Huijing Zhan, Chun Ho Pang, Yuda Chen, Armin Mustafa, Vijay Chandrasekhar, Jie Lin

    Abstract: With the increasing global popularity of self-driving cars, there is an immediate need for challenging real-world datasets for benchmarking and training various computer vision tasks such as 3D object detection. Existing datasets either represent simple scenarios or provide only day-time data. In this paper, we introduce a new challenging A*3D dataset which consists of RGB images and LiDAR data wi… ▽ More

    Submitted 16 September, 2019; originally announced September 2019.

    Comments: A new 3D dataset by I2R, A*STAR for autonomous driving

  7. arXiv:1907.07862  [pdf, other

    cs.IT eess.SP

    Artificial Intelligence-Enabled Cellular Networks: A Critical Path to Beyond-5G and 6G

    Authors: Rubayet Shafin, Lingjia Liu, Vikram Chandrasekhar, Hao Chen, Jeffrey Reed, Jianzhong, Zhang

    Abstract: Mobile Network Operators (MNOs) are in process of overlaying their conventional macro cellular networks with shorter range cells such as outdoor pico cells. The resultant increase in network complexity creates substantial overhead in terms of operating expenses, time, and labor for their planning and management. Artificial intelligence (AI) offers the potential for MNOs to operate their networks i… ▽ More

    Submitted 17 July, 2019; originally announced July 2019.

    Comments: 7 pages, 3 figures, 1 table

  8. arXiv:1902.03444  [pdf, other

    cs.LG stat.ML

    Venn GAN: Discovering Commonalities and Particularities of Multiple Distributions

    Authors: Yasin Yazıcı, Bruno Lecouat, Chuan-Sheng Foo, Stefan Winkler, Kim-Hui Yap, Georgios Piliouras, Vijay Chandrasekhar

    Abstract: We propose a GAN design which models multiple distributions effectively and discovers their commonalities and particularities. Each data distribution is modeled with a mixture of $K$ generator distributions. As the generators are partially shared between the modeling of different true data distributions, shared ones captures the commonality of the distributions, while non-shared ones capture uniqu… ▽ More

    Submitted 9 February, 2019; originally announced February 2019.

  9. arXiv:1901.10074  [pdf, other

    cs.CR

    CaRENets: Compact and Resource-Efficient CNN for Homomorphic Inference on Encrypted Medical Images

    Authors: Jin Chao, Ahmad Al Badawi, Balagopal Unnikrishnan, Jie Lin, Chan Fook Mun, James M. Brown, J. Peter Campbell, Michael Chiang, Jayashree Kalpathy-Cramer, Vijay Ramaseshan Chandrasekhar, Pavitra Krishnaswamy, Khin Mi Mi Aung

    Abstract: Convolutional neural networks (CNNs) have enabled significant performance leaps in medical image classification tasks. However, translating neural network models for clinical applications remains challenging due to data privacy issues. Fully Homomorphic Encryption (FHE) has the potential to address this challenge as it enables the use of CNNs on encrypted images. However, current HE technology pos… ▽ More

    Submitted 28 January, 2019; originally announced January 2019.

  10. arXiv:1901.02064  [pdf, other

    cs.LG stat.ML

    Dataflow-based Joint Quantization of Weights and Activations for Deep Neural Networks

    Authors: Xue Geng, Jie Fu, Bin Zhao, Jie Lin, Mohamed M. Sabry Aly, Christopher Pal, Vijay Chandrasekhar

    Abstract: This paper addresses a challenging problem - how to reduce energy consumption without incurring performance drop when deploying deep neural networks (DNNs) at the inference stage. In order to alleviate the computation and storage burdens, we propose a novel dataflow-based joint quantization approach with the hypothesis that a fewer number of quantization operations would incur less information los… ▽ More

    Submitted 4 January, 2019; originally announced January 2019.

    Journal ref: Data Compression Conference 2019

  11. arXiv:1812.07832  [pdf, other

    cs.CV

    Semi-Supervised Deep Learning for Abnormality Classification in Retinal Images

    Authors: Bruno Lecouat, Ken Chang, Chuan-Sheng Foo, Balagopal Unnikrishnan, James M. Brown, Houssam Zenati, Andrew Beers, Vijay Chandrasekhar, Jayashree Kalpathy-Cramer, Pavitra Krishnaswamy

    Abstract: Supervised deep learning algorithms have enabled significant performance gains in medical image classification tasks. But these methods rely on large labeled datasets that require resource-intensive expert annotation. Semi-supervised generative adversarial network (GAN) approaches offer a means to learn from limited labeled data alongside larger unlabeled datasets, but have not been applied to dis… ▽ More

    Submitted 19 December, 2018; originally announced December 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

    Report number: ML4H/2018/227

  12. arXiv:1812.02288  [pdf, other

    cs.LG stat.ML

    Adversarially Learned Anomaly Detection

    Authors: Houssam Zenati, Manon Romain, Chuan Sheng Foo, Bruno Lecouat, Vijay Ramaseshan Chandrasekhar

    Abstract: Anomaly detection is a significant and hence well-studied problem. However, developing effective anomaly detection methods for complex and high-dimensional data remains a challenge. As Generative Adversarial Networks (GANs) are able to model the complex high-dimensional distributions of real-world data, they offer a promising approach to address this challenge. In this work, we propose an anomaly… ▽ More

    Submitted 5 December, 2018; originally announced December 2018.

    Comments: In the Proceedings of the 20th IEEE International Conference on Data Mining (ICDM), 2018

  13. arXiv:1811.12065  [pdf, other

    cs.NE cs.LG

    TEA-DNN: the Quest for Time-Energy-Accuracy Co-optimized Deep Neural Networks

    Authors: Lile Cai, Anne-Maelle Barneche, Arthur Herbout, Chuan Sheng Foo, Jie Lin, Vijay Ramaseshan Chandrasekhar, Mohamed M. Sabry

    Abstract: Embedded deep learning platforms have witnessed two simultaneous improvements. First, the accuracy of convolutional neural networks (CNNs) has been significantly improved through the use of automated neural-architecture search (NAS) algorithms to determine CNN structure. Second, there has been increasing interest in developing hardware accelerators for CNNs that provide improved inference performa… ▽ More

    Submitted 21 October, 2019; v1 submitted 29 November, 2018; originally announced November 2018.

    Comments: Accepted by ISLPED2019

  14. arXiv:1811.06231  [pdf, other

    cond-mat.mtrl-sci cs.LG

    Graph Convolutional Neural Networks for Polymers Property Prediction

    Authors: Minggang Zeng, Jatin Nitin Kumar, Zeng Zeng, Ramasamy Savitha, Vijay Ramaseshan Chandrasekhar, Kedar Hippalgaonkar

    Abstract: A fast and accurate predictive tool for polymer properties is demanding and will pave the way to iterative inverse design. In this work, we apply graph convolutional neural networks (GCNN) to predict the dielectric constant and energy bandgap of polymers. Using density functional theory (DFT) calculated properties as the ground truth, GCNN can achieve remarkable agreement with DFT results. Moreove… ▽ More

    Submitted 15 November, 2018; originally announced November 2018.

    Comments: Accepted for NIPS 2018 Workshop on Machine Learning for Molecules and Materials

  15. arXiv:1811.06219  [pdf, other

    physics.comp-ph cond-mat.mtrl-sci cs.LG

    Predicting thermoelectric properties from crystal graphs and material descriptors - first application for functional materials

    Authors: Leo Laugier, Daniil Bash, Jose Recatala, Hong Kuan Ng, Savitha Ramasamy, Chuan-Sheng Foo, Vijay R Chandrasekhar, Kedar Hippalgaonkar

    Abstract: We introduce the use of Crystal Graph Convolutional Neural Networks (CGCNN), Fully Connected Neural Networks (FCNN) and XGBoost to predict thermoelectric properties. The dataset for the CGCNN is independent of Density Functional Theory (DFT) and only relies on the crystal and atomic information, while that for the FCNN is based on a rich attribute list mined from Materialsproject.org. The results… ▽ More

    Submitted 15 November, 2018; originally announced November 2018.

  16. arXiv:1811.04595  [pdf, other

    cs.CV

    Holistic Multi-modal Memory Network for Movie Question Answering

    Authors: Anran Wang, Anh Tuan Luu, Chuan-Sheng Foo, Hongyuan Zhu, Yi Tay, Vijay Chandrasekhar

    Abstract: Answering questions according to multi-modal context is a challenging problem as it requires a deep integration of different data sources. Existing approaches only employ partial interactions among data sources in one attention hop. In this paper, we present the Holistic Multi-modal Memory Network (HMMN) framework which fully considers the interactions between different input sources (multi-modal… ▽ More

    Submitted 12 November, 2018; originally announced November 2018.

  17. arXiv:1811.00778  [pdf, other

    cs.CR cs.LG

    Towards the AlexNet Moment for Homomorphic Encryption: HCNN, theFirst Homomorphic CNN on Encrypted Data with GPUs

    Authors: Ahmad Al Badawi, Jin Chao, Jie Lin, Chan Fook Mun, Jun Jie Sim, Benjamin Hong Meng Tan, Xiao Nan, Khin Mi Mi Aung, Vijay Ramaseshan Chandrasekhar

    Abstract: Deep Learning as a Service (DLaaS) stands as a promising solution for cloud-based inference applications. In this setting, the cloud has a pre-learned model whereas the user has samples on which she wants to run the model. The biggest concern with DLaaS is user privacy if the input samples are sensitive data. We provide here an efficient privacy-preserving system by employing high-end technologies… ▽ More

    Submitted 18 August, 2020; v1 submitted 2 November, 2018; originally announced November 2018.

  18. arXiv:1808.07272  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    Deep Adaptive Temporal Pooling for Activity Recognition

    Authors: Sibo Song, Ngai-Man Cheung, Vijay Chandrasekhar, Bappaditya Mandal

    Abstract: Deep neural networks have recently achieved competitive accuracy for human activity recognition. However, there is room for improvement, especially in modeling long-term temporal importance and determining the activity relevance of different temporal segments in a video. To address this problem, we propose a learnable and differentiable module: Deep Adaptive Temporal Pooling (DATP). DATP applies a… ▽ More

    Submitted 22 August, 2018; originally announced August 2018.

    Comments: Accepted by ACM Multimedia 2018

  19. arXiv:1807.04307  [pdf, other

    cs.LG stat.ML

    Manifold regularization with GANs for semi-supervised learning

    Authors: Bruno Lecouat, Chuan-Sheng Foo, Houssam Zenati, Vijay Chandrasekhar

    Abstract: Generative Adversarial Networks are powerful generative models that are able to model the manifold of natural images. We leverage this property to perform manifold regularization by approximating a variant of the Laplacian norm using a Monte Carlo approximation that is easily computed with the GAN. When incorporated into the semi-supervised feature-matching GAN we achieve state-of-the-art results… ▽ More

    Submitted 11 July, 2018; originally announced July 2018.

  20. arXiv:1807.02629  [pdf, other

    cs.LG cs.GT math.OC stat.ML

    Optimistic mirror descent in saddle-point problems: Going the extra (gradient) mile

    Authors: Panayotis Mertikopoulos, Bruno Lecouat, Houssam Zenati, Chuan-Sheng Foo, Vijay Chandrasekhar, Georgios Piliouras

    Abstract: Owing to their connection with generative adversarial networks (GANs), saddle-point problems have recently attracted considerable interest in machine learning and beyond. By necessity, most theoretical guarantees revolve around convex-concave (or even linear) problems; however, making theoretical inroads towards efficient GAN training depends crucially on moving beyond this classic framework. To m… ▽ More

    Submitted 1 October, 2018; v1 submitted 7 July, 2018; originally announced July 2018.

    Comments: 26 pages, 14 figures

  21. arXiv:1806.04498  [pdf, other

    stat.ML cs.CV cs.LG

    The Unusual Effectiveness of Averaging in GAN Training

    Authors: Yasin Yazıcı, Chuan-Sheng Foo, Stefan Winkler, Kim-Hui Yap, Georgios Piliouras, Vijay Chandrasekhar

    Abstract: We examine two different techniques for parameter averaging in GAN training. Moving Average (MA) computes the time-average of parameters, whereas Exponential Moving Average (EMA) computes an exponentially discounted sum. Whilst MA is known to lead to convergence in bilinear settings, we provide the -- to our knowledge -- first theoretical arguments in support of EMA. We show that EMA converges to… ▽ More

    Submitted 26 February, 2019; v1 submitted 12 June, 2018; originally announced June 2018.

    Comments: Published as a conference paper at ICLR 2019

  22. arXiv:1805.08957  [pdf, other

    cs.LG stat.ML

    Semi-Supervised Learning with GANs: Revisiting Manifold Regularization

    Authors: Bruno Lecouat, Chuan-Sheng Foo, Houssam Zenati, Vijay R. Chandrasekhar

    Abstract: GANS are powerful generative models that are able to model the manifold of natural images. We leverage this property to perform manifold regularization by approximating the Laplacian norm using a Monte Carlo approximation that is easily computed with the GAN. When incorporated into the feature-matching GAN of Improved GAN, we achieve state-of-the-art results for GAN-based semi-supervised learning… ▽ More

    Submitted 23 May, 2018; originally announced May 2018.

    Comments: Accepted paper

    Journal ref: Workshop track - ICLR 2018

  23. arXiv:1803.11246  [pdf

    cs.CY cond-mat.mtrl-sci

    Accelerating Materials Development via Automation, Machine Learning, and High-Performance Computing

    Authors: Juan Pablo Correa-Baena, Kedar Hippalgaonkar, Jeroen van Duren, Shaffiq Jaffer, Vijay R. Chandrasekhar, Vladan Stevanovic, Cyrus Wadia, Supratik Guha, Tonio Buonassisi

    Abstract: Successful materials innovations can transform society. However, materials research often involves long timelines and low success probabilities, dissuading investors who have expectations of shorter times from bench to business. A combination of emergent technologies could accelerate the pace of novel materials development by 10x or more, aligning the timelines of stakeholders (investors and resea… ▽ More

    Submitted 20 March, 2018; originally announced March 2018.

    Comments: 22 pages, 3 figures

    Journal ref: Joule 2 (2018) 1410-1420

  24. arXiv:1803.02043  [pdf, other

    cs.NE cs.LG stat.ML

    Online Deep Learning: Growing RBM on the fly

    Authors: Savitha Ramasamy, Kanagasabai Rajaraman, Pavitra Krishnaswamy, Vijay Chandrasekhar

    Abstract: We propose a novel online learning algorithm for Restricted Boltzmann Machines (RBM), namely, the Online Generative Discriminative Restricted Boltzmann Machine (OGD-RBM), that provides the ability to build and adapt the network architecture of RBM according to the statistics of streaming data. The OGD-RBM is trained in two phases: (1) an online generative phase for unsupervised feature representat… ▽ More

    Submitted 6 March, 2018; originally announced March 2018.

    Comments: 14 pages, 4 figures, 2 tables

  25. arXiv:1802.06222  [pdf, ps, other

    cs.LG stat.ML

    Efficient GAN-Based Anomaly Detection

    Authors: Houssam Zenati, Chuan Sheng Foo, Bruno Lecouat, Gaurav Manek, Vijay Ramaseshan Chandrasekhar

    Abstract: Generative adversarial networks (GANs) are able to model the complex highdimensional distributions of real-world data, which suggests they could be effective for anomaly detection. However, few works have explored the use of GANs for the anomaly detection task. We leverage recently developed GAN models for anomaly detection, and achieve state-of-the-art performance on image and network intrusion d… ▽ More

    Submitted 1 May, 2019; v1 submitted 17 February, 2018; originally announced February 2018.

    Comments: Updated version of this work is published at ICDM 2018, see arXiv:1812.02288 . Submitted to the ICLR Workshop 2018

  26. arXiv:1711.01714  [pdf, other

    cs.CV

    End-to-End Video Classification with Knowledge Graphs

    Authors: Fang Yuan, Zhe Wang, Jie Lin, Luis Fernando D'Haro, Kim Jung Jae, Zeng Zeng, Vijay Chandrasekhar

    Abstract: Video understanding has attracted much research attention especially since the recent availability of large-scale video benchmarks. In this paper, we address the problem of multi-label video classification. We first observe that there exists a significant knowledge gap between how machines and humans learn. That is, while current machine learning approaches including deep neural networks largely f… ▽ More

    Submitted 5 November, 2017; originally announced November 2017.

    Comments: 9 pages, 5 figures

  27. arXiv:1707.05455  [pdf, ps, other

    cs.CV

    Pruning Convolutional Neural Networks for Image Instance Retrieval

    Authors: Gaurav Manek, Jie Lin, Vijay Chandrasekhar, Lingyu Duan, Sateesh Giduthuri, Xiaoli Li, Tomaso Poggio

    Abstract: In this work, we focus on the problem of image instance retrieval with deep descriptors extracted from pruned Convolutional Neural Networks (CNN). The objective is to heavily prune convolutional edges while maintaining retrieval performance. To this end, we introduce both data-independent and data-dependent heuristics to prune convolutional edges, and evaluate their performance across various comp… ▽ More

    Submitted 17 July, 2017; originally announced July 2017.

    Comments: 5 pages

  28. arXiv:1706.05461  [pdf, other

    cs.CV

    Truly Multi-modal YouTube-8M Video Classification with Video, Audio, and Text

    Authors: Zhe Wang, Kingsley Kuan, Mathieu Ravaut, Gaurav Manek, Sibo Song, Yuan Fang, Seokhwan Kim, Nancy Chen, Luis Fernando D'Haro, Luu Anh Tuan, Hongyuan Zhu, Zeng Zeng, Ngai Man Cheung, Georgios Piliouras, Jie Lin, Vijay Chandrasekhar

    Abstract: The YouTube-8M video classification challenge requires teams to classify 0.7 million videos into one or more of 4,716 classes. In this Kaggle competition, we placed in the top 3% out of 650 participants using released video and audio features. Beyond that, we extend the original competition by including text information in the classification, making this a truly multi-modal approach with vision, a… ▽ More

    Submitted 9 July, 2017; v1 submitted 16 June, 2017; originally announced June 2017.

    Comments: 8 pages, Accepted to CVPR'17 Workshop on YouTube-8M Large-Scale Video Understanding

  29. arXiv:1705.09435  [pdf, other

    cs.CV

    Deep Learning for Lung Cancer Detection: Tackling the Kaggle Data Science Bowl 2017 Challenge

    Authors: Kingsley Kuan, Mathieu Ravaut, Gaurav Manek, Huiling Chen, Jie Lin, Babar Nazir, Cen Chen, Tse Chiang Howe, Zeng Zeng, Vijay Chandrasekhar

    Abstract: We present a deep learning framework for computer-aided lung cancer diagnosis. Our multi-stage framework detects nodules in 3D lung CAT scans, determines if each nodule is malignant, and finally assigns a cancer probability based on these results. We discuss the challenges and advantages of our framework. In the Kaggle Data Science Bowl 2017, our framework ranked 41st out of 1972 teams.

    Submitted 26 May, 2017; originally announced May 2017.

  30. arXiv:1704.08141  [pdf, other

    cs.CV

    Compact Descriptors for Video Analysis: the Emerging MPEG Standard

    Authors: Ling-Yu Duan, Vijay Chandrasekhar, Shiqi Wang, Yihang Lou, Jie Lin, Yan Bai, Tiejun Huang, Alex Chichung Kot, Wen Gao

    Abstract: This paper provides an overview of the on-going compact descriptors for video analysis standard (CDVA) from the ISO/IEC moving pictures experts group (MPEG). MPEG-CDVA targets at defining a standardized bitstream syntax to enable interoperability in the context of video analysis applications. During the developments of MPEGCDVA, a series of techniques aiming to reduce the descriptor size and impro… ▽ More

    Submitted 26 April, 2017; originally announced April 2017.

    Comments: 4 figures, 4 tables

  31. arXiv:1701.04923  [pdf, other

    cs.CV

    Compression of Deep Neural Networks for Image Instance Retrieval

    Authors: Vijay Chandrasekhar, Jie Lin, Qianli Liao, Olivier Morère, Antoine Veillard, Lingyu Duan, Tomaso Poggio

    Abstract: Image instance retrieval is the problem of retrieving images from a database which contain the same object. Convolutional Neural Network (CNN) based descriptors are becoming the dominant approach for generating {\it global image descriptors} for the instance retrieval problem. One major drawback of CNN-based {\it global descriptors} is that uncompressed deep neural network models require hundreds… ▽ More

    Submitted 17 January, 2017; originally announced January 2017.

    Comments: 10 pages, accepted by DCC 2017

  32. arXiv:1603.04595  [pdf, other

    cs.CV cs.IR

    Nested Invariance Pooling and RBM Hashing for Image Instance Retrieval

    Authors: Olivier Morère, Jie Lin, Antoine Veillard, Vijay Chandrasekhar, Tomaso Poggio

    Abstract: The goal of this work is the computation of very compact binary hashes for image instance retrieval. Our approach has two novel contributions. The first one is Nested Invariance Pooling (NIP), a method inspired from i-theory, a mathematical theory for computing group invariant transformations with feed-forward neural networks. NIP is able to produce compact and well-performing descriptors with vis… ▽ More

    Submitted 14 April, 2016; v1 submitted 15 March, 2016; originally announced March 2016.

    Comments: Image Instance Retrieval, CNN, Invariant Representation, Hashing, Unsupervised Learning, Regularization. arXiv admin note: text overlap with arXiv:1601.02093

  33. arXiv:1601.06603  [pdf, other

    cs.MM cs.CV

    Egocentric Activity Recognition with Multimodal Fisher Vector

    Authors: Sibo Song, Ngai-Man Cheung, Vijay Chandrasekhar, Bappaditya Mandal, Jie Lin

    Abstract: With the increasing availability of wearable devices, research on egocentric activity recognition has received much attention recently. In this paper, we build a Multimodal Egocentric Activity dataset which includes egocentric videos and sensor data of 20 fine-grained and diverse activity categories. We present a novel strategy to extract temporal trajectory-like features from sensor data. We prop… ▽ More

    Submitted 25 January, 2016; originally announced January 2016.

    Comments: 5 pages, 4 figures, ICASSP 2016 accepted

  34. arXiv:1601.02093  [pdf, other

    cs.CV cs.IR

    Group Invariant Deep Representations for Image Instance Retrieval

    Authors: Olivier Morère, Antoine Veillard, Jie Lin, Julie Petta, Vijay Chandrasekhar, Tomaso Poggio

    Abstract: Most image instance retrieval pipelines are based on comparison of vectors known as global image descriptors between a query image and the database images. Due to their success in large scale image classification, representations extracted from Convolutional Neural Networks (CNN) are quickly gaining ground on Fisher Vectors (FVs) as state-of-the-art global descriptors for image instance retrieval.… ▽ More

    Submitted 13 January, 2016; v1 submitted 9 January, 2016; originally announced January 2016.

  35. arXiv:1511.03055  [pdf, other

    cs.IR cs.CV cs.LG

    Tiny Descriptors for Image Retrieval with Unsupervised Triplet Hashing

    Authors: Jie Lin, Olivier Morère, Julie Petta, Vijay Chandrasekhar, Antoine Veillard

    Abstract: A typical image retrieval pipeline starts with the comparison of global descriptors from a large database to find a short list of candidate matches. A good image descriptor is key to the retrieval pipeline and should reconcile two contradictory requirements: providing recall rates as high as possible and being as compact as possible for fast matching. Following the recent successes of Deep Convolu… ▽ More

    Submitted 10 November, 2015; originally announced November 2015.

    MSC Class: 68P20 ACM Class: H.3.3; I.2.6

  36. arXiv:1508.02496  [pdf, other

    cs.CV cs.IR

    A Practical Guide to CNNs and Fisher Vectors for Image Instance Retrieval

    Authors: Vijay Chandrasekhar, Jie Lin, Olivier Morère, Hanlin Goh, Antoine Veillard

    Abstract: With deep learning becoming the dominant approach in computer vision, the use of representations extracted from Convolutional Neural Nets (CNNs) is quickly gaining ground on Fisher Vectors (FVs) as favoured state-of-the-art global image descriptors for image instance retrieval. While the good performance of CNNs for image classification are unambiguously recognised, which of the two has the upper… ▽ More

    Submitted 25 August, 2015; v1 submitted 11 August, 2015; originally announced August 2015.

    Comments: Deep Convolutional Neural Networks for instance retrieval, Fisher Vectors, instance retrieval

  37. arXiv:1501.07738  [pdf, other

    cs.CV

    Co-Regularized Deep Representations for Video Summarization

    Authors: Olivier Morère, Hanlin Goh, Antoine Veillard, Vijay Chandrasekhar, Jie Lin

    Abstract: Compact keyframe-based video summaries are a popular way of generating viewership on video sharing platforms. Yet, creating relevant and compelling summaries for arbitrarily long videos with a small number of keyframes is a challenging task. We propose a comprehensive keyframe-based summarization framework combining deep convolutional neural networks and restricted Boltzmann machines. An original… ▽ More

    Submitted 30 January, 2015; originally announced January 2015.

    Comments: Video summarization, deep convolutional neural networks, co-regularized restricted Boltzmann machines

  38. arXiv:1501.04711  [pdf, other

    cs.CV cs.IR

    DeepHash: Getting Regularization, Depth and Fine-Tuning Right

    Authors: Jie Lin, Olivier Morere, Vijay Chandrasekhar, Antoine Veillard, Hanlin Goh

    Abstract: This work focuses on representing very high-dimensional global image descriptors using very compact 64-1024 bit binary hashes for instance retrieval. We propose DeepHash: a hashing scheme based on deep networks. Key to making DeepHash work at extremely low bitrates are three important considerations -- regularization, depth and fine-tuning -- each requiring solutions specific to the hashing proble… ▽ More

    Submitted 19 January, 2015; originally announced January 2015.

  39. arXiv:1112.1344  [pdf

    cs.IT

    Enhanced Inter-cell Interference Coordination for Heterogeneous Networks in LTE-Advanced: A Survey

    Authors: Lars Lindbom, Robert Love, Sandeep Krishnamurthy, Chunhai Yao, Nobuhiko Miki, Vikram Chandrasekhar

    Abstract: Heterogeneous networks (het-nets) - comprising of conventional macrocell base stations overlaid with femtocells, picocells and wireless relays - offer cellular operators burgeoning traffic demands through cell-splitting gains obtained by bringing users closer to their access points. However, the often random and unplanned location of these access points can cause severe near-far problems, typicall… ▽ More

    Submitted 7 December, 2011; v1 submitted 6 December, 2011; originally announced December 2011.

    Comments: This is a working document describing the Enhanced Inter-cell Interference Coordination (E-ICIC) introduced in LTE-Advanced

  40. Open vs Closed Access Femtocells in the Uplink

    Authors: Ping Xia, Vikram Chandrasekhar, Jeffrey G. Andrews

    Abstract: Femtocells are assuming an increasingly important role in the coverage and capacity of cellular networks. In contrast to existing cellular systems, femtocells are end-user deployed and controlled, randomly located, and rely on third party backhaul (e.g. DSL or cable modem). Femtocells can be configured to be either open access or closed access. Open access allows an arbitrary nearby cellular use… ▽ More

    Submitted 15 February, 2010; originally announced February 2010.

    Comments: 21 pages, 8 figures, 2 tables, submitted to IEEE Trans. on Wireless Communications

  41. Coverage in Multi-Antenna Two-Tier Networks

    Authors: Vikram Chandrasekhar, Marios Kountouris, Jeffrey G. Andrews

    Abstract: In two-tier networks -- comprising a conventional cellular network overlaid with shorter range hotspots (e.g. femtocells, distributed antennas, or wired relays) -- with universal frequency reuse, the near-far effect from cross-tier interference creates dead spots where reliable coverage cannot be guaranteed to users in either tier. Equipping the macrocell and femtocells with multiple antennas en… ▽ More

    Submitted 4 May, 2009; v1 submitted 18 February, 2009; originally announced February 2009.

    Comments: 30 Pages, 11 figures, Revised and Resubmitted to IEEE Transactions on Wireless Communications

  42. Power Control in Two-Tier Femtocell Networks

    Authors: Vikram Chandrasekhar, Jeffrey G. Andrews, Tarik Muharemovic, Zukang Shen, Alan Gatherer

    Abstract: In a two tier cellular network -- comprised of a central macrocell underlaid with shorter range femtocell hotspots -- cross-tier interference limits overall capacity with universal frequency reuse. To quantify near-far effects with universal frequency reuse, this paper derives a fundamental relation providing the largest feasible cellular Signal-to-Interference-Plus-Noise Ratio (SINR), given any… ▽ More

    Submitted 13 May, 2009; v1 submitted 21 October, 2008; originally announced October 2008.

    Comments: 29 pages, 10 figures, Revised and resubmitted to the IEEE Transactions on Wireless Communications

  43. arXiv:0805.1226  [pdf, ps, other

    cs.NI

    Spectrum Allocation in Two-Tier Networks

    Authors: Vikram Chandrasekhar, Jeffrey G. Andrews

    Abstract: Two-tier networks, comprising a conventional cellular network overlaid with shorter range hotspots (e.g. femtocells, distributed antennas, or wired relays), offer an economically viable way to improve cellular system capacity. The capacity-limiting factor in such networks is interference. The cross-tier interference between macrocells and femtocells can suffocate the capacity due to the near-far… ▽ More

    Submitted 24 November, 2008; v1 submitted 8 May, 2008; originally announced May 2008.

    Comments: 25 pages, Revised and submitted to IEEE Transactions on Communications

  44. Femtocell Networks: A Survey

    Authors: Vikram Chandrasekhar, Jeffrey Andrews, Alan Gatherer

    Abstract: The surest way to increase the system capacity of a wireless link is by getting the transmitter and receiver closer to each other, which creates the dual benefits of higher quality links and more spatial reuse. In a network with nomadic users, this inevitably involves deploying more infrastructure, typically in the form of microcells, hotspots, distributed antennas, or relays. A less expensive a… ▽ More

    Submitted 20 September, 2008; v1 submitted 6 March, 2008; originally announced March 2008.

    Comments: IEEE Communications Magazine, vol. 46, no.9, pp. 59-67, Sept. 2008

  45. arXiv:cs/0702132  [pdf, ps, other

    cs.NI cs.IT

    Uplink Capacity and Interference Avoidance for Two-Tier Femtocell Networks

    Authors: Vikram Chandrasekhar, Jeffrey G. Andrews

    Abstract: Two-tier femtocell networks-- comprising a conventional macrocellular network plus embedded femtocell hotspots-- offer an economically viable solution to achieving high cellular user capacity and improved coverage. With universal frequency reuse and DS-CDMA transmission however, the ensuing cross-tier cochannel interference (CCI) causes unacceptable outage probability. This paper develops an upl… ▽ More

    Submitted 5 February, 2009; v1 submitted 22 February, 2007; originally announced February 2007.

    Comments: To be published in the IEEE Transactions on Wireless Communications