Search | arXiv e-print repository

Parameter-Efficient Instruction Tuning of Large Language Models For Extreme Financial Numeral Labelling

Authors: Subhendu Khatuya, Rajdeep Mukherjee, Akash Ghosh, Manjunath Hegde, Koustuv Dasgupta, Niloy Ganguly, Saptarshi Ghosh, Pawan Goyal

Abstract: We study the problem of automatically annotating relevant numerals (GAAP metrics) occurring in the financial documents with their corresponding XBRL tags. Different from prior works, we investigate the feasibility of solving this extreme classification problem using a generative paradigm through instruction tuning of Large Language Models (LLMs). To this end, we leverage metric metadata informatio… ▽ More We study the problem of automatically annotating relevant numerals (GAAP metrics) occurring in the financial documents with their corresponding XBRL tags. Different from prior works, we investigate the feasibility of solving this extreme classification problem using a generative paradigm through instruction tuning of Large Language Models (LLMs). To this end, we leverage metric metadata information to frame our target outputs while proposing a parameter efficient solution for the task using LoRA. We perform experiments on two recently released financial numeric labeling datasets. Our proposed model, FLAN-FinXC, achieves new state-of-the-art performances on both the datasets, outperforming several strong baselines. We explain the better scores of our proposed model by demonstrating its capability for zero-shot as well as the least frequently occurring tags. Also, even when we fail to predict the XBRL tags correctly, our generated output has substantial overlap with the ground-truth in majority of the cases. △ Less

Submitted 15 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

Comments: This work has been accepted to appear at North American Chapter of the Association for Computational Linguistics (NAACL), 2024

arXiv:2401.02765 [pdf, ps, other]

An improved upper bound for the domination number of a graph

Authors: Subramanian Arumugam, Suresh Manjanath Hegde, Shashanka Kulamarva

Abstract: Let $G$ be a graph of order $n$. A classical upper bound for the domination number of a graph $G$ having no isolated vertices is $\lfloor\frac{n}{2}\rfloor$. However, for several families of graphs, we have $γ(G) \le \lfloor\sqrt{n}\rfloor$ which gives a substantially improved upper bound. In this paper, we give a condition necessary for a graph $G$ to have $γ(G) \le \lfloor\sqrt{n}\rfloor$, and s… ▽ More Let $G$ be a graph of order $n$. A classical upper bound for the domination number of a graph $G$ having no isolated vertices is $\lfloor\frac{n}{2}\rfloor$. However, for several families of graphs, we have $γ(G) \le \lfloor\sqrt{n}\rfloor$ which gives a substantially improved upper bound. In this paper, we give a condition necessary for a graph $G$ to have $γ(G) \le \lfloor\sqrt{n}\rfloor$, and some conditions sufficient for a graph $G$ to have $γ(G) \le \lfloor\sqrt{n}\rfloor$. We also present a characterization of all connected graphs $G$ of order $n$ with $γ(G) = \lfloor\sqrt{n}\rfloor$. Further, we prove that for a graph $G$ not satisfying $rad(G)=diam(G)=rad(\overline{G})=diam(\overline{G})=2$, deciding whether $γ(G) \le \lfloor\sqrt{n}\rfloor$ or $γ(\overline{G}) \le \lfloor\sqrt{n}\rfloor$ can be done in polynomial time. We conjecture that this decision problem can be solved in polynomial time for any graph $G$. △ Less

Submitted 16 January, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

MSC Class: 05C69

arXiv:2308.13868 [pdf, ps, other]

A Graph-Theoretic Model for a Generic Three Jug Puzzle

Authors: Suresh Manjanath Hegde, Shashanka Kulamarva

Abstract: In a classic three jug puzzle we have three jugs $A$, $B$, and $C$ with some fixed capacities. The jug $A$ is fully filled with wine to its capacity. The goal of the puzzle is to divide the wine into two equal halves by pouring it from one jug to another without using any other measuring devices. However, we consider a generic version of the three jug puzzle and present an independent graph theore… ▽ More In a classic three jug puzzle we have three jugs $A$, $B$, and $C$ with some fixed capacities. The jug $A$ is fully filled with wine to its capacity. The goal of the puzzle is to divide the wine into two equal halves by pouring it from one jug to another without using any other measuring devices. However, we consider a generic version of the three jug puzzle and present an independent graph theoretic model to determine whether the puzzle has a solution at first place. If it has a solution, then the same can be determined using this model. We also present the sketch of an algorithm to determine the solution of the puzzle. △ Less

Submitted 30 September, 2023; v1 submitted 26 August, 2023; originally announced August 2023.

Comments: 13 pages, 2 figures

MSC Class: 05C20; 05C90

arXiv:2306.03723 [pdf, other]

Financial Numeric Extreme Labelling: A Dataset and Benchmarking for XBRL Tagging

Authors: Soumya Sharma, Subhendu Khatuya, Manjunath Hegde, Afreen Shaikh. Koustuv Dasgupta, Pawan Goyal, Niloy Ganguly

Abstract: The U.S. Securities and Exchange Commission (SEC) mandates all public companies to file periodic financial statements that should contain numerals annotated with a particular label from a taxonomy. In this paper, we formulate the task of automating the assignment of a label to a particular numeral span in a sentence from an extremely large label set. Towards this task, we release a dataset, Financ… ▽ More The U.S. Securities and Exchange Commission (SEC) mandates all public companies to file periodic financial statements that should contain numerals annotated with a particular label from a taxonomy. In this paper, we formulate the task of automating the assignment of a label to a particular numeral span in a sentence from an extremely large label set. Towards this task, we release a dataset, Financial Numeric Extreme Labelling (FNXL), annotated with 2,794 labels. We benchmark the performance of the FNXL dataset by formulating the task as (a) a sequence labelling problem and (b) a pipeline with span extraction followed by Extreme Classification. Although the two approaches perform comparably, the pipeline solution provides a slight edge for the least frequent labels. △ Less

Submitted 6 June, 2023; originally announced June 2023.

Comments: Accepted to ACL'23 Findings Paper

arXiv:2305.01948 [pdf, ps, other]

doi 10.1016/j.disc.2024.113898

Upper Bounds on the Acyclic Chromatic Index of Degenerate Graphs

Authors: Nevil Anto, Manu Basavaraju, Suresh Manjanath Hegde, Shashanka Kulamarva

Abstract: An acyclic edge coloring of a graph is a proper edge coloring without any bichromatic cycles. The acyclic chromatic index of a graph $G$ denoted by $a'(G)$, is the minimum $k$ such that $G$ has an acyclic edge coloring with $k$ colors. Fiamčík conjectured that $a'(G) \le Δ+2$ for any graph $G$ with maximum degree $Δ$. A graph $G$ is said to be $k$-degenerate if every subgraph of $G$ has a vertex o… ▽ More An acyclic edge coloring of a graph is a proper edge coloring without any bichromatic cycles. The acyclic chromatic index of a graph $G$ denoted by $a'(G)$, is the minimum $k$ such that $G$ has an acyclic edge coloring with $k$ colors. Fiamčík conjectured that $a'(G) \le Δ+2$ for any graph $G$ with maximum degree $Δ$. A graph $G$ is said to be $k$-degenerate if every subgraph of $G$ has a vertex of degree at most $k$. Basavaraju and Chandran proved that the conjecture is true for $2$-degenerate graphs. We prove that for a $3$-degenerate graph $G$, $a'(G) \le Δ+5$, thereby bringing the upper bound closer to the conjectured bound. We also consider $k$-degenerate graphs with $k \ge 4$ and give an upper bound for the acyclic chromatic index of the same. △ Less

Submitted 3 May, 2023; originally announced May 2023.

Journal ref: Discrete Mathematics, 347(4), (2024), 113898

arXiv:2302.01638 [pdf, ps, other]

doi 10.1016/j.disc.2023.113434

Acyclic Chromatic Index of Chordless Graphs

Authors: Manu Basavaraju, Suresh Manjanath Hegde, Shashanka Kulamarva

Abstract: An acyclic edge coloring of a graph is a proper edge coloring in which there are no bichromatic cycles. The acyclic chromatic index of a graph $G$ denoted by $a'(G)$, is the minimum positive integer $k$ such that $G$ has an acyclic edge coloring with $k$ colors. It has been conjectured by Fiamčík that $a'(G) \le Δ+2$ for any graph $G$ with maximum degree $Δ$. Linear arboricity of a graph $G$, deno… ▽ More An acyclic edge coloring of a graph is a proper edge coloring in which there are no bichromatic cycles. The acyclic chromatic index of a graph $G$ denoted by $a'(G)$, is the minimum positive integer $k$ such that $G$ has an acyclic edge coloring with $k$ colors. It has been conjectured by Fiamčík that $a'(G) \le Δ+2$ for any graph $G$ with maximum degree $Δ$. Linear arboricity of a graph $G$, denoted by $la(G)$, is the minimum number of linear forests into which the edges of $G$ can be partitioned. A graph is said to be chordless if no cycle in the graph contains a chord. Every $2$-connected chordless graph is a minimally $2$-connected graph. It was shown by Basavaraju and Chandran that if $G$ is $2$-degenerate, then $a'(G) \le Δ+1$. Since chordless graphs are also $2$-degenerate, we have $a'(G) \le Δ+1$ for any chordless graph $G$. Machado, de Figueiredo and Trotignon proved that the chromatic index of a chordless graph is $Δ$ when $Δ\ge 3$. They also obtained a polynomial time algorithm to color a chordless graph optimally. We improve this result by proving that the acyclic chromatic index of a chordless graph is $Δ$, except when $Δ=2$ and the graph has a cycle, in which case it is $Δ+1$. We also provide the sketch of a polynomial time algorithm for an optimal acyclic edge coloring of a chordless graph. As a byproduct, we also prove that $la(G) = \lceil \frac{Δ}{2} \rceil$, unless $G$ has a cycle with $Δ=2$, in which case $la(G) = \lceil \frac{Δ+1}{2} \rceil = 2$. To obtain the result on acyclic chromatic index, we prove a structural result on chordless graphs which is a refinement of the structure given by Machado, de Figueiredo and Trotignon for this class of graphs. This might be of independent interest. △ Less

Submitted 3 February, 2023; originally announced February 2023.

Journal ref: Discrete Mathematics, 346(8), (2023), 113434

arXiv:2210.12467 [pdf, other]

ECTSum: A New Benchmark Dataset For Bullet Point Summarization of Long Earnings Call Transcripts

Authors: Rajdeep Mukherjee, Abhinav Bohra, Akash Banerjee, Soumya Sharma, Manjunath Hegde, Afreen Shaikh, Shivani Shrivastava, Koustuv Dasgupta, Niloy Ganguly, Saptarshi Ghosh, Pawan Goyal

Abstract: Despite tremendous progress in automatic summarization, state-of-the-art methods are predominantly trained to excel in summarizing short newswire articles, or documents with strong layout biases such as scientific articles or government reports. Efficient techniques to summarize financial documents, including facts and figures, have largely been unexplored, majorly due to the unavailability of sui… ▽ More Despite tremendous progress in automatic summarization, state-of-the-art methods are predominantly trained to excel in summarizing short newswire articles, or documents with strong layout biases such as scientific articles or government reports. Efficient techniques to summarize financial documents, including facts and figures, have largely been unexplored, majorly due to the unavailability of suitable datasets. In this work, we present ECTSum, a new dataset with transcripts of earnings calls (ECTs), hosted by publicly traded companies, as documents, and short experts-written telegram-style bullet point summaries derived from corresponding Reuters articles. ECTs are long unstructured documents without any prescribed length limit or format. We benchmark our dataset with state-of-the-art summarizers across various metrics evaluating the content quality and factual consistency of the generated summaries. Finally, we present a simple-yet-effective approach, ECT-BPS, to generate a set of bullet points that precisely capture the important facts discussed in the calls. △ Less

Submitted 26 October, 2022; v1 submitted 22 October, 2022; originally announced October 2022.

Comments: 14 pages; Accepted as a Long Paper in EMNLP 2022 (Main Conference); Codes: https://github.com/rajdeep345/ECTSum

ACM Class: I.2.7

arXiv:2206.02050 [pdf, other]

Learning Speaker-specific Lip-to-Speech Generation

Authors: Munender Varshney, Ravindra Yadav, Vinay P. Namboodiri, Rajesh M Hegde

Abstract: Understanding the lip movement and inferring the speech from it is notoriously difficult for the common person. The task of accurate lip-reading gets help from various cues of the speaker and its contextual or environmental setting. Every speaker has a different accent and speaking style, which can be inferred from their visual and speech features. This work aims to understand the correlation/mapp… ▽ More Understanding the lip movement and inferring the speech from it is notoriously difficult for the common person. The task of accurate lip-reading gets help from various cues of the speaker and its contextual or environmental setting. Every speaker has a different accent and speaking style, which can be inferred from their visual and speech features. This work aims to understand the correlation/mapping between speech and the sequence of lip movement of individual speakers in an unconstrained and large vocabulary. We model the frame sequence as a prior to the transformer in an auto-encoder setting and learned a joint embedding that exploits temporal properties of both audio and video. We learn temporal synchronization using deep metric learning, which guides the decoder to generate speech in sync with input lip movements. The predictive posterior thus gives us the generated speech in speaker speaking style. We have trained our model on the Grid and Lip2Wav Chemistry lecture dataset to evaluate single speaker natural speech generation tasks from lip movement in an unconstrained natural setting. Extensive evaluation using various qualitative and quantitative metrics with human evaluation also shows that our method outperforms the Lip2Wav Chemistry dataset(large vocabulary in an unconstrained setting) by a good margin across almost all evaluation metrics and marginally outperforms the state-of-the-art on GRID dataset. △ Less

Submitted 20 August, 2022; v1 submitted 4 June, 2022; originally announced June 2022.

Comments: Accepted at ICPR 2022

arXiv:2104.10891 [pdf]

Computer Vision-based Social Distancing Surveillance Solution with Optional Automated Camera Calibration for Large Scale Deployment

Authors: Sreetama Das, Anirban Nag, Dhruba Adhikary, Ramswaroop Jeevan Ram, Aravind BR, Sujit Kumar Ojha, Guruprasad M Hegde

Abstract: Social distancing has been suggested as one of the most effective measures to break the chain of viral transmission in the current COVID-19 pandemic. We herein describe a computer vision-based AI-assisted solution to aid compliance with social distancing norms. The solution consists of modules to detect and track people and to identify distance violations. It provides the flexibility to choose bet… ▽ More Social distancing has been suggested as one of the most effective measures to break the chain of viral transmission in the current COVID-19 pandemic. We herein describe a computer vision-based AI-assisted solution to aid compliance with social distancing norms. The solution consists of modules to detect and track people and to identify distance violations. It provides the flexibility to choose between a tool-based mode or an automated mode of camera calibration, making the latter suitable for large-scale deployments. In this paper, we discuss different metrics to assess the risk associated with social distancing violations and how we can differentiate between transient or persistent violations. Our proposed solution performs satisfactorily under different test scenarios, processes video feed at real-time speed as well as addresses data privacy regulations by blurring faces of detected people, making it ideal for deployments. △ Less

Submitted 22 April, 2021; originally announced April 2021.

Comments: 8 pages, 5 figures, 3 tables

arXiv:2011.10727 [pdf, other]

Stochastic Talking Face Generation Using Latent Distribution Matching

Authors: Ravindra Yadav, Ashish Sardana, Vinay P Namboodiri, Rajesh M Hegde

Abstract: The ability to envisage the visual of a talking face based just on hearing a voice is a unique human capability. There have been a number of works that have solved for this ability recently. We differ from these approaches by enabling a variety of talking face generations based on single audio input. Indeed, just having the ability to generate a single talking face would make a system almost robot… ▽ More The ability to envisage the visual of a talking face based just on hearing a voice is a unique human capability. There have been a number of works that have solved for this ability recently. We differ from these approaches by enabling a variety of talking face generations based on single audio input. Indeed, just having the ability to generate a single talking face would make a system almost robotic in nature. In contrast, our unsupervised stochastic audio-to-video generation model allows for diverse generations from a single audio input. Particularly, we present an unsupervised stochastic audio-to-video generation model that can capture multiple modes of the video distribution. We ensure that all the diverse generations are plausible. We do so through a principled multi-modal variational autoencoder framework. We demonstrate its efficacy on the challenging LRW and GRID datasets and demonstrate performance better than the baseline, while having the ability to generate multiple diverse lip synchronized videos. △ Less

Submitted 21 November, 2020; originally announced November 2020.

Comments: InterSpeech 2020

arXiv:2011.07340 [pdf, other]

Speech Prediction in Silent Videos using Variational Autoencoders

Authors: Ravindra Yadav, Ashish Sardana, Vinay P Namboodiri, Rajesh M Hegde

Abstract: Understanding the relationship between the auditory and visual signals is crucial for many different applications ranging from computer-generated imagery (CGI) and video editing automation to assisting people with hearing or visual impairments. However, this is challenging since the distribution of both audio and visual modality is inherently multimodal. Therefore, most of the existing methods ign… ▽ More Understanding the relationship between the auditory and visual signals is crucial for many different applications ranging from computer-generated imagery (CGI) and video editing automation to assisting people with hearing or visual impairments. However, this is challenging since the distribution of both audio and visual modality is inherently multimodal. Therefore, most of the existing methods ignore the multimodal aspect and assume that there only exists a deterministic one-to-one mapping between the two modalities. It can lead to low-quality predictions as the model collapses to optimizing the average behavior rather than learning the full data distributions. In this paper, we present a stochastic model for generating speech in a silent video. The proposed model combines recurrent neural networks and variational deep generative models to learn the auditory signal's conditional distribution given the visual signal. We demonstrate the performance of our model on the GRID dataset based on standard benchmarks. △ Less

Submitted 14 November, 2020; originally announced November 2020.

arXiv:2001.01555 [pdf, other]

A Generalized Framework for Autonomous Calibration of Wheeled Mobile Robots

Authors: Mohan Krishna Nutalapati, Lavish Arora, Anway Bose, Ketan Rajawat, Rajesh M Hegde

Abstract: Robotic calibration allows for the fusion of data from multiple sensors such as odometers, cameras, etc., by providing appropriate transformational relationships between the corresponding reference frames. For wheeled robots equipped with exteroceptive sensors, calibration entails learning the motion model of the sensor or the robot in terms of the odometric data, and must generally be performed p… ▽ More Robotic calibration allows for the fusion of data from multiple sensors such as odometers, cameras, etc., by providing appropriate transformational relationships between the corresponding reference frames. For wheeled robots equipped with exteroceptive sensors, calibration entails learning the motion model of the sensor or the robot in terms of the odometric data, and must generally be performed prior to performing tasks such as simultaneous localization and mapping (SLAM). Within this context, the current trend is to carry out simultaneous calibration of odometry and sensor without the use of any additional hardware. Building upon the existing simultaneous calibration algorithms, we put forth a generalized calibration framework that can not only handle robots operating in 2D with arbitrary or unknown motion models but also handle outliers in an automated manner. We first propose an algorithm based on the alternating minimization framework applicable to two-wheel differential drive. Subsequently, for arbitrary but known drive configurations we put forth an iteratively re-weighted least squares methodology leveraging an intelligent weighing scheme. Different from the existing works, these proposed algorithms require no manual intervention and seamlessly handle outliers that arise due to both systematic and non-systematic errors. Finally, we put forward a novel Gaussian Process-based non-parametric approach for calibrating wheeled robots with arbitrary or unknown drive configurations. Detailed experiments are performed to demonstrate the accuracy, usefulness, and flexibility of the proposed algorithms. △ Less

Submitted 6 January, 2020; originally announced January 2020.

Comments: This manuscript has been submitted to 'Elsevier Journal of Robotics and Autonomous Systems' and is under review for possible publication. Based on IROS 2019 conference submission [arXiv:1910.11917]

arXiv:1910.13676 [pdf, other]

Multi Modal Semantic Segmentation using Synthetic Data

Authors: Kartik Srivastava, Akash Kumar Singh, Guruprasad M. Hegde

Abstract: Semantic understanding of scenes in three-dimensional space (3D) is a quintessential part of robotics oriented applications such as autonomous driving as it provides geometric cues such as size, orientation and true distance of separation to objects which are crucial for taking mission critical decisions. As a first step, in this work we investigate the possibility of semantically classifying diff… ▽ More Semantic understanding of scenes in three-dimensional space (3D) is a quintessential part of robotics oriented applications such as autonomous driving as it provides geometric cues such as size, orientation and true distance of separation to objects which are crucial for taking mission critical decisions. As a first step, in this work we investigate the possibility of semantically classifying different parts of a given scene in 3D by learning the underlying geometric context in addition to the texture cues BUT in the absence of labelled real-world datasets. To this end we generate a large number of synthetic scenes, their pixel-wise labels and corresponding 3D representations using CARLA software framework. We then build a deep neural network that learns underlying category specific 3D representation and texture cues from color information of the rendered synthetic scenes. Further on we apply the learned model on different real world datasets to evaluate its performance. Our preliminary investigation of results show that the neural network is able to learn the geometric context from synthetic scenes and effectively apply this knowledge to classify each point of a 3D representation of a scene in real-world. △ Less

Submitted 30 October, 2019; originally announced October 2019.

Comments: Accepted in 3rd Edition of Deep Learning for Automated Driving (DLAD) workshop, IEEE International Conference on Intelligent Transportation Systems (ITSC'19) [see https://sites.google.com/view/dlad-bp-itsc2019/schedule?authuser=0#h.p_gI84BCoB0_bJ]

ACM Class: I.4

arXiv:1910.11917 [pdf, other]

Model Free Calibration of Wheeled Robots Using Gaussian Process

Authors: Mohan Krishna Nutalapati, Lavish Arora, Anway Bose, Ketan Rajawat, Rajesh M Hegde

Abstract: Robotic calibration allows for the fusion of data from multiple sensors such as odometers, cameras, etc., by providing appropriate relationships between the corresponding reference frames. For wheeled robots equipped with camera/lidar along with wheel encoders, calibration entails learning the motion model of the sensor or the robot in terms of the data from the encoders and generally carried out… ▽ More Robotic calibration allows for the fusion of data from multiple sensors such as odometers, cameras, etc., by providing appropriate relationships between the corresponding reference frames. For wheeled robots equipped with camera/lidar along with wheel encoders, calibration entails learning the motion model of the sensor or the robot in terms of the data from the encoders and generally carried out before performing tasks such as simultaneous localization and mapping (SLAM). This work puts forward a novel Gaussian Process-based non-parametric approach for calibrating wheeled robots with arbitrary or unknown drive configurations. The procedure is more general as it learns the entire sensor/robot motion model in terms of odometry measurements. Different from existing non-parametric approaches, our method relies on measurements from the onboard sensors and hence does not require the ground truth information from external motion capture systems. Alternatively, we propose a computationally efficient approach that relies on the linear approximation of the sensor motion model. Finally, we perform experiments to calibrate robots with un-modelled effects to demonstrate the accuracy, usefulness, and flexibility of the proposed approach. △ Less

Submitted 25 October, 2019; originally announced October 2019.

Comments: To be published in International Conference on Intelligent Robots and Systems (IROS), 2019

arXiv:1901.04987 [pdf, other]

Tango: A Deep Neural Network Benchmark Suite for Various Accelerators

Authors: Aajna Karki, Chethan Palangotu Keshava, Spoorthi Mysore Shivakumar, Joshua Skow, Goutam Madhukeshwar Hegde, Hyeran Jeon

Abstract: Deep neural networks (DNNs) have been proving the effectiveness in various computing fields. To provide more efficient computing platforms for DNN applications, it is essential to have evaluation environments that include assorted benchmark workloads. Though a few DNN benchmark suites have been recently released, most of them require to install proprietary DNN libraries or resource-intensive DNN f… ▽ More Deep neural networks (DNNs) have been proving the effectiveness in various computing fields. To provide more efficient computing platforms for DNN applications, it is essential to have evaluation environments that include assorted benchmark workloads. Though a few DNN benchmark suites have been recently released, most of them require to install proprietary DNN libraries or resource-intensive DNN frameworks, which are hard to run on resource-limited mobile platforms or architecture simulators. To provide a more scalable evaluation environment, we propose a new DNN benchmark suite that can run on any platform that supports CUDA and OpenCL. The proposed benchmark suite includes the most widely used five convolution neural networks and two recurrent neural networks. We provide in-depth architectural statistics of these networks while running them on an architecture simulator, a server- and a mobile-GPU, and a mobile FPGA. △ Less

Submitted 14 January, 2019; originally announced January 2019.

arXiv:1811.07847 [pdf, other]

Toward SATVAM: An IoT Network for Air Quality Monitoring

Authors: Rashmi Ballamajalu, Srijith Nair, Shayal Chhabra, Sumit K Monga, Anand SVR, Malati Hegde, Yogesh Simmhan, Anamika Sharma, Chandan M Choudhary, Ronak Sutaria, Rajesh Zele, Sachchida N. Tripathi

Abstract: Air pollution is ranked as the second most serious risk for public health in India after malnutrition. The lack of spatially and temporally distributed air quality information prevents a scientific study on its impact on human health and on the national economy. In this paper, we present our initial efforts toward SATVAM, Streaming Analytics over Temporal Variables for Air quality Monitoring, that… ▽ More Air pollution is ranked as the second most serious risk for public health in India after malnutrition. The lack of spatially and temporally distributed air quality information prevents a scientific study on its impact on human health and on the national economy. In this paper, we present our initial efforts toward SATVAM, Streaming Analytics over Temporal Variables for Air quality Monitoring, that aims to address this gap. We introduce the multi-disciplinary, multi-institutional project and some of the key IoT technologies used. These cut across hardware integration of gas sensors with a wireless mote packaging, design of the wireless sensor network using 6LoWPAN and RPL, and integration with a cloud backend for data acquisition and analysis. The outcome of our initial deployment will inform an improved design that will enable affordable and manageable monitoring at the city scale. This should lead to data-driven policies for urban air quality management. △ Less

Submitted 19 November, 2018; originally announced November 2018.

arXiv:1803.02500 [pdf, other]

doi 10.1002/spe.2580

Towards a Data-driven IoT Software Architecture for Smart City Utilities

Authors: Yogesh Simmhan, Pushkara Ravindra, Shilpa Chaturvedi, Malati Hegde, Rashmi Ballamajalu

Abstract: The Internet of Things (IoT) is emerging as the next big wave of digital presence for billions of devices on the Internet. Smart Cities are practical manifestation of IoT, with the goal of efficient, reliable and safe delivery of city utilities like water, power and transport to residents, through their intelligent management. A data-driven IoT Software Platform is essential for realizing manageab… ▽ More The Internet of Things (IoT) is emerging as the next big wave of digital presence for billions of devices on the Internet. Smart Cities are practical manifestation of IoT, with the goal of efficient, reliable and safe delivery of city utilities like water, power and transport to residents, through their intelligent management. A data-driven IoT Software Platform is essential for realizing manageable and sustainable Smart Utilities, and for novel applications to be developed upon them. Here, we propose such a service-oriented software architecture to address two key operational activities in a Smart Utility -- the IoT fabric for resource management, and the data and application platform for decision making. Our design uses open web standards and evolving network protocols, Cloud and edge resources, and streaming Big Data platforms. We motivate our design requirements using the smart water management domain; some of these requirements are unique to developing nations. We also validate the architecture within a campus-scale IoT testbed at the Indian Institute of Science (IISc), Bangalore, and present our experiences. Our architecture is scalable to a township or city, while also generalizable to other Smart Utility domains. Our experiences serves as a template for other similar efforts, particularly in emerging markets, and highlights the gaps and opportunities for a data-driven IoT Software architecture for smart cities. △ Less

Submitted 6 March, 2018; originally announced March 2018.

Comments: Pre-print of article to appear in Software: Practice and Experience, Wiley, 2018

Journal ref: Software: Practice and Experience, Volume 48, Issue 7, July 2018, Pages 1390-1416

arXiv:1711.01872 [pdf, other]

Minimum-Phase HRTF Modeling of Pinna Spectral Notches using Group Delay Decomposition

Authors: Sandeep Reddy C, Rajesh M Hegde

Abstract: Accurate reconstruction of HRTFs is important in the development of high quality binaural sound synthesis systems. Conventionally, minimum phase HRTF model development for reconstruction of HRTFs has been limited to minimum phase-pure delay models which ignore the all pass component of the HRTF. In this paper, a novel method for minimum phase HRTF modelling of Pinna Spectral Notches (PSNs) using g… ▽ More Accurate reconstruction of HRTFs is important in the development of high quality binaural sound synthesis systems. Conventionally, minimum phase HRTF model development for reconstruction of HRTFs has been limited to minimum phase-pure delay models which ignore the all pass component of the HRTF. In this paper, a novel method for minimum phase HRTF modelling of Pinna Spectral Notches (PSNs) using group delay decomposition is proposed. The proposed model captures the PSNs contributed by both the minimum phase and all pass component of HRTF thus facilitating an accurate reconstruction of HRTFs. The purely minimum phase HRTF components and their corresponding spatial angles are first identified using Fourier Bessel Series method that ensures a continuous evolution of the PSNs. The minimum phase-pure delay model is used to reconstruct HRTF for these spatial angles. Subsequently, the spatial angles which require both the minimum phase and all pass components are modelled using an all-pass filter cascaded with minimum-phase pure-delay model. Performance of the proposed model is evaluated by conducting experiments on PSN extraction, cross coherence analysis, and binaural synthesis. Both objective and subjective evaluation results are used to indicate the significance of the proposed model in binaural sound synthesis. △ Less

Submitted 3 April, 2018; v1 submitted 6 November, 2017; originally announced November 2017.

Comments: 11 pages; This paper is a preprint of a paper submitted to IET Signal Processing Journal. If accepted, the copy of record will be available at the IET Digital Library

arXiv:1701.02080 [pdf]

A Review of Localization and Tracking Algorithms in Wireless Sensor Networks

Authors: Sudhir Kumar, Rajesh M. Hegde

Abstract: In this paper, a comprehensive survey of the pioneer as well as the state of-the-art localization and tracking methods in the wireless sensor networks is presented. Localization is mostly applicable for the static sensor nodes, whereas, tracking for the mobile sensor nodes. The localization algorithms are broadly classified as range-based and range-free methods. The estimated range (distance) betw… ▽ More In this paper, a comprehensive survey of the pioneer as well as the state of-the-art localization and tracking methods in the wireless sensor networks is presented. Localization is mostly applicable for the static sensor nodes, whereas, tracking for the mobile sensor nodes. The localization algorithms are broadly classified as range-based and range-free methods. The estimated range (distance) between an anchor and an unknown node is highly erroneous in an indoor scenario. This limitation can be handled up to a large extent by employing a large number of existing access points (APs) in the range free localization method. Recent works emphasize on the use multi-sensor data like magnetic, inertial, compass, gyroscope, ultrasound, infrared, visual and/or odometer to improve the localization accuracy further. Additionally, tracking method does the future prediction of location based on the past location history. A smooth trajectory is noted even if some of the received measurements are erroneous. Real experimental set-ups such as National Instruments (NI) wireless sensor nodes, Crossbow motes and hand-held devices for carrying out the localization and tracking are also highlighted herein. △ Less

Submitted 9 January, 2017; originally announced January 2017.

arXiv:1610.05948 [pdf, ps, other]

A Bayesian Approach to Estimation of Speaker Normalization Parameters

Authors: Dhananjay Ram, Debasis Kundu, Rajesh M. Hegde

Abstract: In this work, a Bayesian approach to speaker normalization is proposed to compensate for the degradation in performance of a speaker independent speech recognition system. The speaker normalization method proposed herein uses the technique of vocal tract length normalization (VTLN). The VTLN parameters are estimated using a novel Bayesian approach which utilizes the Gibbs sampler, a special type o… ▽ More In this work, a Bayesian approach to speaker normalization is proposed to compensate for the degradation in performance of a speaker independent speech recognition system. The speaker normalization method proposed herein uses the technique of vocal tract length normalization (VTLN). The VTLN parameters are estimated using a novel Bayesian approach which utilizes the Gibbs sampler, a special type of Markov Chain Monte Carlo method. Additionally the hyperparameters are estimated using maximum likelihood approach. This model is used assuming that human vocal tract can be modeled as a tube of uniform cross section. It captures the variation in length of the vocal tract of different speakers more effectively, than the linear model used in literature. The work has also investigated different methods like minimization of Mean Square Error (MSE) and Mean Absolute Error (MAE) for the estimation of VTLN parameters. Both single pass and two pass approaches are then used to build a VTLN based speech recognizer. Experimental results on recognition of vowels and Hindi phrases from a medium vocabulary indicate that the Bayesian method improves the performance by a considerable margin. △ Less

Submitted 19 October, 2016; originally announced October 2016.

Comments: 23 Pages, 9 Figures

arXiv:1609.04197 [pdf, other]

ADWISERv2: A Plug-and-play Controller for Managing TCP Transfers in IEEE~802.11 Infrastructure WLANs with Multiple Access Points

Authors: Albert Sunny, Sumankumar Panchal, Nikhil Vidhani, Subhashini Krishnasamy, S. V. R. Anand, Malati Hegde, Joy Kuri, Anurag Kumar

Abstract: In this paper, we present a generic plug-and-play controller that ensures fair and efficient operation of IEEE~802.11 infrastructure wireless local area networks with multiple co-channel access points, without any change to hardware/firmware of the network devices. Our controller addresses performance issues of TCP transfers in multi-AP WLANs, by overlaying a coarse time-slicing scheduler on top o… ▽ More In this paper, we present a generic plug-and-play controller that ensures fair and efficient operation of IEEE~802.11 infrastructure wireless local area networks with multiple co-channel access points, without any change to hardware/firmware of the network devices. Our controller addresses performance issues of TCP transfers in multi-AP WLANs, by overlaying a coarse time-slicing scheduler on top of a cascaded fair queuing scheduler. The time slices and queue weights, used in our controller, are obtained from the solution of a constrained utility optimization formulation. A study of the impact of coarse time-slicing on TCP is also presented in this paper. We present an improved algorithm for adaptation of the service rate of the fair queuing scheduler and provide experimental results to illustrate its efficacy. We also present the changes that need to be incorporated to the proposed approach, to handle short-lived and interactive TCP flows. Finally, we report the results of experiments performed on a real testbed, demonstrating the efficacy of our controller. △ Less

Submitted 14 September, 2016; originally announced September 2016.

arXiv:1508.02834 [pdf, ps, other]

Second Order Cone Programming for Sensor Node Localization in Mixed LOS/NLOS Conditions

Authors: Sudhir Kumar, Rishabh Dixit, Rajesh M. Hegde

Abstract: In this paper, a novel method for sensor node localization under mixed line-of-sight/non-line-of-sight (LOS/NLOS) conditions based on second order cone programming (SOCP) is presented. SOCP methods have, hitherto, not been utilized in the node localization under mixed LOS/NLOS conditions. Unlike semidefinite programming (SDP) formulation, SOCP is computationally efficient for resource constrained… ▽ More In this paper, a novel method for sensor node localization under mixed line-of-sight/non-line-of-sight (LOS/NLOS) conditions based on second order cone programming (SOCP) is presented. SOCP methods have, hitherto, not been utilized in the node localization under mixed LOS/NLOS conditions. Unlike semidefinite programming (SDP) formulation, SOCP is computationally efficient for resource constrained ad-hoc sensor network. The proposed method can work seamlessly in mixed LOS/NLOS conditions. The robustness of the method is due to the fair utilization of all measurements obtained under LOS and NLOS conditions. The computational complexity of this method is quadratic in the number of nearest neighbours of the unknown node. Extensive simulations and real field deployments are used to evaluate the performance of the proposed method. The experimental results of the proposed method is reasonably better when compared to similar methods in literature. △ Less

Submitted 12 August, 2015; originally announced August 2015.

arXiv:1411.6741 [pdf, other]

A Complex Matrix Factorization approach to Joint Modeling of Magnitude and Phase for Source Separation

Authors: Chaitanya Ahuja, Karan Nathwani, Rajesh M. Hegde

Abstract: Conventional NMF methods for source separation factorize the matrix of spectral magnitudes. Spectral Phase is not included in the decomposition process of these methods. However, phase of the speech mixture is generally used in reconstructing the target speech signal. This results in undesired traces of interfering sources in the target signal. In this paper the spectral phase is incorporated in t… ▽ More Conventional NMF methods for source separation factorize the matrix of spectral magnitudes. Spectral Phase is not included in the decomposition process of these methods. However, phase of the speech mixture is generally used in reconstructing the target speech signal. This results in undesired traces of interfering sources in the target signal. In this paper the spectral phase is incorporated in the decomposition process itself. Additionally, the complex matrix factorization problem is reduced to an NMF problem using simple transformations. This results in effective separation of speech mixtures since both magnitude and phase are utilized jointly in the separation process. Improvement in source separation results are demonstrated using objective quality evaluations on the GRID corpus. △ Less

Submitted 25 November, 2014; originally announced November 2014.

Comments: 5 pages, 3 figures

arXiv:1308.3874 [pdf, other]

Alert-BDI: BDI Model with Adaptive Alertness through Situational Awareness

Authors: Manu S Hegde, Sanjay Singh

Abstract: In this paper, we address the problems faced by a group of agents that possess situational awareness, but lack a security mechanism, by the introduction of a adaptive risk management system. The Belief-Desire-Intention (BDI) architecture lacks a framework that would facilitate an adaptive risk management system that uses the situational awareness of the agents. We extend the BDI architecture with… ▽ More In this paper, we address the problems faced by a group of agents that possess situational awareness, but lack a security mechanism, by the introduction of a adaptive risk management system. The Belief-Desire-Intention (BDI) architecture lacks a framework that would facilitate an adaptive risk management system that uses the situational awareness of the agents. We extend the BDI architecture with the concept of adaptive alertness. Agents can modify their level of alertness by monitoring the risks faced by them and by their peers. Alert-BDI enables the agents to detect and assess the risks faced by them in an efficient manner, thereby increasing operational efficiency and resistance against attacks. △ Less

Submitted 18 August, 2013; originally announced August 2013.

Comments: 14 pages, 3 figures. Submitted to ICACCI 2013, Mysore, India

arXiv:1107.1945

Region-based Approach for Determining the Optimal Path Using PSO

Authors: Dr. T. R. Gopalakrishnan Nair, Ms. Kavitha Sooda, Ms. Deepthi D Shetty, Ms. Prapthi Hegde, Ms. Anusha Hegde

Abstract: Many research works have been carried out recently to find the optimal path in network routing. Among them the evolutionary algorithms is an area where work is carried out extensively. We in this paper, have used PSO for finding the optimal path and the concept of region based network is introduced along with the use of indirect encoding. A comparative study of genetic algorithm (GA) and particle… ▽ More Many research works have been carried out recently to find the optimal path in network routing. Among them the evolutionary algorithms is an area where work is carried out extensively. We in this paper, have used PSO for finding the optimal path and the concept of region based network is introduced along with the use of indirect encoding. A comparative study of genetic algorithm (GA) and particle swarm optimization (PSO) is carried out, and it was found that PSO performed better than GA. △ Less

Submitted 2 June, 2012; v1 submitted 11 July, 2011; originally announced July 2011.

Comments: This paper has been withdrawn as the authors were unable to present the paper for the conference

arXiv:1103.0133 [pdf, ps, other]

Neighbor Oblivious and Finite-State Algorithms for Circumventing Local Minima in Geographic Forwarding

Authors: Santosh Ramachandran, Chandramani Singh, S. V. R. Anand, Malati Hegde, Anurag Kumar, Rajesh Sundaresan

Abstract: We propose distributed link reversal algorithms to circumvent communication voids in geographic routing. We also solve the attendant problem of integer overflow in these algorithms. These are achieved in two steps. First, we derive partial and full link reversal algorithms that do not require one-hop neighbor information, and convert a destination-disoriented directed acyclic graph (DAG) to a dest… ▽ More We propose distributed link reversal algorithms to circumvent communication voids in geographic routing. We also solve the attendant problem of integer overflow in these algorithms. These are achieved in two steps. First, we derive partial and full link reversal algorithms that do not require one-hop neighbor information, and convert a destination-disoriented directed acyclic graph (DAG) to a destination-oriented DAG. We embed these algorithms in the framework of Gafni and Bertsekas ("Distributed algorithms for generating loop-free routes in networks with frequently changing topology", 1981) in order to establish their termination properties. We also analyze certain key properties exhibited by our neighbor oblivious link reversal algorithms, e.g., for any two neighbors, their t-states are always consecutive integers, and for any node, its t-state size is upper bounded by log(N). In the second step, we resolve the integer overflow problem by analytically deriving one-bit full link reversal and two-bit partial link reversal versions of our neighbor oblivious link reversal algorithms. △ Less

Submitted 4 May, 2012; v1 submitted 1 March, 2011; originally announced March 2011.

Comments: 9 pages; "Neighbor oblivious link reversal over duty-cycled WSNs"

Journal ref: National Conference on Communications (NCC) 2010, Chennai, India, Jan. 29-31, 2010, pages 1 - 5

arXiv:1011.3482 [pdf, ps, other]

Distributed Construction of the Critical Geometric Graph in Dense Wireless Sensor Networks

Authors: Srivathsa Acharya, Anurag Kumar, Vijay Dewangan, Navneet Sankara, Malati Hegde, S. V. R. Anand

Abstract: Wireless sensor networks are often modeled in terms of a dense deployment of smart sensor nodes in a two-dimensional region. Give a node deployment, the \emph{critical geometric graph (CGG)} over these locations (i.e., the connected \emph{geometric graph (GG)} with the smallest radius) is a useful structure since it provides the most accurate proportionality between hop-count and Euclidean distanc… ▽ More Wireless sensor networks are often modeled in terms of a dense deployment of smart sensor nodes in a two-dimensional region. Give a node deployment, the \emph{critical geometric graph (CGG)} over these locations (i.e., the connected \emph{geometric graph (GG)} with the smallest radius) is a useful structure since it provides the most accurate proportionality between hop-count and Euclidean distance. Hence, it can be used for GPS-free node localisation as well as minimum distance packet forwarding. It is also known to be asymptotically optimal for network transport capacity and power efficiency. In this context, we propose DISCRIT, a distributed and asynchronous algorithm for obtaining an approximation of the CGG on the node locations. The algorithm does not require the knowledge of node locations or internode distances, nor does it require pair-wise RSSI (Received Signal Strength Indication) measurements to be made. Instead, the algorithm makes use of successful Hello receipt counts (obtained during a Hello-protocol-based neighbour discovery process) as edge weights, along with a simple distributed min-max computation algorithm. In this paper, we first provide the theory for justifying the use of the above edge weights. Then we provide extensive simulation results to demonstrate the efficacy of DISCRIT in obtaining an approximation of the CGG. Finally, we show how the CGG obtained from DISCRIT performs when used in certain network self-organisation algorithms. △ Less

Submitted 15 November, 2010; originally announced November 2010.

Comments: 20 pages, 11 figures

Showing 1–27 of 27 results for author: Hegde, M