-
SALSA: Speedy ASR-LLM Synchronous Aggregation
Authors:
Ashish Mittal,
Darshan Prabhu,
Sunita Sarawagi,
Preethi Jyothi
Abstract:
Harnessing pre-trained LLMs to improve ASR systems, particularly for low-resource languages, is now an emerging area of research. Existing methods range from using LLMs for ASR error correction to tightly coupled systems that replace the ASR decoder with the LLM. These approaches either increase decoding time or require expensive training of the cross-attention layers. We propose SALSA, which coup…
▽ More
Harnessing pre-trained LLMs to improve ASR systems, particularly for low-resource languages, is now an emerging area of research. Existing methods range from using LLMs for ASR error correction to tightly coupled systems that replace the ASR decoder with the LLM. These approaches either increase decoding time or require expensive training of the cross-attention layers. We propose SALSA, which couples the decoder layers of the ASR to the LLM decoder, while synchronously advancing both decoders. Such coupling is performed with a simple projection of the last decoder state, and is thus significantly more training efficient than earlier approaches. A challenge of our proposed coupling is handling the mismatch between the tokenizers of the LLM and ASR systems. We handle this mismatch using cascading tokenization with respect to the LLM and ASR vocabularies. We evaluate SALSA on 8 low-resource languages in the FLEURS benchmark, yielding substantial WER reductions of up to 38%.
△ Less
Submitted 29 August, 2024;
originally announced August 2024.
-
Improving Self-supervised Pre-training using Accent-Specific Codebooks
Authors:
Darshan Prabhu,
Abhishek Gupta,
Omkar Nitsure,
Preethi Jyothi,
Sriram Ganapathy
Abstract:
Speech accents present a serious challenge to the performance of state-of-the-art end-to-end Automatic Speech Recognition (ASR) systems. Even with self-supervised learning and pre-training of ASR models, accent invariance is seldom achieved. In this work, we propose an accent-aware adaptation technique for self-supervised learning that introduces a trainable set of accent-specific codebooks to the…
▽ More
Speech accents present a serious challenge to the performance of state-of-the-art end-to-end Automatic Speech Recognition (ASR) systems. Even with self-supervised learning and pre-training of ASR models, accent invariance is seldom achieved. In this work, we propose an accent-aware adaptation technique for self-supervised learning that introduces a trainable set of accent-specific codebooks to the self-supervised architecture. These learnable codebooks enable the model to capture accent specific information during pre-training, that is further refined during ASR finetuning. On the Mozilla Common Voice dataset, our proposed approach outperforms all other accent-adaptation approaches on both seen and unseen English accents, with up to 9% relative reduction in word error rate (WER).
△ Less
Submitted 4 July, 2024;
originally announced July 2024.
-
Multi-Convformer: Extending Conformer with Multiple Convolution Kernels
Authors:
Darshan Prabhu,
Yifan Peng,
Preethi Jyothi,
Shinji Watanabe
Abstract:
Convolutions have become essential in state-of-the-art end-to-end Automatic Speech Recognition~(ASR) systems due to their efficient modelling of local context. Notably, its use in Conformers has led to superior performance compared to vanilla Transformer-based ASR systems. While components other than the convolution module in the Conformer have been reexamined, altering the convolution module itse…
▽ More
Convolutions have become essential in state-of-the-art end-to-end Automatic Speech Recognition~(ASR) systems due to their efficient modelling of local context. Notably, its use in Conformers has led to superior performance compared to vanilla Transformer-based ASR systems. While components other than the convolution module in the Conformer have been reexamined, altering the convolution module itself has been far less explored. Towards this, we introduce Multi-Convformer that uses multiple convolution kernels within the convolution module of the Conformer in conjunction with gating. This helps in improved modeling of local dependencies at varying granularities. Our model rivals existing Conformer variants such as CgMLP and E-Branchformer in performance, while being more parameter efficient. We empirically compare our approach with Conformer and its variants across four different datasets and three different modelling paradigms and show up to 8% relative word error rate~(WER) improvements.
△ Less
Submitted 23 July, 2024; v1 submitted 4 July, 2024;
originally announced July 2024.
-
DEXTER: A Benchmark for open-domain Complex Question Answering using LLMs
Authors:
Venktesh V. Deepali Prabhu,
Avishek Anand
Abstract:
Open-domain complex Question Answering (QA) is a difficult task with challenges in evidence retrieval and reasoning. The complexity of such questions could stem from questions being compositional, hybrid evidence, or ambiguity in questions. While retrieval performance for classical QA tasks is well explored, their capabilities for heterogeneous complex retrieval tasks, especially in an open-domain…
▽ More
Open-domain complex Question Answering (QA) is a difficult task with challenges in evidence retrieval and reasoning. The complexity of such questions could stem from questions being compositional, hybrid evidence, or ambiguity in questions. While retrieval performance for classical QA tasks is well explored, their capabilities for heterogeneous complex retrieval tasks, especially in an open-domain setting, and the impact on downstream QA performance, are relatively unexplored. To address this, in this work, we propose a benchmark composing diverse complex QA tasks and provide a toolkit to evaluate state-of-the-art pre-trained dense and sparse retrieval models in an open-domain setting. We observe that late interaction models and surprisingly lexical models like BM25 perform well compared to other pre-trained dense retrieval models. In addition, since context-based reasoning is critical for solving complex QA tasks, we also evaluate the reasoning capabilities of LLMs and the impact of retrieval performance on their reasoning capabilities. Through experiments, we observe that much progress is to be made in retrieval for complex QA to improve downstream QA performance. Our software and related data can be accessed at https://github.com/VenkteshV/DEXTER
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Efficient infusion of self-supervised representations in Automatic Speech Recognition
Authors:
Darshan Prabhu,
Sai Ganesh Mirishkar,
Pankaj Wasnik
Abstract:
Self-supervised learned (SSL) models such as Wav2vec and HuBERT yield state-of-the-art results on speech-related tasks. Given the effectiveness of such models, it is advantageous to use them in conventional ASR systems. While some approaches suggest incorporating these models as a trainable encoder or a learnable frontend, training such systems is extremely slow and requires a lot of computation c…
▽ More
Self-supervised learned (SSL) models such as Wav2vec and HuBERT yield state-of-the-art results on speech-related tasks. Given the effectiveness of such models, it is advantageous to use them in conventional ASR systems. While some approaches suggest incorporating these models as a trainable encoder or a learnable frontend, training such systems is extremely slow and requires a lot of computation cycles. In this work, we propose two simple approaches that use (1) framewise addition and (2) cross-attention mechanisms to efficiently incorporate the representations from the SSL model(s) into the ASR architecture, resulting in models that are comparable in size with standard encoder-decoder conformer systems while also avoiding the usage of SSL models during training. Our approach results in faster training and yields significant performance gains on the Librispeech and Tedlium datasets compared to baselines. We further provide detailed analysis and ablation studies that demonstrate the effectiveness of our approach.
△ Less
Submitted 19 April, 2024;
originally announced April 2024.
-
Accented Speech Recognition With Accent-specific Codebooks
Authors:
Darshan Prabhu,
Preethi Jyothi,
Sriram Ganapathy,
Vinit Unni
Abstract:
Speech accents pose a significant challenge to state-of-the-art automatic speech recognition (ASR) systems. Degradation in performance across underrepresented accents is a severe deterrent to the inclusive adoption of ASR. In this work, we propose a novel accent adaptation approach for end-to-end ASR systems using cross-attention with a trainable set of codebooks. These learnable codebooks capture…
▽ More
Speech accents pose a significant challenge to state-of-the-art automatic speech recognition (ASR) systems. Degradation in performance across underrepresented accents is a severe deterrent to the inclusive adoption of ASR. In this work, we propose a novel accent adaptation approach for end-to-end ASR systems using cross-attention with a trainable set of codebooks. These learnable codebooks capture accent-specific information and are integrated within the ASR encoder layers. The model is trained on accented English speech, while the test data also contained accents which were not seen during training. On the Mozilla Common Voice multi-accented dataset, we show that our proposed approach yields significant performance gains not only on the seen English accents (up to $37\%$ relative improvement in word error rate) but also on the unseen accents (up to $5\%$ relative improvement in WER). Further, we illustrate benefits for a zero-shot transfer setup on the L2Artic dataset. We also compare the performance with other approaches based on accent adversarial training.
△ Less
Submitted 26 October, 2023; v1 submitted 24 October, 2023;
originally announced October 2023.
-
Globular Cluster UVIT legacy Survey (GlobUleS) III. Omega Centauri in Far-Ultraviolet
Authors:
Deepthi S. Prabhu,
Annapurni Subramaniam,
Snehalata Sahu,
Chul Chung,
Nathan W. C. Leigh,
Emanuele Dalessandro,
Sourav Chatterjee,
N. Kameswara Rao,
Michael Shara,
Patrick Cote,
Samyaday Choudhury,
Gajendra Pandey,
Aldo A. R. Valcarce,
Gaurav Singh,
Joesph E. Postma,
Sharmila Rani,
Avrajit Bandyopadhyay,
Aaron M. Geller,
John Hutchings,
Thomas Puzia,
Mirko Simunovic,
Young-Jong Sohn,
Sivarani Thirupathi,
Ramakant Singh Yadav
Abstract:
We present the first comprehensive study of the most massive globular cluster Omega Centauri in the far-ultraviolet (FUV) extending from the center to ~ 28% of the tidal radius using the Ultraviolet Imaging Telescope aboard AstroSat. A comparison of the FUV-optical color-magnitude diagrams with available canonical models reveals that the horizontal branch (HB) stars bluer than the knee (hHBs) and…
▽ More
We present the first comprehensive study of the most massive globular cluster Omega Centauri in the far-ultraviolet (FUV) extending from the center to ~ 28% of the tidal radius using the Ultraviolet Imaging Telescope aboard AstroSat. A comparison of the FUV-optical color-magnitude diagrams with available canonical models reveals that the horizontal branch (HB) stars bluer than the knee (hHBs) and the white dwarfs (WDs) are fainter in the FUV by ~ 0.5 mag than model predictions. They are also fainter than their counterparts in M13, another massive cluster. We simulated HB with at least five subpopulations, including three He-rich populations with a substantial He enrichment of Y up to 0.43 dex, to reproduce the observed FUV distribution. We find the He-rich younger subpopulations to be radially more segregated than the He-normal older ones, suggesting an in-situ enrichment from older generations. The Omega Cen hHBs span the same effective temperature range as their M13 counterparts, but some have smaller radii and lower luminosities. This may suggest that a fraction of Omega Cen hHBs are less massive than those of M13, similar to the result derived from earlier spectroscopic studies of outer extreme HB stars. The WDs in Omega Cen and M13 have similar luminosity-radius-effective temperature parameters, and 0.44 - 0.46 M$_\odot$ He-core WD model tracks evolving from progenitors with Y = 0.4 dex are found to fit the majority of these. This study provides constraints on the formation models of Omega Cen based on the estimated range in age, [Fe/H] and Y (in particular), for the HB stars.
△ Less
Submitted 11 October, 2022;
originally announced October 2022.
-
Globular Clusters UVIT Legacy Survey (GlobULeS) I. FUV-optical Color-Magnitude Diagrams for Eight Globular Clusters
Authors:
Snehalata Sahu,
Annapurni Subramaniam,
Gaurav Singh,
Ramakant Yadav,
Aldo R. Valcarce,
Samyaday Choudhury,
Sharmila Rani,
Deepthi S. Prabhu,
Chul Chung,
Patrick Côté,
Nathan Leigh,
Aaron M. Geller,
Sourav Chatterjee,
N. Kameswara Rao,
Avrajit Bandyopadhyay,
Michael Shara,
Emanuele Dalessandro,
Gajendra Pandey,
Joesph E. Postma,
John Hutchings,
Mirko Simunovic,
Peter B. Stetson,
Sivarani Thirupathi,
Thomas Puzia,
Young-Jong Sohn
Abstract:
We present the first results of eight Globular Clusters (GCs) from the AstroSat/UVIT Legacy Survey program GlobULeS based on the observations carried out in two FUV filters (F148W and F169M). The FUV-optical and FUV-FUV color-magnitude diagrams (CMDs) of GCs with the proper motion membership were constructed by combining the UVIT data with HST UV Globular Cluster Survey (HUGS) data for inner regio…
▽ More
We present the first results of eight Globular Clusters (GCs) from the AstroSat/UVIT Legacy Survey program GlobULeS based on the observations carried out in two FUV filters (F148W and F169M). The FUV-optical and FUV-FUV color-magnitude diagrams (CMDs) of GCs with the proper motion membership were constructed by combining the UVIT data with HST UV Globular Cluster Survey (HUGS) data for inner regions and Gaia Early Data Release (EDR3) for regions outside the HST's field. We detect sources as faint as F148W $\sim$ 23.5~mag which are classified based on their locations in CMDs by overlaying stellar evolutionary models. The CMDs of 8 GCs are combined with the previous UVIT studies of 3 GCs to create stacked FUV-optical CMDs to highlight the features/peculiarities found in the different evolutionary sequences. The FUV (F148W) detected stellar populations of 11 GCs comprises 2,816 Horizontal Branch (HB) stars (190 Extreme HB candidates), 46 post-HB (pHB), 221 Blue Straggler Stars (BSS), and 107 White Dwarf (WD) candidates. We note that the blue HB color extension obtained from F148W$-$G color and the number of FUV detected EHB candidates are strongly correlated with the maximum internal Helium (He) variation within each GC, suggesting that the FUV-optical plane is the most sensitive to He abundance variations in the HB. We discuss the potential science cases that will be addressed using these catalogues including HB morphologies, BSSs, pHB, and, WD stars.
△ Less
Submitted 27 April, 2022;
originally announced April 2022.
-
Prospects of measuring a metallicity trend and spread in globular clusters from low-resolution spectroscopy
Authors:
Martina Baratella,
Deepthi S. Prabhu,
Luiz A. Silva-Lima,
Philippe Prugniel
Abstract:
The metallicity spread, or the metallicity trend along the evolutionary sequence of a globular cluster, is a rich source of information to help understand the cluster physics (e.g. multiple populations) and stellar physics (e.g. atomic diffusion). Low-resolution integral-field-unit spectroscopy in the optical with the MUSE is an attractive prospect if it can provide these diagnostics because it al…
▽ More
The metallicity spread, or the metallicity trend along the evolutionary sequence of a globular cluster, is a rich source of information to help understand the cluster physics (e.g. multiple populations) and stellar physics (e.g. atomic diffusion). Low-resolution integral-field-unit spectroscopy in the optical with the MUSE is an attractive prospect if it can provide these diagnostics because it allows us to extract spectra of a large fraction of the cluster stars. We investigate the possibilities of full-spectrum fitting to derive stellar parameters and chemical abundances at low spectral resolution (R~2000). We reanalysed 1584 MUSE spectra of 1061 stars above the turn-off of NGC 6397 using FERRE and employing two different synthetic libraries. We derive the equivalent iron abundance \fehe for fixed values of \afe. We find that (i) the interpolation schema and grid mesh are not critical for the precision, metallicity spread, and trend; (ii) with the two grids, \fehe increases by ~0.2 dex along the sub-giant branch, starting from the turn-off of the main sequence; (iii) restricting the wavelength range to the optical decreases the precision significantly; and (iv) the precision obtained with the synthetic libraries is lower than the precision obtained previously with empirical libraries. Full-spectrum fitting provides reproducible results that are robust to the choice of the reference grid of synthetic spectra and to the details of the analysis. The \fehe increase along the sub-giant branch is in stark contrast with the nearly constant iron abundance previously found with empirical libraries. The precision of the measurements (0.05 dex on \fehe) is currently not sufficient to assess the intrinsic chemical abundance spreads, but this may change with deeper observations. Improvements of the synthetic spectra are still needed to deliver the full possibilities of full-spectrum fitting.
△ Less
Submitted 23 March, 2022;
originally announced March 2022.
-
On-Device Spatial Attention based Sequence Learning Approach for Scene Text Script Identification
Authors:
Rutika Moharir,
Arun D Prabhu,
Sukumar Moharana,
Gopi Ramena,
Rachit S Munjal
Abstract:
Automatic identification of script is an essential component of a multilingual OCR engine. In this paper, we present an efficient, lightweight, real-time and on-device spatial attention based CNN-LSTM network for scene text script identification, feasible for deployment on resource constrained mobile devices. Our network consists of a CNN, equipped with a spatial attention module which helps reduc…
▽ More
Automatic identification of script is an essential component of a multilingual OCR engine. In this paper, we present an efficient, lightweight, real-time and on-device spatial attention based CNN-LSTM network for scene text script identification, feasible for deployment on resource constrained mobile devices. Our network consists of a CNN, equipped with a spatial attention module which helps reduce the spatial distortions present in natural images. This allows the feature extractor to generate rich image representations while ignoring the deformities and thereby, enhancing the performance of this fine grained classification task. The network also employs residue convolutional blocks to build a deep network to focus on the discriminative features of a script. The CNN learns the text feature representation by identifying each character as belonging to a particular script and the long term spatial dependencies within the text are captured using the sequence learning capabilities of the LSTM layers. Combining the spatial attention mechanism with the residue convolutional blocks, we are able to enhance the performance of the baseline CNN to build an end-to-end trainable network for script identification. The experimental results on several standard benchmarks demonstrate the effectiveness of our method. The network achieves competitive accuracy with state-of-the-art methods and is superior in terms of network size, with a total of just 1.1 million parameters and inference time of 2.7 milliseconds.
△ Less
Submitted 1 December, 2021;
originally announced December 2021.
-
Overcoming limited battery data challenges: A coupled neural network approach
Authors:
Aniruddh Herle,
Janamejaya Channegowda,
Dinakar Prabhu
Abstract:
The Electric Vehicle (EV) Industry has seen extraordinary growth in the last few years. This is primarily due to an ever increasing awareness of the detrimental environmental effects of fossil fuel powered vehicles and availability of inexpensive Lithium-ion batteries (LIBs). In order to safely deploy these LIBs in Electric Vehicles, certain battery states need to be constantly monitored to ensure…
▽ More
The Electric Vehicle (EV) Industry has seen extraordinary growth in the last few years. This is primarily due to an ever increasing awareness of the detrimental environmental effects of fossil fuel powered vehicles and availability of inexpensive Lithium-ion batteries (LIBs). In order to safely deploy these LIBs in Electric Vehicles, certain battery states need to be constantly monitored to ensure safe and healthy operation. The use of Machine Learning to estimate battery states such as State-of-Charge and State-of-Health have become an extremely active area of research. However, limited availability of open-source diverse datasets has stifled the growth of this field, and is a problem largely ignored in literature. In this work, we propose a novel method of time-series battery data augmentation using deep neural networks. We introduce and analyze the method of using two neural networks working together to alternatively produce synthetic charging and discharging battery profiles. One model produces battery charging profiles, and another produces battery discharging profiles. The proposed approach is evaluated using few public battery datasets to illustrate its effectiveness, and our results show the efficacy of this approach to solve the challenges of limited battery data. We also test this approach on dynamic Electric Vehicle drive cycles as well.
△ Less
Submitted 5 October, 2021;
originally announced November 2021.
-
STRIDE : Scene Text Recognition In-Device
Authors:
Rachit S Munjal,
Arun D Prabhu,
Nikhil Arora,
Sukumar Moharana,
Gopi Ramena
Abstract:
Optical Character Recognition (OCR) systems have been widely used in various applications for extracting semantic information from images. To give the user more control over their privacy, an on-device solution is needed. The current state-of-the-art models are too heavy and complex to be deployed on-device. We develop an efficient lightweight scene text recognition (STR) system, which has only 0.…
▽ More
Optical Character Recognition (OCR) systems have been widely used in various applications for extracting semantic information from images. To give the user more control over their privacy, an on-device solution is needed. The current state-of-the-art models are too heavy and complex to be deployed on-device. We develop an efficient lightweight scene text recognition (STR) system, which has only 0.88M parameters and performs real-time text recognition. Attention modules tend to boost the accuracy of STR networks but are generally slow and not optimized for device inference. So, we propose the use of convolution attention modules to the text recognition networks, which aims to provide channel and spatial attention information to the LSTM module by adding very minimal computational cost. It boosts our word accuracy on ICDAR 13 dataset by almost 2\%. We also introduce a novel orientation classifier module, to support the simultaneous recognition of both horizontal and vertical text. The proposed model surpasses on-device metrics of inference time and memory footprint and achieves comparable accuracy when compared to the leading commercial and other open-source OCR engines. We deploy the system on-device with an inference speed of 2.44 ms per word on the Exynos 990 chipset device and achieve an accuracy of 88.4\% on ICDAR-13 dataset.
△ Less
Submitted 17 May, 2021;
originally announced May 2021.
-
The Sariçiçek howardite fall in Turkey: Source crater of HED meteorites on Vesta and impact risk of Vestoids
Authors:
Ozan Unsalan,
Peter Jenniskens,
Qing-Zhu Yin,
Ersin Kaygisiz,
Jim Albers,
David L. Clark,
Mikael Granvik,
Iskender Demirkol,
Ibrahim Y. Erdogan,
Aydin S. Bengu,
Mehmet E. Özel,
Zahide Terzioglu,
Nayeob GI,
Peter Brown,
Esref Yalcinkaya,
Tuğba Temel,
Dinesh K. Prabhu,
Darrel K. Robertson,
Mark Boslough,
Daniel R. Ostrowski,
Jamie Kimberley,
Selman ER,
Douglas J. Rowland,
Kathryn L. Bryson,
Cisem Altunayar-Unsalan
, et al. (54 additional authors not shown)
Abstract:
The Sariçiçek howardite meteorite shower consisting of 343 documented stones occurred on 2 September 2015 in Turkey and is the first documented howardite fall. Cosmogenic isotopes show that Sariçiçek experienced a complex cosmic ray exposure history, exposed during ~12-14 Ma in a regolith near the surface of a parent asteroid, and that an ca.1 m sized meteoroid was launched by an impact 22 +/- 2 M…
▽ More
The Sariçiçek howardite meteorite shower consisting of 343 documented stones occurred on 2 September 2015 in Turkey and is the first documented howardite fall. Cosmogenic isotopes show that Sariçiçek experienced a complex cosmic ray exposure history, exposed during ~12-14 Ma in a regolith near the surface of a parent asteroid, and that an ca.1 m sized meteoroid was launched by an impact 22 +/- 2 Ma ago to Earth (as did one third of all HED meteorites). SIMS dating of zircon and baddeleyite yielded 4550.4 +/- 2.5 Ma and 4553 +/- 8.8 Ma crystallization ages for the basaltic magma clasts. The apatite U-Pb age of 4525 +/- 17 Ma, K-Ar age of ~3.9 Ga, and the U,Th-He ages of 1.8 +/- 0.7 and 2.6 +/- 0.3 Ga are interpreted to represent thermal metamorphic and impact-related resetting ages, respectively. Petrographic, geochemical and O-, Cr- and Ti- isotopic studies confirm that Sariçiçek belongs to the normal clan of HED meteorites. Petrographic observations and analysis of organic material indicate a small portion of carbonaceous chondrite material in the Sariçiçek regolith and organic contamination of the meteorite after a few days on soil. Video observations of the fall show an atmospheric entry at 17.3 +/- 0.8 kms-1 from NW, fragmentations at 37, 33, 31 and 27 km altitude, and provide a pre-atmospheric orbit that is the first dynamical link between the normal HED meteorite clan and the inner Main Belt. Spectral data indicate the similarity of Sariçiçek with the Vesta asteroid family spectra, a group of asteroids stretching to delivery resonances, which includes (4) Vesta. Dynamical modeling of meteoroid delivery to Earth shows that the disruption of a ca.1 km sized Vesta family asteroid or a ~10 km sized impact crater on Vesta is required to provide sufficient meteoroids <4 m in size to account for the influx of meteorites from this HED clan.
△ Less
Submitted 7 February, 2021;
originally announced February 2021.
-
UV photometry of spotted stars in the horizontal branch of the globular cluster NGC 2808 using AstroSat
Authors:
Deepthi S. Prabhu,
Annapurni Subramaniam,
Snehalata Sahu
Abstract:
A recent study of hot (20,000 to 30,000 K) extreme horizontal branch (EHB) stars in globular clusters (GCs) has led to the discovery of their variability. It is suggested that this variability is driven by the projected rotation of magnetic spots on the stellar surfaces and is expected to have higher amplitudes at shorter wavelengths. Here, we present the analysis of such hot stars in the massive…
▽ More
A recent study of hot (20,000 to 30,000 K) extreme horizontal branch (EHB) stars in globular clusters (GCs) has led to the discovery of their variability. It is suggested that this variability is driven by the projected rotation of magnetic spots on the stellar surfaces and is expected to have higher amplitudes at shorter wavelengths. Here, we present the analysis of such hot stars in the massive GC NGC 2808 using the Ultraviolet Imaging Telescope (UVIT), aboard AstroSat. We use the UVIT data in combination with the Hubble Space Telescope UV Globular Cluster Survey (HUGS) data for the central region (within $\sim$ $2.7'$ $\times$ $2.7'$) and ground-based optical photometry for the outer parts of the cluster. We generate the far-UV (FUV) - optical colour-magnitude diagrams (CMDs) and in these we find a population of hot EHB stars fainter than the zero-age horizontal branch (ZAHB) model. A comparison of our FUV magnitudes of the already reported variable EHB stars (vEHBs) shows that the longest period vEHBs are the faintest, along with a tentative correlation between rotation period and UV magnitude of spotted stars. In order to firmly establish any correlation, further study is essential.
△ Less
Submitted 11 December, 2020;
originally announced December 2020.
-
The First Extensive Exploration of UV-bright Stars in the Globular Cluster NGC 2808
Authors:
Deepthi S. Prabhu,
Annapurni Subramaniam,
Snehalata Sahu
Abstract:
In this study, we identified and characterized the hot and luminous UV-bright stars in the globular cluster NGC 2808. We combined data from the Ultra Violet Imaging Telescope (UVIT) on-board the Indian space satellite, AstroSat, with the Hubble Space Telescope UV Globular Cluster Survey (HUGS) data for the central region (within $\sim$…
▽ More
In this study, we identified and characterized the hot and luminous UV-bright stars in the globular cluster NGC 2808. We combined data from the Ultra Violet Imaging Telescope (UVIT) on-board the Indian space satellite, AstroSat, with the Hubble Space Telescope UV Globular Cluster Survey (HUGS) data for the central region (within $\sim$ $\ang[angle-symbol-over-decimal]{;2.7;} \times \ang[angle-symbol-over-decimal]{;2.7;}$) and Gaia and ground-based optical photometry for the outer parts of the cluster. We constructed the UV and UV-optical color-magnitude diagrams, compared the horizontal branch (HB) members with the theoretical zero-age HB and terminal-age HB models and identified 34 UV-bright stars. The spectral energy distributions of the UV-bright stars were fitted with theoretical models to estimate their effective temperatures (12500 K - 100,000 K), radii (0.13 to 2.2 $R_{\odot}$), and luminosities ($\sim 40$ to $3000$ $L_{\odot}$) for the first time. These stars were then placed on the H-R diagram, along with theoretical post-HB (pHB) evolutionary tracks to assess their evolutionary status. The models suggest that most of these stars are in the AGB-manqué phase and all, except three, have evolutionary masses $<$ 0.53 $M_{\odot}$. We also calculated the theoretically expected number of hot post-(early)-AGB (p(e)AGB) stars in this cluster and found the range to match our observations. Seven UV-bright stars located in the outer region of the cluster, identified from the AstroSat/UVIT images, are ideal candidates for detailed follow-up spectroscopic studies.
△ Less
Submitted 10 December, 2020;
originally announced December 2020.
-
Codeswitched Sentence Creation using Dependency Parsing
Authors:
Dhruval Jain,
Arun D Prabhu,
Shubham Vatsal,
Gopi Ramena,
Naresh Purre
Abstract:
Codeswitching has become one of the most common occurrences across multilingual speakers of the world, especially in countries like India which encompasses around 23 official languages with the number of bilingual speakers being around 300 million. The scarcity of Codeswitched data becomes a bottleneck in the exploration of this domain with respect to various Natural Language Processing (NLP) task…
▽ More
Codeswitching has become one of the most common occurrences across multilingual speakers of the world, especially in countries like India which encompasses around 23 official languages with the number of bilingual speakers being around 300 million. The scarcity of Codeswitched data becomes a bottleneck in the exploration of this domain with respect to various Natural Language Processing (NLP) tasks. We thus present a novel algorithm which harnesses the syntactic structure of English grammar to develop grammatically sensible Codeswitched versions of English-Hindi, English-Marathi and English-Kannada data. Apart from maintaining the grammatical sanity to a great extent, our methodology also guarantees abundant generation of data from a minuscule snapshot of given data. We use multiple datasets to showcase the capabilities of our algorithm while at the same time we assess the quality of generated Codeswitched data using some qualitative metrics along with providing baseline results for couple of NLP tasks.
△ Less
Submitted 5 December, 2020;
originally announced December 2020.
-
On-Device Sentence Similarity for SMS Dataset
Authors:
Arun D Prabhu,
Nikhil Arora,
Shubham Vatsal,
Gopi Ramena,
Sukumar Moharana,
Naresh Purre
Abstract:
Determining the sentence similarity between Short Message Service (SMS) texts/sentences plays a significant role in mobile device industry. Gauging the similarity between SMS data is thus necessary for various applications like enhanced searching and navigation, clubbing together SMS of similar type when given a custom label or tag is provided by user irrespective of their sender etc. The problem…
▽ More
Determining the sentence similarity between Short Message Service (SMS) texts/sentences plays a significant role in mobile device industry. Gauging the similarity between SMS data is thus necessary for various applications like enhanced searching and navigation, clubbing together SMS of similar type when given a custom label or tag is provided by user irrespective of their sender etc. The problem faced with SMS data is its incomplete structure and grammatical inconsistencies. In this paper, we propose a unique pipeline for evaluating the text similarity between SMS texts. We use Part of Speech (POS) model for keyword extraction by taking advantage of the partial structure embedded in SMS texts and similarity comparisons are carried out using statistical methods. The proposed pipeline deals with major semantic variations across SMS data as well as makes it effective for its application on-device (mobile phone). To showcase the capabilities of our work, our pipeline has been designed with an inclination towards one of the possible applications of SMS text similarity discussed in one of the following sections but nonetheless guarantees scalability for other applications as well.
△ Less
Submitted 4 December, 2020;
originally announced December 2020.
-
On-Device Text Image Super Resolution
Authors:
Dhruval Jain,
Arun D Prabhu,
Gopi Ramena,
Manoj Goyal,
Debi Prasanna Mohanty,
Sukumar Moharana,
Naresh Purre
Abstract:
Recent research on super-resolution (SR) has witnessed major developments with the advancements of deep convolutional neural networks. There is a need for information extraction from scenic text images or even document images on device, most of which are low-resolution (LR) images. Therefore, SR becomes an essential pre-processing step as Bicubic Upsampling, which is conventionally present in smar…
▽ More
Recent research on super-resolution (SR) has witnessed major developments with the advancements of deep convolutional neural networks. There is a need for information extraction from scenic text images or even document images on device, most of which are low-resolution (LR) images. Therefore, SR becomes an essential pre-processing step as Bicubic Upsampling, which is conventionally present in smartphones, performs poorly on LR images. To give the user more control over his privacy, and to reduce the carbon footprint by reducing the overhead of cloud computing and hours of GPU usage, executing SR models on the edge is a necessity in the recent times. There are various challenges in running and optimizing a model on resource-constrained platforms like smartphones. In this paper, we present a novel deep neural network that reconstructs sharper character edges and thus boosts OCR confidence. The proposed architecture not only achieves significant improvement in PSNR over bicubic upsampling on various benchmark datasets but also runs with an average inference time of 11.7 ms per image. We have outperformed state-of-the-art on the Text330 dataset. We also achieve an OCR accuracy of 75.89% on the ICDAR 2015 TextSR dataset, where ground truth has an accuracy of 78.10%.
△ Less
Submitted 20 November, 2020;
originally announced November 2020.
-
A Temporal Convolution Network Approach to State-of-Charge Estimation in Li-ion Batteries
Authors:
Aniruddh Herle,
Janamejaya Channegowda,
Dinakar Prabhu
Abstract:
Electric Vehicle (EV) fleets have dramatically expanded over the past several years. There has been significant increase in interest to electrify all modes of transportation. EVs are primarily powered by Energy Storage Systems such as Lithium-ion Battery packs. Total battery pack capacity translates to the available range in an EV. State of Charge (SOC) is the ratio of available battery capacity t…
▽ More
Electric Vehicle (EV) fleets have dramatically expanded over the past several years. There has been significant increase in interest to electrify all modes of transportation. EVs are primarily powered by Energy Storage Systems such as Lithium-ion Battery packs. Total battery pack capacity translates to the available range in an EV. State of Charge (SOC) is the ratio of available battery capacity to total capacity and is expressed in percentages. It is crucial to accurately estimate SOC to determine the available range in an EV while it is in use. In this paper, a Temporal Convolution Network (TCN) approach is taken to estimate SOC. This is the first implementation of TCNs for the SOC estimation task. Estimation is carried out on various drive cycles such as HWFET, LA92, UDDS and US06 drive cycles at 1 C and 25 °Celsius. It was found that TCN architecture achieved an accuracy of 99.1%.
△ Less
Submitted 19 November, 2020;
originally announced November 2020.
-
Quasar Detection using Linear Support Vector Machine with Learning From Mistakes Methodology
Authors:
Aniruddh Herle,
Janamejaya Channegowda,
Dinakar Prabhu
Abstract:
The field of Astronomy requires the collection and assimilation of vast volumes of data. The data handling and processing problem has become severe as the sheer volume of data produced by scientific instruments each night grows exponentially. This problem becomes extensive for conventional methods of processing the data, which was mostly manual, but is the perfect setting for the use of Machine Le…
▽ More
The field of Astronomy requires the collection and assimilation of vast volumes of data. The data handling and processing problem has become severe as the sheer volume of data produced by scientific instruments each night grows exponentially. This problem becomes extensive for conventional methods of processing the data, which was mostly manual, but is the perfect setting for the use of Machine Learning approaches. While building classifiers for Astronomy, the cost of losing a rare object like supernovae or quasars to detection losses is far more severe than having many false positives, given the rarity and scientific value of these objects. In this paper, a Linear Support Vector Machine (LSVM) is explored to detect Quasars, which are extremely bright objects in which a supermassive black hole is surrounded by a luminous accretion disk. In Astronomy, it is vital to correctly identify quasars, as they are very rare in nature. Their rarity creates a class-imbalance problem that needs to be taken into consideration. The class-imbalance problem and high cost of misclassification are taken into account while designing the classifier. To achieve this detection, a novel classifier is explored, and its performance is evaluated. It was observed that LSVM along with Ensemble Bagged Trees (EBT) achieved a 10x reduction in the False Negative Rate, using the Learning from Mistakes methodology.
△ Less
Submitted 2 October, 2020; v1 submitted 1 October, 2020;
originally announced October 2020.
-
Mn2V0.5Co0.5Z (Z= Ga, Al) Heusler alloys: Fully compensated ferrimagnets with high Tc and compensation temperature
Authors:
P V Midhunlal,
J Arout Chelvane,
D Prabhu,
Raghavan Gopalan,
Harish Kumar N
Abstract:
High TC fully compensated ferrimagnets are potential candidates for spin transfer torque based spintronic devices. We report the structural and magnetic properties of high TC fully compensated ferrimagnets Mn2V0.5Co0.5Z where Z is Ga, Al, in the melt spun ribbon and arc melted bulk form. While the parent alloys Mn2YZ where Y is V, Co and Z is Ga, Al exhibits a magnetic moment value around 2 muB pe…
▽ More
High TC fully compensated ferrimagnets are potential candidates for spin transfer torque based spintronic devices. We report the structural and magnetic properties of high TC fully compensated ferrimagnets Mn2V0.5Co0.5Z where Z is Ga, Al, in the melt spun ribbon and arc melted bulk form. While the parent alloys Mn2YZ where Y is V, Co and Z is Ga, Al exhibits a magnetic moment value around 2 muB per f.u, the Mn2V0.5Co0.5Ga alloy exhibits room temperature nearly fully compensated moment value of 0.09 and 0.13 muB per f.u. in the bulk and ribbon form respectively. For Mn2V0.5Co0.5Al this turned out to be 0.04 and 0.08 muB per f.u. In Contrast to the bulk sample's Neel P type ferrimagnetic behaviour, ribbon samples exhibit Neel N type ferrimagnetic characteristic with a high compensation temperature of 420 K for Ga alloy and 275 K for Al alloy. The observed TC values are more than 640 K for all samples. The differences in the magnetic properties of arc melted and melt spun alloys indicates that even a slight variation in stoichiometry and sample preparation method can influence the physical properties of a compensated system.
△ Less
Submitted 3 December, 2018;
originally announced December 2018.
-
Bi-serial DNA Encryption Algorithm(BDEA)
Authors:
D. Prabhu,
M. Adimoolam
Abstract:
The vast parallelism, exceptional energy efficiency and extraordinary information inherent in DNA molecules are being explored for computing, data storage and cryptography. DNA cryptography is a emerging field of cryptography. In this paper a novel encryption algorithm is devised based on number conversion, DNA digital coding, PCR amplification, which can effectively prevent attack. Data treatment…
▽ More
The vast parallelism, exceptional energy efficiency and extraordinary information inherent in DNA molecules are being explored for computing, data storage and cryptography. DNA cryptography is a emerging field of cryptography. In this paper a novel encryption algorithm is devised based on number conversion, DNA digital coding, PCR amplification, which can effectively prevent attack. Data treatment is used to transform the plain text into cipher text which provides excellent security
△ Less
Submitted 13 January, 2011;
originally announced January 2011.