Zum Hauptinhalt springen

Showing 1–16 of 16 results for author: Shankar, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10993  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    CoSTA: Code-Switched Speech Translation using Aligned Speech-Text Interleaving

    Authors: Bhavani Shankar, Preethi Jyothi, Pushpak Bhattacharyya

    Abstract: Code-switching is a widely prevalent linguistic phenomenon in multilingual societies like India. Building speech-to-text models for code-switched speech is challenging due to limited availability of datasets. In this work, we focus on the problem of spoken translation (ST) of code-switched speech in Indian languages to English text. We present a new end-to-end model architecture COSTA that scaffol… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  2. arXiv:2406.10512  [pdf, other

    eess.AS cs.SD

    SOA: Reducing Domain Mismatch in SSL Pipeline by Speech Only Adaptation for Low Resource ASR

    Authors: Natarajan Balaji Shankar, Ruchao Fan, Abeer Alwan

    Abstract: Recently, speech foundation models have gained popularity due to their superiority in finetuning downstream ASR tasks. However, models finetuned on certain domains, such as LibriSpeech (adult read speech), behave poorly on other domains (child or noisy speech). One solution could be collecting as much labeled and diverse data as possible for joint finetuning on various domains. However, collecting… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: Accepted to ICASSP 2024 SASB Workshop

  3. arXiv:2406.10507  [pdf, other

    eess.AS cs.CL cs.SD

    Benchmarking Children's ASR with Supervised and Self-supervised Speech Foundation Models

    Authors: Ruchao Fan, Natarajan Balaji Shankar, Abeer Alwan

    Abstract: Speech foundation models (SFMs) have achieved state-of-the-art results for various speech tasks in supervised (e.g. Whisper) or self-supervised systems (e.g. WavLM). However, the performance of SFMs for child ASR has not been systematically studied. In addition, there is no benchmark for child ASR with standard evaluations, making the comparisons of novel ideas difficult. In this paper, we initiat… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: To appear in Interspeech 2024

  4. arXiv:2405.14209  [pdf, other

    cs.PF cs.AR

    Exploring and Evaluating Real-world CXL: Use Cases and System Adoption

    Authors: Jie Liu, Xi Wang, Jianbo Wu, Shuangyan Yang, Jie Ren, Bhanu Shankar, Dong Li

    Abstract: Compute eXpress Link (CXL) is emerging as a promising memory interface technology. Because of the common unavailiability of CXL devices, the performance of the CXL memory is largely unknown. What are the use cases for the CXL memory? What are the impacts of the CXL memory on application performance? How to use the CXL memory in combination with existing memory components? In this work, we study th… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  5. arXiv:2404.18934  [pdf

    cs.CV cs.HC

    The Visual Experience Dataset: Over 200 Recorded Hours of Integrated Eye Movement, Odometry, and Egocentric Video

    Authors: Michelle R. Greene, Benjamin J. Balas, Mark D. Lescroart, Paul R. MacNeilage, Jennifer A. Hart, Kamran Binaee, Peter A. Hausamann, Ronald Mezile, Bharath Shankar, Christian B. Sinnott, Kaylie Capurro, Savannah Halow, Hunter Howe, Mariam Josyula, Annie Li, Abraham Mieses, Amina Mohamed, Ilya Nudnou, Ezra Parkhill, Peter Riley, Brett Schmidt, Matthew W. Shinkle, Wentao Si, Brian Szekely, Joaquin M. Torres , et al. (1 additional authors not shown)

    Abstract: We introduce the Visual Experience Dataset (VEDB), a compilation of over 240 hours of egocentric video combined with gaze- and head-tracking data that offers an unprecedented view of the visual world as experienced by human observers. The dataset consists of 717 sessions, recorded by 58 observers ranging from 6-49 years old. This paper outlines the data collection, processing, and labeling protoco… ▽ More

    Submitted 13 August, 2024; v1 submitted 15 February, 2024; originally announced April 2024.

    Comments: 40 pages, 1 table, 9 figures

  6. arXiv:2212.12264  [pdf, other

    eess.IV cs.CV

    Collective Intelligent Strategy for Improved Segmentation of COVID-19 from CT

    Authors: Surochita Pal Das, Sushmita Mitra, B. Uma Shankar

    Abstract: The devastation caused by the coronavirus pandemic makes it imperative to design automated techniques for a fast and accurate detection. We propose a novel non-invasive tool, using deep learning and imaging, for delineating COVID-19 infection in lungs. The Ensembling Attention-based Multi-scaled Convolution network (EAMC), employing Leave-One-Patient-Out (LOPO) training, exhibits high sensitivity… ▽ More

    Submitted 23 December, 2022; originally announced December 2022.

  7. arXiv:2207.02157  [pdf, other

    cs.IT eess.SP

    Multi-IRS-Aided Doppler-Tolerant Wideband DFRC System

    Authors: Tong Wei, Linlong Wu, Kumar Vijay Mishra, M. R. Bhavani Shankar

    Abstract: Intelligent reflecting surface (IRS) is recognized as an enabler of future dual-function radar-communications (DFRC) by improving spectral efficiency, coverage, parameter estimation, and interference suppression. Prior studies on IRS-aided DFRC focus either on narrowband processing, single-IRS deployment, static targets, non-clutter scenario, or on the under-utilized line-of-sight (LoS) and non-li… ▽ More

    Submitted 10 August, 2023; v1 submitted 5 July, 2022; originally announced July 2022.

    Comments: 16 pages, 8 figures, 2 tables

  8. The Rise of Intelligent Reflecting Surfaces in Integrated Sensing and Communications Paradigms

    Authors: Ahmet M. Elbir, Kumar Vijay Mishra, M. R. Bhavani Shankar, Symeon Chatzinotas

    Abstract: The intelligent reflecting surface (IRS) alters the behavior of wireless media and, consequently, has potential to improve the performance and reliability of wireless systems such as communications and radar remote sensing. Recently, integrated sensing and communications (ISAC) has been widely studied as a means to efficiently utilize spectrum and thereby save cost and power. This article investig… ▽ More

    Submitted 20 December, 2022; v1 submitted 14 April, 2022; originally announced April 2022.

    Comments: Accepted paper in IEEE Network Magazine

    Journal ref: IEEE Network, 2023

  9. arXiv:2201.01971  [pdf, other

    cs.CV cs.AI cs.LG

    Multi-Label Classification on Remote-Sensing Images

    Authors: Aditya Kumar Singh, B. Uma Shankar

    Abstract: Acquiring information on large areas on the earth's surface through satellite cameras allows us to see much more than we can see while standing on the ground. This assists us in detecting and monitoring the physical characteristics of an area like land-use patterns, atmospheric conditions, forest cover, and many unlisted aspects. The obtained images not only keep track of continuous natural phenom… ▽ More

    Submitted 6 January, 2022; originally announced January 2022.

    Comments: The report consists of 95 Pages, 45 Figures, 31 Tables, 85 References

  10. arXiv:2007.15108  [pdf, other

    eess.SP cs.IR math.OC

    Localization with One-Bit Passive Radars in Narrowband Internet-of-Things using Multivariate Polynomial Optimization

    Authors: Saeid Sedighi, Kumar Vijay Mishra, M. R. Bhavani Shankar, Björn Ottersten

    Abstract: Several Internet-of-Things (IoT) applications provide location-based services, wherein it is critical to obtain accurate position estimates by aggregating information from individual sensors. In the recently proposed narrowband IoT (NB-IoT) standard, which trades off bandwidth to gain wide coverage, the location estimation is compounded by the low sampling rate receivers and limited-capacity links… ▽ More

    Submitted 9 April, 2021; v1 submitted 29 July, 2020; originally announced July 2020.

    Comments: 16 pages, 11 figures

  11. arXiv:1912.10036  [pdf, other

    eess.SP cs.IT cs.LG

    A Family of Deep Learning Architectures for Channel Estimation and Hybrid Beamforming in Multi-Carrier mm-Wave Massive MIMO

    Authors: Ahmet M. Elbir, Kumar Vijay Mishra, M. R. Bhavani Shankar, Björn Ottersten

    Abstract: Hybrid analog and digital beamforming transceivers are instrumental in addressing the challenge of expensive hardware and high training overheads in the next generation millimeter-wave (mm-Wave) massive MIMO (multiple-input multiple-output) systems. However, lack of fully digital beamforming in hybrid architectures and short coherence times at mm-Wave impose additional constraints on the channel e… ▽ More

    Submitted 3 January, 2022; v1 submitted 20 December, 2019; originally announced December 2019.

    Comments: Accepted Paper in IEEE Transactions on Cognitive Communications and Networking. arXiv admin note: text overlap with arXiv:1910.14240

  12. arXiv:1909.00798  [pdf, other

    cs.CV cs.LG

    Dynamic Approach for Lane Detection using Google Street View and CNN

    Authors: Rama Sai Mamidala, Uday Uthkota, Mahamkali Bhavani Shankar, A. Joseph Antony, A. V. Narasimhadhan

    Abstract: Lane detection algorithms have been the key enablers for a fully-assistive and autonomous navigation systems. In this paper, a novel and pragmatic approach for lane detection is proposed using a convolutional neural network (CNN) model based on SegNet encoder-decoder architecture. The encoder block renders low-resolution feature maps of the input and the decoder block provides pixel-wise classific… ▽ More

    Submitted 2 September, 2019; originally announced September 2019.

    Comments: Preprint: To be published in the proceedings of IEEE TENCON 2019

  13. arXiv:1811.02629  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Identifying the Best Machine Learning Algorithms for Brain Tumor Segmentation, Progression Assessment, and Overall Survival Prediction in the BRATS Challenge

    Authors: Spyridon Bakas, Mauricio Reyes, Andras Jakab, Stefan Bauer, Markus Rempfler, Alessandro Crimi, Russell Takeshi Shinohara, Christoph Berger, Sung Min Ha, Martin Rozycki, Marcel Prastawa, Esther Alberts, Jana Lipkova, John Freymann, Justin Kirby, Michel Bilello, Hassan Fathallah-Shaykh, Roland Wiest, Jan Kirschke, Benedikt Wiestler, Rivka Colen, Aikaterini Kotrotsou, Pamela Lamontagne, Daniel Marcus, Mikhail Milchenko , et al. (402 additional authors not shown)

    Abstract: Gliomas are the most common primary brain malignancies, with different degrees of aggressiveness, variable prognosis and various heterogeneous histologic sub-regions, i.e., peritumoral edematous/invaded tissue, necrotic core, active and non-enhancing core. This intrinsic heterogeneity is also portrayed in their radio-phenotype, as their sub-regions are depicted by varying intensity profiles dissem… ▽ More

    Submitted 23 April, 2019; v1 submitted 5 November, 2018; originally announced November 2018.

    Comments: The International Multimodal Brain Tumor Segmentation (BraTS) Challenge

  14. arXiv:1806.07589  [pdf, other

    cs.CV

    A CADe System for Gliomas in Brain MRI using Convolutional Neural Networks

    Authors: Subhasis Banerjee, Sushmita Mitra, Anmol Sharma, B. Uma Shankar

    Abstract: Inspired by the success of Convolutional Neural Networks (CNN), we develop a novel Computer Aided Detection (CADe) system using CNN for Glioblastoma Multiforme (GBM) detection and segmentation from multi channel MRI data. A two-stage approach first identifies the presence of GBM. This is followed by a GBM localization in each "abnormal" MR slice. As part of the CADe system, two CNN architectures v… ▽ More

    Submitted 20 June, 2018; originally announced June 2018.

    Comments: The paper consists of 11 Pages, 6 Figures, 7 Tables, 56 References

  15. Pushing for higher rates and efficiency in Satcom: the different perspectives within SatNExIV

    Authors: Miguel Ángel Vázquez, Ana Pérez-Neira, Carlos Mosquera, Bhavanni Shankar, Pol Henarejos, Athanasios D. Panagopoulos, Giovanni Giambere, Vasilios Siris, George Polyzos, Nader Alagha

    Abstract: SatNEx IV project aims at studying medium and long term directions of satellite telecommunication systems for any of the commercial or institutional applications that can be considered appealing by key players although still not mature enough for attracting industry or initiating dedicated ESA R&D activities. This paper summarizes the first year activities identified as very promising techniques f… ▽ More

    Submitted 20 March, 2018; originally announced March 2018.

  16. Signal Processing for High Throughput Satellite Systems: Challenges in New Interference-Limited Scenarios

    Authors: Ana I. Perez-Neira, Miguel Angel Vazquez, Sina Maleki, M. R. Bhavani Shankar, Symeon Chatzinotas

    Abstract: The field of satellite communications is enjoying a renewed interest in the global telecom market, and very high throughput satellites (V/HTS), with their multiple spot-beams, are key for delivering the future rate demands. In this article, the state-of-the-art and open research challenges of signal processing techniques for V/HTS systems are presented for the first time, with focus on novel appro… ▽ More

    Submitted 12 February, 2018; originally announced February 2018.