Zum Hauptinhalt springen

Showing 1–16 of 16 results for author: Samarasinghe, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.12630  [pdf, other

    eess.AS cs.SD

    Reproducing the Acoustic Velocity Vectors in a Circular Listening Area

    Authors: Jiarui Wang, Thushara Abhayapala, Jihui Aimee Zhang, Prasanga Samarasinghe

    Abstract: Acoustic velocity vectors are important for human's localization of sound at low frequencies. This paper proposes a sound field reproduction algorithm, which matches the acoustic velocity vectors in a circular listening area. In previous work, acoustic velocity vectors are matched either at sweet spots or on the boundary of the listening area. Sweet spots restrict listener's movement, whereas meas… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: Submitted to EUSIPCO 2024

  2. arXiv:2309.13819  [pdf, other

    eess.AS cs.SD

    A Two-Step Approach for Narrowband Source Localization in Reverberant Rooms

    Authors: Wei-Ting Lai, Lachlan Birnie, Thushara Abhayapala, Amy Bastine, Shaoheng Xu, Prasanga Samarasinghe

    Abstract: This paper presents a two-step approach for narrowband source localization within reverberant rooms. The first step involves dereverberation by modeling the homogeneous component of the sound field by an equivalent decomposition of planewaves using Iteratively Reweighted Least Squares (IRLS), while the second step focuses on source localization by modeling the dereverberated component as a sparse… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

  3. arXiv:2309.10605  [pdf, other

    eess.AS cs.SD

    An Active Noise Control System Based on Soundfield Interpolation Using a Physics-informed Neural Network

    Authors: Yile Angela Zhang, Fei Ma, Thushara Abhayapala, Prasanga Samarasinghe, Amy Bastine

    Abstract: Conventional multiple-point active noise control (ANC) systems require placing error microphones within the region of interest (ROI), inconveniencing users. This paper designs a feasible monitoring microphone arrangement placed outside the ROI, providing a user with more freedom of movement. The soundfield within the ROI is interpolated from the microphone signals using a physics-informed neural… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

  4. arXiv:2309.08290  [pdf, other

    eess.AS cs.SD

    Head-Related Transfer Function Interpolation with a Spherical CNN

    Authors: Xingyu Chen, Fei Ma, Yile Zhang, Amy Bastine, Prasanga N. Samarasinghe

    Abstract: Head-related transfer functions (HRTFs) are crucial for spatial soundfield reproduction in virtual reality applications. However, obtaining personalized, high-resolution HRTFs is a time-consuming and costly task. Recently, deep learning-based methods showed promise in interpolating high-resolution HRTFs from sparse measurements. Some of these methods treat HRTF interpolation as an image super-reso… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  5. arXiv:2306.09135  [pdf, other

    eess.AS cs.SD

    Time-Domain Wideband Image Source Method for Spherical Microphone Arrays

    Authors: Jiarui Wang, Prasanga Samarasinghe, Thushara Abhayapala, Jihui Aimee Zhang

    Abstract: This paper presents the time-domain wideband spherical microphone array impulse response generator (TDW-SMIR generator), which is a time-domain wideband image source method (ISM) for generating the room impulse responses captured by an open spherical microphone array. To incorporate loudspeaker directivity, the TDW-SMIR generator considers a source that emits a sequence of spherical wave fronts wh… ▽ More

    Submitted 9 August, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: Accepted for publication in the IEEE 25th International Workshop on Multimedia Signal Processing (IEEE MMSP 2023)

  6. Comparative Study of Parameter Selection for Enhanced Edge Inference for a Multi-Output Regression model for Head Pose Estimation

    Authors: Asiri Lindamulage, Nuwan Kodagoda, Shyam Reyal, Pradeepa Samarasinghe, Pratheepan Yogarajah

    Abstract: Magnitude-based pruning is a technique used to optimise deep learning models for edge inference. We have achieved over 75% model size reduction with a higher accuracy than the original multi-output regression model for head-pose estimation.

    Submitted 28 December, 2022; originally announced February 2023.

    Comments: Conference:- in TENCON 2022 - 2022 IEEE Region 10 Conference (TENCON)

    Journal ref: TENCON 2022 - 2022 IEEE Region 10 Conference (TENCON), Nov. 2022

  7. arXiv:2212.09027  [pdf, other

    cs.CV cs.LG

    2D Pose Estimation based Child Action Recognition

    Authors: Sanka Mohottala, Sandun Abeygunawardana, Pradeepa Samarasinghe, Dharshana Kasthurirathna, Charith Abhayaratne

    Abstract: We present a graph convolutional network with 2D pose estimation for the first time on child action recognition task achieving on par results with an RGB modality based model on a novel benchmark dataset containing unconstrained environment based videos.

    Submitted 18 December, 2022; originally announced December 2022.

    Comments: Paper Accepted for the IEEE TENCON Conference (2022). 7 pages, 5 figures

  8. arXiv:2212.09013  [pdf, other

    cs.CV cs.LG

    Graph Neural Network based Child Activity Recognition

    Authors: Sanka Mohottala, Pradeepa Samarasinghe, Dharshana Kasthurirathna, Charith Abhayaratne

    Abstract: This paper presents an implementation on child activity recognition (CAR) with a graph convolution network (GCN) based deep learning model since prior implementations in this domain have been dominated by CNN, LSTM and other methods despite the superior performance of GCN. To the best of our knowledge, we are the first to use a GCN model in child activity recognition domain. In overcoming the chal… ▽ More

    Submitted 18 December, 2022; originally announced December 2022.

    Comments: Accepted to 23rd IEEE ICIT Conference (2022), 8 pages, 4 figures

  9. arXiv:2206.09298  [pdf, ps, other

    cs.SD cs.RO eess.AS

    GMM based multi-stage Wiener filtering for low SNR speech enhancement

    Authors: Wageesha Manamperi, Prasanga N. Samarasinghe, Thushara D. Abhayapala, Jihui Zhang

    Abstract: This paper proposes a single-channel speech enhancement method to reduce the noise and enhance speech at low signal-to-noise ratio (SNR) levels and non-stationary noise conditions. Specifically, we focus on modeling the noise using a Gaussian mixture model (GMM) based on a multi-stage process with a parametric Wiener filter. The proposed noise model estimates a more accurate noise power spectral d… ▽ More

    Submitted 14 July, 2022; v1 submitted 18 June, 2022; originally announced June 2022.

    Comments: 5 pages, 3 figures, submitted to a conference

  10. A Novel Method for Obtaining Diffuse Field Measurements for Microphone Calibration

    Authors: Noman Akbar, Glenn Dickins, Mark R. P. Thomas, Prasanga Samarasinghe, Thushara Abhayapala

    Abstract: We propose a straightforward and cost-effective method to perform diffuse soundfield measurements for calibrating the magnitude response of a microphone array. Typically, such calibration is performed in a diffuse soundfield created in reverberation chambers, an expensive and time-consuming process. A method is proposed for obtaining diffuse field measurements in untreated environments. First, a c… ▽ More

    Submitted 8 August, 2020; originally announced August 2020.

    Comments: Accepted to appear in IEEE ICASSP 2020

    Journal ref: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020

  11. arXiv:2007.11795  [pdf, other

    eess.AS cs.SD

    Sound Field Translation and Mixed Source Model for Virtual Applications with Perceptual Validation

    Authors: Lachlan Birnie, Thushara Abhayapala, Vladimir Tourbabin, Prasanga Samarasinghe

    Abstract: Non-interactive and linear experiences like cinema film offer high quality surround sound audio to enhance immersion, however the listener's experience is usually fixed to a single acoustic perspective. With the rise of virtual reality, there is a demand for recording and recreating real-world experiences in a way that allows for the user to interact and move within the reproduction. Conventional… ▽ More

    Submitted 23 July, 2020; originally announced July 2020.

    Comments: 12 pages, 11 figures This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  12. Multi-Source DOA Estimation through Pattern Recognition of the Modal Coherence of a Reverberant Soundfield

    Authors: A. Fahim, P. N. Samarasinghe, T. D. Abhayapala

    Abstract: We propose a novel multi-source direction of arrival (DOA) estimation technique using a convolutional neural network algorithm which learns the modal coherence patterns of an incident soundfield through measured spherical harmonic coefficients. We train our model for individual time-frequency bins in the short-time Fourier transform spectrum by analyzing the unique snapshot of modal coherence for… ▽ More

    Submitted 18 March, 2020; originally announced March 2020.

    Journal ref: IEEE/ACM Transactions on Audio, Speech, and Language Processing 28 (2019) 605 - 618

  13. PSD Estimation and Source Separation in a Noisy Reverberant Environment using a Spherical Microphone Array

    Authors: Abdullah Fahim, Prasanga N. Samarasinghe, Thushara D. Abhayapala

    Abstract: In this paper, we propose an efficient technique for estimating individual power spectral density (PSD) components, i.e., PSD of each desired sound source as well as of noise and reverberation, in a multi-source reverberant sound scene with coherent background noise. We formulate the problem in the spherical harmonics domain to take the advantage of the inherent orthogonality of the spherical harm… ▽ More

    Submitted 16 May, 2018; originally announced May 2018.

  14. PSD Estimation of Multiple Sound Sources in a Reverberant Room Using a Spherical Microphone Array

    Authors: Abdullah Fahim, Prasanga N. Samarasinghe, Thushara D. Abhayapala

    Abstract: We propose an efficient method to estimate source power spectral densities (PSDs) in a multi-source reverberant environment using a spherical microphone array. The proposed method utilizes the spatial correlation between the spherical harmonics (SH) coefficients of a sound field to estimate source PSDs. The use of the spatial cross-correlation of the SH coefficients allows us to employ the method… ▽ More

    Submitted 5 September, 2017; originally announced September 2017.

    Comments: Accepted for WASPAA 2017

  15. arXiv:1510.08950  [pdf, other

    cs.SD

    Estimation of the direct-to-reverberant Energy Ratio using a spherical microphone array

    Authors: Hanchi Chen, Prasanga N. Samarasinghe, Thushara D. Abhayapala, Wen Zhang

    Abstract: This paper proposes a practical approach to estimate the direct-to-reverberant energy ratio (DRR) using a spherical microphone array without having knowledge of the source signal. We base our estimation on a theoretical relationship between the DRR and the coherence estimation function between coincident pressure and particle velocity. We discuss the proposed method's ability to estimate the DRR i… ▽ More

    Submitted 29 October, 2015; originally announced October 2015.

    Comments: In Proceedings of the ACE Challenge Workshop - a satellite event of IEEE-WASPAA 2015 (arXiv:1510.00383)

    Report number: ACEChallenge/2015/01

  16. arXiv:1505.04385  [pdf, ps, other

    cs.SD

    An Efficient Parameterization of the Room Transfer Function

    Authors: Prasanga Samarasinghe, Thushara Abhayapala, Mark Poletti, Terence Betlehem

    Abstract: This paper proposes an efficient parameterization of the Room Transfer Function (RTF). Typically, the RTF rapidly varies with varying source and receiver positions, hence requires an impractical number of point to point measurements to characterize a given room. Therefore, we derive a novel RTF parameterization that is robust to both receiver and source variations with the following salient featur… ▽ More

    Submitted 17 May, 2015; originally announced May 2015.

    Comments: 11 pages, 6 figures