Zum Hauptinhalt springen

Showing 1–5 of 5 results for author: McElwain, N L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17190  [pdf, other

    cs.SD cs.LG eess.AS

    Sound Tagging in Infant-centric Home Soundscapes

    Authors: Mohammad Nur Hossain Khan, Jialu Li, Nancy L. McElwain, Mark Hasegawa-Johnson, Bashima Islam

    Abstract: Certain environmental noises have been associated with negative developmental outcomes for infants and young children. Though classifying or tagging sound events in a domestic environment is an active research area, previous studies focused on data collected from a non-stationary microphone placed in the environment or from the perspective of adults. Further, many of these works ignore infants or… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Accepted in IEEE/ACM CHASE 2024

  2. arXiv:2402.06888  [pdf, other

    eess.AS cs.SD

    Analysis of Self-Supervised Speech Models on Children's Speech and Infant Vocalizations

    Authors: Jialu Li, Mark Hasegawa-Johnson, Nancy L. McElwain

    Abstract: To understand why self-supervised learning (SSL) models have empirically achieved strong performances on several speech-processing downstream tasks, numerous studies have focused on analyzing the encoded information of the SSL layer representations in adult speech. Limited work has investigated how pre-training and fine-tuning affect SSL models encoding children's speech and vocalizations. In this… ▽ More

    Submitted 6 June, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

    Comments: Accepted to 2024 ICASSP Workshop of Self-supervision in Audio, Speech and Beyond (SASB)

  3. arXiv:2306.15808  [pdf, other

    cs.MM cs.SD eess.AS eess.SP

    Classification of Infant Sleep/Wake States: Cross-Attention among Large Scale Pretrained Transformer Networks using Audio, ECG, and IMU Data

    Authors: Kai Chieh Chang, Mark Hasegawa-Johnson, Nancy L. McElwain, Bashima Islam

    Abstract: Infant sleep is critical to brain and behavioral development. Prior studies on infant sleep/wake classification have been largely limited to reliance on expensive and burdensome polysomnography (PSG) tests in the laboratory or wearable devices that collect single-modality data. To facilitate data collection and accuracy of detection, we aimed to advance this field of study by using a multi-modal w… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: Preprint for APSIPA2023

  4. Towards Robust Family-Infant Audio Analysis Based on Unsupervised Pretraining of Wav2vec 2.0 on Large-Scale Unlabeled Family Audio

    Authors: Jialu Li, Mark Hasegawa-Johnson, Nancy L. McElwain

    Abstract: To perform automatic family audio analysis, past studies have collected recordings using phone, video, or audio-only recording devices like LENA, investigated supervised learning methods, and used or fine-tuned general-purpose embeddings learned from large pretrained models. In this study, we advance the audio component of a new infant wearable multi-modal device called LittleBeats (LB) by learnin… ▽ More

    Submitted 8 December, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: Proceedings of Interspeech 2023; v4 version updates: correction of W2V2-base pretrained on 960-hour of LibriSpeech and number of families participated for LENA home recordings

  5. arXiv:2203.15183  [pdf, other

    eess.AS cs.CL cs.SD

    Visualizations of Complex Sequences of Family-Infant Vocalizations Using Bag-of-Audio-Words Approach Based on Wav2vec 2.0 Features

    Authors: Jialu Li, Mark Hasegawa-Johnson, Nancy L. McElwain

    Abstract: In the U.S., approximately 15-17% of children 2-8 years of age are estimated to have at least one diagnosed mental, behavioral or developmental disorder. However, such disorders often go undiagnosed, and the ability to evaluate and treat disorders in the first years of life is limited. To analyze infant developmental changes, previous studies have shown advanced ML models excel at classifying infa… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: Submitted to Interspeech 2022