Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Mahapatra, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.06702  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Temporally Aligning Long Audio Interviews with Questions: A Case Study in Multimodal Data Integration

    Authors: Piyush Singh Pasi, Karthikeya Battepati, Preethi Jyothi, Ganesh Ramakrishnan, Tanmay Mahapatra, Manoj Singh

    Abstract: The problem of audio-to-text alignment has seen significant amount of research using complete supervision during training. However, this is typically not in the context of long audio recordings wherein the text being queried does not appear verbatim within the audio file. This work is a collaboration with a non-governmental organization called CARE India that collects long audio health surveys fro… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: Work Accepted in IJCAI-23- AI and Social Good Track