Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Misiak, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.12028  [pdf, other

    cs.CL

    TreeSeg: Hierarchical Topic Segmentation of Large Transcripts

    Authors: Dimitrios C. Gklezakos, Timothy Misiak, Diamond Bishop

    Abstract: From organizing recorded videos and meetings into chapters, to breaking down large inputs in order to fit them into the context window of commoditized Large Language Models (LLMs), topic segmentation of large transcripts emerges as a task of increasing significance. Still, accurate segmentation presents many challenges, including (a) the noisy nature of the Automatic Speech Recognition (ASR) softw… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.