Zum Hauptinhalt springen

Showing 1–15 of 15 results for author: Karpov, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.01446  [pdf

    cs.CL cs.LG eess.AS eess.SP

    Enabling ASR for Low-Resource Languages: A Comprehensive Dataset Creation Approach

    Authors: Ara Yeroyan, Nikolay Karpov

    Abstract: In recent years, automatic speech recognition (ASR) systems have significantly improved, especially in languages with a vast amount of transcribed speech data. However, ASR systems tend to perform poorly for low-resource languages with fewer resources, such as minority and regional languages. This study introduces a novel pipeline designed to generate ASR training datasets from audiobooks, which t… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 13 pages, 10 figures (including ablation studies), to be published in 2024 IEEE Spoken Language Technology Workshop. Additionally, the associated software package can be accessed at (https://pypi.org/project/vac-aligner/) for practical applications and further development

  2. arXiv:2310.12378  [pdf, other

    eess.AS cs.SD

    The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System

    Authors: Tae Jin Park, He Huang, Ante Jukic, Kunal Dhawan, Krishna C. Puvvada, Nithin Koluguri, Nikolay Karpov, Aleksandr Laptev, Jagadeesh Balam, Boris Ginsburg

    Abstract: We present the NVIDIA NeMo team's multi-channel speech recognition system for the 7th CHiME Challenge Distant Automatic Speech Recognition (DASR) Task, focusing on the development of a multi-channel, multi-speaker speech recognition system tailored to transcribe speech from distributed microphones and microphone arrays. The system predominantly comprises of the following integral modules: the Spea… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Journal ref: CHiME-7 Workshop 2023

  3. arXiv:2310.02943  [pdf, other

    cs.CL

    LibriSpeech-PC: Benchmark for Evaluation of Punctuation and Capitalization Capabilities of end-to-end ASR Models

    Authors: Aleksandr Meister, Matvei Novikov, Nikolay Karpov, Evelina Bakhturina, Vitaly Lavrukhin, Boris Ginsburg

    Abstract: Traditional automatic speech recognition (ASR) models output lower-cased words without punctuation marks, which reduces readability and necessitates a subsequent text processing model to convert ASR transcripts into a proper format. Simultaneously, the development of end-to-end ASR models capable of predicting punctuation and capitalization presents several challenges, primarily due to limited dat… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  4. arXiv:2301.11442  [pdf, other

    cs.LG

    Communication-Efficient Collaborative Regret Minimization in Multi-Armed Bandits

    Authors: Nikolai Karpov, Qin Zhang

    Abstract: In this paper, we study the collaborative learning model, which concerns the tradeoff between parallelism and communication overhead in multi-agent multi-armed bandits. For regret minimization in multi-armed bandits, we present the first set of tradeoffs between the number of rounds of communication among the agents and the regret of the collaborative learning process.

    Submitted 20 December, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

    Comments: 13 pages, 1 figure

  5. arXiv:2208.09029  [pdf, other

    cs.LG

    Communication-Efficient Collaborative Best Arm Identification

    Authors: Nikolai Karpov, Qin Zhang

    Abstract: We investigate top-$m$ arm identification, a basic problem in bandit theory, in a multi-agent learning model in which agents collaborate to learn an objective function. We are interested in designing collaborative learning algorithms that achieve maximum speedup (compared to single-agent learning algorithms) using minimum communication cost, as communication is frequently the bottleneck in multi-a… ▽ More

    Submitted 26 November, 2022; v1 submitted 18 August, 2022; originally announced August 2022.

    Comments: 12 pages, 12 figures

  6. arXiv:2207.08015  [pdf, ps, other

    cs.LG cs.DS

    Parallel Best Arm Identification in Heterogeneous Environments

    Authors: Nikolai Karpov, Qin Zhang

    Abstract: In this paper, we study the tradeoffs between the time and the number of communication rounds of the best arm identification problem in the heterogeneous collaborative learning model, where multiple agents interact with possibly different environments and they want to learn in parallel an objective function in the aggregated environment. By proving almost tight upper and lower bounds, we show that… ▽ More

    Submitted 18 April, 2024; v1 submitted 16 July, 2022; originally announced July 2022.

    Comments: 15 pages (published in SPAA 2024)

  7. arXiv:2108.06812  [pdf, other

    cs.LG

    Batched Thompson Sampling for Multi-Armed Bandits

    Authors: Nikolai Karpov, Qin Zhang

    Abstract: We study Thompson Sampling algorithms for stochastic multi-armed bandits in the batched setting, in which we want to minimize the regret over a sequence of arm pulls using a small number of policy changes (or, batches). We propose two algorithms and demonstrate their effectiveness by experiments on both synthetic and real datasets. We also analyze the proposed algorithms from the theoretical aspec… ▽ More

    Submitted 15 August, 2021; originally announced August 2021.

    Comments: 9 pages

  8. arXiv:2012.01499  [pdf, ps, other

    cs.LG cs.DS

    Instance-Sensitive Algorithms for Pure Exploration in Multinomial Logit Bandit

    Authors: Nikolai Karpov, Qin Zhang

    Abstract: Motivated by real-world applications such as fast fashion retailing and online advertising, the Multinomial Logit Bandit (MNL-bandit) is a popular model in online learning and operations research, and has attracted much attention in the past decade. However, it is a bit surprising that pure exploration, a basic problem in bandit theory, has not been well studied in MNL-bandit so far. In this paper… ▽ More

    Submitted 15 August, 2021; v1 submitted 2 December, 2020; originally announced December 2020.

    Comments: 13 pages

  9. arXiv:2004.09454  [pdf, other

    cs.DS cs.DC cs.LG

    Collaborative Top Distribution Identifications with Limited Interaction

    Authors: Nikolai Karpov, Qin Zhang, Yuan Zhou

    Abstract: We consider the following problem in this paper: given a set of $n$ distributions, find the top-$m$ ones with the largest means. This problem is also called {\em top-$m$ arm identifications} in the literature of reinforcement learning, and has numerous applications. We study the problem in the collaborative learning model where we have multiple agents who can draw samples from the $n$ distribution… ▽ More

    Submitted 8 September, 2020; v1 submitted 20 April, 2020; originally announced April 2020.

    Comments: Accepted for presentation at FOCS 2020

  10. arXiv:1903.05617  [pdf, ps, other

    cs.DS

    Distributed and Streaming Linear Programming in Low Dimensions

    Authors: Sepehr Assadi, Nikolai Karpov, Qin Zhang

    Abstract: We study linear programming and general LP-type problems in several big data (streaming and distributed) models. We mainly focus on low dimensional problems in which the number of constraints is much larger than the number of variables. Low dimensional LP-type problems appear frequently in various machine learning tasks such as robust regression, support vector machines, and core vector machines.… ▽ More

    Submitted 13 March, 2019; originally announced March 2019.

    Comments: To appear in PODS'19; 28 pages

  11. arXiv:1706.09487  [pdf, other

    cs.DS

    Parameterized Algorithms for Partitioning Graphs into Highly Connected Clusters

    Authors: Ivan Bliznets, Nikolai Karpov

    Abstract: Clustering is a well-known and important problem with numerous applications. The graph-based model is one of the typical cluster models. In the graph model, clusters are generally defined as cliques. However, such an approach might be too restrictive as in some applications, not all objects from the same cluster must be connected. That is why different types of cliques relaxations often considered… ▽ More

    Submitted 28 June, 2017; originally announced June 2017.

  12. arXiv:1706.06086  [pdf, other

    cs.DS

    An exponential lower bound for cut sparsifiers in planar graphs

    Authors: Nikolai Karpov, Marcin Pilipczuk, Anna Zych-Pawlewicz

    Abstract: Given an edge-weighted graph $G$ with a set $Q$ of $k$ terminals, a mimicking network is a graph with the same set of terminals that exactly preserves the sizes of minimum cuts between any partition of the terminals. A natural question in the area of graph compression is to provide as small mimicking networks as possible for input graph $G$ being either an arbitrary graph or coming from a specific… ▽ More

    Submitted 30 December, 2017; v1 submitted 19 June, 2017; originally announced June 2017.

  13. arXiv:1502.03989  [pdf, other

    cs.DS

    Parameterized Complexity of Secluded Connectivity Problems

    Authors: Fedor V. Fomin, Petr A. Golovach, Nikolay Karpov, Alexander S. Kulikov

    Abstract: The Secluded Path problem models a situation where a sensitive information has to be transmitted between a pair of nodes along a path in a network. The measure of the quality of a selected path is its exposure, which is the total weight of vertices in its closed neighborhood. In order to minimize the risk of intercepting the information, we are interested in selecting a secluded path, i.e. a path… ▽ More

    Submitted 21 April, 2015; v1 submitted 13 February, 2015; originally announced February 2015.

    Comments: Minor corrections are done

  14. arXiv:1502.01461  [pdf, ps, other

    cs.DS

    Parameterized Complexity of Superstring Problems

    Authors: Ivan Bliznets, Fedor V. Fomin, Petr A. Golovach, Nikolay Karpov, Alexander S. Kulikov, Saket Saurabh

    Abstract: In the Shortest Superstring problem we are given a set of strings $S=\{s_1, \ldots, s_n\}$ and integer $\ell$ and the question is to decide whether there is a superstring $s$ of length at most $\ell$ containing all strings of $S$ as substrings. We obtain several parameterized algorithms and complexity results for this problem. In particular, we give an algorithm which in time… ▽ More

    Submitted 5 February, 2015; originally announced February 2015.

  15. arXiv:cs/0703049  [pdf

    cs.SD cs.CL

    Algorithm of Segment-Syllabic Synthesis in Speech Recognition Problem

    Authors: Oleg N. Karpov, Olga A. Savenkova

    Abstract: Speech recognition based on the syllable segment is discussed in this paper. The principal search methods in space of states for the speech recognition problem by segment-syllabic parameters trajectory synthesis are investigated. Recognition as comparison the parameters trajectories in chosen speech units on the sections of the segmented speech is realized. Some experimental results are given an… ▽ More

    Submitted 10 March, 2007; originally announced March 2007.

    Comments: 11 pages, 4 figures