Zum Hauptinhalt springen

Showing 1–15 of 15 results for author: Mai, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00463  [pdf, other

    cs.LG cs.AI cs.CL cs.HC eess.AS

    Open-Source Conversational AI with SpeechBrain 1.0

    Authors: Mirco Ravanelli, Titouan Parcollet, Adel Moumen, Sylvain de Langen, Cem Subakan, Peter Plantinga, Yingzhi Wang, Pooneh Mousavi, Luca Della Libera, Artem Ploujnikov, Francesco Paissan, Davide Borra, Salah Zaiem, Zeyu Zhao, Shucong Zhang, Georgios Karakasidis, Sung-Lin Yeh, Pierre Champion, Aku Rouhe, Rudolf Braun, Florian Mai, Juan Zuluaga-Gomez, Seyed Mahed Mousavi, Andreas Nautsch, Xuechen Liu , et al. (7 additional authors not shown)

    Abstract: SpeechBrain is an open-source Conversational AI toolkit based on PyTorch, focused particularly on speech processing tasks such as speech recognition, speech enhancement, speaker recognition, text-to-speech, and much more. It promotes transparency and replicability by releasing both the pre-trained models and the complete "recipes" of code and algorithms required for training them. This paper prese… ▽ More

    Submitted 18 July, 2024; v1 submitted 29 June, 2024; originally announced July 2024.

    Comments: Submitted to JMLR (Machine Learning Open Source Software)

  2. arXiv:2404.00614  [pdf, other

    cs.CL cs.AI

    Learning to Plan for Language Modeling from Unlabeled Data

    Authors: Nathan Cornille, Marie-Francine Moens, Florian Mai

    Abstract: By training to predict the next token in an unlabeled corpus, large language models learn to perform many tasks without any labeled data. However, their next-token-prediction objective arguably limits their performance in scenarios that require planning, such as writing a coherent article. In this paper, we train a module for planning the future writing process via a self-supervised learning objec… ▽ More

    Submitted 31 July, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

    Comments: Published at COLM 2024

  3. arXiv:2402.12332  [pdf, other

    cs.CL

    Triple-Encoders: Representations That Fire Together, Wire Together

    Authors: Justus-Jonas Erker, Florian Mai, Nils Reimers, Gerasimos Spanakis, Iryna Gurevych

    Abstract: Search-based dialog models typically re-encode the dialog history at every turn, incurring high cost. Curved Contrastive Learning, a representation learning method that encodes relative distances between utterances into the embedding space via a bi-encoder, has recently shown promising results for dialog modeling at far superior efficiency. While high efficiency is achieved through independently e… ▽ More

    Submitted 13 July, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: accepted at ACL 2024 (main conference)

  4. arXiv:2310.02568  [pdf

    cs.SI cs.AI cs.CY cs.LG

    Stand for Something or Fall for Everything: Predict Misinformation Spread with Stance-Aware Graph Neural Networks

    Authors: Zihan Chen, Jingyi Sun, Rong Liu, Feng Mai

    Abstract: Although pervasive spread of misinformation on social media platforms has become a pressing challenge, existing platform interventions have shown limited success in curbing its dissemination. In this study, we propose a stance-aware graph neural network (stance-aware GNN) that leverages users' stances to proactively predict misinformation spread. As different user stances can form unique echo cham… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: Accepted by the 2023 International Conference on Information Systems (ICIS 2023)

    ACM Class: H.0; J.4; I.2.7

  5. arXiv:2305.18281  [pdf, other

    cs.CL cs.AI cs.LG eess.AS

    HyperConformer: Multi-head HyperMixer for Efficient Speech Recognition

    Authors: Florian Mai, Juan Zuluaga-Gomez, Titouan Parcollet, Petr Motlicek

    Abstract: State-of-the-art ASR systems have achieved promising results by modeling local and global interactions separately. While the former can be computed efficiently, global interactions are usually modeled via attention mechanisms, which are expensive for long input sequences. Here, we address this by extending HyperMixer, an efficient alternative to attention exhibiting linear complexity, to the Confo… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    Comments: Florian Mai and Juan Zuluaga-Gomez contributed equally. To appear in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2023

  6. arXiv:2301.03313  [pdf, other

    cs.LG math.OC

    BQ-NCO: Bisimulation Quotienting for Efficient Neural Combinatorial Optimization

    Authors: Darko Drakulic, Sofia Michel, Florian Mai, Arnaud Sors, Jean-Marc Andreoli

    Abstract: Despite the success of neural-based combinatorial optimization methods for end-to-end heuristic learning, out-of-distribution generalization remains a challenge. In this paper, we present a novel formulation of Combinatorial Optimization Problems (COPs) as Markov Decision Processes (MDPs) that effectively leverages common symmetries of COPs to improve out-of-distribution robustness. Starting from… ▽ More

    Submitted 28 September, 2023; v1 submitted 9 January, 2023; originally announced January 2023.

  7. arXiv:2203.03691  [pdf, other

    cs.CL cs.AI cs.LG

    HyperMixer: An MLP-based Low Cost Alternative to Transformers

    Authors: Florian Mai, Arnaud Pannatier, Fabio Fehr, Haolin Chen, Francois Marelli, Francois Fleuret, James Henderson

    Abstract: Transformer-based architectures are the model of choice for natural language understanding, but they come at a significant cost, as they have quadratic complexity in the input length, require a lot of training data, and can be difficult to tune. In the pursuit of lower costs, we investigate simple MLP-based architectures. We find that existing architectures such as MLPMixer, which achieves token m… ▽ More

    Submitted 13 November, 2023; v1 submitted 7 March, 2022; originally announced March 2022.

    Comments: Published at ACL 2023

    Journal ref: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

  8. arXiv:2110.07002  [pdf, other

    cs.CL cs.AI cs.LG

    Bag-of-Vectors Autoencoders for Unsupervised Conditional Text Generation

    Authors: Florian Mai, James Henderson

    Abstract: Text autoencoders are often used for unsupervised conditional text generation by applying mappings in the latent space to change attributes to the desired values. Recently, Mai et al. (2020) proposed Emb2Emb, a method to learn these mappings in the embedding space of an autoencoder. However, their method is restricted to autoencoders with a single-vector embedding, which limits how much informatio… ▽ More

    Submitted 4 February, 2023; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: Published at AACL 2022

    Journal ref: In Proceedings of AACL/IJCNLP 2022, pages 468-488. Association of Computational Linguistics (2022)

  9. arXiv:2010.02983  [pdf, other

    cs.CL cs.AI

    Plug and Play Autoencoders for Conditional Text Generation

    Authors: Florian Mai, Nikolaos Pappas, Ivan Montero, Noah A. Smith, James Henderson

    Abstract: Text autoencoders are commonly used for conditional generation tasks such as style transfer. We propose methods which are plug and play, where any pretrained autoencoder can be used, and only require learning a mapping within the autoencoder's embedding space, training embedding-to-embedding (Emb2Emb). This reduces the need for labeled training data for the task and makes the training procedure mo… ▽ More

    Submitted 12 October, 2020; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: To be published in EMNLP 2020

  10. arXiv:1910.11758  [pdf, other

    cs.LG stat.ML

    Optimizer Benchmarking Needs to Account for Hyperparameter Tuning

    Authors: Prabhu Teja Sivaprasad, Florian Mai, Thijs Vogels, Martin Jaggi, François Fleuret

    Abstract: The performance of optimizers, particularly in deep learning, depends considerably on their chosen hyperparameter configuration. The efficacy of optimizers is often studied under near-optimal problem-specific hyperparameters, and finding these settings may be prohibitively costly for practitioners. In this work, we argue that a fair assessment of optimizers' performance must take the computational… ▽ More

    Submitted 15 August, 2020; v1 submitted 25 October, 2019; originally announced October 2019.

    Comments: published at International Conference on Machine Learning (ICML 2020)

  11. arXiv:1909.09758  [pdf, other

    cs.AI

    Empirical Analysis of Multi-Task Learning for Reducing Model Bias in Toxic Comment Detection

    Authors: Ameya Vaidya, Feng Mai, Yue Ning

    Abstract: With the recent rise of toxicity in online conversations on social media platforms, using modern machine learning algorithms for toxic comment detection has become a central focus of many online applications. Researchers and companies have developed a variety of models to identify toxicity in online conversations, reviews, or comments with mixed successes. However, many existing approaches have le… ▽ More

    Submitted 27 March, 2020; v1 submitted 20 September, 2019; originally announced September 2019.

    Comments: ICWSM 2020

  12. arXiv:1907.12366  [pdf, other

    cs.IR cs.LG stat.ML

    Multi-Modal Adversarial Autoencoders for Recommendations of Citations and Subject Labels

    Authors: Lukas Galke, Florian Mai, Iacopo Vagliano, Ansgar Scherp

    Abstract: We present multi-modal adversarial autoencoders for recommendation and evaluate them on two different tasks: citation recommendation and subject label recommendation. We analyze the effects of adversarial regularization, sparsity, and different input modalities. By conducting 408 experiments, we show that adversarial regularization consistently improves the performance of autoencoders for recommen… ▽ More

    Submitted 22 July, 2019; originally announced July 2019.

    Comments: Published in: UMAP '18 Proceedings of the 26th Conference on User Modeling, Adaptation and Personalization Pages 197-205

  13. arXiv:1902.06423  [pdf, other

    cs.CL cs.LG

    CBOW Is Not All You Need: Combining CBOW with the Compositional Matrix Space Model

    Authors: Florian Mai, Lukas Galke, Ansgar Scherp

    Abstract: Continuous Bag of Words (CBOW) is a powerful text embedding method. Due to its strong capabilities to encode word content, CBOW embeddings perform well on a wide range of downstream tasks while being efficient to compute. However, CBOW is not capable of capturing the word order. The reason is that the computation of CBOW's word embeddings is commutative, i.e., embeddings of XYZ and ZYX are the sam… ▽ More

    Submitted 18 February, 2019; originally announced February 2019.

    Comments: Conference paper at ICLR 2019

    Journal ref: In International Conference on Learning Representations 2019

  14. Using Deep Learning for Title-Based Semantic Subject Indexing to Reach Competitive Performance to Full-Text

    Authors: Florian Mai, Lukas Galke, Ansgar Scherp

    Abstract: For (semi-)automated subject indexing systems in digital libraries, it is often more practical to use metadata such as the title of a publication instead of the full-text or the abstract. Therefore, it is desirable to have good text mining and text classification algorithms that operate well already on the title of a publication. So far, the classification performance on titles is not competitive… ▽ More

    Submitted 29 May, 2018; v1 submitted 20 January, 2018; originally announced January 2018.

    Comments: Presented at JCDL 2018, 10 pages, code and data at https://github.com/florianmai/Quadflor

    Journal ref: JCDL '18: The 18th ACM/IEEE Joint Conference on Digital Libraries, June 3--7, 2018, Fort Worth, TX, USA

  15. arXiv:1705.05311  [pdf, other

    cs.DL cs.CL

    Using Titles vs. Full-text as Source for Automated Semantic Document Annotation

    Authors: Lukas Galke, Florian Mai, Alan Schelten, Dennis Brunsch, Ansgar Scherp

    Abstract: A significant part of the largest Knowledge Graph today, the Linked Open Data cloud, consists of metadata about documents such as publications, news reports, and other media articles. While the widespread access to the document metadata is a tremendous advancement, it is yet not so easy to assign semantic annotations and organize the documents along semantic concepts. Providing semantic annotation… ▽ More

    Submitted 27 September, 2017; v1 submitted 15 May, 2017; originally announced May 2017.

    Comments: Accepted as SHORT PAPER by K-CAP 2017, 9 pages, 1 figure, 3 tables