Zum Hauptinhalt springen

Showing 1–2 of 2 results for author: Chenaghlu, M

.
  1. arXiv:2402.06196  [pdf, other

    cs.CL cs.AI

    Large Language Models: A Survey

    Authors: Shervin Minaee, Tomas Mikolov, Narjes Nikzad, Meysam Chenaghlu, Richard Socher, Xavier Amatriain, Jianfeng Gao

    Abstract: Large Language Models (LLMs) have drawn a lot of attention due to their strong performance on a wide range of natural language tasks, since the release of ChatGPT in November 2022. LLMs' ability of general-purpose language understanding and generation is acquired by training billions of model's parameters on massive amounts of text data, as predicted by scaling laws \cite{kaplan2020scaling,hoffman… ▽ More

    Submitted 20 February, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2401.14423

  2. arXiv:2004.03705  [pdf, other

    cs.CL cs.LG stat.ML

    Deep Learning Based Text Classification: A Comprehensive Review

    Authors: Shervin Minaee, Nal Kalchbrenner, Erik Cambria, Narjes Nikzad, Meysam Chenaghlu, Jianfeng Gao

    Abstract: Deep learning based models have surpassed classical machine learning based approaches in various text classification tasks, including sentiment analysis, news categorization, question answering, and natural language inference. In this paper, we provide a comprehensive review of more than 150 deep learning based models for text classification developed in recent years, and discuss their technical c… ▽ More

    Submitted 4 January, 2021; v1 submitted 5 April, 2020; originally announced April 2020.