Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Bernadskiy, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2307.03712  [pdf, other

    cs.LG cs.CL cs.CV

    INT-FP-QSim: Mixed Precision and Formats For Large Language Models and Vision Transformers

    Authors: Lakshmi Nair, Mikhail Bernadskiy, Arulselvan Madhavan, Craig Chan, Ayon Basumallik, Darius Bunandar

    Abstract: The recent rise of large language models (LLMs) has resulted in increased efforts towards running LLMs at reduced precision. Running LLMs at lower precision supports resource constraints and furthers their democratization, enabling users to run billion-parameter LLMs on their personal devices. To supplement this ongoing effort, we propose INT-FP-QSim: an open-source simulator that enables flexible… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: This report is supplementary material to the open-source code available at: https://github.com/lightmatter-ai/INT-FP-QSim