Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Tuecke, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.11009  [pdf, other

    cs.CL cs.LG

    CharED: Character-wise Ensemble Decoding for Large Language Models

    Authors: Kevin Gu, Eva Tuecke, Dmitriy Katz, Raya Horesh, David Alvarez-Melis, Mikhail Yurochkin

    Abstract: Large language models (LLMs) have shown remarkable potential for problem solving, with open source models achieving increasingly impressive performance on benchmarks measuring areas from logical reasoning to mathematical ability. Ensembling models can further improve capabilities across a variety of domains. However, conventional methods of combining models at inference time such as shallow fusion… ▽ More

    Submitted 25 June, 2024; originally announced July 2024.

    Comments: 9 pages, 4 figures