Zum Hauptinhalt springen

Showing 1–6 of 6 results for author: Terilla, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.11606  [pdf, ps, other

    cs.CL cs.AI cs.LG

    The Foundations of Tokenization: Statistical and Computational Concerns

    Authors: Juan Luis Gastaldi, John Terilla, Luca Malagutti, Brian DuSell, Tim Vieira, Ryan Cotterell

    Abstract: Tokenization - the practice of converting strings of characters over an alphabet into sequences of tokens over a vocabulary - is a critical yet under-theorized step in the NLP pipeline. Notably, it remains the only major step not fully integrated into widely used end-to-end neural models. This paper aims to address this theoretical gap by laying the foundations of tokenization from a formal perspe… ▽ More

    Submitted 8 August, 2024; v1 submitted 16 July, 2024; originally announced July 2024.

  2. arXiv:2106.07890  [pdf, ps, other

    math.CT cs.CL

    An enriched category theory of language: from syntax to semantics

    Authors: Tai-Danae Bradley, John Terilla, Yiannis Vlassopoulos

    Abstract: State of the art language models return a natural language text continuation from any piece of input text. This ability to generate coherent text extensions implies significant sophistication, including a knowledge of grammar and semantics. In this paper, we propose a mathematical framework for passing from probability distributions on extensions of given texts, such as the ones learned by today's… ▽ More

    Submitted 17 November, 2021; v1 submitted 15 June, 2021; originally announced June 2021.

    Comments: 29 pages; v2 major revision with new proofs and computations

  3. arXiv:2003.01039  [pdf, other

    cs.LG quant-ph stat.ML

    Tensor Networks for Probabilistic Sequence Modeling

    Authors: Jacob Miller, Guillaume Rabusseau, John Terilla

    Abstract: Tensor networks are a powerful modeling framework developed for computational many-body physics, which have only recently been applied within machine learning. In this work we utilize a uniform matrix product state (u-MPS) model for probabilistic modeling of sequence data. We first show that u-MPS enable sequence-level parallelism, with length-n sequences able to be evaluated in depth O(log n). We… ▽ More

    Submitted 23 April, 2021; v1 submitted 2 March, 2020; originally announced March 2020.

    Comments: 18 pages, 2 figures; v4 conference version; v3 link to code for experiments; v2 major revision with new main result on regular expression sampling. International Conference on Artificial Intelligence and Statistics. PMLR, 2021

  4. arXiv:1910.07425  [pdf, other

    quant-ph cs.LG stat.ML

    Modeling Sequences with Quantum States: A Look Under the Hood

    Authors: Tai-Danae Bradley, E. Miles Stoudenmire, John Terilla

    Abstract: Classical probability distributions on sets of sequences can be modeled using quantum states. Here, we do so with a quantum state that is pure and entangled. Because it is entangled, the reduced densities that describe subsystems also carry information about the complementary subsystem. This is in contrast to the classical marginal distributions on a subsystem in which information about the comple… ▽ More

    Submitted 16 October, 2019; originally announced October 2019.

    Comments: 27 pages

    Journal ref: 2020 Mach. Learn.: Sci. Technol. 1 035008

  5. arXiv:1902.06888  [pdf, other

    quant-ph cs.LG stat.ML

    Probabilistic Modeling with Matrix Product States

    Authors: James Stokes, John Terilla

    Abstract: Inspired by the possibility that generative models based on quantum circuits can provide a useful inductive bias for sequence modeling tasks, we propose an efficient training algorithm for a subset of classically simulable quantum circuit models. The gradient-free algorithm, presented as a sequence of exactly solvable effective models, is a modification of the density matrix renormalization group… ▽ More

    Submitted 18 February, 2019; originally announced February 2019.

  6. arXiv:1711.01416  [pdf, ps, other

    cs.CL cond-mat.dis-nn cs.LG cs.NE stat.ML

    Language as a matrix product state

    Authors: Vasily Pestun, John Terilla, Yiannis Vlassopoulos

    Abstract: We propose a statistical model for natural language that begins by considering language as a monoid, then representing it in complex matrices with a compatible translation invariant probability measure. We interpret the probability measure as arising via the Born rule from a translation invariant matrix product state.

    Submitted 4 November, 2017; originally announced November 2017.

    Comments: 10 pages