Zum Hauptinhalt springen

Showing 1–4 of 4 results for author: Doshi, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.00871  [pdf, other

    cs.LG cs.CL stat.ML

    Pretraining Data Mixtures Enable Narrow Model Selection Capabilities in Transformer Models

    Authors: Steve Yadlowsky, Lyric Doshi, Nilesh Tripuraneni

    Abstract: Transformer models, notably large language models (LLMs), have the remarkable ability to perform in-context learning (ICL) -- to perform new tasks when prompted with unseen input-output examples without any explicit model training. In this work, we study how effectively transformers can bridge between their pretraining data mixture, comprised of multiple distinct task families, to identify and lea… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  2. arXiv:2306.06798  [pdf, other

    cs.DB cs.LG

    Kepler: Robust Learning for Faster Parametric Query Optimization

    Authors: Lyric Doshi, Vincent Zhuang, Gaurav Jain, Ryan Marcus, Haoyu Huang, Deniz Altinbüken, Eugene Brevdo, Campbell Fraser

    Abstract: Most existing parametric query optimization (PQO) techniques rely on traditional query optimizer cost models, which are often inaccurate and result in suboptimal query performance. We propose Kepler, an end-to-end learning-based approach to PQO that demonstrates significant speedups in query latency over a traditional query optimizer. Central to our method is Row Count Evolution (RCE), a novel pla… ▽ More

    Submitted 18 October, 2023; v1 submitted 11 June, 2023; originally announced June 2023.

    Comments: SIGMOD 2023

  3. arXiv:2012.12501  [pdf, other

    cs.DB cs.DC cs.LG

    Learned Indexes for a Google-scale Disk-based Database

    Authors: Hussam Abu-Libdeh, Deniz Altınbüken, Alex Beutel, Ed H. Chi, Lyric Doshi, Tim Kraska, Xiaozhou, Li, Andy Ly, Christopher Olston

    Abstract: There is great excitement about learned index structures, but understandable skepticism about the practicality of a new method uprooting decades of research on B-Trees. In this paper, we work to remove some of that uncertainty by demonstrating how a learned index can be integrated in a distributed, disk-based database system: Google's Bigtable. We detail several design decisions we made to integra… ▽ More

    Submitted 23 December, 2020; originally announced December 2020.

    Comments: 4 pages, Presented at Workshop on ML for Systems at NeurIPS 2020

  4. arXiv:1208.4173  [pdf, other

    cs.DB

    The Vertica Analytic Database: C-Store 7 Years Later

    Authors: Andrew Lamb, Matt Fuller, Ramakrishna Varadarajan, Nga Tran, Ben Vandier, Lyric Doshi, Chuck Bear

    Abstract: This paper describes the system architecture of the Vertica Analytic Database (Vertica), a commercialization of the design of the C-Store research prototype. Vertica demonstrates a modern commercial RDBMS system that presents a classical relational interface while at the same time achieving the high performance expected from modern "web scale" analytic systems by making appropriate architectural c… ▽ More

    Submitted 20 August, 2012; originally announced August 2012.

    Comments: VLDB2012

    Journal ref: Proceedings of the VLDB Endowment (PVLDB), Vol. 5, No. 12, pp. 1790-1801 (2012)