Zum Hauptinhalt springen

Showing 1–4 of 4 results for author: Siegel, Z S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.12883  [pdf, other

    cs.CL cs.AI cs.IR

    BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

    Authors: Hongjin Su, Howard Yen, Mengzhou Xia, Weijia Shi, Niklas Muennighoff, Han-yu Wang, Haisu Liu, Quan Shi, Zachary S. Siegel, Michael Tang, Ruoxi Sun, Jinsung Yoon, Sercan O. Arik, Danqi Chen, Tao Yu

    Abstract: Existing retrieval benchmarks primarily consist of information-seeking queries (e.g., aggregated questions from search engines) where keyword or semantic-based retrieval is usually sufficient. However, many complex real-world queries require in-depth reasoning to identify relevant documents that go beyond surface form matching. For example, finding documentation for a coding question requires unde… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 50 pages

  2. arXiv:2407.01502  [pdf, other

    cs.LG cs.AI

    AI Agents That Matter

    Authors: Sayash Kapoor, Benedikt Stroebl, Zachary S. Siegel, Nitya Nadgir, Arvind Narayanan

    Abstract: AI agents are an exciting new research direction, and agent development is driven by benchmarks. Our analysis of current agent benchmarks and evaluation practices reveals several shortcomings that hinder their usefulness in real-world applications. First, there is a narrow focus on accuracy without attention to other metrics. As a result, SOTA agents are needlessly complex and costly, and the comm… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  3. arXiv:2312.08566  [pdf, other

    cs.AI cs.CL cs.RO

    Learning adaptive planning representations with natural language guidance

    Authors: Lionel Wong, Jiayuan Mao, Pratyusha Sharma, Zachary S. Siegel, Jiahai Feng, Noa Korneev, Joshua B. Tenenbaum, Jacob Andreas

    Abstract: Effective planning in the real world requires not only world knowledge, but the ability to leverage that knowledge to build the right representation of the task at hand. Decades of hierarchical planning techniques have used domain-specific temporal action abstractions to support efficient and accurate planning, almost always relying on human priors and domain knowledge to decompose hard tasks into… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  4. arXiv:2206.05794  [pdf, other

    cs.LG stat.ML

    Characterizing the Implicit Bias of Regularized SGD in Rank Minimization

    Authors: Tomer Galanti, Zachary S. Siegel, Aparna Gupte, Tomaso Poggio

    Abstract: We study the bias of Stochastic Gradient Descent (SGD) to learn low-rank weight matrices when training deep neural networks. Our results show that training neural networks with mini-batch SGD and weight decay causes a bias towards rank minimization over the weight matrices. Specifically, we show, both theoretically and empirically, that this bias is more pronounced when using smaller batch sizes,… ▽ More

    Submitted 25 October, 2023; v1 submitted 12 June, 2022; originally announced June 2022.