Zum Hauptinhalt springen

Showing 1–7 of 7 results for author: Weissman, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.21783  [pdf, other

    cs.AI cs.CL cs.CV

    The Llama 3 Herd of Models

    Authors: Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-Dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Amy Yang, Angela Fan, Anirudh Goyal, Anthony Hartshorn, Aobo Yang, Archi Mitra, Archie Sravankumar, Artem Korenev, Arthur Hinsvark, Arun Rao, Aston Zhang, Aurelien Rodriguez, Austen Gregerson, Ava Spataru, Baptiste Roziere, Bethany Biron, Binh Tang , et al. (510 additional authors not shown)

    Abstract: Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models that natively support multilinguality, coding, reasoning, and tool usage. Our largest model is a dense Transformer with 405B parameters and a context window of up to 128K tokens. This paper presents an extensive empirical… ▽ More

    Submitted 15 August, 2024; v1 submitted 31 July, 2024; originally announced July 2024.

  2. arXiv:2311.15838  [pdf, other

    cs.LG cs.AI

    Utilizing Explainability Techniques for Reinforcement Learning Model Assurance

    Authors: Alexander Tapley, Kyle Gatesman, Luis Robaina, Brett Bissey, Joseph Weissman

    Abstract: Explainable Reinforcement Learning (XRL) can provide transparency into the decision-making process of a Deep Reinforcement Learning (DRL) model and increase user trust and adoption in real-world use cases. By utilizing XRL techniques, researchers can identify potential vulnerabilities within a trained DRL model prior to deployment, therefore limiting the potential for mission failure or mistakes b… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: 9 pages, 8 figures including appendices (A, B, C). Accepted as a poster presentation in the demo track at the "XAI in Action: Past, Present, and Future Applications" workshop at NeurIPS 2023. MITRE Public Release Case Number 23-3095

  3. arXiv:2212.01984  [pdf, other

    cs.DC

    Locality, Latency and Spatial-Aware Data Placement Strategies at the Edge

    Authors: N. Sreekumar, A. Chandra, J. B. Weissman

    Abstract: The vast data deluge at the network's edge is raising multiple challenges for the edge computing community. One of them is identifying edge storage servers where data from edge devices/sensors have to be stored to ensure low latency access services to emerging edge applications. Existing data placement algorithms mainly focus on locality, latency, and zoning to select edge storage servers under mu… ▽ More

    Submitted 6 April, 2023; v1 submitted 4 December, 2022; originally announced December 2022.

  4. arXiv:2201.12394  [pdf, other

    cs.DC

    Constellation: An Edge-Based Semantic Runtime System for Internet of Things Applications

    Authors: Mitch Terrell, Yixuan Wang, Matt Dorow, Soumya Agrawal, Bhaargav Sriraman, Zach Leidall, Abhishek Chandra, Jon Weissman

    Abstract: With the global Internet of Things IoT market size predicted to grow to over 1 trillion dollars in the next 5 years, many large corporations are scrambling to solidify their product line as the defacto device suite for consumers. This has led to each corporation developing their devices in a siloed environment with unique protocols and runtime frameworks that explicitly exclude the ability to work… ▽ More

    Submitted 28 January, 2022; originally announced January 2022.

    Comments: 15 pages, 11 figures, 2 tables

  5. arXiv:2111.12002  [pdf, other

    cs.DC

    Armada: A Robust Latency-Sensitive Edge Cloud in Heterogeneous Edge-Dense Environments

    Authors: Lei Huang, Zhiying Liang, Nikhil Sreekumar, Sumanth Kaushik Vishwanath, Cody Perakslis, Abhishek Chandra, Jon Weissman

    Abstract: Edge computing has enabled a large set of emerging edge applications by exploiting data proximity and offloading latency-sensitive and computation-intensive workloads to nearby edge servers. However, supporting edge application users at scale in wide-area environments poses challenges due to limited point-of-presence edge sites and constrained elasticity. In this paper, we introduce Armada: a dens… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

    Comments: 13 pages, 13 figures

    ACM Class: C.2.4; D.4.5; D.4.7

  6. arXiv:1504.04720  [pdf, other

    cs.DC

    Integrating Abstractions to Enhance the Execution of Distributed Applications

    Authors: Matteo Turilli, Feng Liu, Zhao Zhang, Andre Merzky, Michael Wilde, Jon Weissman, Daniel S. Katz, Shantenu Jha

    Abstract: One of the factors that limits the scale, performance, and sophistication of distributed applications is the difficulty of concurrently executing them on multiple distributed computing resources. In part, this is due to a poor understanding of the general properties and performance of the coupling between applications and dynamic resources. This paper addresses this issue by integrating abstractio… ▽ More

    Submitted 18 February, 2016; v1 submitted 18 April, 2015; originally announced April 2015.

  7. arXiv:1208.2649  [pdf, other

    cs.DC

    Survey and Analysis of Production Distributed Computing Infrastructures

    Authors: Daniel S. Katz, Shantenu Jha, Manish Parashar, Omer Rana, Jon Weissman

    Abstract: This report has two objectives. First, we describe a set of the production distributed infrastructures currently available, so that the reader has a basic understanding of them. This includes explaining why each infrastructure was created and made available and how it has succeeded and failed. The set is not complete, but we believe it is representative. Second, we describe the infrastructures i… ▽ More

    Submitted 13 August, 2012; originally announced August 2012.

    Report number: Computation Institute, University of Chicago, Technical Report CI-TR-7-0811 ACM Class: C.2.4; C.5.0; K.6.0