Zum Hauptinhalt springen

Showing 1–2 of 2 results for author: Wasil, A R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.16074  [pdf, other

    cs.CY cs.AI

    Verification methods for international AI agreements

    Authors: Akash R. Wasil, Tom Reed, Jack William Miller, Peter Barnett

    Abstract: What techniques can be used to verify compliance with international agreements about advanced AI development? In this paper, we examine 10 verification methods that could detect two types of potential violations: unauthorized AI training (e.g., training runs above a certain FLOP threshold) and unauthorized data centers. We divide the verification methods into three categories: (a) national technic… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

  2. arXiv:2406.15371  [pdf, ps, other

    cs.CY cs.AI

    Affirmative safety: An approach to risk management for high-risk AI

    Authors: Akash R. Wasil, Joshua Clymer, David Krueger, Emily Dardaman, Simeon Campos, Evan R. Murphy

    Abstract: Prominent AI experts have suggested that companies developing high-risk AI systems should be required to show that such systems are safe before they can be developed or deployed. The goal of this paper is to expand on this idea and explore its implications for risk management. We argue that entities developing or deploying high-risk AI systems should be required to present evidence of affirmative… ▽ More

    Submitted 14 April, 2024; originally announced June 2024.