Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Roberts, S T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.07786  [pdf, ps, other

    cs.HC cs.AI cs.CY

    The Human Factor in AI Red Teaming: Perspectives from Social and Collaborative Computing

    Authors: Alice Qian Zhang, Ryland Shaw, Jacy Reese Anthis, Ashlee Milton, Emily Tseng, Jina Suh, Lama Ahmad, Ram Shankar Siva Kumar, Julian Posada, Benjamin Shestakofsky, Sarah T. Roberts, Mary L. Gray

    Abstract: Rapid progress in general-purpose AI has sparked significant interest in "red teaming," a practice of adversarial testing originating in military and cybersecurity applications. AI red teaming raises many questions about the human factor, such as how red teamers are selected, biases and blindspots in how tests are conducted, and harmful content's psychological effects on red teamers. A growing bod… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: Workshop proposal accepted to CSCW 2024