Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Risher, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.16251  [pdf, other

    cs.CR cs.AI cs.CL

    Prompt Leakage effect and defense strategies for multi-turn LLM interactions

    Authors: Divyansh Agarwal, Alexander R. Fabbri, Ben Risher, Philippe Laban, Shafiq Joty, Chien-Sheng Wu

    Abstract: Prompt leakage poses a compelling security and privacy threat in LLM applications. Leakage of system prompts may compromise intellectual property, and act as adversarial reconnaissance for an attacker. A systematic evaluation of prompt leakage threats and mitigation strategies is lacking, especially for multi-turn LLM interactions. In this paper, we systematically investigate LLM vulnerabilities a… ▽ More

    Submitted 29 July, 2024; v1 submitted 24 April, 2024; originally announced April 2024.