Zum Hauptinhalt springen

Showing 1–5 of 5 results for author: Marshall, K O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.04180  [pdf, other

    cs.CV

    Slice-100K: A Multimodal Dataset for Extrusion-based 3D Printing

    Authors: Anushrut Jignasu, Kelly O. Marshall, Ankush Kumar Mishra, Lucas Nerone Rillo, Baskar Ganapathysubramanian, Aditya Balu, Chinmay Hegde, Adarsh Krishnamurthy

    Abstract: G-code (Geometric code) or RS-274 is the most widely used computer numerical control (CNC) and 3D printing programming language. G-code provides machine instructions for the movement of the 3D printer, especially for the nozzle, stage, and extrusion of material for extrusion-based additive manufacturing. Currently there does not exist a large repository of curated CAD models along with their corre… ▽ More

    Submitted 11 July, 2024; v1 submitted 4 July, 2024; originally announced July 2024.

    Comments: Replaced "SLICE-100K" with "Slice-100K", added acknowledgements, and updated main figure to better capture shadows

  2. arXiv:2404.03631  [pdf, other

    cs.CV

    Robust Concept Erasure Using Task Vectors

    Authors: Minh Pham, Kelly O. Marshall, Chinmay Hegde, Niv Cohen

    Abstract: With the rapid growth of text-to-image models, a variety of techniques have been suggested to prevent undesirable image generations. Yet, these methods often only protect against specific user prompts and have been shown to allow unsafe generations with other inputs. Here we focus on unconditionally erasing a concept from a text-to-image model rather than conditioning the erasure on the user's pro… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  3. arXiv:2402.18085  [pdf, other

    cs.SD cs.CR eess.AS

    AI-assisted Tagging of Deepfake Audio Calls using Challenge-Response

    Authors: Govind Mittal, Arthur Jakobsson, Kelly O. Marshall, Chinmay Hegde, Nasir Memon

    Abstract: Scammers are aggressively leveraging AI voice-cloning technology for social engineering attacks, a situation significantly worsened by the advent of audio Real-time Deepfakes (RTDFs). RTDFs can clone a target's voice in real-time over phone calls, making these interactions highly interactive and thus far more convincing. Our research confidently addresses the gap in the existing literature on deep… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: Dataset will be made public by end of March 2024

  4. arXiv:2308.01508  [pdf, other

    cs.LG cs.CR cs.CV

    Circumventing Concept Erasure Methods For Text-to-Image Generative Models

    Authors: Minh Pham, Kelly O. Marshall, Niv Cohen, Govind Mittal, Chinmay Hegde

    Abstract: Text-to-image generative models can produce photo-realistic images for an extremely broad range of concepts, and their usage has proliferated widely among the general public. On the flip side, these models have numerous drawbacks, including their potential to generate images featuring sexually explicit content, mirror artistic styles without permission, or even hallucinate (or deepfake) the likene… ▽ More

    Submitted 8 October, 2023; v1 submitted 2 August, 2023; originally announced August 2023.

  5. arXiv:2306.08183  [pdf, other

    cs.CV

    ZeroForge: Feedforward Text-to-Shape Without 3D Supervision

    Authors: Kelly O. Marshall, Minh Pham, Ameya Joshi, Anushrut Jignasu, Aditya Balu, Adarsh Krishnamurthy, Chinmay Hegde

    Abstract: Current state-of-the-art methods for text-to-shape generation either require supervised training using a labeled dataset of pre-defined 3D shapes, or perform expensive inference-time optimization of implicit neural representations. In this work, we present ZeroForge, an approach for zero-shot text-to-shape generation that avoids both pitfalls. To achieve open-vocabulary shape generation, we requir… ▽ More

    Submitted 15 June, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

    Comments: 19 pages, High resolution figures needed to demonstrate 3D results