Zum Hauptinhalt springen

Showing 1–8 of 8 results for author: Bowen, R S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.03145  [pdf, other

    cs.CV

    DreamWalk: Style Space Exploration using Diffusion Guidance

    Authors: Michelle Shu, Charles Herrmann, Richard Strong Bowen, Forrester Cole, Ramin Zabih

    Abstract: Text-conditioned diffusion models can generate impressive images, but fall short when it comes to fine-grained control. Unlike direct-editing tools like Photoshop, text conditioned models require the artist to perform "prompt engineering," constructing special text sentences to control the style or amount of a particular subject present in the output image. Our goal is to provide fine-grained cont… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  2. arXiv:2112.01502  [pdf, other

    cs.CV

    Dimensions of Motion: Monocular Prediction through Flow Subspaces

    Authors: Richard Strong Bowen, Richard Tucker, Ramin Zabih, Noah Snavely

    Abstract: We introduce a way to learn to estimate a scene representation from a single image by predicting a low-dimensional subspace of optical flow for each training example, which encompasses the variety of possible camera and object movement. Supervision is provided by a novel loss which measures the distance between this predicted flow subspace and an observed optical flow. This provides a new approach… ▽ More

    Submitted 26 October, 2022; v1 submitted 2 December, 2021; originally announced December 2021.

    Comments: Project page at https://dimensions-of-motion.github.io/

  3. arXiv:2108.09641  [pdf, other

    eess.IV cs.CV

    Deep survival analysis with longitudinal X-rays for COVID-19

    Authors: Michelle Shu, Richard Strong Bowen, Charles Herrmann, Gengmo Qi, Michele Santacatterina, Ramin Zabih

    Abstract: Time-to-event analysis is an important statistical tool for allocating clinical resources such as ICU beds. However, classical techniques like the Cox model cannot directly incorporate images due to their high dimensionality. We propose a deep learning approach that naturally incorporates multiple, time-dependent imaging studies as well as non-imaging data into time-to-event analysis. Our techniqu… ▽ More

    Submitted 22 August, 2021; originally announced August 2021.

  4. arXiv:2011.11789  [pdf, other

    cs.CV

    Object-centered image stitching

    Authors: Charles Herrmann, Chen Wang, Richard Strong Bowen, Emil Keyder, Ramin Zabih

    Abstract: Image stitching is typically decomposed into three phases: registration, which aligns the source images with a common target image; seam finding, which determines for each target pixel the source image it should come from; and blending, which smooths transitions over the seams. As described in [1], the seam finding phase attempts to place seams between pixels where the transition between source im… ▽ More

    Submitted 23 November, 2020; originally announced November 2020.

    Comments: ECCV 2018

  5. arXiv:2011.11784  [pdf, other

    cs.CV

    Robust image stitching with multiple registrations

    Authors: Charles Herrmann, Chen Wang, Richard Strong Bowen, Emil Keyder, Michael Krainin, Ce Liu, Ramin Zabih

    Abstract: Panorama creation is one of the most widely deployed techniques in computer vision. In addition to industry applications such as Google Street View, it is also used by millions of consumers in smartphones and other cameras. Traditionally, the problem is decomposed into three phases: registration, which picks a single transformation of each source image to align it to the other inputs, seam finding… ▽ More

    Submitted 23 November, 2020; originally announced November 2020.

    Comments: ECCV 2018

  6. arXiv:2004.12260  [pdf, other

    cs.CV

    Learning to Autofocus

    Authors: Charles Herrmann, Richard Strong Bowen, Neal Wadhwa, Rahul Garg, Qiurui He, Jonathan T. Barron, Ramin Zabih

    Abstract: Autofocus is an important task for digital cameras, yet current approaches often exhibit poor performance. We propose a learning-based approach to this problem, and provide a realistic dataset of sufficient size for effective learning. Our dataset is labeled with per-pixel depths obtained from multi-view stereo, following "Learning single camera depth estimation using dual-pixels". Using this data… ▽ More

    Submitted 2 May, 2020; v1 submitted 25 April, 2020; originally announced April 2020.

    Comments: CVPR 2020

  7. arXiv:1812.04180  [pdf, other

    cs.CV

    Channel selection using Gumbel Softmax

    Authors: Charles Herrmann, Richard Strong Bowen, Ramin Zabih

    Abstract: Important applications such as mobile computing require reducing the computational costs of neural network inference. Ideally, applications would specify their preferred tradeoff between accuracy and speed, and the network would optimize this end-to-end, using classification error to remove parts of the network. Increasing speed can be done either during training - e.g., pruning filters - or durin… ▽ More

    Submitted 23 November, 2020; v1 submitted 10 December, 2018; originally announced December 2018.

    Comments: ECCV 2020

  8. arXiv:0902.2407  [pdf, other

    cs.CC cs.SC

    Group-Theoretic Partial Matrix Multiplication

    Authors: Richard Strong Bowen, Bo Chen, Hendrik Orem, Martijn van Schaardenburg

    Abstract: A generalization of recent group-theoretic matrix multiplication algorithms to an analogue of the theory of partial matrix multiplication is presented. We demonstrate that the added flexibility of this approach can in some cases improve upper bounds on the exponent of matrix multiplication yielded by group-theoretic full matrix multiplication. The group theory behind our partial matrix multiplic… ▽ More

    Submitted 13 February, 2009; originally announced February 2009.

    Comments: 14 pages, 3 figures