Zum Hauptinhalt springen

Showing 1–19 of 19 results for author: Reda, F

.
  1. arXiv:2306.08276  [pdf, other

    cs.CV cs.GR

    TryOnDiffusion: A Tale of Two UNets

    Authors: Luyang Zhu, Dawei Yang, Tyler Zhu, Fitsum Reda, William Chan, Chitwan Saharia, Mohammad Norouzi, Ira Kemelmacher-Shlizerman

    Abstract: Given two images depicting a person and a garment worn by another person, our goal is to generate a visualization of how the garment might look on the input person. A key challenge is to synthesize a photorealistic detail-preserving visualization of the garment, while warping the garment to accommodate a significant body pose and shape change across the subjects. Previous methods either focus on g… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: CVPR 2023. Project page: https://tryondiffusion.github.io/

  2. arXiv:2301.00245  [pdf

    physics.optics

    Reprogrammable holograms from maskless surface photo-morphing

    Authors: Francesco Reda, Marcella Salvatore, Marco Astarita, Fabio Borbone, Stefano L. Oscurato

    Abstract: Holographic technologies have the potentiality to impact our everyday life in many sectors including science, education, entertainment, art, and healthcare. Although holographic screens and projectors are part of common imagination since long time, they are still at initial stages of development and integration. Recent achievements of metasurface and flat optics research gave an unprecedented stre… ▽ More

    Submitted 31 December, 2022; originally announced January 2023.

  3. arXiv:2203.14863  [pdf, other

    cs.CV cs.MM

    HIME: Efficient Headshot Image Super-Resolution with Multiple Exemplars

    Authors: Xiaoyu Xiang, Jon Morton, Fitsum A Reda, Lucas Young, Federico Perazzi, Rakesh Ranjan, Amit Kumar, Andrea Colaco, Jan Allebach

    Abstract: A promising direction for recovering the lost information in low-resolution headshot images is utilizing a set of high-resolution exemplars from the same identity. Complementary images in the reference set can improve the generated headshot quality across many different views and poses. However, it is challenging to make the best use of multiple exemplars: the quality and alignment of each exempla… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: Technical Report

  4. arXiv:2203.10712  [pdf, other

    cs.CV

    Disentangling Architecture and Training for Optical Flow

    Authors: Deqing Sun, Charles Herrmann, Fitsum Reda, Michael Rubinstein, David Fleet, William T. Freeman

    Abstract: How important are training details and datasets to recent optical flow models like RAFT? And do they generalize? To explore these questions, rather than develop a new model, we revisit three prominent models, PWC-Net, IRR-PWC and RAFT, with a common set of modern training techniques and datasets, and observe significant performance gains, demonstrating the importance and generality of these traini… ▽ More

    Submitted 19 September, 2022; v1 submitted 20 March, 2022; originally announced March 2022.

    Comments: Accepted to ECCV22. 33 pages, including supplementals. Website at: https://autoflow-google.github.io/

  5. arXiv:2202.04901  [pdf, other

    cs.CV

    FILM: Frame Interpolation for Large Motion

    Authors: Fitsum Reda, Janne Kontkanen, Eric Tabellion, Deqing Sun, Caroline Pantofaru, Brian Curless

    Abstract: We present a frame interpolation algorithm that synthesizes multiple intermediate frames from two input images with large in-between motion. Recent methods use multiple networks to estimate optical flow or depth and a separate network dedicated to frame synthesis. This is often complex and requires scarce optical flow or depth ground-truth. In this work, we present a single unified network, distin… ▽ More

    Submitted 16 July, 2022; v1 submitted 10 February, 2022; originally announced February 2022.

    Comments: Accepted to ECCV 2022. Project website: https://film-net.github.io. Code: https://github.com/google-research/frame-interpolation. YouTube: https://www.youtube.com/watch?v=OAD-BieIjH4

  6. arXiv:2107.12786  [pdf

    physics.optics

    Shapeshifting diffractive optical devices

    Authors: S. L. Oscurato, F. Reda, M. Salvatore, F. Borbone, P. Maddalena, A. Ambrosio

    Abstract: In optical devices like diffraction gratings and Fresnel lenses, light wavefront is engineered through the structuring of device surface morphology, within thicknesses comparable to the light wavelength. Fabrication of such diffractive optical elements involves highly accurate multi-step lithographic processes that in fact set into stone both the device morphology and optical functionality. In thi… ▽ More

    Submitted 27 July, 2021; originally announced July 2021.

    Comments: 26 pages, 12 figures

  7. arXiv:2103.01524  [pdf, other

    eess.IV cs.CV cs.LG

    Feature-Align Network with Knowledge Distillation for Efficient Denoising

    Authors: Lucas D. Young, Fitsum A. Reda, Rakesh Ranjan, Jon Morton, Jun Hu, Yazhu Ling, Xiaoyu Xiang, David Liu, Vikas Chandra

    Abstract: We propose an efficient neural network for RAW image denoising. Although neural network-based denoising has been extensively studied for image restoration, little attention has been given to efficient denoising for compute limited and power sensitive devices, such as smartphones and smartwatches. In this paper, we present a novel architecture and a suite of training techniques for high quality den… ▽ More

    Submitted 17 March, 2021; v1 submitted 2 March, 2021; originally announced March 2021.

    MSC Class: 94A08 (Primary) 68T07; 65D19 (Secondary) ACM Class: I.4.5; I.2.6

  8. arXiv:2012.02228  [pdf, other

    cs.CV cs.LG eess.IV

    EVRNet: Efficient Video Restoration on Edge Devices

    Authors: Sachin Mehta, Amit Kumar, Fitsum Reda, Varun Nasery, Vikram Mulukutla, Rakesh Ranjan, Vikas Chandra

    Abstract: Video transmission applications (e.g., conferencing) are gaining momentum, especially in times of global health pandemic. Video signals are transmitted over lossy channels, resulting in low-quality received signals. To restore videos on recipient edge devices in real-time, we introduce an efficient video restoration network, EVRNet. EVRNet efficiently allocates parameters inside the network using… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

    Comments: Technical report

  9. arXiv:2007.07243  [pdf, other

    cs.CV cs.GR

    Transposer: Universal Texture Synthesis Using Feature Maps as Transposed Convolution Filter

    Authors: Guilin Liu, Rohan Taori, Ting-Chun Wang, Zhiding Yu, Shiqiu Liu, Fitsum A. Reda, Karan Sapra, Andrew Tao, Bryan Catanzaro

    Abstract: Conventional CNNs for texture synthesis consist of a sequence of (de)-convolution and up/down-sampling layers, where each layer operates locally and lacks the ability to capture the long-term structural dependency required by texture synthesis. Thus, they often simply enlarge the input texture, rather than perform reasonable synthesis. As a compromise, many recent methods sacrifice generalizabilit… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

  10. arXiv:1912.11683  [pdf, other

    cs.CV cs.LG eess.IV

    Neural ODEs for Image Segmentation with Level Sets

    Authors: Rafael Valle, Fitsum Reda, Mohammad Shoeybi, Patrick Legresley, Andrew Tao, Bryan Catanzaro

    Abstract: We propose a novel approach for image segmentation that combines Neural Ordinary Differential Equations (NODEs) and the Level Set method. Our approach parametrizes the evolution of an initial contour with a NODE that implicitly learns from data a speed function describing the evolution. In addition, for cases where an initial contour is not available and to alleviate the need for careful choice or… ▽ More

    Submitted 25 December, 2019; originally announced December 2019.

  11. arXiv:1906.05928  [pdf, other

    cs.CV

    Unsupervised Video Interpolation Using Cycle Consistency

    Authors: Fitsum A. Reda, Deqing Sun, Aysegul Dundar, Mohammad Shoeybi, Guilin Liu, Kevin J. Shih, Andrew Tao, Jan Kautz, Bryan Catanzaro

    Abstract: Learning to synthesize high frame rate videos via interpolation requires large quantities of high frame rate training videos, which, however, are scarce, especially at high resolutions. Here, we propose unsupervised techniques to synthesize high frame rate videos directly from low frame rate videos using cycle consistency. For a triplet of consecutive frames, we optimize models to minimize the dis… ▽ More

    Submitted 27 March, 2021; v1 submitted 13 June, 2019; originally announced June 2019.

    Comments: Published in ICCV 2019. Codes are available at https://github.com/NVIDIA/unsupervised-video-interpolation. Project website https://nv-adlr.github.io/publication/2019-UnsupervisedVideoInterpolation

  12. arXiv:1812.01593  [pdf, other

    cs.CV cs.AI cs.MM cs.RO

    Improving Semantic Segmentation via Video Propagation and Label Relaxation

    Authors: Yi Zhu, Karan Sapra, Fitsum A. Reda, Kevin J. Shih, Shawn Newsam, Andrew Tao, Bryan Catanzaro

    Abstract: Semantic segmentation requires large amounts of pixel-wise annotations to learn accurate models. In this paper, we present a video prediction-based methodology to scale up training sets by synthesizing new training samples in order to improve the accuracy of semantic segmentation networks. We exploit video prediction models' ability to predict future frames in order to also predict future labels.… ▽ More

    Submitted 2 July, 2019; v1 submitted 4 December, 2018; originally announced December 2018.

    Comments: CVPR 2019 Oral. Code link: https://github.com/NVIDIA/semantic-segmentation. YouTube link: https://www.youtube.com/watch?v=aEbXjGZDZSQ

  13. arXiv:1811.11718  [pdf, other

    cs.CV

    Partial Convolution based Padding

    Authors: Guilin Liu, Kevin J. Shih, Ting-Chun Wang, Fitsum A. Reda, Karan Sapra, Zhiding Yu, Andrew Tao, Bryan Catanzaro

    Abstract: In this paper, we present a simple yet effective padding scheme that can be used as a drop-in module for existing convolutional neural networks. We call it partial convolution based padding, with the intuition that the padded region can be treated as holes and the original input as non-holes. Specifically, during the convolution operation, the convolution results are re-weighted near image borders… ▽ More

    Submitted 28 November, 2018; originally announced November 2018.

    Comments: 11 pages; code is available at https://github.com/NVIDIA/partialconv

  14. arXiv:1811.00684  [pdf, other

    cs.CV

    SDCNet: Video Prediction Using Spatially-Displaced Convolution

    Authors: Fitsum A. Reda, Guilin Liu, Kevin J. Shih, Robert Kirby, Jon Barker, David Tarjan, Andrew Tao, Bryan Catanzaro

    Abstract: We present an approach for high-resolution video frame prediction by conditioning on both past frames and past optical flows. Previous approaches rely on resampling past frames, guided by a learned future optical flow, or on direct generation of pixels. Resampling based on flow is insufficient because it cannot deal with disocclusions. Generative models currently lead to blurry results. Recent app… ▽ More

    Submitted 27 March, 2021; v1 submitted 1 November, 2018; originally announced November 2018.

    Comments: Published in ECCV 2018. Codes available at https://github.com/NVIDIA/semantic-segmentation/tree/sdcnet/sdcnet. Project page available at https://nv-adlr.github.io/publication/2018-SDCNet

  15. arXiv:1804.07723  [pdf, other

    cs.CV

    Image Inpainting for Irregular Holes Using Partial Convolutions

    Authors: Guilin Liu, Fitsum A. Reda, Kevin J. Shih, Ting-Chun Wang, Andrew Tao, Bryan Catanzaro

    Abstract: Existing deep learning based image inpainting methods use a standard convolutional network over the corrupted image, using convolutional filter responses conditioned on both valid pixels as well as the substitute values in the masked holes (typically the mean value). This often leads to artifacts such as color discrepancy and blurriness. Post-processing is usually used to reduce such artifacts, bu… ▽ More

    Submitted 15 December, 2018; v1 submitted 20 April, 2018; originally announced April 2018.

    Comments: Update: camera-ready; L1 loss is size-averaged; code of partial conv layer: https://github.com/NVIDIA/partialconv. Published at ECCV 2018

  16. arXiv:1601.05472  [pdf, other

    cs.CL

    Hierarchical Latent Word Clustering

    Authors: Halid Ziya Yerebakan, Fitsum Reda, Yiqiang Zhan, Yoshihisa Shinagawa

    Abstract: This paper presents a new Bayesian non-parametric model by extending the usage of Hierarchical Dirichlet Allocation to extract tree structured word clusters from text data. The inference algorithm of the model collects words in a cluster if they share similar distribution over documents. In our experiments, we observed meaningful hierarchical structures on NIPS corpus and radiology reports collect… ▽ More

    Submitted 20 January, 2016; originally announced January 2016.

  17. The spatially resolved stellar populations of isolated early-type galaxies

    Authors: Fatma M. Reda, Robert N. Proctor, Duncan A. Forbes, George K. T. Hau, Søren S. Larsen, ;

    Abstract: We present radial stellar population parameters for a subsample of 12 galaxies from the 36 isolated early-type galaxies of Reda et al. Using new long-slit spectra, central values and radial gradients for the stellar age, metallicity [Z/H] and alpha-element abundance [E/Fe] are measured. Similarly, the central stellar population parameters are derived for a further 5 isolated early-type galaxies… ▽ More

    Submitted 20 March, 2007; originally announced March 2007.

    Comments: The paper contains 8 figures and 5 tables. Accepted for publication in the MNRAS

    Journal ref: Mon.Not.Roy.Astron.Soc.377:1772-1784,2007

  18. The fundamental plane of isolated early-type galaxies

    Authors: Fatma M. Reda, Duncan A. Forbes, George K. T. Hau

    Abstract: Here we present new measurements of effective radii, surface brightnesses and internal velocity dispersions for 23 isolated early-type galaxies. The photometric properties are derived from new multi-colour imaging of 10 galaxies, whereas the central kinematics for 7 galaxies are taken from forthcoming work by Hau & Forbes. These are supplemented with data from the literature. We reproduce the co… ▽ More

    Submitted 12 May, 2005; originally announced May 2005.

    Comments: Comments:11 pages, 7 Postscript figures, 2 JPEG figures. Accepted for publication in MNRAS

  19. The Photometric Properties of Isolated Early-Type Galaxies

    Authors: Fatma M. Reda, Duncan A. Forbes, Michael A. Beasley, Ewan J. O'Sullivan, Paul Goudfrooij

    Abstract: Isolated galaxies are important since they probe the lowest density regimes inhabited by galaxies. We define a sample of 36 nearby isolated early-type galaxies for further study. Our isolation criteria require them to have no comparable-mass neighbours within 2 B-band magnitudes, 0.67 Mpc in the plane of the sky and 700 km/s in recession velocity. New wide-field optical imaging of 10 isolated ga… ▽ More

    Submitted 23 August, 2004; originally announced August 2004.

    Comments: 16 pages, Latex, 17 figures, 6 tables, MNRAS in press

    Journal ref: Mon.Not.Roy.Astron.Soc. 354 (2004) 851-869