Zum Hauptinhalt springen

Showing 1–50 of 63 results for author: Jacobs, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.02489  [pdf, other

    cs.CV cs.AI cs.GR cs.HC cs.LG

    Magic Insert: Style-Aware Drag-and-Drop

    Authors: Nataniel Ruiz, Yuanzhen Li, Neal Wadhwa, Yael Pritch, Michael Rubinstein, David E. Jacobs, Shlomi Fruchter

    Abstract: We present Magic Insert, a method for dragging-and-dropping subjects from a user-provided image into a target image of a different style in a physically plausible manner while matching the style of the target image. This work formalizes the problem of style-aware drag-and-drop and presents a method for tackling it by addressing two sub-problems: style-aware personalization and realistic object ins… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Project page: https://magicinsert.github.io/

  2. arXiv:2406.09417  [pdf, other

    cs.CV cs.GR cs.LG

    Rethinking Score Distillation as a Bridge Between Image Distributions

    Authors: David McAllister, Songwei Ge, Jia-Bin Huang, David W. Jacobs, Alexei A. Efros, Aleksander Holynski, Angjoo Kanazawa

    Abstract: Score distillation sampling (SDS) has proven to be an important tool, enabling the use of large-scale diffusion priors for tasks operating in data-poor domains. Unfortunately, SDS has a number of characteristic artifacts that limit its usefulness in general-purpose applications. In this paper, we make progress toward understanding the behavior of SDS and its variants by viewing them as solving an… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Project webpage: https://sds-bridge.github.io/

  3. arXiv:2405.08813  [pdf, other

    cs.CV cs.LG cs.MM

    CinePile: A Long Video Question Answering Dataset and Benchmark

    Authors: Ruchit Rawal, Khalid Saifullah, Ronen Basri, David Jacobs, Gowthami Somepalli, Tom Goldstein

    Abstract: Current datasets for long-form video understanding often fall short of providing genuine long-form comprehension challenges, as many tasks derived from these datasets can be successfully tackled by analyzing just one or a few random frames from a video. To address this issue, we present a novel dataset and benchmark, CinePile, specifically designed for authentic long-form video understanding. This… ▽ More

    Submitted 14 June, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

    Comments: Project page with all the artifacts - https://ruchitrawal.github.io/cinepile/. Updated version with results on Gemini Flash model and additional related work

  4. arXiv:2403.15651  [pdf, other

    cs.CV

    GaNI: Global and Near Field Illumination Aware Neural Inverse Rendering

    Authors: Jiaye Wu, Saeed Hadadan, Geng Lin, Matthias Zwicker, David Jacobs, Roni Sengupta

    Abstract: In this paper, we present GaNI, a Global and Near-field Illumination-aware neural inverse rendering technique that can reconstruct geometry, albedo, and roughness parameters from images of a scene captured with co-located light and camera. Existing inverse rendering techniques with co-located light-camera focus on single objects only, without modeling global illumination and near-field lighting mo… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  5. arXiv:2402.17745  [pdf, other

    physics.comp-ph cs.CV physics.optics

    Low-light phase retrieval with implicit generative priors

    Authors: Raunak Manekar, Elisa Negrini, Minh Pham, Daniel Jacobs, Jaideep Srivastava, Stanley J. Osher, Jianwei Miao

    Abstract: Phase retrieval (PR) is fundamentally important in scientific imaging and is crucial for nanoscale techniques like coherent diffractive imaging (CDI). Low radiation dose imaging is essential for applications involving radiation-sensitive samples. However, most PR methods struggle in low-dose scenarios due to high shot noise. Recent advancements in optical data acquisition setups, such as in-situ C… ▽ More

    Submitted 23 August, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    MSC Class: 68T10 68T07 78A46

  6. arXiv:2310.18702  [pdf, other

    cs.LG

    Towards Combinatorial Generalization for Catalysts: A Kohn-Sham Charge-Density Approach

    Authors: Phillip Pope, David Jacobs

    Abstract: The Kohn-Sham equations underlie many important applications such as the discovery of new catalysts. Recent machine learning work on catalyst modeling has focused on prediction of the energy, but has so far not yet demonstrated significant out-of-distribution generalization. Here we investigate another approach based on the pointwise learning of the Kohn-Sham charge-density. On a new dataset of bu… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Comments: Published at NeurIPS 2023

  7. arXiv:2309.16668  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    RealFill: Reference-Driven Generation for Authentic Image Completion

    Authors: Luming Tang, Nataniel Ruiz, Qinghao Chu, Yuanzhen Li, Aleksander Holynski, David E. Jacobs, Bharath Hariharan, Yael Pritch, Neal Wadhwa, Kfir Aberman, Michael Rubinstein

    Abstract: Recent advances in generative imagery have brought forth outpainting and inpainting models that can produce high-quality, plausible image content in unknown regions. However, the content these models hallucinate is necessarily inauthentic, since they are unaware of the true scene. In this work, we propose RealFill, a novel generative approach for image completion that fills in missing regions of a… ▽ More

    Submitted 14 May, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

    Comments: SIGGRAPH 2024 (Journal Track). Project page: https://realfill.github.io

  8. Development and validation of an interpretable machine learning-based calculator for predicting 5-year weight trajectories after bariatric surgery: a multinational retrospective cohort SOPHIA study

    Authors: Patrick Saux, Pierre Bauvin, Violeta Raverdy, Julien Teigny, Hélène Verkindt, Tomy Soumphonphakdy, Maxence Debert, Anne Jacobs, Daan Jacobs, Valerie Monpellier, Phong Ching Lee, Chin Hong Lim, Johanna C Andersson-Assarsson, Lena Carlsson, Per-Arne Svensson, Florence Galtier, Guelareh Dezfoulian, Mihaela Moldovanu, Severine Andrieux, Julien Couster, Marie Lepage, Erminia Lembo, Ornella Verrastro, Maud Robert, Paulina Salminen , et al. (9 additional authors not shown)

    Abstract: Background Weight loss trajectories after bariatric surgery vary widely between individuals, and predicting weight loss before the operation remains challenging. We aimed to develop a model using machine learning to provide individual preoperative prediction of 5-year weight loss trajectories after surgery. Methods In this multinational retrospective observational study we enrolled adult participa… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: The Lancet Digital Health, 2023

  9. arXiv:2308.01379  [pdf, other

    cs.CV cs.GR cs.LG

    Computational Long Exposure Mobile Photography

    Authors: Eric Tabellion, Nikhil Karnad, Noa Glaser, Ben Weiss, David E. Jacobs, Yael Pritch

    Abstract: Long exposure photography produces stunning imagery, representing moving elements in a scene with motion-blur. It is generally employed in two modalities, producing either a foreground or a background blur effect. Foreground blur images are traditionally captured on a tripod-mounted camera and portray blurred moving foreground elements, such as silky water or light trails, over a perfectly sharp b… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

    Comments: 15 pages, 17 figures

    ACM Class: I.4; I.3.3; I.2.10

    Journal ref: ACM Trans. Graph. 42, 4, Article 48 (August 2023)

  10. arXiv:2306.15662  [pdf, other

    cs.CV

    Measured Albedo in the Wild: Filling the Gap in Intrinsics Evaluation

    Authors: Jiaye Wu, Sanjoy Chowdhury, Hariharmano Shanmugaraja, David Jacobs, Soumyadip Sengupta

    Abstract: Intrinsic image decomposition and inverse rendering are long-standing problems in computer vision. To evaluate albedo recovery, most algorithms report their quantitative performance with a mean Weighted Human Disagreement Rate (WHDR) metric on the IIW dataset. However, WHDR focuses only on relative albedo values and often fails to capture overall quality of the albedo. In order to comprehensively… ▽ More

    Submitted 29 June, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: Accepted into ICCP2023

  11. arXiv:2305.10474  [pdf, other

    cs.CV cs.GR cs.LG

    Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models

    Authors: Songwei Ge, Seungjun Nah, Guilin Liu, Tyler Poon, Andrew Tao, Bryan Catanzaro, David Jacobs, Jia-Bin Huang, Ming-Yu Liu, Yogesh Balaji

    Abstract: Despite tremendous progress in generating high-quality images using diffusion models, synthesizing a sequence of animated frames that are both photorealistic and temporally coherent is still in its infancy. While off-the-shelf billion-scale datasets for image generation are available, collecting similar video data of the same scale is still challenging. Also, training a video diffusion model is co… ▽ More

    Submitted 25 March, 2024; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: ICCV 2023. Project webpage: https://research.nvidia.com/labs/dir/pyoco

  12. arXiv:2304.00387  [pdf, other

    cs.CV

    HaLP: Hallucinating Latent Positives for Skeleton-based Self-Supervised Learning of Actions

    Authors: Anshul Shah, Aniket Roy, Ketul Shah, Shlok Kumar Mishra, David Jacobs, Anoop Cherian, Rama Chellappa

    Abstract: Supervised learning of skeleton sequence encoders for action recognition has received significant attention in recent times. However, learning such encoders without labels continues to be a challenging problem. While prior works have shown promising results by applying contrastive learning to pose sequences, the quality of the learned representations is often observed to be closely tied to data au… ▽ More

    Submitted 1 April, 2023; originally announced April 2023.

    Comments: To be presented at CVPR 2023

  13. arXiv:2303.12343  [pdf, other

    cs.CV

    LD-ZNet: A Latent Diffusion Approach for Text-Based Image Segmentation

    Authors: Koutilya Pnvr, Bharat Singh, Pallabi Ghosh, Behjat Siddiquie, David Jacobs

    Abstract: Large-scale pre-training tasks like image classification, captioning, or self-supervised techniques do not incentivize learning the semantic boundaries of objects. However, recent generative foundation models built using text-based latent diffusion techniques may learn semantic boundaries. This is because they have to synthesize intricate details about all objects in an image based on a text descr… ▽ More

    Submitted 23 August, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

    Comments: Supplementary material is included in the paper following the references section

  14. arXiv:2212.00653  [pdf, other

    cs.CV cs.LG

    Hyperbolic Contrastive Learning for Visual Representations beyond Objects

    Authors: Songwei Ge, Shlok Mishra, Simon Kornblith, Chun-Liang Li, David Jacobs

    Abstract: Although self-/un-supervised methods have led to rapid progress in visual representation learning, these methods generally treat objects and scenes using the same lens. In this paper, we focus on learning representations for objects and scenes that preserve the structure among them. Motivated by the observation that visually similar objects are close in the representation space, we argue that th… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

  15. arXiv:2210.16870  [pdf, other

    cs.CV cs.LG

    A simple, efficient and scalable contrastive masked autoencoder for learning visual representations

    Authors: Shlok Mishra, Joshua Robinson, Huiwen Chang, David Jacobs, Aaron Sarna, Aaron Maschinot, Dilip Krishnan

    Abstract: We introduce CAN, a simple, efficient and scalable method for self-supervised learning of visual representations. Our framework is a minimal and conceptually clean synthesis of (C) contrastive learning, (A) masked autoencoders, and (N) the noise prediction approach used in diffusion models. The learning mechanisms are complementary to one another: contrastive learning shapes the embedding space ac… ▽ More

    Submitted 30 October, 2022; originally announced October 2022.

    Comments: Mishra and Robinson contributed equally

  16. arXiv:2206.03693  [pdf, other

    cs.LG cs.CR

    Autoregressive Perturbations for Data Poisoning

    Authors: Pedro Sandoval-Segura, Vasu Singla, Jonas Geiping, Micah Goldblum, Tom Goldstein, David W. Jacobs

    Abstract: The prevalence of data scraping from social media as a means to obtain datasets has led to growing concerns regarding unauthorized use of data. Data poisoning attacks have been proposed as a bulwark against scraping, as they make data "unlearnable" by adding small, imperceptible perturbations. Unfortunately, existing methods require knowledge of both the target architecture and the complete datase… ▽ More

    Submitted 13 October, 2022; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: Accepted to NeurIPS 2022. Code available at https://github.com/psandovalsegura/autoregressive-poisoning

  17. arXiv:2204.08615  [pdf, other

    cs.LG cs.CR

    Poisons that are learned faster are more effective

    Authors: Pedro Sandoval-Segura, Vasu Singla, Liam Fowl, Jonas Geiping, Micah Goldblum, David Jacobs, Tom Goldstein

    Abstract: Imperceptible poisoning attacks on entire datasets have recently been touted as methods for protecting data privacy. However, among a number of defenses preventing the practical use of these techniques, early-stopping stands out as a simple, yet effective defense. To gauge poisons' vulnerability to early-stopping, we benchmark error-minimizing, error-maximizing, and synthetic poisons in terms of p… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

    Comments: 8 pages, 4 figures. Accepted to CVPR 2022 Art of Robustness Workshop

  18. arXiv:2204.03638  [pdf, other

    cs.CV cs.AI

    Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer

    Authors: Songwei Ge, Thomas Hayes, Harry Yang, Xi Yin, Guan Pang, David Jacobs, Jia-Bin Huang, Devi Parikh

    Abstract: Videos are created to express emotion, exchange information, and share experiences. Video synthesis has intrigued researchers for a long time. Despite the rapid progress driven by advances in visual synthesis, most existing studies focus on improving the frames' quality and the transitions between them, while little progress has been made in generating longer videos. In this paper, we present a me… ▽ More

    Submitted 24 September, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

    Comments: In ECCV 2022

  19. arXiv:2203.16515  [pdf, other

    cs.CV

    Fast Light-Weight Near-Field Photometric Stereo

    Authors: Daniel Lichy, Soumyadip Sengupta, David W. Jacobs

    Abstract: We introduce the first end-to-end learning-based solution to near-field Photometric Stereo (PS), where the light sources are close to the object of interest. This setup is especially useful for reconstructing large immobile objects. Our method is fast, producing a mesh from 52 512$\times$384 resolution images in about 1 second on a commodity GPU, thus potentially unlocking several AR/VR applicatio… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: Accepted to CVPR 2022

  20. arXiv:2203.09255  [pdf, ps, other

    cs.LG cs.AI

    On the Spectral Bias of Convolutional Neural Tangent and Gaussian Process Kernels

    Authors: Amnon Geifman, Meirav Galun, David Jacobs, Ronen Basri

    Abstract: We study the properties of various over-parametrized convolutional neural architectures through their respective Gaussian process and neural tangent kernels. We prove that, with normalized multi-channel input and ReLU activation, the eigenfunctions of these kernels with the uniform measure are formed by products of spherical harmonics, defined over the channels of the different pixels. We next use… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

  21. arXiv:2201.00889  [pdf, other

    cs.LG physics.bio-ph q-bio.BM stat.ML

    Biased Hypothesis Formation From Projection Pursuit

    Authors: John Patterson, Chris Avery, Tyler Grear, Donald J. Jacobs

    Abstract: The effect of bias on hypothesis formation is characterized for an automated data-driven projection pursuit neural network to extract and select features for binary classification of data streams. This intelligent exploratory process partitions a complete vector state space into disjoint subspaces to create working hypotheses quantified by similarities and differences observed between two groups o… ▽ More

    Submitted 3 January, 2022; originally announced January 2022.

    Comments: 12 pages, 5 figures

    Journal ref: Advances in Artificial Intelligence and Machine Learning. 2021;3:213-224

  22. arXiv:2112.00319  [pdf, other

    cs.CV cs.LG

    Object-Aware Cropping for Self-Supervised Learning

    Authors: Shlok Mishra, Anshul Shah, Ankan Bansal, Abhyuday Jagannatha, Janit Anjaria, Abhishek Sharma, David Jacobs, Dilip Krishnan

    Abstract: A core component of the recent success of self-supervised learning is cropping data augmentation, which selects sub-regions of an image to be used as positive views in the self-supervised loss. The underlying assumption is that randomly cropped and resized regions of a given image share information about the objects of interest, which the learned representation will capture. This assumption is mos… ▽ More

    Submitted 6 April, 2023; v1 submitted 1 December, 2021; originally announced December 2021.

    Journal ref: Transactions on Machine Learning Research 2022

  23. arXiv:2110.14189  [pdf, other

    cs.CV cs.LG

    Robust Contrastive Learning Using Negative Samples with Diminished Semantics

    Authors: Songwei Ge, Shlok Mishra, Haohan Wang, Chun-Liang Li, David Jacobs

    Abstract: Unsupervised learning has recently made exceptional progress because of the development of more effective contrastive learning methods. However, CNNs are prone to depend on low-level features that humans deem non-semantic. This dependency has been conjectured to induce a lack of robustness to image perturbations or domain shift. In this paper, we show that by generating carefully designed negative… ▽ More

    Submitted 3 January, 2022; v1 submitted 27 October, 2021; originally announced October 2021.

    Comments: NeurIPS 2021

  24. arXiv:2109.01980  [pdf, other

    cs.CV cs.GR cs.LG

    Deep Saliency Prior for Reducing Visual Distraction

    Authors: Kfir Aberman, Junfeng He, Yossi Gandelsman, Inbar Mosseri, David E. Jacobs, Kai Kohlhoff, Yael Pritch, Michael Rubinstein

    Abstract: Using only a model that was trained to predict where people look at images, and no additional training data, we can produce a range of powerful editing effects for reducing distraction in images. Given an image and a mask specifying the region to edit, we backpropagate through a state-of-the-art saliency model to parameterize a differentiable editing operator, such that the saliency within the mas… ▽ More

    Submitted 4 September, 2021; originally announced September 2021.

    Comments: https://deep-saliency-prior.github.io/

  25. arXiv:2108.11503  [pdf, other

    cs.AI cs.CV cs.DC cs.PF

    Maneuver Identification Challenge

    Authors: Kaira Samuel, Vijay Gadepally, David Jacobs, Michael Jones, Kyle McAlpin, Kyle Palko, Ben Paulk, Sid Samsi, Ho Chit Siu, Charles Yee, Jeremy Kepner

    Abstract: AI algorithms that identify maneuvers from trajectory data could play an important role in improving flight safety and pilot training. AI challenges allow diverse teams to work together to solve hard problems and are an effective tool for developing AI solutions. AI challenges are also a key driver of AI computational requirements. The Maneuver Identification Challenge hosted at maneuver-id.mit.ed… ▽ More

    Submitted 25 August, 2021; originally announced August 2021.

    Comments: 7 pages, 8 figures, 1 table, 33 references, accepted to IEEE HPEC 2021

  26. arXiv:2104.06397  [pdf, other

    cs.CV

    Shape and Material Capture at Home

    Authors: Daniel Lichy, Jiaye Wu, Soumyadip Sengupta, David W. Jacobs

    Abstract: In this paper, we present a technique for estimating the geometry and reflectance of objects using only a camera, flashlight, and optionally a tripod. We propose a simple data capture technique in which the user goes around the object, illuminating it with a flashlight and capturing only a few images. Our main technical contribution is the introduction of a recursive neural architecture, which can… ▽ More

    Submitted 13 April, 2021; originally announced April 2021.

    Comments: Accepted to CVPR 2021

  27. arXiv:2103.12201  [pdf, other

    cs.CV

    Improved Detection of Face Presentation Attacks Using Image Decomposition

    Authors: Shlok Kumar Mishra, Kuntal Sengupta, Max Horowitz-Gelb, Wen-Sheng Chu, Sofien Bouaziz, David Jacobs

    Abstract: Presentation attack detection (PAD) is a critical component in secure face authentication. We present a PAD algorithm to distinguish face spoofs generated by a photograph of a subject from live images. Our method uses an image decomposition network to extract albedo and normal. The domain gap between the real and spoof face images leads to easily identifiable differences, especially between the re… ▽ More

    Submitted 1 December, 2022; v1 submitted 22 March, 2021; originally announced March 2021.

    Comments: Conference - IJCB

    Journal ref: 2022 IEEE international joint conference on biometrics (IJCB) (ORAL)

  28. arXiv:2103.02695  [pdf, other

    cs.LG

    Shift Invariance Can Reduce Adversarial Robustness

    Authors: Songwei Ge, Vasu Singla, Ronen Basri, David Jacobs

    Abstract: Shift invariance is a critical property of CNNs that improves performance on classification. However, we show that invariance to circular shifts can also lead to greater sensitivity to adversarial attacks. We first characterize the margin between classes when a shift-invariant linear classifier is used. We show that the margin can only depend on the DC component of the signals. Then, using results… ▽ More

    Submitted 22 November, 2021; v1 submitted 3 March, 2021; originally announced March 2021.

    Comments: Published as a conference paper at NeurIPS 2021

  29. arXiv:2102.07861  [pdf, other

    cs.LG

    Low Curvature Activations Reduce Overfitting in Adversarial Training

    Authors: Vasu Singla, Sahil Singla, David Jacobs, Soheil Feizi

    Abstract: Adversarial training is one of the most effective defenses against adversarial attacks. Previous works suggest that overfitting is a dominant phenomenon in adversarial training leading to a large generalization gap between test and train accuracy in neural networks. In this work, we show that the observed generalization gap is closely related to the choice of the activation function. In particular… ▽ More

    Submitted 18 August, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

    Comments: Accepted at ICCV 2021

  30. arXiv:2011.01901  [pdf, other

    cs.CV cs.LG

    Learning Visual Representations for Transfer Learning by Suppressing Texture

    Authors: Shlok Mishra, Anshul Shah, Ankan Bansal, Janit Anjaria, Jonghyun Choi, Abhinav Shrivastava, Abhishek Sharma, David Jacobs

    Abstract: Recent literature has shown that features obtained from supervised training of CNNs may over-emphasize texture rather than encoding high-level information. In self-supervised learning in particular, texture as a low-level cue may provide shortcuts that prevent the network from learning higher level representations. To address these problems we propose to use classic methods based on anisotropic di… ▽ More

    Submitted 26 January, 2023; v1 submitted 3 November, 2020; originally announced November 2020.

    Journal ref: BMVC 2022

  31. arXiv:2007.01580  [pdf, ps, other

    cs.LG stat.ML

    On the Similarity between the Laplace and Neural Tangent Kernels

    Authors: Amnon Geifman, Abhay Yadav, Yoni Kasten, Meirav Galun, David Jacobs, Ronen Basri

    Abstract: Recent theoretical work has shown that massively overparameterized neural networks are equivalent to kernel regressors that use Neural Tangent Kernels(NTK). Experiments show that these kernel methods perform similarly to real neural networks. Here we show that NTK for fully connected networks is closely related to the standard Laplace kernel. We show theoretically that for normalized data on the h… ▽ More

    Submitted 14 November, 2020; v1 submitted 3 July, 2020; originally announced July 2020.

  32. arXiv:2006.04026  [pdf, other

    cs.CV

    SharinGAN: Combining Synthetic and Real Data for Unsupervised Geometry Estimation

    Authors: Koutilya PNVR, Hao Zhou, David Jacobs

    Abstract: We propose a novel method for combining synthetic and real images when training networks to determine geometric information from a single image. We suggest a method for mapping both image types into a single, shared domain. This is connected to a primary network for end-to-end training. Ideally, this results in images from two domains that present shared information to the primary network. Our exp… ▽ More

    Submitted 6 June, 2020; originally announced June 2020.

    Comments: Accepted to CVPR 2020. Supplementary material added towards the end instead of a separate file. A Github link to the code is also provided in this submission

  33. arXiv:2005.08925  [pdf, other

    cs.CV cs.GR

    Portrait Shadow Manipulation

    Authors: Xuaner Cecilia Zhang, Jonathan T. Barron, Yun-Ta Tsai, Rohit Pandey, Xiuming Zhang, Ren Ng, David E. Jacobs

    Abstract: Casually-taken portrait photographs often suffer from unflattering lighting and shadowing because of suboptimal conditions in the environment. Aesthetic qualities such as the position and softness of shadows and the lighting ratio between the bright and dark parts of the face are frequently determined by the constraints of the environment rather than by the photographer. Professionals address this… ▽ More

    Submitted 20 May, 2020; v1 submitted 18 May, 2020; originally announced May 2020.

    Comments: (updated version); SIGGRAPH 2020;Project webpage: https://people.eecs.berkeley.edu/~cecilia77/project-pages/portrait Video: https://youtu.be/M_qYTXhzyac

  34. arXiv:2004.05109  [pdf, ps, other

    cs.CL

    Towards Automatic Generation of Questions from Long Answers

    Authors: Shlok Kumar Mishra, Pranav Goel, Abhishek Sharma, Abhyuday Jagannatha, David Jacobs, Hal Daumé III

    Abstract: Automatic question generation (AQG) has broad applicability in domains such as tutoring systems, conversational agents, healthcare literacy, and information retrieval. Existing efforts at AQG have been limited to short answer lengths of up to two or three sentences. However, several real-world applications require question generation from answers that span several sentences. Therefore, we propose… ▽ More

    Submitted 15 April, 2020; v1 submitted 10 April, 2020; originally announced April 2020.

  35. arXiv:2003.04560  [pdf, other

    cs.LG stat.ML

    Frequency Bias in Neural Networks for Input of Non-Uniform Density

    Authors: Ronen Basri, Meirav Galun, Amnon Geifman, David Jacobs, Yoni Kasten, Shira Kritchman

    Abstract: Recent works have partly attributed the generalization ability of over-parameterized neural networks to frequency bias -- networks trained with gradient descent on data drawn from a uniform distribution find a low frequency fit before high frequency ones. As realistic training sets are not drawn from a uniform distribution, we here use the Neural Tangent Kernel (NTK) model to explore the effect of… ▽ More

    Submitted 10 March, 2020; originally announced March 2020.

  36. arXiv:1906.00425  [pdf, ps, other

    cs.LG eess.SP stat.ML

    The Convergence Rate of Neural Networks for Learned Functions of Different Frequencies

    Authors: Ronen Basri, David Jacobs, Yoni Kasten, Shira Kritchman

    Abstract: We study the relationship between the frequency of a function and the speed at which a neural network learns it. We build on recent results that show that the dynamics of overparameterized neural networks trained with gradient descent can be well approximated by a linear system. When normalized training data is uniformly distributed on a hypersphere, the eigenfunctions of this linear system are sp… ▽ More

    Submitted 2 December, 2019; v1 submitted 2 June, 2019; originally announced June 2019.

    Journal ref: in Advances in Neural Information Processing Systems 32 (NIPS 2019)

  37. arXiv:1905.08232  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    Adversarially robust transfer learning

    Authors: Ali Shafahi, Parsa Saadatpanah, Chen Zhu, Amin Ghiasi, Christoph Studer, David Jacobs, Tom Goldstein

    Abstract: Transfer learning, in which a network is trained on one task and re-purposed on another, is often used to produce neural network classifiers when data is scarce or full-scale training is too costly. When the goal is to produce a model that is not only accurate but also adversarially robust, data scarcity and computational limitations become even more cumbersome. We consider robust transfer learnin… ▽ More

    Submitted 21 February, 2020; v1 submitted 20 May, 2019; originally announced May 2019.

  38. arXiv:1901.02453  [pdf, other

    cs.CV

    Neural Inverse Rendering of an Indoor Scene from a Single Image

    Authors: Soumyadip Sengupta, Jinwei Gu, Kihwan Kim, Guilin Liu, David W. Jacobs, Jan Kautz

    Abstract: Inverse rendering aims to estimate physical attributes of a scene, e.g., reflectance, geometry, and lighting, from image(s). Inverse rendering has been studied primarily for single objects or with methods that solve for only one of the scene attributes. We propose the first learning-based approach that jointly estimates albedo, normals, and lighting of an indoor scene from a single image. Our key… ▽ More

    Submitted 14 September, 2019; v1 submitted 8 January, 2019; originally announced January 2019.

  39. arXiv:1901.01499  [pdf, other

    cs.LG cs.CV stat.ML

    Understanding the (un)interpretability of natural image distributions using generative models

    Authors: Ryen Krusinga, Sohil Shah, Matthias Zwicker, Tom Goldstein, David Jacobs

    Abstract: Probability density estimation is a classical and well studied problem, but standard density estimation methods have historically lacked the power to model complex and high-dimensional image distributions. More recent generative models leverage the power of neural networks to implicitly learn and represent probability models over complex images. We describe methods to extract explicit probability… ▽ More

    Submitted 25 February, 2019; v1 submitted 5 January, 2019; originally announced January 2019.

  40. Synthetic Depth-of-Field with a Single-Camera Mobile Phone

    Authors: Neal Wadhwa, Rahul Garg, David E. Jacobs, Bryan E. Feldman, Nori Kanazawa, Robert Carroll, Yair Movshovitz-Attias, Jonathan T. Barron, Yael Pritch, Marc Levoy

    Abstract: Shallow depth-of-field is commonly used by photographers to isolate a subject from a distracting background. However, standard cell phone cameras cannot produce such images optically, as their short focal lengths and small apertures capture nearly all-in-focus images. We present a system to computationally synthesize shallow depth-of-field images with a single mobile camera and a single button pre… ▽ More

    Submitted 11 June, 2018; originally announced June 2018.

    Comments: Accepted to SIGGRAPH 2018. Basis for Portrait Mode on Google Pixel 2 and Pixel 2 XL

  41. arXiv:1712.06584  [pdf, other

    cs.CV

    End-to-end Recovery of Human Shape and Pose

    Authors: Angjoo Kanazawa, Michael J. Black, David W. Jacobs, Jitendra Malik

    Abstract: We describe Human Mesh Recovery (HMR), an end-to-end framework for reconstructing a full 3D mesh of a human body from a single RGB image. In contrast to most current methods that compute 2D or 3D joint locations, we produce a richer and more useful mesh representation that is parameterized by shape and 3D joint angles. The main objective is to minimize the reprojection loss of keypoints, which all… ▽ More

    Submitted 23 June, 2018; v1 submitted 18 December, 2017; originally announced December 2017.

    Comments: CVPR 2018, Project page with code: https://akanazawa.github.io/hmr/

  42. arXiv:1712.01261  [pdf, other

    cs.CV

    SfSNet: Learning Shape, Reflectance and Illuminance of Faces in the Wild

    Authors: Soumyadip Sengupta, Angjoo Kanazawa, Carlos D. Castillo, David Jacobs

    Abstract: We present SfSNet, an end-to-end learning framework for producing an accurate decomposition of an unconstrained human face image into shape, reflectance and illuminance. SfSNet is designed to reflect a physical lambertian rendering model. SfSNet learns from a mixture of labeled synthetic and unlabeled real world images. This allows the network to capture low frequency variations from synthetic and… ▽ More

    Submitted 19 April, 2018; v1 submitted 1 December, 2017; originally announced December 2017.

    Comments: Accepted to CVPR 2018 (Spotlight)

  43. arXiv:1709.01993  [pdf, other

    cs.CV

    Label Denoising Adversarial Network (LDAN) for Inverse Lighting of Face Images

    Authors: Hao Zhou, Jin Sun, Yaser Yacoob, David W. Jacobs

    Abstract: Lighting estimation from face images is an important task and has applications in many areas such as image editing, intrinsic image decomposition, and image forgery detection. We propose to train a deep Convolutional Neural Network (CNN) to regress lighting parameters from a single face image. Lacking massive ground truth lighting labels for face images in the wild, we use an existing method to es… ▽ More

    Submitted 6 September, 2017; originally announced September 2017.

  44. arXiv:1705.07364  [pdf, other

    cs.LG cs.CV math.NA

    Stabilizing Adversarial Nets With Prediction Methods

    Authors: Abhay Yadav, Sohil Shah, Zheng Xu, David Jacobs, Tom Goldstein

    Abstract: Adversarial neural networks solve many important problems in data science, but are notoriously difficult to train. These difficulties come from the fact that optimal weights for adversarial nets correspond to saddle points, and not minimizers, of the loss function. The alternating stochastic gradient methods typically used for such problems do not reliably converge to saddle points, and when conve… ▽ More

    Submitted 8 February, 2018; v1 submitted 20 May, 2017; originally announced May 2017.

    Comments: Accepted at ICLR 2018

  45. arXiv:1702.07971  [pdf, other

    cs.CV

    Seeing What Is Not There: Learning Context to Determine Where Objects Are Missing

    Authors: Jin Sun, David W. Jacobs

    Abstract: Most of computer vision focuses on what is in an image. We propose to train a standalone object-centric context representation to perform the opposite task: seeing what is not there. Given an image, our context model can predict where objects should exist, even when no object instances are present. Combined with object detection results, we can perform a novel vision task: finding where objects ar… ▽ More

    Submitted 25 February, 2017; originally announced February 2017.

  46. arXiv:1702.03023  [pdf, other

    cs.CV

    A New Rank Constraint on Multi-view Fundamental Matrices, and its Application to Camera Location Recovery

    Authors: Soumyadip Sengupta, Tal Amir, Meirav Galun, Tom Goldstein, David W. Jacobs, Amit Singer, Ronen Basri

    Abstract: Accurate estimation of camera matrices is an important step in structure from motion algorithms. In this paper we introduce a novel rank constraint on collections of fundamental matrices in multi-view settings. We show that in general, with the selection of proper scale factors, a matrix formed by stacking fundamental matrices between pairs of images has rank 6. Moreover, this matrix forms the sym… ▽ More

    Submitted 9 February, 2017; originally announced February 2017.

  47. arXiv:1702.00506  [pdf, other

    cs.CV

    Solving Uncalibrated Photometric Stereo Using Fewer Images by Jointly Optimizing Low-rank Matrix Completion and Integrability

    Authors: Soumyadip Sengupta, Hao Zhou, Walter Forkel, Ronen Basri, Tom Goldstein, David W. Jacobs

    Abstract: We introduce a new, integrated approach to uncalibrated photometric stereo. We perform 3D reconstruction of Lambertian objects using multiple images produced by unknown, directional light sources. We show how to formulate a single optimization that includes rank and integrability constraints, allowing also for missing data. We then solve this optimization using the Alternate Direction Method of Mu… ▽ More

    Submitted 1 February, 2017; originally announced February 2017.

  48. arXiv:1611.07700  [pdf, other

    cs.CV

    3D Menagerie: Modeling the 3D shape and pose of animals

    Authors: Silvia Zuffi, Angjoo Kanazawa, David Jacobs, Michael J. Black

    Abstract: There has been significant work on learning realistic, articulated, 3D models of the human body. In contrast, there are few such models of animals, despite many applications. The main challenge is that animals are much less cooperative than humans. The best human body models are learned from thousands of 3D scans of people in specific poses, which is infeasible with live animals. Consequently, we… ▽ More

    Submitted 12 April, 2017; v1 submitted 23 November, 2016; originally announced November 2016.

    Comments: Accepted at CVPR 2017 (camera ready version)

  49. arXiv:1610.05792  [pdf, other

    cs.LG math.NA math.OC stat.ML

    Big Batch SGD: Automated Inference using Adaptive Batch Sizes

    Authors: Soham De, Abhay Yadav, David Jacobs, Tom Goldstein

    Abstract: Classical stochastic gradient methods for optimization rely on noisy gradient approximations that become progressively less accurate as iterates approach a solution. The large noise and small signal in the resulting gradients makes it difficult to use them for adaptive stepsize selection and automatic stopping. We propose alternative "big batch" SGD schemes that adaptively grow the batch size over… ▽ More

    Submitted 6 April, 2017; v1 submitted 18 October, 2016; originally announced October 2016.

    Comments: A preliminary version of this paper appears in AISTATS 2017 (International Conference on Artificial Intelligence and Statistics)

  50. arXiv:1605.09527  [pdf, other

    cs.CV math.NA math.OC

    Biconvex Relaxation for Semidefinite Programming in Computer Vision

    Authors: Sohil Shah, Abhay Kumar, Carlos Castillo, David Jacobs, Christoph Studer, Tom Goldstein

    Abstract: Semidefinite programming is an indispensable tool in computer vision, but general-purpose solvers for semidefinite programs are often too slow and memory intensive for large-scale problems. We propose a general framework to approximately solve large-scale semidefinite problems (SDPs) at low complexity. Our approach, referred to as biconvex relaxation (BCR), transforms a general SDP into a specific… ▽ More

    Submitted 8 August, 2016; v1 submitted 31 May, 2016; originally announced May 2016.