Skip to main content

Showing 1–26 of 26 results for author: Kim, Y M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.11347  [pdf, other

    cs.CV

    I$^2$-SLAM: Inverting Imaging Process for Robust Photorealistic Dense SLAM

    Authors: Gwangtak Bae, Changwoon Choi, Hyeongjun Heo, Sang Min Kim, Young Min Kim

    Abstract: We present an inverse image-formation module that can enhance the robustness of existing visual SLAM pipelines for casually captured scenarios. Casual video captures often suffer from motion blur and varying appearances, which degrade the final quality of coherent 3D visual representation. We propose integrating the physical imaging into the SLAM system, which employs linear HDR radiance maps to c… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  2. arXiv:2406.08292  [pdf, other

    cs.CV

    Outdoor Scene Extrapolation with Hierarchical Generative Cellular Automata

    Authors: Dongsu Zhang, Francis Williams, Zan Gojcic, Karsten Kreis, Sanja Fidler, Young Min Kim, Amlan Kar

    Abstract: We aim to generate fine-grained 3D geometry from large-scale sparse LiDAR scans, abundantly captured by autonomous vehicles (AV). Contrary to prior work on AV scene completion, we aim to extrapolate fine geometry from unlabeled and beyond spatial limits of LiDAR scans, taking a step towards generating realistic, high-resolution simulation-ready 3D street environments. We propose hierarchical Gener… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted to CVPR 2024 as highlight

  3. arXiv:2403.19904  [pdf, other

    cs.CV

    Fully Geometric Panoramic Localization

    Authors: Junho Kim, Jiwon Jeong, Young Min Kim

    Abstract: We introduce a lightweight and accurate localization method that only utilizes the geometry of 2D-3D lines. Given a pre-captured 3D map, our approach localizes a panorama image, taking advantage of the holistic 360 view. The system mitigates potential privacy breaches or domain discrepancies by avoiding trained or hand-crafted visual descriptors. However, as lines alone can be ambiguous, we expres… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024

  4. arXiv:2402.03690  [pdf, other

    cs.CV

    3Doodle: Compact Abstraction of Objects with 3D Strokes

    Authors: Changwoon Choi, Jaeah Lee, Jaesik Park, Young Min Kim

    Abstract: While free-hand sketching has long served as an efficient representation to convey characteristics of an object, they are often subjective, deviating significantly from realistic representations. Moreover, sketches are not consistent for arbitrary viewpoints, making it hard to catch 3D shapes. We propose 3Dooole, generating descriptive and view-consistent sketch images given multi-view images of t… ▽ More

    Submitted 29 April, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: SIGGRAPH 2024 (Transactions on Graphics)

  5. arXiv:2308.16880  [pdf, other

    cs.CV

    Text2Scene: Text-driven Indoor Scene Stylization with Part-aware Details

    Authors: Inwoo Hwang, Hyeonwoo Kim, Young Min Kim

    Abstract: We propose Text2Scene, a method to automatically create realistic textures for virtual scenes composed of multiple objects. Guided by a reference image and text descriptions, our pipeline adds detailed texture on labeled 3D geometries in the room such that the generated colors respect the hierarchical structure or semantic parts that are often composed of similar materials. Instead of applying fla… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: Accepted to CVPR 2023

  6. arXiv:2308.14005  [pdf, other

    cs.CV

    Calibrating Panoramic Depth Estimation for Practical Localization and Mapping

    Authors: Junho Kim, Eun Sun Lee, Young Min Kim

    Abstract: The absolute depth values of surrounding environments provide crucial cues for various assistive technologies, such as localization, navigation, and 3D structure estimation. We propose that accurate depth estimated from panoramic images can serve as a powerful and light-weight input for a wide range of downstream tasks requiring 3D information. While panoramic images can easily capture the surroun… ▽ More

    Submitted 2 February, 2024; v1 submitted 27 August, 2023; originally announced August 2023.

    Comments: Accepted to ICCV 2023

  7. arXiv:2308.13989  [pdf, other

    cs.CV

    LDL: Line Distance Functions for Panoramic Localization

    Authors: Junho Kim, Changwoon Choi, Hojun Jang, Young Min Kim

    Abstract: We introduce LDL, a fast and robust algorithm that localizes a panorama to a 3D map using line segments. LDL focuses on the sparse structural information of lines in the scene, which is robust to illumination changes and can potentially enable efficient computation. While previous line-based localization approaches tend to sacrifice accuracy or computation time, our method effectively observes the… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

    Comments: Accepted to ICCV 2023

  8. arXiv:2307.01896  [pdf, other

    cs.CL

    Transformed Protoform Reconstruction

    Authors: Young Min Kim, Kalvin Chang, Chenxuan Cui, David Mortensen

    Abstract: Protoform reconstruction is the task of inferring what morphemes or words appeared like in the ancestral languages of a set of daughter languages. Meloni et al. (2021) achieved the state-of-the-art on Latin protoform reconstruction with an RNN-based encoder-decoder with attention model. We update their model with the state-of-the-art seq2seq model: the Transformer. Our model outperforms their mode… ▽ More

    Submitted 5 July, 2023; v1 submitted 4 July, 2023; originally announced July 2023.

    Comments: Accepted at ACL 2023

  9. arXiv:2305.03249  [pdf, other

    cs.GR cs.LG

    PMP: Learning to Physically Interact with Environments using Part-wise Motion Priors

    Authors: Jinseok Bae, Jungdam Won, Donggeun Lim, Cheol-Hui Min, Young Min Kim

    Abstract: We present a method to animate a character incorporating multiple part-wise motion priors (PMP). While previous works allow creating realistic articulated motions from reference data, the range of motion is largely limited by the available samples. Especially for the interaction-rich scenarios, it is impractical to attempt acquiring every possible interacting motion, as the combination of physical… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: 13 pages, 11 figures

  10. arXiv:2303.12408  [pdf, other

    cs.CV

    Balanced Spherical Grid for Egocentric View Synthesis

    Authors: Changwoon Choi, Sang Min Kim, Young Min Kim

    Abstract: We present EgoNeRF, a practical solution to reconstruct large-scale real-world environments for VR assets. Given a few seconds of casually captured 360 video, EgoNeRF can efficiently build neural radiance fields which enable high-quality rendering from novel viewpoints. Motivated by the recent acceleration of NeRF using feature grids, we adopt spherical coordinate instead of conventional Cartesian… ▽ More

    Submitted 24 March, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

    Comments: Accepted to CVPR 2023

  11. arXiv:2212.03177  [pdf, other

    cs.CV

    Privacy-Preserving Visual Localization with Event Cameras

    Authors: Junho Kim, Young Min Kim, Yicheng Wu, Ramzi Zahreddine, Weston A. Welge, Gurunandan Krishnan, Sizhuo Ma, Jian Wang

    Abstract: We present a robust, privacy-preserving visual localization algorithm using event cameras. While event cameras can potentially make robust localization due to high dynamic range and small motion blur, the sensors exhibit large domain gaps making it difficult to directly apply conventional image-based localization algorithms. To mitigate the gap, we propose applying event-to-image conversion prior… ▽ More

    Submitted 8 December, 2022; v1 submitted 4 December, 2022; originally announced December 2022.

  12. arXiv:2211.15992  [pdf, other

    cs.RO cs.CV

    MoDA: Map style transfer for self-supervised Domain Adaptation of embodied agents

    Authors: Eun Sun Lee, Junho Kim, SangWon Park, Young Min Kim

    Abstract: We propose a domain adaptation method, MoDA, which adapts a pretrained embodied agent to a new, noisy environment without ground-truth supervision. Map-based memory provides important contextual information for visual navigation, and exhibits unique spatial structure mainly composed of flat walls and rectangular obstacles. Our adaptation approach encourages the inherent regularities on the estimat… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: ECCV 2022

  13. arXiv:2210.08202  [pdf, other

    cs.CV

    IBL-NeRF: Image-Based Lighting Formulation of Neural Radiance Fields

    Authors: Changwoon Choi, Juhyeon Kim, Young Min Kim

    Abstract: We propose IBL-NeRF, which decomposes the neural radiance fields (NeRF) of large-scale indoor scenes into intrinsic components. Recent approaches further decompose the baked radiance of the implicit volume into intrinsic components such that one can partially approximate the rendering equation. However, they are limited to representing isolated objects with a shared environment lighting, and suffe… ▽ More

    Submitted 11 September, 2023; v1 submitted 15 October, 2022; originally announced October 2022.

    Comments: Computer Graphics Forum (Pacific Graphics 2023)

  14. arXiv:2207.05317  [pdf, other

    cs.CV

    CPO: Change Robust Panorama to Point Cloud Localization

    Authors: Junho Kim, Hojun Jang, Changwoon Choi, Young Min Kim

    Abstract: We present CPO, a fast and robust algorithm that localizes a 2D panorama with respect to a 3D point cloud of a scene possibly containing changes. To robustly handle scene changes, our approach deviates from conventional feature point matching, and focuses on the spatial context provided from panorama images. Specifically, we propose efficient color histogram generation and subsequent robust locali… ▽ More

    Submitted 1 February, 2024; v1 submitted 12 July, 2022; originally announced July 2022.

    Comments: Accepted to ECCV 2022

  15. arXiv:2206.12455  [pdf, other

    cs.CV

    Ev-NeRF: Event Based Neural Radiance Field

    Authors: Inwoo Hwang, Junho Kim, Young Min Kim

    Abstract: We present Ev-NeRF, a Neural Radiance Field derived from event data. While event cameras can measure subtle brightness changes in high frame rates, the measurements in low lighting or extreme motion suffer from significant domain discrepancy with complex noise. As a result, the performance of event-based vision tasks does not transfer to challenging environments, where the event cameras are expect… ▽ More

    Submitted 5 March, 2023; v1 submitted 24 June, 2022; originally announced June 2022.

    Comments: Accepted to WACV 2023

  16. arXiv:2204.01264  [pdf, other

    cs.CV

    Probabilistic Implicit Scene Completion

    Authors: Dongsu Zhang, Changwoon Choi, Inbum Park, Young Min Kim

    Abstract: We propose a probabilistic shape completion method extended to the continuous geometry of large-scale 3D scenes. Real-world scans of 3D scenes suffer from a considerable amount of missing data cluttered with unsegmented objects. The problem of shape completion is inherently ill-posed, and high-quality result requires scalable solutions that consider multiple possible outcomes. We employ the Genera… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

    Comments: Accepted to ICLR 2022 as spotlight, code available at https://github.com/96lives/gca

  17. arXiv:2203.12247  [pdf, other

    cs.CV

    Ev-TTA: Test-Time Adaptation for Event-Based Object Recognition

    Authors: Junho Kim, Inwoo Hwang, Young Min Kim

    Abstract: We introduce Ev-TTA, a simple, effective test-time adaptation algorithm for event-based object recognition. While event cameras are proposed to provide measurements of scenes with fast motions or drastic illumination changes, many existing event-based recognition algorithms suffer from performance deterioration under extreme conditions due to significant domain shifts. Ev-TTA mitigates the severe… ▽ More

    Submitted 28 March, 2022; v1 submitted 23 March, 2022; originally announced March 2022.

    Comments: Accepted to CVPR 2022

  18. arXiv:2202.08418  [pdf, other

    cs.CV cs.AI

    Neural Marionette: Unsupervised Learning of Motion Skeleton and Latent Dynamics from Volumetric Video

    Authors: Jinseok Bae, Hojun Jang, Cheol-Hui Min, Hyungun Choi, Young Min Kim

    Abstract: We present Neural Marionette, an unsupervised approach that discovers the skeletal structure from a dynamic sequence and learns to generate diverse motions that are consistent with the observed motion dynamics. Given a video stream of point cloud observation of an articulated body under arbitrary motion, our approach discovers the unknown low-dimensional skeletal relationship that can effectively… ▽ More

    Submitted 16 February, 2022; originally announced February 2022.

    Comments: 7 pages (main), 10 pages (appendix) and to be appeared in AAAI2022

  19. arXiv:2112.01041  [pdf, other

    cs.CV

    N-ImageNet: Towards Robust, Fine-Grained Object Recognition with Event Cameras

    Authors: Junho Kim, Jaehyeok Bae, Gangin Park, Dongsu Zhang, Young Min Kim

    Abstract: We introduce N-ImageNet, a large-scale dataset targeted for robust, fine-grained object recognition with event cameras. The dataset is collected using programmable hardware in which an event camera consistently moves around a monitor displaying images from ImageNet. N-ImageNet serves as a challenging benchmark for event-based object recognition, due to its large number of classes and samples. We e… ▽ More

    Submitted 27 March, 2022; v1 submitted 2 December, 2021; originally announced December 2021.

    Comments: Accepted to ICCV 2021

  20. arXiv:2110.07184  [pdf, other

    cs.CV cs.RO

    Self-Supervised Domain Adaptation for Visual Navigation with Global Map Consistency

    Authors: Eun Sun Lee, Junho Kim, Young Min Kim

    Abstract: We propose a light-weight, self-supervised adaptation for a visual navigation agent to generalize to unseen environment. Given an embodied agent trained in a noiseless environment, our objective is to transfer the agent to a noisy environment where actuation and odometry sensor noise is present. Our method encourages the agent to maximize the consistency between the global maps generated at differ… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.

    Comments: Accepted to WACV 2022

  21. arXiv:2110.07171  [pdf, other

    cs.CV cs.RO

    SGoLAM: Simultaneous Goal Localization and Mapping for Multi-Object Goal Navigation

    Authors: Junho Kim, Eun Sun Lee, Mingi Lee, Donsu Zhang, Young Min Kim

    Abstract: We present SGoLAM, short for simultaneous goal localization and mapping, which is a simple and efficient algorithm for Multi-Object Goal navigation. Given an agent equipped with an RGB-D camera and a GPS/Compass sensor, our objective is to have the agent navigate to a sequence of target objects in realistic 3D environments. Our pipeline fully leverages the strength of classical approaches for visu… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.

  22. arXiv:2108.06545  [pdf, other

    cs.CV

    PICCOLO: Point Cloud-Centric Omnidirectional Localization

    Authors: Junho Kim, Changwoon Choi, Hojun Jang, Young Min Kim

    Abstract: We present PICCOLO, a simple and efficient algorithm for omnidirectional localization. Given a colored point cloud and a 360 panorama image of a scene, our objective is to recover the camera pose at which the panorama image is taken. Our pipeline works in an off-the-shelf manner with a single image given as a query and does not require any training of neural networks or collecting ground-truth pos… ▽ More

    Submitted 2 February, 2024; v1 submitted 14 August, 2021; originally announced August 2021.

    Comments: Accepted to ICCV 2021

  23. arXiv:2104.04275  [pdf, other

    cs.CV cs.LG cs.RO

    GATSBI: Generative Agent-centric Spatio-temporal Object Interaction

    Authors: Cheol-Hui Min, Jinseok Bae, Junho Lee, Young Min Kim

    Abstract: We present GATSBI, a generative model that can transform a sequence of raw observations into a structured latent representation that fully captures the spatio-temporal context of the agent's actions. In vision-based decision-making scenarios, an agent faces complex high-dimensional observations where multiple entities interact with each other. The agent requires a good scene representation of the… ▽ More

    Submitted 9 April, 2021; originally announced April 2021.

    Comments: accepted to CVPR'2021 as an oral presentation. Code and video will be released soon

  24. arXiv:2103.04130  [pdf, other

    cs.CV

    Learning to Generate 3D Shapes with Generative Cellular Automata

    Authors: Dongsu Zhang, Changwoon Choi, Jeonghwan Kim, Young Min Kim

    Abstract: We present a probabilistic 3D generative model, named Generative Cellular Automata, which is able to produce diverse and high quality shapes. We formulate the shape generation process as sampling from the transition kernel of a Markov chain, where the sampling chain eventually evolves to the full shape of the learned distribution. The transition kernel employs the local update rules of cellular au… ▽ More

    Submitted 6 March, 2021; originally announced March 2021.

    Comments: ICLR 2021

  25. arXiv:2007.03169  [pdf, other

    cs.CV

    Spatial Semantic Embedding Network: Fast 3D Instance Segmentation with Deep Metric Learning

    Authors: Dongsu Zhang, Junha Chun, Sang Kyun Cha, Young Min Kim

    Abstract: We propose spatial semantic embedding network (SSEN), a simple, yet efficient algorithm for 3D instance segmentation using deep metric learning. The raw 3D reconstruction of an indoor environment suffers from occlusions, noise, and is produced without any meaningful distinction between individual entities. For high-level intelligent tasks from a large scale scene, 3D instance segmentation recogniz… ▽ More

    Submitted 6 July, 2020; originally announced July 2020.

  26. arXiv:1904.12304  [pdf, other

    cs.CV cs.AI

    RL-GAN-Net: A Reinforcement Learning Agent Controlled GAN Network for Real-Time Point Cloud Shape Completion

    Authors: Muhammad Sarmad, Hyunjoo Jenny Lee, Young Min Kim

    Abstract: We present RL-GAN-Net, where a reinforcement learning (RL) agent provides fast and robust control of a generative adversarial network (GAN). Our framework is applied to point cloud shape completion that converts noisy, partial point cloud data into a high-fidelity completed shape by controlling the GAN. While a GAN is unstable and hard to train, we circumvent the problem by (1) training the GAN on… ▽ More

    Submitted 28 April, 2019; originally announced April 2019.

    Comments: Accepted to IEEE CVPR 2019