Skip to main content

Showing 1–50 of 91 results for author: Park, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.12508  [pdf, other

    cs.CL cs.AI cs.CV

    MERLIN: Multimodal Embedding Refinement via LLM-based Iterative Navigation for Text-Video Retrieval-Rerank Pipeline

    Authors: Donghoon Han, Eunhwan Park, Gisang Lee, Adam Lee, Nojun Kwak

    Abstract: The rapid expansion of multimedia content has made accurately retrieving relevant videos from large collections increasingly challenging. Recent advancements in text-video retrieval have focused on cross-modal interactions, large-scale foundation model training, and probabilistic modeling, yet often neglect the crucial user perspective, leading to discrepancies between user queries and the content… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: Work in progress

  2. arXiv:2406.18459  [pdf, other

    cs.CV

    DiffuseHigh: Training-free Progressive High-Resolution Image Synthesis through Structure Guidance

    Authors: Younghyun Kim, Geunmin Hwang, Junyu Zhang, Eunbyung Park

    Abstract: Recent surge in large-scale generative models has spurred the development of vast fields in computer vision. In particular, text-to-image diffusion models have garnered widespread adoption across diverse domain due to their potential for high-fidelity image generation. Nonetheless, existing large-scale diffusion models are confined to generate images of up to 1K resolution, which is far from meeti… ▽ More

    Submitted 11 July, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

  3. arXiv:2406.15102  [pdf, other

    cs.CV cs.LG

    HLQ: Fast and Efficient Backpropagation via Hadamard Low-rank Quantization

    Authors: Seonggon Kim, Eunhyeok Park

    Abstract: With the rapid increase in model size and the growing importance of various fine-tuning applications, lightweight training has become crucial. Since the backward pass is twice as expensive as the forward pass, optimizing backpropagation is particularly important. However, modifications to this process can lead to suboptimal convergence, so training optimization should minimize perturbations, which… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  4. arXiv:2406.13251  [pdf, other

    cs.CV cs.GR eess.IV

    Freq-Mip-AA : Frequency Mip Representation for Anti-Aliasing Neural Radiance Fields

    Authors: Youngin Park, Seungtae Nam, Cheul-hee Hahm, Eunbyung Park

    Abstract: Neural Radiance Fields (NeRF) have shown remarkable success in representing 3D scenes and generating novel views. However, they often struggle with aliasing artifacts, especially when rendering images from different camera distances from the training views. To address the issue, Mip-NeRF proposed using volumetric frustums to render a pixel and suggested integrated positional encoding (IPE). While… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Accepted to ICIP 2024, 7 pages, 3 figures

  5. arXiv:2405.17083  [pdf, other

    cs.CV

    F-3DGS: Factorized Coordinates and Representations for 3D Gaussian Splatting

    Authors: Xiangyu Sun, Joo Chan Lee, Daniel Rho, Jong Hwan Ko, Usman Ali, Eunbyung Park

    Abstract: The neural radiance field (NeRF) has made significant strides in representing 3D scenes and synthesizing novel views. Despite its advancements, the high computational costs of NeRF have posed challenges for its deployment in resource-constrained environments and real-time applications. As an alternative to NeRF-like neural rendering methods, 3D Gaussian Splatting (3DGS) offers rapid rendering spee… ▽ More

    Submitted 28 May, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: Our project page including code is available at https://xiangyu1sun.github.io/Factorize-3DGS/

  6. arXiv:2404.19381  [pdf, other

    cs.AR

    Low-overhead General-purpose Near-Data Processing in CXL Memory Expanders

    Authors: Hyungkyu Ham, Jeongmin Hong, Geonwoo Park, Yunseon Shin, Okkyun Woo, Wonhyuk Yang, Jinhoon Bae, Eunhyeok Park, Hyojin Sung, Euicheol Lim, Gwangsun Kim

    Abstract: To overcome the memory capacity wall of large-scale AI and big data applications, Compute Express Link (CXL) enables cost-efficient memory expansion beyond the local DRAM of processors. While its CXL.mem protocol stack minimizes interconnect latency, CXL memory accesses can still result in significant slowdowns for memory-bound applications. While near-data processing (NDP) in CXL memory can overc… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  7. arXiv:2404.14687  [pdf, other

    cs.MM cs.AI cs.CL cs.CV

    Pegasus-v1 Technical Report

    Authors: Raehyuk Jung, Hyojun Go, Jaehyuk Yi, Jiho Jang, Daniel Kim, Jay Suh, Aiden Lee, Cooper Han, Jae Lee, Jeff Kim, Jin-Young Kim, Junwan Kim, Kyle Park, Lucas Lee, Mars Ha, Minjoon Seo, Abraham Jo, Ed Park, Hassan Kianinejad, SJ Kim, Tony Moon, Wade Jeong, Andrei Popescu, Esther Kim, EK Yoon , et al. (19 additional authors not shown)

    Abstract: This technical report introduces Pegasus-1, a multimodal language model specialized in video content understanding and interaction through natural language. Pegasus-1 is designed to address the unique challenges posed by video data, such as interpreting spatiotemporal information, to offer nuanced video content comprehension across various lengths. This technical report overviews Pegasus-1's archi… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  8. arXiv:2404.04913  [pdf, other

    cs.CV

    CodecNeRF: Toward Fast Encoding and Decoding, Compact, and High-quality Novel-view Synthesis

    Authors: Gyeongjin Kang, Younggeun Lee, Seungjun Oh, Eunbyung Park

    Abstract: Neural Radiance Fields (NeRF) have achieved huge success in effectively capturing and representing 3D objects and scenes. However, several factors have impeded its further proliferation as next-generation 3D media. To establish a ubiquitous presence in everyday media formats, such as images and videos, it is imperative to devise a solution that effectively fulfills three key objectives: fast encod… ▽ More

    Submitted 28 May, 2024; v1 submitted 7 April, 2024; originally announced April 2024.

    Comments: Project page: https://gynjn.github.io/Codec-NeRF/

  9. arXiv:2404.01745  [pdf, other

    cs.CV cs.AI

    Unleash the Potential of CLIP for Video Highlight Detection

    Authors: Donghoon Han, Seunghyeon Seo, Eunhwan Park, Seong-Uk Nam, Nojun Kwak

    Abstract: Multimodal and large language models (LLMs) have revolutionized the utilization of open-world knowledge, unlocking novel potentials across various tasks and applications. Among these domains, the video domain has notably benefited from their capabilities. In this paper, we present Highlight-CLIP (HL-CLIP), a method designed to excel in the video highlight detection task by leveraging the pre-train… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  10. arXiv:2404.00018  [pdf, other

    cs.HC cs.AI cs.SI

    Can AI Outperform Human Experts in Creating Social Media Creatives?

    Authors: Eunkyung Park, Raymond K. Wong, Junbum Kwon

    Abstract: Artificial Intelligence has outperformed human experts in functional tasks such as chess and baduk. How about creative tasks? This paper evaluates AI's capability in the creative domain compared to human experts, which little research has been conducted so far. We propose a novel Prompt-for-Prompt to generate social media creatives via prompt augmentation by Large Language Models. We take the most… ▽ More

    Submitted 19 March, 2024; originally announced April 2024.

    Comments: 17 pages, 5 figures

    MSC Class: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)

  11. arXiv:2403.15485  [pdf, other

    cs.CL cs.AI cs.LG

    MOGAM: A Multimodal Object-oriented Graph Attention Model for Depression Detection

    Authors: Junyeop Cha, Seoyun Kim, Dongjae Kim, Eunil Park

    Abstract: Early detection plays a crucial role in the treatment of depression. Therefore, numerous studies have focused on social media platforms, where individuals express their emotions, aiming to achieve early detection of depression. However, the majority of existing approaches often rely on specific features, leading to limited scalability across different types of social media datasets, such as text,… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 12 pages, 3 figures, 4 tables

  12. arXiv:2403.13909  [pdf, other

    cs.LG eess.SY

    Sequential Modeling of Complex Marine Navigation: Case Study on a Passenger Vessel (Student Abstract)

    Authors: Yimeng Fan, Pedram Agand, Mo Chen, Edward J. Park, Allison Kennedy, Chanwoo Bae

    Abstract: The maritime industry's continuous commitment to sustainability has led to a dedicated exploration of methods to reduce vessel fuel consumption. This paper undertakes this challenge through a machine learning approach, leveraging a real-world dataset spanning two years of a ferry in west coast Canada. Our focus centers on the creation of a time series forecasting model given the dynamic and static… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: 5 pages, 3 figures, AAAI 2024 student abstract

  13. arXiv:2403.06342  [pdf, other

    math.NA cs.LG

    Separable Physics-informed Neural Networks for Solving the BGK Model of the Boltzmann Equation

    Authors: Jaemin Oh, Seung Yeon Cho, Seok-Bae Yun, Eunbyung Park, Youngjoon Hong

    Abstract: In this study, we introduce a method based on Separable Physics-Informed Neural Networks (SPINNs) for effectively solving the BGK model of the Boltzmann equation. While the mesh-free nature of PINNs offers significant advantages in handling high-dimensional partial differential equations (PDEs), challenges arise when applying quadrature rules for accurate integral evaluation in the BGK operator, w… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

    MSC Class: 68T20; 35R09

  14. arXiv:2402.18293  [pdf, other

    cs.CV

    Continuous Memory Representation for Anomaly Detection

    Authors: Joo Chan Lee, Taejune Kim, Eunbyung Park, Simon S. Woo, Jong Hwan Ko

    Abstract: There have been significant advancements in anomaly detection in an unsupervised manner, where only normal images are available for training. Several recent methods aim to detect anomalies based on a memory, comparing or reconstructing the input with directly stored normal features (or trained features with normal images). However, such memory-based approaches operate on a discrete feature space i… ▽ More

    Submitted 10 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: Project page: https://tae-mo.github.io/crad/

  15. arXiv:2402.14604  [pdf, other

    cs.CG

    Embeddings and near-neighbor searching with constant additive error for hyperbolic spaces

    Authors: Eunku Park, Antoine Vigneron

    Abstract: We give an embedding of the Poincaré halfspace $H^D$ into a discrete metric space based on a binary tiling of $H^D$, with additive distortion $O(\log D)$. It yields the following results. We show that any subset $P$ of $n$ points in $H^D$ can be embedded into a graph-metric with $2^{O(D)}n$ vertices and edges, and with additive distortion $O(\log D)$. We also show how to construct, for any $k$, an… ▽ More

    Submitted 1 April, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

  16. arXiv:2402.14196  [pdf, other

    cs.CV cs.GR

    Mip-Grid: Anti-aliased Grid Representations for Neural Radiance Fields

    Authors: Seungtae Nam, Daniel Rho, Jong Hwan Ko, Eunbyung Park

    Abstract: Despite the remarkable achievements of neural radiance fields (NeRF) in representing 3D scenes and generating novel view images, the aliasing issue, rendering "jaggies" or "blurry" images at varying camera distances, remains unresolved in most existing approaches. The recently proposed mip-NeRF has addressed this challenge by rendering conical frustums instead of rays. However, it relies on MLP ar… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: Accepted to NeurIPS 2023

  17. arXiv:2401.17547  [pdf, other

    cs.CV

    Task-Oriented Diffusion Model Compression

    Authors: Geonung Kim, Beomsu Kim, Eunhyeok Park, Sunghyun Cho

    Abstract: As recent advancements in large-scale Text-to-Image (T2I) diffusion models have yielded remarkable high-quality image generation, diverse downstream Image-to-Image (I2I) applications have emerged. Despite the impressive results achieved by these I2I models, their practical utility is hampered by their large model size and the computational burden of the iterative denoising process. In this paper,… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  18. arXiv:2401.00834  [pdf, other

    cs.CV

    Deblurring 3D Gaussian Splatting

    Authors: Byeonghyeon Lee, Howoong Lee, Xiangyu Sun, Usman Ali, Eunbyung Park

    Abstract: Recent studies in Radiance Fields have paved the robust way for novel view synthesis with their photorealistic rendering quality. Nevertheless, they usually employ neural networks and volumetric rendering, which are costly to train and impede their broad use in various real-time applications due to the lengthy rendering time. Lately 3D Gaussians splatting-based approach has been proposed to model… ▽ More

    Submitted 26 May, 2024; v1 submitted 1 January, 2024; originally announced January 2024.

    Comments: 29 pages, 16 figures

  19. arXiv:2401.00825  [pdf, other

    cs.CV cs.GR eess.IV

    Sharp-NeRF: Grid-based Fast Deblurring Neural Radiance Fields Using Sharpness Prior

    Authors: Byeonghyeon Lee, Howoong Lee, Usman Ali, Eunbyung Park

    Abstract: Neural Radiance Fields (NeRF) have shown remarkable performance in neural rendering-based novel view synthesis. However, NeRF suffers from severe visual quality degradation when the input images have been captured under imperfect conditions, such as poor illumination, defocus blurring, and lens aberrations. Especially, defocus blur is quite common in the images when they are normally captured usin… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

    Comments: Accepted to WACV 2024

  20. arXiv:2312.07905  [pdf, ps, other

    cs.CV

    Plant Disease Recognition Datasets in the Age of Deep Learning: Challenges and Opportunities

    Authors: Mingle Xu, Ji Eun Park, Jaehwan Lee, Jucheng Yang, Sook Yoon

    Abstract: Plant disease recognition has witnessed a significant improvement with deep learning in recent years. Although plant disease datasets are essential and many relevant datasets are public available, two fundamental questions exist. First, how to differentiate datasets and further choose suitable public datasets for specific applications? Second, what kinds of characteristics of datasets are desired… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    Comments: Submission v1 to a journal

  21. arXiv:2312.03517  [pdf, other

    cs.CV cs.AI

    FRDiff : Feature Reuse for Universal Training-free Acceleration of Diffusion Models

    Authors: Junhyuk So, Jungwon Lee, Eunhyeok Park

    Abstract: The substantial computational costs of diffusion models, especially due to the repeated denoising steps necessary for high-quality image generation, present a major obstacle to their widespread adoption. While several studies have attempted to address this issue by reducing the number of score function evaluations (NFE) using advanced ODE solvers without fine-tuning, the decreased number of denois… ▽ More

    Submitted 2 April, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: Work in progress. Project page : https://jungwon-lee.github.io/Project_FRDiff/

  22. arXiv:2311.14993  [pdf, other

    cs.CV

    Coordinate-Aware Modulation for Neural Fields

    Authors: Joo Chan Lee, Daniel Rho, Seungtae Nam, Jong Hwan Ko, Eunbyung Park

    Abstract: Neural fields, mapping low-dimensional input coordinates to corresponding signals, have shown promising results in representing various signals. Numerous methodologies have been proposed, and techniques employing MLPs and grid representations have achieved substantial success. MLPs allow compact and high expressibility, yet often suffer from spectral bias and slow convergence speed. On the other h… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

    Comments: Project page: http://maincold2.github.io/cam/

  23. arXiv:2311.13681  [pdf, other

    cs.CV cs.GR

    Compact 3D Gaussian Representation for Radiance Field

    Authors: Joo Chan Lee, Daniel Rho, Xiangyu Sun, Jong Hwan Ko, Eunbyung Park

    Abstract: Neural Radiance Fields (NeRFs) have demonstrated remarkable potential in capturing complex 3D scenes with high fidelity. However, one persistent challenge that hinders the widespread adoption of NeRFs is the computational bottleneck due to the volumetric rendering. On the other hand, 3D Gaussian splatting (3DGS) has recently emerged as an alternative representation that leverages a 3D Gaussisan-ba… ▽ More

    Submitted 15 February, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

    Comments: Project page: http://maincold2.github.io/c3dgs/

  24. Fuel Consumption Prediction for a Passenger Ferry using Machine Learning and In-service Data: A Comparative Study

    Authors: Pedram Agand, Allison Kennedy, Trevor Harris, Chanwoo Bae, Mo Chen, Edward J Park

    Abstract: As the importance of eco-friendly transportation increases, providing an efficient approach for marine vessel operation is essential. Methods for status monitoring with consideration to the weather condition and forecasting with the use of in-service data from ships requires accurate and complete models for predicting the energy efficiency of a ship. The models need to effectively process all the… ▽ More

    Submitted 23 October, 2023; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: 20 pages, 11 figures, 7 tables

    Journal ref: Ocean Engineering 284 (2023): 115271

  25. arXiv:2309.16884  [pdf, other

    cs.RO eess.SY

    An MCTS-DRL Based Obstacle and Occlusion Avoidance Methodology in Robotic Follow-Ahead Applications

    Authors: Sahar Leisiazar, Edward J. Park, Angelica Lim, Mo Chen

    Abstract: We propose a novel methodology for robotic follow-ahead applications that address the critical challenge of obstacle and occlusion avoidance. Our approach effectively navigates the robot while ensuring avoidance of collisions and occlusions caused by surrounding objects. To achieve this, we developed a high-level decision-making algorithm that generates short-term navigational goals for the mobile… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  26. arXiv:2309.06922  [pdf, other

    cs.CV

    Hydra: Multi-head Low-rank Adaptation for Parameter Efficient Fine-tuning

    Authors: Sanghyeon Kim, Hyunmo Yang, Younghyun Kim, Youngjoon Hong, Eunbyung Park

    Abstract: The recent surge in large-scale foundation models has spurred the development of efficient methods for adapting these models to various downstream tasks. Low-rank adaptation methods, such as LoRA, have gained significant attention due to their outstanding parameter efficiency and no additional inference latency. This paper investigates a more general form of adapter module based on the analysis th… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

  27. arXiv:2307.09455  [pdf, other

    cs.CL

    Pseudo Outlier Exposure for Out-of-Distribution Detection using Pretrained Transformers

    Authors: Jaeyoung Kim, Kyuheon Jung, Dongbin Na, Sion Jang, Eunbin Park, Sungchul Choi

    Abstract: For real-world language applications, detecting an out-of-distribution (OOD) sample is helpful to alert users or reject such unreliable samples. However, modern over-parameterized language models often produce overconfident predictions for both in-distribution (ID) and OOD samples. In particular, language models suffer from OOD samples with a similar semantic representation to ID samples since the… ▽ More

    Submitted 19 July, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

    Comments: 12 pages, 2 figures

    MSC Class: 68T50

    Journal ref: Findings of the Association for Computational Linguistics: ACL 2023 (2023) 1469-1482

  28. arXiv:2306.15969  [pdf, other

    cs.LG cs.AI

    Separable Physics-Informed Neural Networks

    Authors: Junwoo Cho, Seungtae Nam, Hyunmo Yang, Seok-Bae Yun, Youngjoon Hong, Eunbyung Park

    Abstract: Physics-informed neural networks (PINNs) have recently emerged as promising data-driven PDE solvers showing encouraging results on various PDEs. However, there is a fundamental limitation of training PINNs to solve multi-dimensional PDEs and approximate highly complex solution functions. The number of training points (collocation points) required on these challenging PDEs grows substantially, but… ▽ More

    Submitted 31 October, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: To appear in NeurIPS 2023 (28 pages, 13 figures). workshop paper: arXiv:2211.08761

  29. arXiv:2306.02316  [pdf, other

    cs.CV

    Temporal Dynamic Quantization for Diffusion Models

    Authors: Junhyuk So, Jungwon Lee, Daehyun Ahn, Hyungjun Kim, Eunhyeok Park

    Abstract: The diffusion model has gained popularity in vision applications due to its remarkable generative performance and versatility. However, high storage and computation demands, resulting from the model size and iterative generation, hinder its use on mobile devices. Existing quantization techniques struggle to maintain performance even in 8-bit precision due to the diffusion model's unique property o… ▽ More

    Submitted 11 December, 2023; v1 submitted 4 June, 2023; originally announced June 2023.

  30. arXiv:2306.02272  [pdf, other

    cs.CL

    OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Models

    Authors: Changhun Lee, Jungyu Jin, Taesu Kim, Hyungjun Kim, Eunhyeok Park

    Abstract: Large language models (LLMs) with hundreds of billions of parameters require powerful server-grade GPUs for inference, limiting their practical deployment. To address this challenge, we introduce the outlier-aware weight quantization (OWQ) method, which aims to minimize LLM's footprint through low-precision representation. OWQ prioritizes a small subset of structured weights sensitive to quantizat… ▽ More

    Submitted 23 January, 2024; v1 submitted 4 June, 2023; originally announced June 2023.

    Comments: Accepted at AAAI 2024 (oral presentation)

  31. arXiv:2304.02330  [pdf, other

    cs.CV

    SMPConv: Self-moving Point Representations for Continuous Convolution

    Authors: Sanghyeon Kim, Eunbyung Park

    Abstract: Continuous convolution has recently gained prominence due to its ability to handle irregularly sampled data and model long-term dependency. Also, the promising experimental results of using large convolutional kernels have catalyzed the development of continuous convolution since they can construct large kernels very efficiently. Leveraging neural networks, more specifically multilayer perceptrons… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: Accepted to CVPR 2023

  32. arXiv:2303.15699  [pdf, other

    cs.CV

    Enhancing Breast Cancer Risk Prediction by Incorporating Prior Images

    Authors: Hyeonsoo Lee, Junha Kim, Eunkyung Park, Minjeong Kim, Taesoo Kim, Thijs Kooi

    Abstract: Recently, deep learning models have shown the potential to predict breast cancer risk and enable targeted screening strategies, but current models do not consider the change in the breast over time. In this paper, we present a new method, PRIME+, for breast cancer risk prediction that leverages prior mammograms using a transformer decoder, outperforming a state-of-the-art risk prediction method th… ▽ More

    Submitted 28 August, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

    Comments: MICCAI 2023 accepted

  33. FFNeRV: Flow-Guided Frame-Wise Neural Representations for Videos

    Authors: Joo Chan Lee, Daniel Rho, Jong Hwan Ko, Eunbyung Park

    Abstract: Neural fields, also known as coordinate-based or implicit neural representations, have shown a remarkable capability of representing, generating, and manipulating various forms of signals. For video representations, however, mapping pixel-wise coordinates to RGB colors has shown relatively low compression performance and slow convergence and inference speed. Frame-wise video representation, which… ▽ More

    Submitted 6 August, 2023; v1 submitted 23 December, 2022; originally announced December 2022.

    Comments: Our project page including code is available at https://maincold2.github.io/ffnerv/

  34. arXiv:2212.09069  [pdf, other

    cs.CV cs.GR

    Masked Wavelet Representation for Compact Neural Radiance Fields

    Authors: Daniel Rho, Byeonghyeon Lee, Seungtae Nam, Joo Chan Lee, Jong Hwan Ko, Eunbyung Park

    Abstract: Neural radiance fields (NeRF) have demonstrated the potential of coordinate-based neural representation (neural fields or implicit neural representation) in neural rendering. However, using a multi-layer perceptron (MLP) to represent a 3D scene or object requires enormous computational resources and time. There have been recent studies on how to reduce these computational inefficiencies by using a… ▽ More

    Submitted 21 March, 2023; v1 submitted 18 December, 2022; originally announced December 2022.

    Comments: Accepted to CVPR 2023

  35. arXiv:2212.05081  [pdf, other

    hep-ex cs.LG physics.comp-ph

    FAIR AI Models in High Energy Physics

    Authors: Javier Duarte, Haoyang Li, Avik Roy, Ruike Zhu, E. A. Huerta, Daniel Diaz, Philip Harris, Raghav Kansal, Daniel S. Katz, Ishaan H. Kavoori, Volodymyr V. Kindratenko, Farouk Mokhtar, Mark S. Neubauer, Sang Eon Park, Melissa Quinnan, Roger Rusack, Zhizhen Zhao

    Abstract: The findable, accessible, interoperable, and reusable (FAIR) data principles provide a framework for examining, evaluating, and improving how data is shared to facilitate scientific discovery. Generalizing these principles to research software and other digital products is an active area of research. Machine learning (ML) models -- algorithms that have been trained on data without being explicitly… ▽ More

    Submitted 29 December, 2023; v1 submitted 9 December, 2022; originally announced December 2022.

    Comments: 34 pages, 9 figures, 10 tables

    Journal ref: Mach. Learn.: Sci. Technol. 4 (2023) 045062

  36. arXiv:2211.08761  [pdf, other

    cs.LG

    Separable PINN: Mitigating the Curse of Dimensionality in Physics-Informed Neural Networks

    Authors: Junwoo Cho, Seungtae Nam, Hyunmo Yang, Seok-Bae Yun, Youngjoon Hong, Eunbyung Park

    Abstract: Physics-informed neural networks (PINNs) have emerged as new data-driven PDE solvers for both forward and inverse problems. While promising, the expensive computational costs to obtain solutions often restrict their broader applicability. We demonstrate that the computations in automatic differentiation (AD) can be significantly reduced by leveraging forward-mode AD when training PINN. However, a… ▽ More

    Submitted 2 November, 2023; v1 submitted 16 November, 2022; originally announced November 2022.

    Comments: To appear in NeurIPS 2022 Workshop on The Symbiosis of Deep Learning and Differential Equations (DLDE) - II, 12 pages, 5 figures, full paper: arXiv:2306.15969

  37. arXiv:2208.05484  [pdf, other

    hep-ph cs.LG hep-ex

    Neural Embedding: Learning the Embedding of the Manifold of Physics Data

    Authors: Sang Eon Park, Philip Harris, Bryan Ostdiek

    Abstract: In this paper, we present a method of embedding physics data manifolds with metric structure into lower dimensional spaces with simpler metrics, such as Euclidean and Hyperbolic spaces. We then demonstrate that it can be a powerful step in the data analysis pipeline for many applications. Using progressively more realistic simulated collisions at the Large Hadron Collider, we show that this embedd… ▽ More

    Submitted 14 August, 2022; v1 submitted 10 August, 2022; originally announced August 2022.

  38. arXiv:2208.00338  [pdf, other

    cs.LG cs.AI

    Symmetry Regularization and Saturating Nonlinearity for Robust Quantization

    Authors: Sein Park, Yeongsang Jang, Eunhyeok Park

    Abstract: Robust quantization improves the tolerance of networks for various implementations, allowing reliable output in different bit-widths or fragmented low-precision arithmetic. In this work, we perform extensive analyses to identify the sources of quantization error and present three insights to robustify a network against quantization: reduction of error propagation, range clamping for error minimiza… ▽ More

    Submitted 30 July, 2022; originally announced August 2022.

  39. arXiv:2207.12800  [pdf, other

    cs.LG math.NA

    PIXEL: Physics-Informed Cell Representations for Fast and Accurate PDE Solvers

    Authors: Namgyu Kang, Byeonghyeon Lee, Youngjoon Hong, Seok-Bae Yun, Eunbyung Park

    Abstract: With the increases in computational power and advances in machine learning, data-driven learning-based methods have gained significant attention in solving PDEs. Physics-informed neural networks (PINNs) have recently emerged and succeeded in various forward and inverse PDE problems thanks to their excellent properties, such as flexibility, mesh-free solutions, and unsupervised training. However, t… ▽ More

    Submitted 4 February, 2023; v1 submitted 26 July, 2022; originally announced July 2022.

    Comments: Accepted to the 37th AAAI Conference on Artificial Intelligence (AAAI 2023) Main Track, DLDE-II NeurIPS 2022 Workshop (Spotlight), https://namgyukang.github.io/PIXEL/ (Project Page), https://github.com/NamGyuKang/CosineSampler (Customized CUDA) (17 pages, 17 figures)

  40. arXiv:2207.09663  [pdf, other

    cs.CV

    Streamable Neural Fields

    Authors: Junwoo Cho, Seungtae Nam, Daniel Rho, Jong Hwan Ko, Eunbyung Park

    Abstract: Neural fields have emerged as a new data representation paradigm and have shown remarkable success in various signal representations. Since they preserve signals in their network parameters, the data transfer by sending and receiving the entire model parameters prevents this emerging technology from being used in many practical scenarios. We propose streamable neural fields, a single model that co… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

    Comments: To appear in ECCV 2022

  41. arXiv:2207.07597  [pdf, other

    cs.CL cs.AI cs.LG

    OASYS: Domain-Agnostic Automated System for Constructing Knowledge Base from Unstructured Text

    Authors: Minsang Kim, Sang-hyun Je, Eunjoo Park

    Abstract: In recent years, creating and managing knowledge bases have become crucial to the retail product and enterprise domains. We present an automatic knowledge base construction system that mines data from documents. This system can generate training data during the training process without human intervention. Therefore, it is domain-agnostic trainable using only the target domain text corpus and a pre… ▽ More

    Submitted 29 June, 2022; originally announced July 2022.

    Comments: ACM SIGKDD Workshop on Mining and Learning with Graphs 2022, Accepted

  42. arXiv:2206.00820  [pdf, other

    cs.LG

    NIPQ: Noise proxy-based Integrated Pseudo-Quantization

    Authors: Juncheol Shin, Junhyuk So, Sein Park, Seungyeop Kang, Sungjoo Yoo, Eunhyeok Park

    Abstract: Straight-through estimator (STE), which enables the gradient flow over the non-differentiable function via approximation, has been favored in studies related to quantization-aware training (QAT). However, STE incurs unstable convergence during QAT, resulting in notable quality degradation in low precision. Recently, pseudoquantization training has been proposed as an alternative approach to updati… ▽ More

    Submitted 1 July, 2023; v1 submitted 1 June, 2022; originally announced June 2022.

  43. arXiv:2205.11179  [pdf, other

    cs.CV

    Online Hybrid Lightweight Representations Learning: Its Application to Visual Tracking

    Authors: Ilchae Jung, Minji Kim, Eunhyeok Park, Bohyung Han

    Abstract: This paper presents a novel hybrid representation learning framework for streaming data, where an image frame in a video is modeled by an ensemble of two distinct deep neural networks; one is a low-bit quantized network and the other is a lightweight full-precision network. The former learns coarse primary information with low cost while the latter conveys residual information for high fidelity to… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: 7 pages, 1 figure, accepted at IJCAI2022

  44. Federated Learning Enables Big Data for Rare Cancer Boundary Detection

    Authors: Sarthak Pati, Ujjwal Baid, Brandon Edwards, Micah Sheller, Shih-Han Wang, G Anthony Reina, Patrick Foley, Alexey Gruzdev, Deepthi Karkada, Christos Davatzikos, Chiharu Sako, Satyam Ghodasara, Michel Bilello, Suyash Mohan, Philipp Vollmuth, Gianluca Brugnara, Chandrakanth J Preetha, Felix Sahm, Klaus Maier-Hein, Maximilian Zenk, Martin Bendszus, Wolfgang Wick, Evan Calabrese, Jeffrey Rudie, Javier Villanueva-Meyer , et al. (254 additional authors not shown)

    Abstract: Although machine learning (ML) has shown promise in numerous domains, there are concerns about generalizability to out-of-sample data. This is currently addressed by centrally sharing ample, and importantly diverse, data from multiple sites. However, such centralization is challenging to scale (or even not feasible) due to various limitations. Federated ML (FL) provides an alternative to train acc… ▽ More

    Submitted 25 April, 2022; v1 submitted 22 April, 2022; originally announced April 2022.

    Comments: federated learning, deep learning, convolutional neural network, segmentation, brain tumor, glioma, glioblastoma, FeTS, BraTS

  45. arXiv:2204.07439  [pdf, other

    cs.CV cs.LG

    INSTA-BNN: Binary Neural Network with INSTAnce-aware Threshold

    Authors: Changhun Lee, Hyungjun Kim, Eunhyeok Park, Jae-Joon Kim

    Abstract: Binary Neural Networks (BNNs) have emerged as a promising solution for reducing the memory footprint and compute costs of deep neural networks, but they suffer from quality degradation due to the lack of freedom as activations and weights are constrained to the binary values. To compensate for the accuracy drop, we propose a novel BNN design called Binary Neural Network with INSTAnce-aware thresho… ▽ More

    Submitted 19 October, 2023; v1 submitted 15 April, 2022; originally announced April 2022.

    Comments: Accepted at ICCV 2023

  46. arXiv:2203.10996  [pdf, other

    cs.IR cs.AI cs.MM

    Technologies for AI-Driven Fashion Social Networking Service with E-Commerce

    Authors: Jinseok Seol, Seongjae Kim, Sungchan Park, Holim Lim, Hyunsoo Na, Eunyoung Park, Dohee Jung, Soyoung Park, Kangwoo Lee, Sang-goo Lee

    Abstract: The rapid growth of the online fashion market brought demands for innovative fashion services and commerce platforms. With the recent success of deep learning, many applications employ AI technologies such as visual search and recommender systems to provide novel and beneficial services. In this paper, we describe applied technologies for AI-driven fashion social networking service that incorporat… ▽ More

    Submitted 10 March, 2022; originally announced March 2022.

    Comments: 16 pages, accepted in International Semantic Intelligence Conference (ISIC) 2022, The Applications and Deployment Track

  47. Are you aware of what you are watching? Role of machine heuristic in online content recommendations

    Authors: Soyoung Oh, Eunil Park

    Abstract: Since recommender systems have been created and developed to automate the recommendation process, users can easily consume their desired video content on online platforms. In this line, several content recommendation algorithms are introduced and employed to allow users to encounter content of their interests, effectively. However, the recommendation systems sometimes regrettably recommend inappro… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

  48. arXiv:2201.04329  [pdf, other

    cs.CV

    Neural Residual Flow Fields for Efficient Video Representations

    Authors: Daniel Rho, Junwoo Cho, Jong Hwan Ko, Eunbyung Park

    Abstract: Neural fields have emerged as a powerful paradigm for representing various signals, including videos. However, research on improving the parameter efficiency of neural fields is still in its early stages. Even though neural fields that map coordinates to colors can be used to encode video signals, this scheme does not exploit the spatial and temporal redundancy of video signals. Inspired by standa… ▽ More

    Submitted 5 October, 2022; v1 submitted 12 January, 2022; originally announced January 2022.

    Comments: Accepted for ACCV 2022, codes are available at https://github.com/daniel03c1/eff_video_representation

  49. arXiv:2112.14370  [pdf, other

    cs.IR cs.LG

    On the Overlooked Significance of Underutilized Contextual Features in Recent News Recommendation Models

    Authors: Sungmin Cho, Hongjun Lim, Keunchan Park, Sungjoo Yoo, Eunhyeok Park

    Abstract: Personalized news recommendation aims to provide attractive articles for readers by predicting their likelihood of clicking on a certain article. To accurately predict this probability, plenty of studies have been proposed that actively utilize content features of articles, such as words, categories, or entities. However, we observed that the articles' contextual features, such as CTR (click-throu… ▽ More

    Submitted 28 December, 2021; originally announced December 2021.

  50. arXiv:2111.03146  [pdf, other

    cs.LG cs.SD eess.AS

    Generating Diverse Realistic Laughter for Interactive Art

    Authors: M. Mehdi Afsar, Eric Park, Étienne Paquette, Gauthier Gidel, Kory W. Mathewson, Eilif Muller

    Abstract: We propose an interactive art project to make those rendered invisible by the COVID-19 crisis and its concomitant solitude reappear through the welcome melody of laughter, and connections created and explored through advanced laughter synthesis approaches. However, the unconditional generation of the diversity of human emotional responses in high-quality auditory synthesis remains an open problem,… ▽ More

    Submitted 29 July, 2022; v1 submitted 4 November, 2021; originally announced November 2021.

    Comments: Presented at Machine Learning for Creativity and Design workshop at NeurIPS 2021, 6 pages