Zum Hauptinhalt springen

Showing 1–43 of 43 results for author: Im, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.11915  [pdf, other

    cs.SD cs.CV cs.LG cs.MM eess.AS

    Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound

    Authors: Junwon Lee, Jaekwon Im, Dabin Kim, Juhan Nam

    Abstract: Foley sound synthesis is crucial for multimedia production, enhancing user experience by synchronizing audio and video both temporally and semantically. Recent studies on automating this labor-intensive process through video-to-sound generation face significant challenges. Systems lacking explicit temporal features suffer from poor controllability and alignment, while timestamp-based models requir… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  2. arXiv:2404.07217  [pdf, other

    eess.SP cs.AI cs.CV cs.LG

    Attention-aware Semantic Communications for Collaborative Inference

    Authors: Jiwoong Im, Nayoung Kwon, Taewoo Park, Jiheon Woo, Jaeho Lee, Yongjune Kim

    Abstract: We propose a communication-efficient collaborative inference framework in the domain of edge inference, focusing on the efficient use of vision transformer (ViT) models. The partitioning strategy of conventional collaborative inference fails to reduce communication cost because of the inherent architecture of ViTs maintaining consistent layer dimensions across the entire transformer encoder. There… ▽ More

    Submitted 31 May, 2024; v1 submitted 23 February, 2024; originally announced April 2024.

  3. arXiv:2404.02072  [pdf, other

    cs.CV cs.LG

    EGTR: Extracting Graph from Transformer for Scene Graph Generation

    Authors: Jinbae Im, JeongYeon Nam, Nokyung Park, Hyungmin Lee, Seunghyun Park

    Abstract: Scene Graph Generation (SGG) is a challenging task of detecting objects and predicting relationships between objects. After DETR was developed, one-stage SGG models based on a one-stage object detector have been actively studied. However, complex modeling is used to predict the relationship between objects, and the inherent relationship between object queries learned in the multi-head self-attenti… ▽ More

    Submitted 24 June, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: CVPR 2024 (Best paper award candidate)

  4. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  5. arXiv:2403.10384  [pdf, other

    cs.GT cs.MA eess.SY

    Coordination in Noncooperative Multiplayer Matrix Games via Reduced Rank Correlated Equilibria

    Authors: Jaehan Im, Yue Yu, David Fridovich-Keil, Ufuk Topcu

    Abstract: Coordination in multiplayer games enables players to avoid the lose-lose outcome that often arises at Nash equilibria. However, designing a coordination mechanism typically requires the consideration of the joint actions of all players, which becomes intractable in large-scale games. We develop a novel coordination mechanism, termed reduced rank correlated equilibria, which reduces the number of j… ▽ More

    Submitted 12 June, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

  6. arXiv:2401.08102  [pdf, other

    cs.SD eess.AS

    DIFFRENT: A Diffusion Model for Recording Environment Transfer of Speech

    Authors: Jaekwon Im, Juhan Nam

    Abstract: Properly setting up recording conditions, including microphone type and placement, room acoustics, and ambient noise, is essential to obtaining the desired acoustic characteristics of speech. In this paper, we propose Diff-R-EN-T, a Diffusion model for Recording ENvironment Transfer which transforms the input speech to have the recording conditions of a reference speech while preserving the speech… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: 4 pages, 2 figures

  7. arXiv:2309.12333  [pdf, other

    cs.CY cs.LG

    Onchain Sports Betting using UBET Automated Market Maker

    Authors: Daniel Jiwoong Im, Alexander Kondratskiy, Vincent Harvey, Hsuan-Wei Fu

    Abstract: The paper underscores how decentralization in sports betting addresses the drawbacks of traditional centralized platforms, ensuring transparency, security, and lower fees. Non-custodial solutions empower bettors with ownership of funds, bypassing geographical restrictions. Decentralized platforms enhance security, privacy, and democratic decision-making. However, decentralized sports betting neces… ▽ More

    Submitted 17 August, 2023; originally announced September 2023.

  8. arXiv:2309.00311  [pdf, other

    q-bio.GN cs.DB

    BRCA Gene Mutations in dbSNP: A Visual Exploration of Genetic Variants

    Authors: Woowon Jang, Shiwoo Koak, Jiwon Im, Utku Ozbulak, Joris Vankerschaver

    Abstract: BRCA genes, comprising BRCA1 and BRCA2 play indispensable roles in preserving genomic stability and facilitating DNA repair mechanisms. The presence of germline mutations in these genes has been associated with increased susceptibility to various cancers, notably breast and ovarian cancers. Recent advancements in cost-effective sequencing technologies have revolutionized the landscape of cancer ge… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

  9. arXiv:2308.09248  [pdf, other

    cs.LG stat.ML

    Active and Passive Causal Inference Learning

    Authors: Daniel Jiwoong Im, Kyunghyun Cho

    Abstract: This paper serves as a starting point for machine learning researchers, engineers and students who are interested in but not yet familiar with causal inference. We start by laying out an important set of assumptions that are collectively needed for causal identification, such as exchangeability, positivity, consistency and the absence of interference. From these assumptions, we build out a set of… ▽ More

    Submitted 25 August, 2024; v1 submitted 17 August, 2023; originally announced August 2023.

  10. arXiv:2308.06375  [pdf, ps, other

    cs.LG cs.CE q-fin.CP

    UAMM: Price-oracle based Automated Market Maker

    Authors: Daniel Jiwoong Im, Alexander Kondratskiy, Vincent Harvey, Hsuan-Wei Fu

    Abstract: Automated market makers (AMMs) are pricing mechanisms utilized by decentralized exchanges (DEX). Traditional AMM approaches are constrained by pricing solely based on their own liquidity pool, without consideration of external markets or risk management for liquidity providers. In this paper, we propose a new approach known as UBET AMM (UAMM), which calculates prices by considering external market… ▽ More

    Submitted 25 August, 2024; v1 submitted 11 August, 2023; originally announced August 2023.

  11. arXiv:2306.08204  [pdf, other

    cs.AI cs.LG

    Unraveling the ARC Puzzle: Mimicking Human Solutions with Object-Centric Decision Transformer

    Authors: Jaehyun Park, Jaegyun Im, Sanha Hwang, Mintaek Lim, Sabina Ualibekova, Sejin Kim, Sundong Kim

    Abstract: In the pursuit of artificial general intelligence (AGI), we tackle Abstraction and Reasoning Corpus (ARC) tasks using a novel two-pronged approach. We employ the Decision Transformer in an imitation learning paradigm to model human problem-solving, and introduce an object detection algorithm, the Push and Pull clustering method. This dual strategy enhances AI's ARC problem-solving skills and provi… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

  12. arXiv:2304.12521  [pdf, other

    cs.SD eess.AS

    Foley Sound Synthesis at the DCASE 2023 Challenge

    Authors: Keunwoo Choi, Jaekwon Im, Laurie Heller, Brian McFee, Keisuke Imoto, Yuki Okamoto, Mathieu Lagrange, Shinosuke Takamichi

    Abstract: The addition of Foley sound effects during post-production is a common technique used to enhance the perceived acoustic properties of multimedia content. Traditionally, Foley sound has been produced by human Foley artists, which involves manual recording and mixing of sound. However, recent advances in sound synthesis and generative models have generated interest in machine-assisted or automatic F… ▽ More

    Submitted 28 September, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

    Comments: DCASE 2023 Challenge - Task 7 - Technical Report (Submitted to DCASE 2023 Workshop)

  13. Women's Perspectives on Harm and Justice after Online Harassment

    Authors: Jane Im, Sarita Schoenebeck, Marilyn Iriarte, Gabriel Grill, Daricia Wilkinson, Amna Batool, Rahaf Alharbi, Audrey Funwie, Tergel Gankhuu, Eric Gilbert, Mustafa Naseem

    Abstract: Social media platforms aspire to create online experiences where users can participate safely and equitably. However, women around the world experience widespread online harassment, including insults, stalking, aggression, threats, and non-consensual sharing of sexual photos. This article describes women's perceptions of harm associated with online harassment and preferred platform responses to th… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

  14. arXiv:2301.07163  [pdf, other

    cs.CY cs.HC

    AppealMod: Inducing Friction to Reduce Moderator Workload of Handling User Appeals

    Authors: Shubham Atreja, Jane Im, Paul Resnick, Libby Hemphill

    Abstract: As content moderation becomes a central aspect of all social media platforms and online communities, interest has grown in how to make moderation decisions contestable. On social media platforms where individual communities moderate their own activities, the responsibility to address user appeals falls on volunteers from within the community. While there is a growing body of work devoted to unders… ▽ More

    Submitted 9 January, 2024; v1 submitted 17 January, 2023; originally announced January 2023.

    Comments: accepted at CSCW'24

  15. arXiv:2211.15948  [pdf, other

    cs.SD eess.AS

    Neural Vocoder Feature Estimation for Dry Singing Voice Separation

    Authors: Jaekwon Im, Soonbeom Choi, Sangeon Yong, Juhan Nam

    Abstract: Singing voice separation (SVS) is a task that separates singing voice audio from its mixture with instrumental audio. Previous SVS studies have mainly employed the spectrogram masking method which requires a large dimensionality in predicting the binary masks. In addition, they focused on extracting a vocal stem that retains the wet sound with the reverberation effect. This result may hinder the r… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: 6 pages, 4 figures

    Journal ref: 14th Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2022

  16. arXiv:2207.01749  [pdf, other

    cs.SE cs.HC

    Human-AI Guidelines in Practice: Leaky Abstractions as an Enabler in Collaborative Software Teams

    Authors: Hariharan Subramonyam, Jane Im, Colleen Seifert, Eytan Adar

    Abstract: In conventional software development, user experience (UX) designers and engineers collaborate through separation of concerns (SoC): designers create human interface specifications, and engineers build to those specifications. However, we argue that Human-AI systems thwart SoC because human needs must shape the design of the AI interface, the underlying AI sub-components, and training data. How do… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

  17. arXiv:2112.00956  [pdf, other

    cs.LG cs.MA cs.NI cs.RO

    Personalized Federated Learning of Driver Prediction Models for Autonomous Driving

    Authors: Manabu Nakanoya, Junha Im, Hang Qiu, Sachin Katti, Marco Pavone, Sandeep Chinchali

    Abstract: Autonomous vehicles (AVs) must interact with a diverse set of human drivers in heterogeneous geographic areas. Ideally, fleets of AVs should share trajectory data to continually re-train and improve trajectory forecasting models from collective experience using cloud-based distributed learning. At the same time, these robots should ideally avoid uploading raw driver interaction data in order to pr… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

  18. arXiv:2111.08656  [pdf, other

    cs.LG

    Causal Effect Variational Autoencoder with Uniform Treatment

    Authors: Daniel Jiwoong Im, Kyunghyun Cho, Narges Razavian

    Abstract: Domain adaptation and covariate shift are big issues in deep learning and they ultimately affect any causal inference algorithms that rely on deep neural networks. Causal effect variational autoencoder (CEVAE) is trained to predict the outcome given observational treatment data and it suffers from the distribution shift at test time. In this paper, we introduce uniform treatment variational autoen… ▽ More

    Submitted 20 September, 2022; v1 submitted 16 November, 2021; originally announced November 2021.

  19. Searching For or Reviewing Evidence Improves Crowdworkers' Misinformation Judgments and Reduces Partisan Bias

    Authors: Paul Resnick, Aljohara Alfayez, Jane Im, Eric Gilbert

    Abstract: Can crowd workers be trusted to judge whether news-like articles circulating on the Internet are misleading, or does partisanship and inexperience get in the way? And can the task be structured in a way that reduces partisanship? We assembled pools of both liberal and conservative crowd raters and tested three ways of asking them to make judgments about 374 articles. In a no research condition, th… ▽ More

    Submitted 10 April, 2023; v1 submitted 17 August, 2021; originally announced August 2021.

    Comments: Revised title and framing to focus on difference results for different experimental conditions; new Fig. 1 that provides overview of the performance comparison process; other improvements in response to reviewer feedback

    Journal ref: Collective Intelligence, 2(2) (2023)

  20. arXiv:2105.13135  [pdf, other

    cs.CL cs.AI cs.LG

    Self-Supervised Multimodal Opinion Summarization

    Authors: Jinbae Im, Moonki Kim, Hoyeop Lee, Hyunsouk Cho, Sehee Chung

    Abstract: Recently, opinion summarization, which is the generation of a summary from multiple reviews, has been conducted in a self-supervised manner by considering a sampled review as a pseudo summary. However, non-text data such as image and metadata related to reviews have been considered less often. To use the abundant information contained in non-text data, we propose a self-supervised multimodal opini… ▽ More

    Submitted 27 May, 2021; originally announced May 2021.

    Comments: ACL 2021

  21. arXiv:2102.07813  [pdf, other

    cs.LG

    Online hyperparameter optimization by real-time recurrent learning

    Authors: Daniel Jiwoong Im, Cristina Savin, Kyunghyun Cho

    Abstract: Conventional hyperparameter optimization methods are computationally intensive and hard to generalize to scenarios that require dynamically adapting hyperparameters, such as life-long learning. Here, we propose an online hyperparameter optimization algorithm that is asymptotically exact and computationally tractable, both theoretically and practically. Our framework takes advantage of the analogy… ▽ More

    Submitted 8 April, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

  22. arXiv:2102.07645  [pdf, other

    cs.IR cs.AI cs.LG

    Freudian and Newtonian Recurrent Cell for Sequential Recommendation

    Authors: Hoyeop Lee, Jinbae Im, Chang Ouk Kim, Sehee Chung

    Abstract: A sequential recommender system aims to recommend attractive items to users based on behaviour patterns. The predominant sequential recommendation models are based on natural language processing models, such as the gated recurrent unit, that embed items in some defined space and grasp the user's long-term and short-term preferences based on the item embeddings. However, these approaches lack funda… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

  23. Tunnel Facility-based Vehicle Localization in Highway Tunnel using 3D LIDAR

    Authors: Kyuwon Kim, Junhyuck Im, Gyuin Jee

    Abstract: Vehicle localization in highway tunnels is a challenging issue for autonomous vehicle navigation. Since GPS signals from satellites cannot be received inside a highway tunnel, map-aided localization is essential. However, the environment around the tunnel is composed mostly of an elliptical wall. Thereby, the unique feature points for map matching are few unlike the case outdoors. As a result, it… ▽ More

    Submitted 24 December, 2020; originally announced December 2020.

    Comments: 16 pages, 25 figures

  24. arXiv:2007.12298  [pdf, other

    cs.LG stat.ML

    Evaluation metrics for behaviour modeling

    Authors: Daniel Jiwoong Im, Iljung Kwak, Kristin Branson

    Abstract: A primary difficulty with unsupervised discovery of structure in large data sets is a lack of quantitative evaluation criteria. In this work, we propose and investigate several metrics for evaluating and comparing generative models of behavior learned using imitation learning. Compared to the commonly-used model log-likelihood, these criteria look at longer temporal relationships in behavior, are… ▽ More

    Submitted 23 July, 2020; originally announced July 2020.

    Comments: 17 pages

  25. arXiv:2001.01647  [pdf, other

    cs.NE cs.LG stat.ML

    Are skip connections necessary for biologically plausible learning rules?

    Authors: Daniel Jiwoong Im, Rutuja Patil, Kristin Branson

    Abstract: Backpropagation is the workhorse of deep learning, however, several other biologically-motivated learning rules have been introduced, such as random feedback alignment and difference target propagation. None of these methods have produced a competitive performance against backpropagation. In this paper, we show that biologically-motivated learning rules with skip connections between intermediate l… ▽ More

    Submitted 4 December, 2019; originally announced January 2020.

  26. arXiv:1910.07368  [pdf, other

    cs.LG stat.ML

    Model-Agnostic Meta-Learning using Runge-Kutta Methods

    Authors: Daniel Jiwoong Im, Yibo Jiang, Nakul Verma

    Abstract: Meta-learning has emerged as an important framework for learning new tasks from just a few examples. The success of any meta-learning model depends on (i) its fast adaptation to new tasks, as well as (ii) having a shared representation across similar tasks. Here we extend the model-agnostic meta-learning (MAML) framework introduced by Finn et al. (2017) to achieve improved performance by analyzing… ▽ More

    Submitted 17 October, 2019; v1 submitted 16 October, 2019; originally announced October 2019.

  27. arXiv:1908.00413  [pdf, other

    cs.IR cs.AI cs.LG

    MeLU: Meta-Learned User Preference Estimator for Cold-Start Recommendation

    Authors: Hoyeop Lee, Jinbae Im, Seongwon Jang, Hyunsouk Cho, Sehee Chung

    Abstract: This paper proposes a recommender system to alleviate the cold-start problem that can estimate user preferences based on only a small number of items. To identify a user's preference in the cold state, existing recommender systems, such as Netflix, initially provide items to a user; we call those items evidence candidates. Recommendations are then made based on the items selected by the user. Prev… ▽ More

    Submitted 31 July, 2019; originally announced August 2019.

    Comments: Accepted as a full paper at KDD 2019

  28. arXiv:1907.12850  [pdf, other

    cs.CL

    Confirmatory Aspect-based Opinion Mining Processes

    Authors: Jongho Im, Taikgun Song, Youngsu Lee, Jewoo Kim

    Abstract: A new opinion extraction method is proposed to summarize unstructured, user-generated content (i.e., online customer reviews) in the fixed topic domains. To differentiate the current approach from other opinion extraction approaches, which are often exposed to a sparsity problem and lack of sentiment scores, a confirmatory aspect-based opinion mining framework is introduced along with its practica… ▽ More

    Submitted 30 July, 2019; originally announced July 2019.

  29. arXiv:1906.03214  [pdf, other

    cs.LG eess.IV stat.ML

    Importance Weighted Adversarial Variational Autoencoders for Spike Inference from Calcium Imaging Data

    Authors: Daniel Jiwoong Im, Sridhama Prakhya, Jinyao Yan, Srinivas Turaga, Kristin Branson

    Abstract: The Importance Weighted Auto Encoder (IWAE) objective has been shown to improve the training of generative models over the standard Variational Auto Encoder (VAE) objective. Here, we derive importance weighted extensions to AVB and AAE. These latent variable models use implicitly defined inference networks whose approximate posterior density q_φ(z|x) cannot be directly evaluated, an essential ingr… ▽ More

    Submitted 22 October, 2019; v1 submitted 7 June, 2019; originally announced June 2019.

  30. arXiv:1901.11162  [pdf, other

    cs.SI cs.CY

    Still out there: Modeling and Identifying Russian Troll Accounts on Twitter

    Authors: Jane Im, Eshwar Chandrasekharan, Jackson Sargent, Paige Lighthammer, Taylor Denby, Ankit Bhargava, Libby Hemphill, David Jurgens, Eric Gilbert

    Abstract: There is evidence that Russia's Internet Research Agency attempted to interfere with the 2016 U.S. election by running fake accounts on Twitter - often referred to as "Russian trolls". In this work, we: 1) develop machine learning models that predict whether a Twitter account is a Russian troll within a set of 170K control accounts; and, 2) demonstrate that it is possible to use this model to find… ▽ More

    Submitted 30 January, 2019; originally announced January 2019.

  31. arXiv:1811.01247  [pdf, other

    cs.LG stat.ML

    Stochastic Neighbor Embedding under f-divergences

    Authors: Daniel Jiwoong Im, Nakul Verma, Kristin Branson

    Abstract: The t-distributed Stochastic Neighbor Embedding (t-SNE) is a powerful and popular method for visualizing high-dimensional data. It minimizes the Kullback-Leibler (KL) divergence between the original and embedded data distributions. In this work, we propose extending this method to other f-divergences. We analytically and empirically evaluate the types of latent structure-manifold, cluster, and hie… ▽ More

    Submitted 3 November, 2018; originally announced November 2018.

  32. arXiv:1803.01045  [pdf, other

    cs.LG

    Quantitatively Evaluating GANs With Divergences Proposed for Training

    Authors: Daniel Jiwoong Im, He Ma, Graham Taylor, Kristin Branson

    Abstract: Generative adversarial networks (GANs) have been extremely effective in approximating complex distributions of high-dimensional, input data samples, and substantial progress has been made in understanding and improving GAN performance in terms of both theory and application. However, we currently lack quantitative methods for model assessment. Because of this, while many GAN variants are being pro… ▽ More

    Submitted 28 April, 2018; v1 submitted 2 March, 2018; originally announced March 2018.

    Comments: ICLR 2018

  33. arXiv:1712.02047  [pdf, other

    cs.CL cs.AI

    Distance-based Self-Attention Network for Natural Language Inference

    Authors: Jinbae Im, Sungzoon Cho

    Abstract: Attention mechanism has been used as an ancillary means to help RNN or CNN. However, the Transformer (Vaswani et al., 2017) recently recorded the state-of-the-art performance in machine translation with a dramatic reduction in training time by solely using attention. Motivated by the Transformer, Directional Self Attention Network (Shen et al., 2017), a fully attention-based sentence encoder, was… ▽ More

    Submitted 6 December, 2017; originally announced December 2017.

    Comments: 12 pages, 13 figures

  34. arXiv:1706.07518  [pdf, other

    cs.CL

    Neural Machine Translation with Gumbel-Greedy Decoding

    Authors: Jiatao Gu, Daniel Jiwoong Im, Victor O. K. Li

    Abstract: Previous neural machine translation models used some heuristic search algorithms (e.g., beam search) in order to avoid solving the maximum a posteriori problem over translation sentences at test time. In this paper, we propose the Gumbel-Greedy Decoding which trains a generative network to predict translation under a trained model. We solve such a problem using the Gumbel-Softmax reparameterizatio… ▽ More

    Submitted 22 June, 2017; originally announced June 2017.

  35. arXiv:1612.04021  [pdf, other

    cs.LG stat.ML

    Generative Adversarial Parallelization

    Authors: Daniel Jiwoong Im, He Ma, Chris Dongjoo Kim, Graham Taylor

    Abstract: Generative Adversarial Networks have become one of the most studied frameworks for unsupervised learning due to their intuitive formulation. They have also been shown to be capable of generating convincing examples in limited domains, such as low-resolution images. However, they still prove difficult to train in practice and tend to ignore modes of the data generating distribution. Quantitatively… ▽ More

    Submitted 12 December, 2016; originally announced December 2016.

  36. arXiv:1612.04010  [pdf, other

    cs.LG

    An empirical analysis of the optimization of deep network loss surfaces

    Authors: Daniel Jiwoong Im, Michael Tao, Kristin Branson

    Abstract: The success of deep neural networks hinges on our ability to accurately and efficiently optimize high-dimensional, non-convex functions. In this paper, we empirically investigate the loss functions of state-of-the-art networks, and how commonly-used stochastic gradient descent variants optimize these loss functions. To do this, we visualize the loss function by projecting them down to low-dimensio… ▽ More

    Submitted 7 December, 2017; v1 submitted 12 December, 2016; originally announced December 2016.

  37. arXiv:1607.03050  [pdf, other

    cs.LG stat.ML

    Learning a metric for class-conditional KNN

    Authors: Daniel Jiwoong Im, Graham W. Taylor

    Abstract: Naive Bayes Nearest Neighbour (NBNN) is a simple and effective framework which addresses many of the pitfalls of K-Nearest Neighbour (KNN) classification. It has yielded competitive results on several computer vision benchmarks. Its central tenet is that during NN search, a query is not compared to every example in a database, ignoring class information. Instead, NN searches are performed within e… ▽ More

    Submitted 11 July, 2016; originally announced July 2016.

  38. arXiv:1602.05110  [pdf, other

    cs.LG cs.CV

    Generating images with recurrent adversarial networks

    Authors: Daniel Jiwoong Im, Chris Dongjoo Kim, Hui Jiang, Roland Memisevic

    Abstract: Gatys et al. (2015) showed that optimizing pixels to match features in a convolutional network with respect reference image features is a way to render images of high visual quality. We show that unrolling this gradient-based optimization yields a recurrent computation that creates images by incrementally adding onto a visual "canvas". We propose a recurrent generative model inspired by this view,… ▽ More

    Submitted 12 December, 2016; v1 submitted 16 February, 2016; originally announced February 2016.

  39. arXiv:1511.06406  [pdf, other

    cs.LG

    Denoising Criterion for Variational Auto-Encoding Framework

    Authors: Daniel Jiwoong Im, Sungjin Ahn, Roland Memisevic, Yoshua Bengio

    Abstract: Denoising autoencoders (DAE) are trained to reconstruct their clean inputs with noise injected at the input level, while variational autoencoders (VAE) are trained with noise injected in their stochastic hidden layer, with a regularizer that encourages this noise injection. In this paper, we show that injecting noise both in input and in the stochastic hidden layer can be advantageous and we propo… ▽ More

    Submitted 4 January, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

    Comments: ICLR conference submission

  40. arXiv:1506.07643  [pdf, other

    cs.LG

    Conservativeness of untied auto-encoders

    Authors: Daniel Jiwoong Im, Mohamed Ishmael Diwan Belghazi, Roland Memisevic

    Abstract: We discuss necessary and sufficient conditions for an auto-encoder to define a conservative vector field, in which case it is associated with an energy function akin to the unnormalized log-probability of the data. We show that the conditions for conservativeness are more general than for encoder and decoder weights to be the same ("tied weights"), and that they also depend on the form of the hidd… ▽ More

    Submitted 21 September, 2015; v1 submitted 25 June, 2015; originally announced June 2015.

  41. arXiv:1412.6630  [pdf, other

    cs.LG cs.NE stat.ML

    Neural Network Regularization via Robust Weight Factorization

    Authors: Jan Rudy, Weiguang Ding, Daniel Jiwoong Im, Graham W. Taylor

    Abstract: Regularization is essential when training large neural networks. As deep neural networks can be mathematically interpreted as universal function approximators, they are effective at memorizing sampling noise in the training data. This results in poor generalization to unseen data. Therefore, it is no surprise that a new regularization technique, Dropout, was partially responsible for the now-ubiqu… ▽ More

    Submitted 5 January, 2015; v1 submitted 20 December, 2014; originally announced December 2014.

  42. arXiv:1412.6617  [pdf, other

    cs.LG

    Understanding Minimum Probability Flow for RBMs Under Various Kinds of Dynamics

    Authors: Daniel Jiwoong Im, Ethan Buchman, Graham W. Taylor

    Abstract: Energy-based models are popular in machine learning due to the elegance of their formulation and their relationship to statistical physics. Among these, the Restricted Boltzmann Machine (RBM), and its staple training algorithm contrastive divergence (CD), have been the prototype for some recent advancements in the unsupervised training of deep neural networks. However, CD has limited theoretical m… ▽ More

    Submitted 7 April, 2015; v1 submitted 20 December, 2014; originally announced December 2014.

    Comments: Nine pages including the reference page plus one page appendix. Appeared at ICLR2015 workshop track

  43. arXiv:1412.6610  [pdf, other

    cs.LG cs.NE

    Scoring and Classifying with Gated Auto-encoders

    Authors: Daniel Jiwoong Im, Graham W. Taylor

    Abstract: Auto-encoders are perhaps the best-known non-probabilistic methods for representation learning. They are conceptually simple and easy to train. Recent theoretical work has shed light on their ability to capture manifold structure, and drawn connections to density modelling. This has motivated researchers to seek ways of auto-encoder scoring, which has furthered their use in classification. Gated a… ▽ More

    Submitted 14 June, 2015; v1 submitted 20 December, 2014; originally announced December 2014.