Zum Hauptinhalt springen

Showing 151–200 of 344 results for author: Yang, E

.
  1. arXiv:2105.01044  [pdf, other

    cs.IR cs.CL

    Goldilocks: Just-Right Tuning of BERT for Technology-Assisted Review

    Authors: Eugene Yang, Sean MacAvaney, David D. Lewis, Ophir Frieder

    Abstract: Technology-assisted review (TAR) refers to iterative active learning workflows for document review in high recall retrieval (HRR) tasks. TAR research and most commercial TAR software have applied linear models such as logistic regression to lexical features. Transformer-based models with supervised tuning are known to improve effectiveness on many text classification tasks, suggesting their use in… ▽ More

    Submitted 19 January, 2022; v1 submitted 3 May, 2021; originally announced May 2021.

    Comments: 6 pages, 1 figure, accepted at ECIR 2022

  2. arXiv:2105.00795  [pdf, other

    cs.LG

    RetCL: A Selection-based Approach for Retrosynthesis via Contrastive Learning

    Authors: Hankook Lee, Sungsoo Ahn, Seung-Woo Seo, You Young Song, Eunho Yang, Sung-Ju Hwang, Jinwoo Shin

    Abstract: Retrosynthesis, of which the goal is to find a set of reactants for synthesizing a target product, is an emerging research area of deep learning. While the existing approaches have shown promising results, they currently lack the ability to consider availability (e.g., stability or purchasability) of the reactants or generalize to unseen reaction templates (i.e., chemical reaction rules). In this… ▽ More

    Submitted 3 June, 2021; v1 submitted 3 May, 2021; originally announced May 2021.

    Comments: Accepted to IJCAI 2021. Short version was accepted to Machine Learning for Molecules Workshop at NeurIPS 2020

  3. arXiv:2104.08314  [pdf, other

    cs.CV

    High Performance Convolution Using Sparsity and Patterns for Inference in Deep Convolutional Neural Networks

    Authors: Hossam Amer, Ahmed H. Salamah, Ahmad Sajedi, En-hui Yang

    Abstract: Deploying deep Convolutional Neural Networks (CNNs) is impacted by their memory footprint and speed requirements, which mainly come from convolution. Widely-used convolution algorithms, im2col and MEC, produce a lowered matrix from an activation map by redundantly storing the map's elements included at horizontal and/or vertical kernel overlappings without considering the sparsity of the map. Usin… ▽ More

    Submitted 16 April, 2021; originally announced April 2021.

    Comments: 34 pages

  4. arXiv:2103.14302  [pdf, other

    cs.CL cs.SD eess.AS

    Mutually-Constrained Monotonic Multihead Attention for Online ASR

    Authors: Jaeyun Song, Hajin Shim, Eunho Yang

    Abstract: Despite the feature of real-time decoding, Monotonic Multihead Attention (MMA) shows comparable performance to the state-of-the-art offline methods in machine translation and automatic speech recognition (ASR) tasks. However, the latency of MMA is still a major issue in ASR and should be combined with a technique that can reduce the test latency at inference time, such as head-synchronous beam sea… ▽ More

    Submitted 26 March, 2021; originally announced March 2021.

    Comments: Accepted at IEEE ICASSP 2021

  5. arXiv:2103.13151  [pdf, other

    cs.CV

    Learning Polar Encodings for Arbitrary-Oriented Ship Detection in SAR Images

    Authors: Yishan He, Fei Gao, Jun Wang, Amir Hussain, Erfu Yang, Huiyu Zhou

    Abstract: Common horizontal bounding box (HBB)-based methods are not capable of accurately locating slender ship targets with arbitrary orientations in synthetic aperture radar (SAR) images. Therefore, in recent years, methods based on oriented bounding box (OBB) have gradually received attention from researchers. However, most of the recently proposed deep learning-based methods for OBB detection encounter… ▽ More

    Submitted 24 March, 2021; originally announced March 2021.

  6. arXiv:2103.01328  [pdf, other

    cs.CL

    ToxCCIn: Toxic Content Classification with Interpretability

    Authors: Tong Xiang, Sean MacAvaney, Eugene Yang, Nazli Goharian

    Abstract: Despite the recent successes of transformer-based models in terms of effectiveness on a variety of tasks, their decisions often remain opaque to humans. Explanations are particularly important for tasks like offensive language or toxicity detection on social media because a manual appeal process is often in place to dispute automatically flagged content. In this work, we propose a technique to imp… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

    Comments: Long paper accepted to WASSA2021@EACL

  7. arXiv:2102.03866  [pdf, other

    cs.LG

    Model-Augmented Q-learning

    Authors: Youngmin Oh, Jinwoo Shin, Eunho Yang, Sung Ju Hwang

    Abstract: In recent years, $Q$-learning has become indispensable for model-free reinforcement learning (MFRL). However, it suffers from well-known problems such as under- and overestimation bias of the value, which may adversely affect the policy learning. To resolve this issue, we propose a MFRL framework that is augmented with the components of model-based RL. Specifically, we propose to estimate not only… ▽ More

    Submitted 7 February, 2021; originally announced February 2021.

  8. arXiv:2102.02386  [pdf, other

    astro-ph.CO astro-ph.IM

    Analysis of Temperature-to-Polarization Leakage in BICEP3 and Keck CMB Data from 2016 to 2018

    Authors: The BICEP/Keck Collaboration, :, T. St. Germaine, P. A. R. Ade, Z. Ahmed, M. Amiri, D. Barkats, R. Basu Thakur, C. A. Bischoff, J. J. Bock, H. Boenish, E. Bullock, V. Buza, J. R. Cheshire, J. Connors, J. Cornelison, M. Crumrine, A. Cukierman, E. Denison, M. Dierickx, L. Duband, M. Eiben, S. Fatigoni, J. P. Filippini, S. Fliescher , et al. (64 additional authors not shown)

    Abstract: The BICEP/Keck Array experiment is a series of small-aperture refracting telescopes observing degree-scale Cosmic Microwave Background polarization from the South Pole in search of a primordial $B$-mode signature. As a pair differencing experiment, an important systematic that must be controlled is the differential beam response between the co-located, orthogonally polarized detectors. We use high… ▽ More

    Submitted 3 February, 2021; originally announced February 2021.

    Comments: 9 pages, 4 figures

    Journal ref: Proc. SPIE 11453, Millimeter, Submillimeter, and Far-Infrared Detectors and Instrumentation for Astronomy X, 114532E (15 December 2020)

  9. arXiv:2101.12409  [pdf, other

    cs.CL

    Few-Shot Domain Adaptation for Grammatical Error Correction via Meta-Learning

    Authors: Shengsheng Zhang, Yaping Huang, Yun Chen, Liner Yang, Chencheng Wang, Erhong Yang

    Abstract: Most existing Grammatical Error Correction (GEC) methods based on sequence-to-sequence mainly focus on how to generate more pseudo data to obtain better performance. Few work addresses few-shot GEC domain adaptation. In this paper, we treat different GEC domains as different GEC tasks and propose to extend meta-learning to few-shot GEC domain adaptation without using any pseudo data. We exploit a… ▽ More

    Submitted 29 January, 2021; originally announced January 2021.

  10. arXiv:2101.12149  [pdf, other

    physics.comp-ph cs.DC physics.acc-ph

    Porting WarpX to GPU-accelerated platforms

    Authors: A. Myers, A. Almgren, L. D. Amorim, J. Bell, L. Fedeli, L. Ge, K. Gott, D. P. Grote, M. Hogan, A. Huebl, R. Jambunathan, R. Lehe, C. Ng, M. Rowan, O. Shapoval, M. Thévenet, J. -L. Vay, H. Vincenti, E. Yang, N. Zaïm, W. Zhang, Y. Zhao, E. Zoni

    Abstract: WarpX is a general purpose electromagnetic particle-in-cell code that was originally designed to run on many-core CPU architectures. We describe the strategy followed to allow WarpX to use the GPU-accelerated nodes on OLCF's Summit supercomputer, a strategy we believe will extend to the upcoming machines Frontier and Aurora. We summarize the challenges encountered, lessons learned, and give curren… ▽ More

    Submitted 2 September, 2021; v1 submitted 28 January, 2021; originally announced January 2021.

    Comments: 11 pages, 5 figures, accepted by Parallel Computing. Minor revisions, results unchanged

    Journal ref: Parallel Computing, Volume 108, 2021, 102833

  11. arXiv:2101.09294  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Censorship of Online Encyclopedias: Implications for NLP Models

    Authors: Eddie Yang, Margaret E. Roberts

    Abstract: While artificial intelligence provides the backbone for many tools people use around the world, recent work has brought to attention that the algorithms powering AI are not free of politics, stereotypes, and bias. While most work in this area has focused on the ways in which AI can exacerbate existing inequalities and discrimination, very little work has studied how governments actively shape trai… ▽ More

    Submitted 22 January, 2021; originally announced January 2021.

    Comments: Accepted for publication at ACM FAccT 2021

  12. arXiv:2012.09363  [pdf, other

    astro-ph.CO astro-ph.IM

    Observing low elevation sky and the CMB Cold Spot with BICEP3 at the South Pole

    Authors: J. Kang, P. A. R. Ade, Z. Ahmed, M. Amiri, D. Barkats, R. Basu Thakur, C. A. Bischoff, J. J. Bock, H. Boenish, E. Bullock, V. Buza, J. R. Cheshire, J. Connors, J. Cornelison, M. Crumrine, A. Cukierman, E. Denison, M. Dierickx, L. Duband, M. Eiben, S. Fatigoni, J. P. Filippini, S. Fliescher, N. Goeckner-Wald, D. C. Goldfinger , et al. (62 additional authors not shown)

    Abstract: BICEP3 is a 520 mm aperture on-axis refracting telescope at the South Pole, which observes the polarization of the cosmic microwave background (CMB) at 95 GHz to search for the B-mode signal from inflationary gravitational waves. In addition to this main target, we have developed a low-elevation observation strategy to extend coverage of the Southern sky at the South Pole, where BICEP3 can quickly… ▽ More

    Submitted 17 December, 2020; v1 submitted 16 December, 2020; originally announced December 2020.

    Comments: 12 pages, 10 figures; Figure 7 shows the correct file

    Journal ref: Proc. SPIE 11453, Millimeter, Submillimeter, and Far-Infrared Detectors and Instrumentation for Astronomy X, 114532D (13 December 2020)

  13. arXiv:2012.05934  [pdf, other

    astro-ph.IM

    Polarization Calibration of the BICEP3 CMB polarimeter at the South Pole

    Authors: J. Cornelison, P. A. R. Ade, Z. Ahmed, M. Amiri, D. Barkats, R. Basu Thakur, C. A. Bischoff, J. J. Bock, H. Boenish, E. Bullock, V. Buza, J. R. Cheshire, J. Connors, M. Crumrine, A. Cukierman, E. Denison, M. Dierickx, L. Duband, M. Eiben, S. Fatigoni, J. P. Filippini, S. Fliescher, N. Goeckner-Wald, D. C. Goldfinger, J. A. Grayson , et al. (62 additional authors not shown)

    Abstract: The BICEP3 CMB Polarimeter is a small-aperture refracting telescope located at the South Pole and is specifically designed to search for the possible signature of inflationary gravitational waves in the Cosmic Microwave Background (CMB). The experiment measures polarization on the sky by differencing the signal of co-located, orthogonally polarized antennas coupled to Transition Edge Sensor (TES)… ▽ More

    Submitted 10 December, 2020; originally announced December 2020.

    Comments: Proceedings submitted to SPIE 2020 (AS111). 12 pages, 5 figures, 2 tables

  14. arXiv:2012.04047  [pdf, other

    astro-ph.IM astro-ph.CO physics.ins-det

    Receiver development for BICEP Array, a next-generation CMB polarimeter at the South Pole

    Authors: L. Moncelsi, P. A. R. Ade, Z. Ahmed, M. Amiri, D. Barkats, R. Basu Thakur, C. A. Bischoff, J. J. Bock, V. Buza, J. Cheshire, J. Connors, J. Cornelison, M. Crumrine, A. Cukierman, E. V. Denison, M. Dierickx, L. Duband, M. Eiben, S. Fatigoni, J. P. Filippini, N. Goeckner-Wald, D. C. Goldfinger, J. Grayson, P. Grimes, G. Hall , et al. (50 additional authors not shown)

    Abstract: A detection of curl-type ($B$-mode) polarization of the primary CMB would be direct evidence for the inflationary paradigm of the origin of the Universe. The BICEP/Keck Array (BK) program targets the degree angular scales, where the power from primordial $B$-mode polarization is expected to peak, with ever-increasing sensitivity and has published the most stringent constraints on inflation to date… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

    Comments: Proceedings of SPIE 2020 (AS111). This article supersedes arXiv:1808.00568 and arXiv:2002.05228

  15. arXiv:2011.14203  [pdf, other

    cs.AR cs.CL

    EdgeBERT: Sentence-Level Energy Optimizations for Latency-Aware Multi-Task NLP Inference

    Authors: Thierry Tambe, Coleman Hooper, Lillian Pentecost, Tianyu Jia, En-Yu Yang, Marco Donato, Victor Sanh, Paul N. Whatmough, Alexander M. Rush, David Brooks, Gu-Yeon Wei

    Abstract: Transformer-based language models such as BERT provide significant accuracy improvement for a multitude of natural language processing (NLP) tasks. However, their hefty computational and memory demands make them challenging to deploy to resource-constrained edge platforms with strict latency requirements. We present EdgeBERT, an in-depth algorithm-hardware co-design for latency-aware energy optimi… ▽ More

    Submitted 5 September, 2021; v1 submitted 28 November, 2020; originally announced November 2020.

    Comments: 12 pages plus references. Paper to appear at the 54th IEEE/ACM International Symposium on Microarchitecture (MICRO 2021)

  16. arXiv:2011.13720  [pdf, other

    cond-mat.str-el cond-mat.mes-hall hep-th quant-ph

    Topological entanglement entropy of interacting disordered zigzag graphene ribbons

    Authors: Young Heon Kim, Hye Jeong Lee, S. -R. Eric Yang

    Abstract: Interacting disordered zigzag graphene nanoribbons have fractional charges, are quasi-one-dimensional, and display an exponentially small gap. Our numerical computations showed that the topological entanglement entropy of these systems has a small finite but universal value, independent of the strength of the interaction and the disorder. The result that was obtained for the topological entangleme… ▽ More

    Submitted 21 March, 2021; v1 submitted 27 November, 2020; originally announced November 2020.

    Comments: 4 pages, 4 figures, new figures added; to be published in PRB

    Journal ref: Phys. Rev. B 103, 115151 (2021)

  17. arXiv:2011.00730  [pdf

    cond-mat.mtrl-sci

    Thermal Conductivities and Interfacial Thermal Conductance of 1- to 3-Layer WSe$_2$

    Authors: Elham Easy, Yuan Gao, Yingtao Wang, Dingkai Yan, Seyed M. Goushehgir, Eui-Hyeok Yang, Baoxing Xu, Xian Zhang

    Abstract: Atomically thin materials such as graphene and semiconducting transition metal dichalcogenides have attracted extensive interest in recent years, motivating investigation into multiple properties. In this work, we used the opto thermal Raman technique to measure the thermal transport properties of a popular TMDC material WSe$_2$, in single atomic layer, bilayer, and trilayer forms.

    Submitted 7 March, 2021; v1 submitted 30 October, 2020; originally announced November 2020.

  18. arXiv:2010.15269  [pdf, other

    eess.IV cs.CV cs.LG

    GloFlow: Global Image Alignment for Creation of Whole Slide Images for Pathology from Video

    Authors: Viswesh Krishna, Anirudh Joshi, Philip L. Bulterys, Eric Yang, Andrew Y. Ng, Pranav Rajpurkar

    Abstract: The application of deep learning to pathology assumes the existence of digital whole slide images of pathology slides. However, slide digitization is bottlenecked by the high cost of precise motor stages in slide scanners that are needed for position information used for slide stitching. We propose GloFlow, a two-stage method for creating a whole slide image using optical flow-based image registra… ▽ More

    Submitted 12 November, 2020; v1 submitted 28 October, 2020; originally announced October 2020.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2020 - Extended Abstract

  19. arXiv:2010.15054  [pdf, other

    cs.LG cs.AI cs.CV

    Attribution Preservation in Network Compression for Reliable Network Interpretation

    Authors: Geondo Park, June Yong Yang, Sung Ju Hwang, Eunho Yang

    Abstract: Neural networks embedded in safety-sensitive applications such as self-driving cars and wearable health monitors rely on two important techniques: input attribution for hindsight analysis and network compression to reduce its size for edge-computing. In this paper, we show that these seemingly unrelated techniques conflict with each other as network compression deforms the produced attributions, w… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

    Comments: NeurIPS 2020. Code: https://github.com/GeondoPark/attribute-preserve

  20. arXiv:2010.08776  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    The NVIDIA PilotNet Experiments

    Authors: Mariusz Bojarski, Chenyi Chen, Joyjit Daw, Alperen Değirmenci, Joya Deri, Bernhard Firner, Beat Flepp, Sachin Gogri, Jesse Hong, Lawrence Jackel, Zhenhua Jia, BJ Lee, Bo Liu, Fei Liu, Urs Muller, Samuel Payne, Nischal Kota Nagendra Prasad, Artem Provodin, John Roach, Timur Rvachov, Neha Tadimeti, Jesper van Engelen, Haiguang Wen, Eric Yang, Zongyi Yang

    Abstract: Four years ago, an experimental system known as PilotNet became the first NVIDIA system to steer an autonomous car along a roadway. This system represents a departure from the classical approach for self-driving in which the process is manually decomposed into a series of modules, each performing a different task. In PilotNet, on the other hand, a single deep neural network (DNN) takes pixels as i… ▽ More

    Submitted 17 October, 2020; originally announced October 2020.

  21. arXiv:2010.05533  [pdf, other

    cs.CL

    Toward Cross-Lingual Definition Generation for Language Learners

    Authors: Cunliang Kong, Liner Yang, Tianzuo Zhang, Qinan Fan, Zhenghao Liu, Yun Chen, Erhong Yang

    Abstract: Generating dictionary definitions automatically can prove useful for language learners. However, it's still a challenging task of cross-lingual definition generation. In this work, we propose to generate definitions in English for words in various languages. To achieve this, we present a simple yet effective approach based on publicly available pretrained language models. In this approach, models… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

  22. arXiv:2010.02727  [pdf

    cs.PL cs.LG cs.SE

    Symbolic Techniques for Deep Learning: Challenges and Opportunities

    Authors: Belinda Fang, Elaine Yang, Fei Xie

    Abstract: As the number of deep learning frameworks increase and certain ones gain popularity, it spurs the discussion of what methodologies are employed by these frameworks and the reasoning behind them. The goal of this survey is to study how symbolic techniques are utilized in deep learning. To do this, we look at some of the most popular deep learning frameworks being used today, including TensorFlow, K… ▽ More

    Submitted 1 October, 2020; originally announced October 2020.

  23. CMB-S4: Forecasting Constraints on Primordial Gravitational Waves

    Authors: CMB-S4 Collaboration, :, Kevork Abazajian, Graeme E. Addison, Peter Adshead, Zeeshan Ahmed, Daniel Akerib, Aamir Ali, Steven W. Allen, David Alonso, Marcelo Alvarez, Mustafa A. Amin, Adam Anderson, Kam S. Arnold, Peter Ashton, Carlo Baccigalupi, Debbie Bard, Denis Barkats, Darcy Barron, Peter S. Barry, James G. Bartlett, Ritoban Basu Thakur, Nicholas Battaglia, Rachel Bean, Chris Bebek , et al. (212 additional authors not shown)

    Abstract: CMB-S4---the next-generation ground-based cosmic microwave background (CMB) experiment---is set to significantly advance the sensitivity of CMB measurements and enhance our understanding of the origin and evolution of the Universe, from the highest energies at the dawn of time through the growth of structure to the present day. Among the science cases pursued with CMB-S4, the quest for detecting p… ▽ More

    Submitted 27 August, 2020; originally announced August 2020.

    Comments: 24 pages, 8 figures, 9 tables, submitted to ApJ. arXiv admin note: text overlap with arXiv:1907.04473

  24. Reconstructing Highly-twisted Magnetic Fields

    Authors: Victor M. Demcsak, Michael S. Wheatland, Alpha Mastrano, Kai E. Yang

    Abstract: We investigate the ability of a nonlinear force-free code to calculate highly-twisted magnetic field configurations using the Titov and Démoulin (1999) equilibrium field as a test case. The code calculates a force-free field using boundary conditions on the normal component of the field in the lower boundary, and the normal component of the current density over one polarity of the field in the low… ▽ More

    Submitted 7 August, 2020; originally announced August 2020.

    Comments: 23 pages, 6 figures, accepted by Solar Physics

  25. arXiv:2008.02956  [pdf, other

    cs.LG stat.ML

    Bootstrapping Neural Processes

    Authors: Juho Lee, Yoonho Lee, Jungtaek Kim, Eunho Yang, Sung Ju Hwang, Yee Whye Teh

    Abstract: Unlike in the traditional statistical modeling for which a user typically hand-specify a prior, Neural Processes (NPs) implicitly define a broad class of stochastic processes with neural networks. Given a data stream, NP learns a stochastic process that best describes the data. While this "data-driven" way of learning stochastic processes has proven to handle various types of data, NPs still rely… ▽ More

    Submitted 27 October, 2020; v1 submitted 6 August, 2020; originally announced August 2020.

    Comments: Published in Thirty-fourth Conference on Neural Information Processing Systems (NeurIPS 2020) Code is available at https://github.com/juho-lee/bnp

  26. arXiv:2008.02953  [pdf, other

    cs.LG stat.ML

    Neural Complexity Measures

    Authors: Yoonho Lee, Juho Lee, Sung Ju Hwang, Eunho Yang, Seungjin Choi

    Abstract: While various complexity measures for deep neural networks exist, specifying an appropriate measure capable of predicting and explaining generalization in deep networks has proven challenging. We propose Neural Complexity (NC), a meta-learning framework for predicting generalization. Our model learns a scalar complexity measure through interactions with many heterogeneous tasks in a data-driven wa… ▽ More

    Submitted 23 October, 2020; v1 submitted 6 August, 2020; originally announced August 2020.

    Comments: Published in Thirty-fourth Conference on Neural Information Processing Systems (NeurIPS 2020) Code is available at https://github.com/yoonholee/neural-complexity

  27. arXiv:2007.14477  [pdf, ps, other

    cs.CL

    GUIR at SemEval-2020 Task 12: Domain-Tuned Contextualized Models for Offensive Language Detection

    Authors: Sajad Sotudeh, Tong Xiang, Hao-Ren Yao, Sean MacAvaney, Eugene Yang, Nazli Goharian, Ophir Frieder

    Abstract: Offensive language detection is an important and challenging task in natural language processing. We present our submissions to the OffensEval 2020 shared task, which includes three English sub-tasks: identifying the presence of offensive language (Sub-task A), identifying the presence of target in offensive language (Sub-task B), and identifying the categories of the target (Sub-task C). Our expe… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

    Comments: SemEval 2020

  28. arXiv:2007.12020  [pdf, other

    cs.LG cs.AI stat.ML

    Few-shot Visual Reasoning with Meta-analogical Contrastive Learning

    Authors: Youngsung Kim, Jinwoo Shin, Eunho Yang, Sung Ju Hwang

    Abstract: While humans can solve a visual puzzle that requires logical reasoning by observing only few samples, it would require training over large amount of data for state-of-the-art deep reasoning models to obtain similar performance on the same task. In this work, we propose to solve such a few-shot (or low-shot) visual reasoning problem, by resorting to analogical reasoning, which is a unique human abi… ▽ More

    Submitted 23 July, 2020; originally announced July 2020.

  29. arXiv:2007.11362  [pdf, other

    cs.LG physics.comp-ph stat.ML

    Time-Reversal Symmetric ODE Network

    Authors: In Huh, Eunho Yang, Sung Ju Hwang, Jinwoo Shin

    Abstract: Time-reversal symmetry, which requires that the dynamics of a system should not change with the reversal of time axis, is a fundamental property that frequently holds in classical and quantum mechanics. In this paper, we propose a novel loss function that measures how well our ordinary differential equation (ODE) networks comply with this time-reversal symmetry; it is formally defined by the discr… ▽ More

    Submitted 6 January, 2021; v1 submitted 22 July, 2020; originally announced July 2020.

    Comments: 15 pages; accepted to NeurIPS 2020; Code is available at https://github.com/inhuh/trs-oden; v3: references added, typo corrected

  30. arXiv:2007.08844  [pdf, other

    cs.LG stat.ML

    Distribution Aligning Refinery of Pseudo-label for Imbalanced Semi-supervised Learning

    Authors: Jaehyung Kim, Youngbum Hur, Sejun Park, Eunho Yang, Sung Ju Hwang, Jinwoo Shin

    Abstract: While semi-supervised learning (SSL) has proven to be a promising way for leveraging unlabeled data when labeled data is scarce, the existing SSL algorithms typically assume that training class distributions are balanced. However, these SSL algorithms trained under imbalanced class distributions can severely suffer when generalizing to a balanced testing criterion, since they utilize biased pseudo… ▽ More

    Submitted 13 September, 2021; v1 submitted 17 July, 2020; originally announced July 2020.

    Comments: 19 pages; NeurIPS 2020

  31. arXiv:2007.07484  [pdf, other

    cs.LG math.OC stat.ML

    A General Family of Stochastic Proximal Gradient Methods for Deep Learning

    Authors: Jihun Yun, Aurelie C. Lozano, Eunho Yang

    Abstract: We study the training of regularized neural networks where the regularizer can be non-smooth and non-convex. We propose a unified framework for stochastic proximal gradient descent, which we term ProxGen, that allows for arbitrary positive preconditioners and lower semi-continuous regularizers. Our framework encompasses standard stochastic proximal gradient methods without preconditioners as speci… ▽ More

    Submitted 15 July, 2020; originally announced July 2020.

    Comments: 21 pages

  32. arXiv:2007.07358  [pdf, other

    cs.LG stat.ML

    Learning to Sample with Local and Global Contexts in Experience Replay Buffer

    Authors: Youngmin Oh, Kimin Lee, Jinwoo Shin, Eunho Yang, Sung Ju Hwang

    Abstract: Experience replay, which enables the agents to remember and reuse experience from the past, has played a significant role in the success of off-policy reinforcement learning (RL). To utilize the experience replay efficiently, the existing sampling methods allow selecting out more meaningful experiences by imposing priorities on them based on certain metrics (e.g. TD-error). However, they may resul… ▽ More

    Submitted 7 April, 2021; v1 submitted 14 July, 2020; originally announced July 2020.

  33. arXiv:2007.00884   

    cs.LG stat.ML

    A Revision of Neural Tangent Kernel-based Approaches for Neural Networks

    Authors: Kyung-Su Kim, Aurélie C. Lozano, Eunho Yang

    Abstract: Recent theoretical works based on the neural tangent kernel (NTK) have shed light on the optimization and generalization of over-parameterized networks, and partially bridge the gap between their practical success and classical learning theory. Especially, using the NTK-based approach, the following three representative results were obtained: (1) A training error bound was derived to show that net… ▽ More

    Submitted 6 August, 2020; v1 submitted 2 July, 2020; originally announced July 2020.

    Comments: We spotted an error in the proof of Lemma A.4 and are investigating whether this can be corrected. Furthermore, the authors of the original paper have informed us that they are fixing the lemma upon which our theorem 3.2 builds. Therefore, we are removing the current version of our paper

  34. arXiv:2007.00873  [pdf, other

    cs.LG stat.ML

    Compressed Sensing via Measurement-Conditional Generative Models

    Authors: Kyung-Su Kim, Jung Hyun Lee, Eunho Yang

    Abstract: A pre-trained generator has been frequently adopted in compressed sensing (CS) due to its ability to effectively estimate signals with the prior of NNs. In order to further refine the NN-based prior, we propose a framework that allows the generator to utilize additional information from a given measurement for prior learning, thereby yielding more accurate prediction for signals. As our framework… ▽ More

    Submitted 2 November, 2020; v1 submitted 2 July, 2020; originally announced July 2020.

  35. arXiv:2006.14222  [pdf, other

    cs.LG stat.ML

    Set Based Stochastic Subsampling

    Authors: Bruno Andreis, Seanie Lee, A. Tuan Nguyen, Juho Lee, Eunho Yang, Sung Ju Hwang

    Abstract: Deep models are designed to operate on huge volumes of high dimensional data such as images. In order to reduce the volume of data these models must process, we propose a set-based two-stage end-to-end neural subsampling model that is jointly optimized with an \textit{arbitrary} downstream task network (e.g. classifier). In the first stage, we efficiently subsample \textit{candidate elements} usin… ▽ More

    Submitted 30 May, 2022; v1 submitted 25 June, 2020; originally announced June 2020.

    Comments: 20 pages

  36. arXiv:2006.12777  [pdf, other

    cs.LG stat.ML

    Clinical Risk Prediction with Temporal Probabilistic Asymmetric Multi-Task Learning

    Authors: A. Tuan Nguyen, Hyewon Jeong, Eunho Yang, Sung Ju Hwang

    Abstract: Although recent multi-task learning methods have shown to be effective in improving the generalization of deep neural networks, they should be used with caution for safety-critical applications, such as clinical risk prediction. This is because even if they achieve improved task-average performance, they may still yield degraded performance on individual tasks, which may be critical (e.g., predict… ▽ More

    Submitted 18 February, 2021; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: AAAI 2021. The first two authors contributed equally to this work. 10 pages, 4 figures, 4 tables

  37. arXiv:2006.12139  [pdf, other

    cs.LG stat.ML

    Rapid Structural Pruning of Neural Networks with Set-based Task-Adaptive Meta-Pruning

    Authors: Minyoung Song, Jaehong Yoon, Eunho Yang, Sung Ju Hwang

    Abstract: As deep neural networks are growing in size and being increasingly deployed to more resource-limited devices, there has been a recent surge of interest in network pruning methods, which aim to remove less important weights or activations of a given network. A common limitation of most existing pruning techniques, is that they require pre-training of the network at least once before pruning, and th… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

  38. arXiv:2006.12097  [pdf, other

    cs.LG stat.ML

    Federated Semi-Supervised Learning with Inter-Client Consistency & Disjoint Learning

    Authors: Wonyong Jeong, Jaehong Yoon, Eunho Yang, Sung Ju Hwang

    Abstract: While existing federated learning approaches mostly require that clients have fully-labeled data to train on, in realistic settings, data obtained at the client-side often comes without any accompanying labels. Such deficiency of labels may result from either high labeling cost, or difficulty of annotation due to the requirement of expert knowledge. Thus the private data at each client may be eith… ▽ More

    Submitted 29 March, 2021; v1 submitted 22 June, 2020; originally announced June 2020.

    Journal ref: International Conference on Learning Representations (ICLR 2021), International Workshop on Federated Learning for User Privacy and Data Confidentiality in Conjunction with ICML 2020 (FL-ICML'20)

  39. arXiv:2006.05419  [pdf, other

    cs.LG cs.HC stat.ML

    Cost-effective Interactive Attention Learning with Neural Attention Processes

    Authors: Jay Heo, Junhyeon Park, Hyewon Jeong, Kwang Joon Kim, Juho Lee, Eunho Yang, Sung Ju Hwang

    Abstract: We propose a novel interactive learning framework which we refer to as Interactive Attention Learning (IAL), in which the human supervisors interactively manipulate the allocated attentions, to correct the model's behavior by updating the attention-generating network. However, such a model is prone to overfitting due to scarcity of human annotations, and requires costly retraining. Moreover, it is… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.

  40. arXiv:2005.00304  [pdf

    cond-mat.str-el cond-mat.mtrl-sci

    Thickness dependence of electronic and crystal structures in VO$_2$ ultrathin films: suppression of the collaborative Mott-Peierls transition

    Authors: D. Shiga, B. E. Yang, N. Hasegawa, T. Kanda, R. Tokunaga, K. Yoshimatsu, R. Yukawa, M. Kitamura, K. Horiba, H. Kumigashira

    Abstract: Through ${in~situ}$ photoemission spectroscopy, we investigated the change in the electronic and crystal structures of dimensionality-controlled VO$_2$ films coherently grown on TiO$_2$(001) substrates. In the nanostructured films, the balance between the instabilities of a bandlike Peierls transition and a Mott transition is controlled as a function of thickness. The characteristic spectral chang… ▽ More

    Submitted 1 May, 2020; originally announced May 2020.

    Comments: 30 pages, 4 main figures, 4 supplementary figures

    Journal ref: Phys. Rev. B 102, 115114 (2020)

  41. arXiv:2004.14125  [pdf, ps, other

    cond-mat.str-el cond-mat.mes-hall cond-mat.stat-mech hep-th

    Topologically ordered zigzag nanoribbon: $e/2$ fractional edge charge, spin-charge separation, and ground state degeneracy

    Authors: S. -R. Eric Yang, Min-Chul Cha, Hye Jeong Lee, Young Heon Kim

    Abstract: We numerically compute the density of states (DOS) of interacting disordered zigzag graphene nanoribbon (ZGNR) having midgap states showing $e/2$ fractional edge charges. The computed Hartree-Fock DOS is linear at the critical disorder strength where the gap vanishes. This implies an $I\mbox{-}V$ curve of $I\propto V^2$. Thus, $I\mbox{-}V$ curve measurement may yield evidence of fractional charges… ▽ More

    Submitted 22 July, 2020; v1 submitted 29 April, 2020; originally announced April 2020.

    Comments: 10 pages, 18 figures, Published version, Phys. Rev. Research 2, 033109 (2020)

  42. Self-consistent Nonlinear Force-free Field Reconstruction from Weighted Boundary Conditions

    Authors: Alpha Mastrano, Kai E. Yang, Michael S. Wheatland

    Abstract: Vector magnetogram data are often used as photospheric boundary conditions for force-free coronal magnetic field extrapolations. In general, however, vector magnetogram data are not consistent with the force-free assumption. In this article, we demonstrate a way to deal with inconsistent boundary data, by generalizing the "self-consistency procedure" of Wheatland & Regnier (2009). In that procedur… ▽ More

    Submitted 26 April, 2020; originally announced April 2020.

    Comments: 12 pages, 5 figures, submitted to Solar Physics

  43. Relative Magnetic Helicity Based on a Periodic Potential Field

    Authors: Kai E. Yang, Michael S. Wheatland, Stuart A. Gilchrist

    Abstract: Magnetic helicity is conserved under ideal magnetohydrodynamics (MHD) and quasi-conserved even under a resistive process. The standard definition for magnetic helicity cannot be applied directly to an open magnetic field in a volume, because it is gauge-dependent. Instead, the relative magnetic helicity is widely used. We find that the energy of a potential magnetic field in a rectangular domain w… ▽ More

    Submitted 18 April, 2020; originally announced April 2020.

    Comments: 33 pages, 7 figures, accepted by ApJ

  44. arXiv:2004.07955  [pdf, other

    cs.CR cs.CV cs.LG

    Targeted Attack for Deep Hashing based Retrieval

    Authors: Jiawang Bai, Bin Chen, Yiming Li, Dongxian Wu, Weiwei Guo, Shu-tao Xia, En-hui Yang

    Abstract: The deep hashing based retrieval method is widely adopted in large-scale image and video retrieval. However, there is little investigation on its security. In this paper, we propose a novel method, dubbed deep hashing targeted attack (DHTA), to study the targeted attack on such retrieval. Specifically, we first formulate the targeted attack as a point-to-set optimization, which minimizes the avera… ▽ More

    Submitted 23 July, 2020; v1 submitted 15 April, 2020; originally announced April 2020.

    Comments: Accepted by ECCV 2020 as Oral

  45. arXiv:2003.03196  [pdf, other

    cs.LG stat.ML

    Federated Continual Learning with Weighted Inter-client Transfer

    Authors: Jaehong Yoon, Wonyong Jeong, Giwoong Lee, Eunho Yang, Sung Ju Hwang

    Abstract: There has been a surge of interest in continual learning and federated learning, both of which are important in deep neural networks in real-world scenarios. Yet little research has been done regarding the scenario where each client learns on a sequence of tasks from a private local data stream. This problem of federated continual learning poses new challenges to continual learning, such as utiliz… ▽ More

    Submitted 14 June, 2021; v1 submitted 6 March, 2020; originally announced March 2020.

    Comments: ICML 2021

  46. arXiv:2002.06561  [pdf, other

    cs.LG cs.IR stat.ML

    Generalized Embedding Machines for Recommender Systems

    Authors: Enneng Yang, Xin Xin, Li Shen, Guibing Guo

    Abstract: Factorization machine (FM) is an effective model for feature-based recommendation which utilizes inner product to capture second-order feature interactions. However, one of the major drawbacks of FM is that it couldn't capture complex high-order interaction signals. A common solution is to change the interaction function, such as stacking deep neural networks on the top of FM. In this work, we pro… ▽ More

    Submitted 16 February, 2020; originally announced February 2020.

    Comments: 8 pages

    Journal ref: Machine Intelligence Research (2024): 1-14

  47. Optical Design and Characterization of 40-GHz Detector and Module for the BICEP Array

    Authors: A. Soliman, P. A. R. Ade, Z. Ahmed, M. Amiri, D. Barkats, R. Basu Thakur, C. A. Bischoff, J. J. Bock, H. Boenish, E. Bullock, V. Buza, J. Cheshire, J. Connors, J. Cornelison, M. Crumrine, A. Cukierman, M. Dierickx, L. Duband, S. Fatigoni, J. P. Filippini, G. Hall, M. Halpern, S. Harrison, S. Henderson, S. R. Hildebrandt , et al. (44 additional authors not shown)

    Abstract: Families of cosmic inflation models predict a primordial gravitational-wave background that imprints B-mode polarization pattern in the Cosmic Microwave Background (CMB). High sensitivity instruments with wide frequency coverage and well-controlled systematic errors are needed to constrain the faint B-mode amplitude. We have developed antenna-coupled Transition Edge Sensor (TES) arrays for high-se… ▽ More

    Submitted 12 February, 2020; originally announced February 2020.

    Comments: 8 pages, 7 figures, Accepted by the Journal of Low Temperature Physics (Proceedings of the 18th International Workshop on Low Temperature Detectors)

  48. arXiv:2002.05228  [pdf, other

    astro-ph.IM astro-ph.CO

    Design and performance of the first BICEP Array receiver

    Authors: A. Schillaci, P. A. R. Ade, Z. Ahmed, M. Amiri, D. Barkats, R. Basu Thakur, C. A. Bischoff, J. J. Bock, H. Boenish, E. Bullock, V. Buza, J. Cheshire, J. Connors, J. Cornelison, M. Crumrine, A. Cukierman, M. Dierickx, L. Duband, S. Fatigoni, J. P. Filippini, G. Hall, M. Halpern, S. Harrison, S. Henderson, S. R. Hildebrandt , et al. (44 additional authors not shown)

    Abstract: Branches of cosmic inflationary models, such as slow-roll inflation, predict a background of primordial gravitational waves that imprints a unique odd-parity B-mode pattern in the Cosmic Microwave Background (CMB) at amplitudes that are within experimental reach. The BICEP/Keck (BK) experiment targets this primordial signature, the amplitude of which is parameterized by the tensor-to-scalar ratio… ▽ More

    Submitted 12 February, 2020; originally announced February 2020.

    Comments: 9 pages, 5 figures, presented at LTD18 in Milan (July 2019), accepted on JLTP (February 2020)

  49. arXiv:2002.05219  [pdf, other

    astro-ph.IM astro-ph.CO physics.ins-det

    Characterizing the Sensitivity of 40 GHz TES Bolometers for BICEP Array

    Authors: C. Zhang, P. A. R. Ade, Z. Ahmed, M. Amiri, D. Barkats, R. Basu Thakur, C. A. Bischoff, J. J. Bock, H. Boenish, E. Bullock, V. Buza, J. Cheshire, J. Connors, J. Cornelison, M. Crumrine, A. Cukierman, M. Dierickx, L. Duband, S. Fatigoni, J. P. Filippini, G. Hall, M. Halpern, S. Harrison, S. Henderson, S. R. Hildebrandt , et al. (44 additional authors not shown)

    Abstract: The BICEP/Keck (BK) experiment aims to detect the imprint of primordial gravitational waves in the Cosmic Microwave Background polarization, which would be direct evidence of the inflation theory. While the tensor-to-scalar ratio has been constrained to be r_0.05 < 0.06 at 95% c.l., further improvements on this upper limit are hindered by polarized Galactic foreground emissions and removal of grav… ▽ More

    Submitted 12 February, 2020; originally announced February 2020.

    Comments: Accepted for publication in Journal of Low Temperature Physics

  50. arXiv:2002.05197  [pdf, other

    astro-ph.IM astro-ph.GA

    Optical characterization of the Keck Array and BICEP3 CMB Polarimeters from 2016 to 2019

    Authors: The BICEP/Keck Collaboration, :, T. St Germaine, P. A. R. Ade, Z. Ahmed, M. Amiri, D. Barkats, R. Basu Thakur, C. A. Bischoff, J. J. Bock, H. Boenish, E. Bullock, V. Buza, J. Cheshire, J. Connors, J. Cornelison, M. Crumrine, A. Cukierman, M. Dierickx, L. Duband, S. Fatigoni, J. P. Filippini, S. Fliescher, J. A. Grayson, G. Hall , et al. (50 additional authors not shown)

    Abstract: The BICEP/Keck experiment (BK) is a series of small-aperture refracting telescopes observing degree-scale Cosmic Microwave Background (CMB) polarization from the South Pole in search of a primordial $B$-mode signature. This $B$-mode signal arises from primordial gravitational waves interacting with the CMB, and has amplitude parametrized by the tensor-to-scalar ratio $r$. Since 2016, BICEP3 and th… ▽ More

    Submitted 12 February, 2020; originally announced February 2020.

    Comments: 8 pages, 3 figures. Accepted by the Journal of Low Temperature Physics (Proceedings of the 18th International Workshop on Low Temperature Detectors)