Zum Hauptinhalt springen

Showing 1–22 of 22 results for author: Hwang, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.14203  [pdf, other

    cs.LG cs.AI physics.chem-ph

    GLaD: Synergizing Molecular Graphs and Language Descriptors for Enhanced Power Conversion Efficiency Prediction in Organic Photovoltaic Devices

    Authors: Thao Nguyen, Tiara Torres-Flores, Changhyun Hwang, Carl Edwards, Ying Diao, Heng Ji

    Abstract: This paper presents a novel approach for predicting Power Conversion Efficiency (PCE) of Organic Photovoltaic (OPV) devices, called GLaD: synergizing molecular Graphs and Language Descriptors for enhanced PCE prediction. Due to the lack of high-quality experimental data, we collect a dataset consisting of 500 pairs of OPV donor and acceptor molecules along with their corresponding PCE values, whic… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: In progress

  2. arXiv:2402.06787  [pdf, other

    cs.NI cs.DC cs.LG

    ForestColl: Efficient Collective Communications on Heterogeneous Network Fabrics

    Authors: Liangyu Zhao, Saeed Maleki, Ziyue Yang, Hossein Pourreza, Aashaka Shah, Changho Hwang, Arvind Krishnamurthy

    Abstract: As modern DNN models grow ever larger, collective communications between the accelerators (allreduce, etc.) emerge as a significant performance bottleneck. Designing efficient communication schedules is challenging given today's highly diverse and heterogeneous network fabrics. In this paper, we present ForestColl, a tool that generates efficient schedules for any network topology. ForestColl cons… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: text overlap with arXiv:2305.18461

  3. arXiv:2310.11654  [pdf, other

    cs.LG stat.ML

    Subject-specific Deep Neural Networks for Count Data with High-cardinality Categorical Features

    Authors: Hangbin Lee, Il Do Ha, Changha Hwang, Youngjo Lee

    Abstract: There is a growing interest in subject-specific predictions using deep neural networks (DNNs) because real-world data often exhibit correlations, which has been typically overlooked in traditional DNN frameworks. In this paper, we propose a novel hierarchical likelihood learning framework for introducing gamma random effects into the Poisson DNN, so as to improve the prediction performance by capt… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  4. arXiv:2309.02685  [pdf, other

    cs.RO cs.AI cs.LG

    Diffusion-EDFs: Bi-equivariant Denoising Generative Modeling on SE(3) for Visual Robotic Manipulation

    Authors: Hyunwoo Ryu, Jiwoo Kim, Hyunseok An, Junwoo Chang, Joohwan Seo, Taehan Kim, Yubin Kim, Chaewon Hwang, Jongeun Choi, Roberto Horowitz

    Abstract: Diffusion generative modeling has become a promising approach for learning robotic manipulation tasks from stochastic human demonstrations. In this paper, we present Diffusion-EDFs, a novel SE(3)-equivariant diffusion-based approach for visual robotic manipulation tasks. We show that our proposed method achieves remarkable data efficiency, requiring only 5 to 10 human demonstrations for effective… ▽ More

    Submitted 28 November, 2023; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: 31 pages, 13 figures

  5. arXiv:2308.12066  [pdf, other

    cs.LG cs.AI cs.AR

    Pre-gated MoE: An Algorithm-System Co-Design for Fast and Scalable Mixture-of-Expert Inference

    Authors: Ranggi Hwang, Jianyu Wei, Shijie Cao, Changho Hwang, Xiaohu Tang, Ting Cao, Mao Yang

    Abstract: Large language models (LLMs) based on transformers have made significant strides in recent years, the success of which is driven by scaling up their model size. Despite their high algorithmic performance, the computational and memory requirements of LLMs present unprecedented challenges. To tackle the high compute requirements of LLMs, the Mixture-of-Experts (MoE) architecture was introduced which… ▽ More

    Submitted 27 April, 2024; v1 submitted 23 August, 2023; originally announced August 2023.

  6. arXiv:2206.03382  [pdf, other

    cs.DC cs.CL cs.CV

    Tutel: Adaptive Mixture-of-Experts at Scale

    Authors: Changho Hwang, Wei Cui, Yifan Xiong, Ziyue Yang, Ze Liu, Han Hu, Zilong Wang, Rafael Salas, Jithin Jose, Prabhat Ram, Joe Chau, Peng Cheng, Fan Yang, Mao Yang, Yongqiang Xiong

    Abstract: Sparsely-gated mixture-of-experts (MoE) has been widely adopted to scale deep learning models to trillion-plus parameters with fixed computational cost. The algorithmic performance of MoE relies on its token routing mechanism that forwards each input token to the right sub-models or experts. While token routing dynamically determines the amount of expert workload at runtime, existing systems suffe… ▽ More

    Submitted 5 June, 2023; v1 submitted 7 June, 2022; originally announced June 2022.

  7. Transformer Network-based Reinforcement Learning Method for Power Distribution Network (PDN) Optimization of High Bandwidth Memory (HBM)

    Authors: Hyunwook Park, Minsu Kim, Seongguk Kim, Keunwoo Kim, Haeyeon Kim, Taein Shin, Keeyoung Son, Boogyo Sim, Subin Kim, Seungtaek Jeong, Chulsoon Hwang, Joungho Kim

    Abstract: In this article, for the first time, we propose a transformer network-based reinforcement learning (RL) method for power distribution network (PDN) optimization of high bandwidth memory (HBM). The proposed method can provide an optimal decoupling capacitor (decap) design to maximize the reduction of PDN self- and transfer impedance seen at multiple ports. An attention-based transformer network is… ▽ More

    Submitted 23 August, 2022; v1 submitted 29 March, 2022; originally announced March 2022.

    Comments: 15 pages, 14 figures, Under review as a journal paper at IEEE Transactions on Microwave and Theory and Techniques (TMTT) Fig. 10 revised; Fig. 14 added

  8. arXiv:2106.10693  [pdf, other

    cs.LG cs.AI

    Fast PDN Impedance Prediction Using Deep Learning

    Authors: Ling Zhang, Jack Juang, Zurab Kiguradze, Bo Pu, Shuai Jin, Songping Wu, Zhiping Yang, Chulsoon Hwang

    Abstract: Modeling and simulating a power distribution network (PDN) for printed circuit boards (PCBs) with irregular board shapes and multi-layer stackup is computationally inefficient using full-wave simulations. This paper presents a new concept of using deep learning for PDN impedance prediction. A boundary element method (BEM) is applied to efficiently calculate the impedance for arbitrary board shape… ▽ More

    Submitted 20 June, 2021; originally announced June 2021.

  9. arXiv:2102.01932  [pdf, other

    cs.RO

    Roughly Collected Dataset for Contact Force Sensing Catheter

    Authors: Seunghyuk Cho, Minsoo Koo, Dongwoo Kim, Juyong Lee, Yeonwoo Jung, Kibyung Nam, Changmo Hwang

    Abstract: With rise of interventional cardiology, Catheter Ablation Therapy (CAT) has established itself as a first-line solution to treat cardiac arrhythmia. Although CAT is a promising technique, cardiologist lacks vision inside the body during the procedure, which may cause serious clinical syndromes. To support accurate clinical procedure, Contact Force Sensing (CFS) system is developed to find a positi… ▽ More

    Submitted 3 February, 2021; originally announced February 2021.

    Comments: 7 pages, 6 figures

  10. arXiv:2006.05148  [pdf, other

    cs.LG cs.CR eess.SP

    XOR Mixup: Privacy-Preserving Data Augmentation for One-Shot Federated Learning

    Authors: MyungJae Shin, Chihoon Hwang, Joongheon Kim, Jihong Park, Mehdi Bennis, Seong-Lyun Kim

    Abstract: User-generated data distributions are often imbalanced across devices and labels, hampering the performance of federated learning (FL). To remedy to this non-independent and identically distributed (non-IID) data problem, in this work we develop a privacy-preserving XOR based mixup data augmentation technique, coined XorMixup, and thereby propose a novel one-shot FL framework, termed XorMixFL. The… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.

  11. arXiv:1906.10910  [pdf, other

    cs.LG cs.CL stat.ML

    Creating A Neural Pedagogical Agent by Jointly Learning to Review and Assess

    Authors: Youngnam Lee, Youngduck Choi, Junghyun Cho, Alexander R. Fabbri, Hyunbin Loh, Chanyou Hwang, Yongku Lee, Sang-Wook Kim, Dragomir Radev

    Abstract: Machine learning plays an increasing role in intelligent tutoring systems as both the amount of data available and specialization among students grow. Nowadays, these systems are frequently deployed on mobile applications. Users on such mobile education platforms are dynamic, frequently being added, accessing the application with varying levels of focus, and changing while using the service. The e… ▽ More

    Submitted 1 July, 2019; v1 submitted 26 June, 2019; originally announced June 2019.

    Comments: 9 pages, 9 figures, 7 tables

  12. arXiv:1711.08679  [pdf

    cs.NE cs.LG

    Markov chain Hebbian learning algorithm with ternary synaptic units

    Authors: Guhyun Kim, Vladimir Kornijcuk, Dohun Kim, Inho Kim, Jaewook Kim, Hyo Cheon Woo, Ji Hun Kim, Cheol Seong Hwang, Doo Seok Jeong

    Abstract: In spite of remarkable progress in machine learning techniques, the state-of-the-art machine learning algorithms often keep machines from real-time learning (online learning) due in part to computational complexity in parameter optimization. As an alternative, a learning algorithm to train a memory in real time is proposed, which is named as the Markov chain Hebbian learning algorithm. The algorit… ▽ More

    Submitted 23 November, 2017; originally announced November 2017.

    Comments: 25 pages, 4 figures

  13. arXiv:1706.03475  [pdf, other

    cs.LG stat.ML

    Confident Multiple Choice Learning

    Authors: Kimin Lee, Changho Hwang, KyoungSoo Park, Jinwoo Shin

    Abstract: Ensemble methods are arguably the most trustworthy techniques for boosting the performance of machine learning models. Popular independent ensembles (IE) relying on naive averaging/voting scheme have been of typical choice for most applications involving deep neural networks, but they do not consider advanced collaboration among ensemble models. In this paper, we propose new ensemble methods speci… ▽ More

    Submitted 22 September, 2017; v1 submitted 12 June, 2017; originally announced June 2017.

    Comments: Accepted in ICML 2017

  14. arXiv:1512.05437  [pdf

    cs.IR

    A Method of Passage-Based Document Retrieval in Question Answering System

    Authors: Man-Hung Jong, Chong-Han Ri, Hyok-Chol Choe, Chol-Jun Hwang

    Abstract: We propose a method for using the scoring values of passages to effectively retrieve documents in a Question Answering system. For this, we suggest evaluation function that considers proximity between each question terms in passage. And using this evaluation function , we extract a documents which involves scoring values in the highest collection, as a suitable document for question. The propo… ▽ More

    Submitted 16 December, 2015; originally announced December 2015.

    Comments: 4 pages

  15. arXiv:1512.04653  [pdf

    cs.SE cs.NI

    Using Pi-calculus to Model Dynamic Web Services Composition Based on the Authority Model

    Authors: Sok-Min Han, Un-Chol Pang, Hyok-Chol Choe, Chol-Jun Hwang

    Abstract: There are lots of research works on web service, composition, modeling, verification and other problems. Theses research works are done on the basis of formal methods, such as petri-net, pi-calculus, automata theory, and so on. Pi-calculus is a natural vehicle to model mobility aspect in dynamic web services composition (DWSC). However, it has recently been shown that pi-calculus needs to be exten… ▽ More

    Submitted 15 December, 2015; originally announced December 2015.

    Comments: 11 pages, 3 figures

  16. arXiv:1511.02435  [pdf

    cs.CL

    A Chinese POS Decision Method Using Korean Translation Information

    Authors: Son-Il Kwak, O-Chol Kown, Chang-Sin Kim, Yong-Il Pak, Gum-Chol Son, Chol-Jun Hwang, Hyon-Chol Kim, Hyok-Chol Sin, Gyong-Il Hyon, Sok-Min Han

    Abstract: In this paper we propose a method that imitates a translation expert using the Korean translation information and analyse the performance. Korean is good at tagging than Chinese, so we can use this property in Chinese POS tagging.

    Submitted 7 November, 2015; originally announced November 2015.

    Comments: 6 pages, 0 figures

  17. arXiv:1511.02432  [pdf

    cs.AI

    A Study of an Modeling Method of T-S fuzzy System Based on Moving Fuzzy Reasoning and Its Application

    Authors: Son-Il Kwak, Gang Choe, In-Song Kim, Gyong-Ho Jo, Chol-Jun Hwang

    Abstract: To improve the effectiveness of the fuzzy identification, a structure identification method based on moving rate is proposed for T-S fuzzy model. The proposed method is called "T-S modeling (or T-S fuzzy identification method) based on moving rate". First, to improve the shortcomings of existing fuzzy reasoning methods based on matching degree, the moving rates for s-type, z-type and trapezoidal m… ▽ More

    Submitted 7 November, 2015; originally announced November 2015.

    Comments: 24 pages, 11 figures

  18. arXiv:0910.1639  [pdf, other

    cs.IT

    On the Fundamental Limits of Interweaved Cognitive Radios

    Authors: G. Chung, S. Vishwanath, C. S. Hwang

    Abstract: This paper considers the problem of channel sensing in cognitive radios. The system model considered is a set of N parallel (dis-similar) channels, where each channel at any given time is either available or occupied by a legitimate user. The cognitive radio is permitted to sense channels to determine each of their states as available or occupied. The end goal of this paper is to select the best… ▽ More

    Submitted 8 October, 2009; originally announced October 2009.

    Comments: 7 pages, 3 figures, IEEE Radio and Wireless Symposium, 2010

  19. arXiv:0812.4985  [pdf, other

    cs.IT

    On the Capacity of Partially Cognitive Radios

    Authors: G. Chung, S. Sridharan, S. Vishwanath, C. S. Hwang

    Abstract: This paper considers the problem of cognitive radios with partial-message information. Here, an interference channel setting is considered where one transmitter (the "cognitive" one) knows the message of the other ("legitimate" user) partially. An outer bound on the capacity region of this channel is found for the "weak" interference case (where the interference from the cognitive transmitter to… ▽ More

    Submitted 29 December, 2008; originally announced December 2008.

    Comments: 7 pages,2 figures

  20. arXiv:0810.0882  [pdf, ps, other

    cs.IT

    Asymptotic Eigenvalue Moments of Wishart-Type Random Matrix Without Ergodicity in One Channel Realization

    Authors: Chien-Hwa Hwang

    Abstract: Consider a random matrix whose variance profile is random. This random matrix is ergodic in one channel realization if, for each column and row, the empirical distribution of the squared magnitudes of elements therein converges to a nonrandom distribution. In this paper, noncrossing partition theory is employed to derive expressions for several asymptotic eigenvalue moments (AEM) related quantit… ▽ More

    Submitted 6 October, 2008; originally announced October 2008.

    Comments: 36 pages, 6 figures, submitted to IEEE Transactions on Information Theory, Oct. 2008

  21. arXiv:0709.0259  [pdf, ps, other

    cs.IT

    Spectrum Sensing in Wideband OFDM Cognitive Radios

    Authors: Chien-Hwa Hwang, Shih-Chang Chen

    Abstract: In this paper, detection of the primary user (PU) signal in an orthogonal frequency division multiplexing (OFDM) based cognitive radio (CR) system is addressed. According to the prior knowledge of the PU signal known to the detector, three detection algorithms based on the Neyman-Pearson philosophy are proposed. In the first case, a Gaussian PU signal with completely known probability density fu… ▽ More

    Submitted 6 October, 2008; v1 submitted 3 September, 2007; originally announced September 2007.

    Comments: 30 pages, 7 figures, submitted to IEEE Transactions on Signal Processing, Aug. 2007

  22. arXiv:cs/0609076  [pdf, ps, other

    cs.IT

    Asymptotic Spectral Distribution of Crosscorrelation Matrix in Asynchronous CDMA

    Authors: Chien-Hwa Hwang

    Abstract: Asymptotic spectral distribution (ASD) of the crosscorrelation matrix is investigated for a random spreading short/long-code asynchronous direct sequence-code division multiple access (DS-CDMA) system. The discrete-time decision statistics are obtained as the output samples of a bank of symbol matched filters of all users. The crosscorrelation matrix is studied when the number of symbols transmi… ▽ More

    Submitted 6 October, 2008; v1 submitted 13 September, 2006; originally announced September 2006.

    Comments: 63 pages, 8 figures, submitted to IEEE Transactions on Information Theory, Sept. 2006