Skip to main content

Showing 1–40 of 40 results for author: Sohn, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10478  [pdf, other

    cs.CL cs.AI cs.GR

    From Words to Worlds: Transforming One-line Prompt into Immersive Multi-modal Digital Stories with Communicative LLM Agent

    Authors: Samuel S. Sohn, Danrui Li, Sen Zhang, Che-Jui Chang, Mubbasir Kapadia

    Abstract: Digital storytelling, essential in entertainment, education, and marketing, faces challenges in production scalability and flexibility. The StoryAgent framework, introduced in this paper, utilizes Large Language Models and generative tools to automate and refine digital storytelling. Employing a top-down story drafting and bottom-up asset generation approach, StoryAgent tackles key issues such as… ▽ More

    Submitted 21 June, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

    Comments: 16 pages, 13 figures

  2. arXiv:2406.05963  [pdf, other

    cs.CV cs.AI

    Solution for SMART-101 Challenge of CVPR Multi-modal Algorithmic Reasoning Task 2024

    Authors: Jinwoo Ahn, Junhyeok Park, Min-Jun Kim, Kang-Hyeon Kim, So-Yeong Sohn, Yun-Ji Lee, Du-Seong Chang, Yu-Jung Heo, Eun-Sol Kim

    Abstract: In this paper, the solution of HYU MLLAB KT Team to the Multimodal Algorithmic Reasoning Task: SMART-101 CVPR 2024 Challenge is presented. Beyond conventional visual question-answering problems, the SMART-101 challenge aims to achieve human-level multimodal understanding by tackling complex visio-linguistic puzzles designed for children in the 6-8 age group. To solve this problem, we suggest two m… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  3. arXiv:2406.05431  [pdf

    cs.CL

    MaTableGPT: GPT-based Table Data Extractor from Materials Science Literature

    Authors: Gyeong Hoon Yi, Jiwoo Choi, Hyeongyun Song, Olivia Miano, Jaewoong Choi, Kihoon Bang, Byungju Lee, Seok Su Sohn, David Buttler, Anna Hiszpanski, Sang Soo Han, Donghun Kim

    Abstract: Efficiently extracting data from tables in the scientific literature is pivotal for building large-scale databases. However, the tables reported in materials science papers exist in highly diverse forms; thus, rule-based extractions are an ineffective approach. To overcome this challenge, we present MaTableGPT, which is a GPT-based table data extractor from the materials science literature. MaTabl… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  4. arXiv:2404.13027  [pdf, other

    cs.RO

    An Analysis of Driver-Initiated Takeovers during Assisted Driving and their Effect on Driver Satisfaction

    Authors: Robin Schwager, Michael Grimm, Xin Liu, Lukas Ewecker, Tim Bruehl, Tin Stribor Sohn, Soeren Hohmann

    Abstract: During the use of Advanced Driver Assistance Systems (ADAS), drivers can intervene in the active function and take back control due to various reasons. However, the specific reasons for driver-initiated takeovers in naturalistic driving are still not well understood. In order to get more information on the reasons behind these takeovers, a test group study was conducted. There, 17 participants use… ▽ More

    Submitted 10 June, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

  5. arXiv:2403.08978  [pdf, other

    cs.CL cs.LG

    AutoGuide: Automated Generation and Selection of State-Aware Guidelines for Large Language Model Agents

    Authors: Yao Fu, Dong-Ki Kim, Jaekyeom Kim, Sungryull Sohn, Lajanugen Logeswaran, Kyunghoon Bae, Honglak Lee

    Abstract: The primary limitation of large language models (LLMs) is their restricted understanding of the world. This poses significant difficulties for LLM-based agents, particularly in domains where pre-trained LLMs lack sufficient knowledge. In this paper, we introduce a novel framework, called AutoGuide, that bridges the knowledge gap in pre-trained LLMs by leveraging implicit knowledge in offline exper… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  6. arXiv:2401.06709  [pdf, other

    cs.CL cs.AI

    Reliability Analysis of Psychological Concept Extraction and Classification in User-penned Text

    Authors: Muskan Garg, MSVPJ Sathvik, Amrit Chadha, Shaina Raza, Sunghwan Sohn

    Abstract: The social NLP research community witness a recent surge in the computational advancements of mental health analysis to build responsible AI models for a complex interplay between language use and self-perception. Such responsible AI models aid in quantifying the psychological concepts from user-penned texts on social media. On thinking beyond the low-level (classification) task, we advance the ex… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

  7. arXiv:2401.05018  [pdf, other

    cs.CV

    AdvMT: Adversarial Motion Transformer for Long-term Human Motion Prediction

    Authors: Sarmad Idrees, Jongeun Choi, Seokman Sohn

    Abstract: To achieve seamless collaboration between robots and humans in a shared environment, accurately predicting future human movements is essential. Human motion prediction has traditionally been approached as a sequence prediction problem, leveraging historical human motion data to estimate future poses. Beginning with vanilla recurrent networks, the research community has investigated a variety of me… ▽ More

    Submitted 19 February, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: The paper is under consideration at Pattern Recognition Letters

  8. arXiv:2312.04668  [pdf, other

    cs.CL cs.AI cs.LG

    TOD-Flow: Modeling the Structure of Task-Oriented Dialogues

    Authors: Sungryull Sohn, Yiwei Lyu, Anthony Liu, Lajanugen Logeswaran, Dong-Ki Kim, Dongsub Shim, Honglak Lee

    Abstract: Task-Oriented Dialogue (TOD) systems have become crucial components in interactive artificial intelligence applications. While recent advances have capitalized on pre-trained language models (PLMs), they exhibit limitations regarding transparency and controllability. To address these challenges, we propose a novel approach focusing on inferring the TOD-Flow graph from dialogue data annotated with… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  9. arXiv:2311.12404  [pdf, other

    cs.CL cs.IR

    InterPrompt: Interpretable Prompting for Interrelated Interpersonal Risk Factors in Reddit Posts

    Authors: MSVPJ Sathvik, Surjodeep Sarkar, Chandni Saxena, Sunghwan Sohn, Muskan Garg

    Abstract: Mental health professionals and clinicians have observed the upsurge of mental disorders due to Interpersonal Risk Factors (IRFs). To simulate the human-in-the-loop triaging scenario for early detection of mental health disorders, we recognized textual indications to ascertain these IRFs : Thwarted Belongingness (TBe) and Perceived Burdensomeness (PBu) within personal narratives. In light of this,… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    Comments: 5 pages

  10. arXiv:2311.09601  [pdf, other

    cs.AI

    Code Models are Zero-shot Precondition Reasoners

    Authors: Lajanugen Logeswaran, Sungryull Sohn, Yiwei Lyu, Anthony Zhe Liu, Dong-Ki Kim, Dongsub Shim, Moontae Lee, Honglak Lee

    Abstract: One of the fundamental skills required for an agent acting in an environment to complete tasks is the ability to understand what actions are plausible at any given point. This work explores a novel use of code representations to reason about action preconditions for sequential decision making tasks. Code representations offer the flexibility to model procedural activities and associated constraint… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: Neurips Foundation Models for Decision Making Workshop 2023

  11. arXiv:2310.18364  [pdf, other

    cs.CL cs.AI

    From Heuristic to Analytic: Cognitively Motivated Strategies for Coherent Physical Commonsense Reasoning

    Authors: Zheyuan Zhang, Shane Storks, Fengyuan Hu, Sungryull Sohn, Moontae Lee, Honglak Lee, Joyce Chai

    Abstract: Pre-trained language models (PLMs) have shown impressive performance in various language tasks. However, they are prone to spurious correlations, and often generate illusory information. In real-world applications, PLMs should justify decisions with formalized, coherent reasoning chains, but this challenge remains under-explored. Cognitive psychology theorizes that humans are capable of utilizing… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 Main Conference

  12. arXiv:2310.16730  [pdf, other

    cs.LG

    MultiPrompter: Cooperative Prompt Optimization with Multi-Agent Reinforcement Learning

    Authors: Dong-Ki Kim, Sungryull Sohn, Lajanugen Logeswaran, Dongsub Shim, Honglak Lee

    Abstract: Recently, there has been an increasing interest in automated prompt optimization based on reinforcement learning (RL). This approach offers important advantages, such as generating interpretable prompts and being compatible with black-box foundation models. However, the substantial prompt space size poses challenges for RL-based methods, often leading to suboptimal policy convergence. This paper i… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

  13. arXiv:2309.15311  [pdf, other

    cs.HC cs.AI cs.GR

    The Importance of Multimodal Emotion Conditioning and Affect Consistency for Embodied Conversational Agents

    Authors: Che-Jui Chang, Samuel S. Sohn, Sen Zhang, Rajath Jayashankar, Muhammad Usman, Mubbasir Kapadia

    Abstract: Previous studies regarding the perception of emotions for embodied virtual agents have shown the effectiveness of using virtual characters in conveying emotions through interactions with humans. However, creating an autonomous embodied conversational agent with expressive behaviors presents two major challenges. The first challenge is the difficulty of synthesizing the conversational behaviors for… ▽ More

    Submitted 6 December, 2023; v1 submitted 26 September, 2023; originally announced September 2023.

  14. arXiv:2306.16772  [pdf, other

    cs.CV cs.AI cs.LG

    M3Act: Learning from Synthetic Human Group Activities

    Authors: Che-Jui Chang, Danrui Li, Deep Patel, Parth Goel, Honglu Zhou, Seonghyeon Moon, Samuel S. Sohn, Sejong Yoon, Vladimir Pavlovic, Mubbasir Kapadia

    Abstract: The study of complex human interactions and group activities has become a focal point in human-centric computer vision. However, progress in related tasks is often hindered by the challenges of obtaining large-scale labeled datasets from real-world scenarios. To address the limitation, we introduce M3Act, a synthetic data generator for multi-view multi-group multi-person human atomic actions and g… ▽ More

    Submitted 2 May, 2024; v1 submitted 29 June, 2023; originally announced June 2023.

  15. arXiv:2306.05596  [pdf, other

    cs.CL

    LOST: A Mental Health Dataset of Low Self-esteem in Reddit Posts

    Authors: Muskan Garg, Manas Gaur, Raxit Goswami, Sunghwan Sohn

    Abstract: Low self-esteem and interpersonal needs (i.e., thwarted belongingness (TB) and perceived burdensomeness (PB)) have a major impact on depression and suicide attempts. Individuals seek social connectedness on social media to boost and alleviate their loneliness. Social media platforms allow people to express their thoughts, experiences, beliefs, and emotions. Prior studies on mental health from soci… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  16. arXiv:2306.04059  [pdf, other

    cs.CL cs.CY

    Augmenting Reddit Posts to Determine Wellness Dimensions impacting Mental Health

    Authors: Chandreen Liyanage, Muskan Garg, Vijay Mago, Sunghwan Sohn

    Abstract: Amid ongoing health crisis, there is a growing necessity to discern possible signs of Wellness Dimensions (WD) manifested in self-narrated text. As the distribution of WD on social media data is intrinsically imbalanced, we experiment the generative NLP models for data augmentation to enable further improvement in the pre-screening task of classifying WD. To this end, we propose a simple yet effec… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

  17. arXiv:2303.09031  [pdf, other

    cs.CL cs.AI cs.LG

    A Picture is Worth a Thousand Words: Language Models Plan from Pixels

    Authors: Anthony Z. Liu, Lajanugen Logeswaran, Sungryull Sohn, Honglak Lee

    Abstract: Planning is an important capability of artificial agents that perform long-horizon tasks in real-world environments. In this work, we explore the use of pre-trained language models (PLMs) to reason about plan sequences from text instructions in embodied visual environments. Prior PLM based approaches for planning either assume observations are available in the form of text (e.g., provided by a cap… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  18. arXiv:2302.09173  [pdf, other

    cs.AI cs.CL cs.LG

    Unsupervised Task Graph Generation from Instructional Video Transcripts

    Authors: Lajanugen Logeswaran, Sungryull Sohn, Yunseok Jang, Moontae Lee, Honglak Lee

    Abstract: This work explores the problem of generating task graphs of real-world activities. Different from prior formulations, we consider a setting where text transcripts of instructional videos performing a real-world activity (e.g., making coffee) are provided and the goal is to identify the key steps relevant to the task as well as the dependency relationship between these key steps. We propose a novel… ▽ More

    Submitted 2 May, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

    Comments: Findings of ACL 2023

  19. arXiv:2302.08672  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    Multimodal Subtask Graph Generation from Instructional Videos

    Authors: Yunseok Jang, Sungryull Sohn, Lajanugen Logeswaran, Tiange Luo, Moontae Lee, Honglak Lee

    Abstract: Real-world tasks consist of multiple inter-dependent subtasks (e.g., a dirty pan needs to be washed before it can be used for cooking). In this work, we aim to model the causal dependencies between such subtasks from instructional videos describing the task. This is a challenging problem since complete information about the world is often inaccessible from videos, which demands robust learning mec… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

  20. arXiv:2212.04673  [pdf, other

    cs.CV

    MSI: Maximize Support-Set Information for Few-Shot Segmentation

    Authors: Seonghyeon Moon, Samuel S. Sohn, Honglu Zhou, Sejong Yoon, Vladimir Pavlovic, Muhammad Haris Khan, Mubbasir Kapadia

    Abstract: FSS(Few-shot segmentation) aims to segment a target class using a small number of labeled images(support set). To extract information relevant to the target class, a dominant approach in best-performing FSS methods removes background features using a support mask. We observe that this feature excision through a limiting support mask introduces an information bottleneck in several challenging FSS c… ▽ More

    Submitted 10 November, 2023; v1 submitted 9 December, 2022; originally announced December 2022.

    Comments: ICCV 2023

  21. arXiv:2211.00817  [pdf, other

    cs.LG cs.MA

    An Information-Theoretic Approach for Estimating Scenario Generalization in Crowd Motion Prediction

    Authors: Gang Qiao, Kaidong Hu, Seonghyeon Moon, Samuel S. Sohn, Sejong Yoon, Mubbasir Kapadia, Vladimir Pavlovic

    Abstract: Learning-based approaches to modeling crowd motion have become increasingly successful but require training and evaluation on large datasets, coupled with complex model selection and parameter tuning. To circumvent this tremendously time-consuming process, we propose a novel scoring method, which characterizes generalization of models trained on source crowd scenarios and applied to target crowd s… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

  22. arXiv:2205.12648  [pdf, other

    cs.LG cs.AI

    Fast Inference and Transfer of Compositional Task Structures for Few-shot Task Generalization

    Authors: Sungryull Sohn, Hyunjae Woo, Jongwook Choi, lyubing qiang, Izzeddin Gur, Aleksandra Faust, Honglak Lee

    Abstract: We tackle real-world problems with complex structures beyond the pixel-based game or simulator. We formulate it as a few-shot reinforcement learning problem where a task is characterized by a subtask graph that defines a set of subtasks and their dependencies that are unknown to the agent. Different from the previous meta-rl methods trying to directly infer the unstructured task embedding, our mul… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Comments: Accepted to UAI 2022 as an oral presentation

  23. arXiv:2205.09075  [pdf

    cond-mat.mtrl-sci cs.LG

    Predicting failure characteristics of structural materials via deep learning based on nondestructive void topology

    Authors: Leslie Ching Ow Tiong, Gunjick Lee, Seok Su Sohn, Donghun Kim

    Abstract: Accurate predictions of the failure progression of structural materials is critical for preventing failure-induced accidents. Despite considerable mechanics modeling-based efforts, accurate prediction remains a challenging task in real-world environments due to unexpected damage factors and defect evolutions. Here, we report a novel method for predicting material failure characteristics that uniqu… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

  24. arXiv:2203.15034  [pdf, other

    cs.LG cs.AI

    Learning Parameterized Task Structure for Generalization to Unseen Entities

    Authors: Anthony Z. Liu, Sungryull Sohn, Mahdi Qazwini, Honglak Lee

    Abstract: Real world tasks are hierarchical and compositional. Tasks can be composed of multiple subtasks (or sub-goals) that are dependent on each other. These subtasks are defined in terms of entities (e.g., "apple", "pear") that can be recombined to form new subtasks (e.g., "pickup apple", and "pickup pear"). To solve these tasks efficiently, an agent must infer subtask dependencies (e.g. an agent must e… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: Published in AAAI 2022

  25. arXiv:2203.12826  [pdf, other

    cs.CV

    HM: Hybrid Masking for Few-Shot Segmentation

    Authors: Seonghyeon Moon, Samuel S. Sohn, Honglu Zhou, Sejong Yoon, Vladimir Pavlovic, Muhammad Haris Khan, Mubbasir Kapadia

    Abstract: We study few-shot semantic segmentation that aims to segment a target object from a query image when provided with a few annotated support images of the target class. Several recent methods resort to a feature masking (FM) technique to discard irrelevant feature activations which eventually facilitates the reliable prediction of segmentation mask. A fundamental limitation of FM is the inability to… ▽ More

    Submitted 24 July, 2022; v1 submitted 23 March, 2022; originally announced March 2022.

    Comments: 14 pages

    MSC Class: 68T45

  26. arXiv:2201.07189  [pdf, other

    cs.CV

    MUSE-VAE: Multi-Scale VAE for Environment-Aware Long Term Trajectory Prediction

    Authors: Mihee Lee, Samuel S. Sohn, Seonghyeon Moon, Sejong Yoon, Mubbasir Kapadia, Vladimir Pavlovic

    Abstract: Accurate long-term trajectory prediction in complex scenes, where multiple agents (e.g., pedestrians or vehicles) interact with each other and the environment while attempting to accomplish diverse and often unknown goals, is a challenging stochastic forecasting problem. In this work, we propose MUSE, a new probabilistic modeling framework based on a cascade of Conditional VAEs, which tackles the… ▽ More

    Submitted 18 January, 2022; originally announced January 2022.

  27. arXiv:2112.11734  [pdf, other

    cs.LG cs.AI

    D-HYPR: Harnessing Neighborhood Modeling and Asymmetry Preservation for Digraph Representation Learning

    Authors: Honglu Zhou, Advith Chegu, Samuel S. Sohn, Zuohui Fu, Gerard de Melo, Mubbasir Kapadia

    Abstract: Digraph Representation Learning (DRL) aims to learn representations for directed homogeneous graphs (digraphs). Prior work in DRL is largely constrained (e.g., limited to directed acyclic graphs), or has poor generalizability across tasks (e.g., evaluated solely on one task). Most Graph Neural Networks (GNNs) exhibit poor performance on digraphs due to the neglect of modeling neighborhoods and pre… ▽ More

    Submitted 28 September, 2022; v1 submitted 22 December, 2021; originally announced December 2021.

    Comments: CIKM 2022

  28. arXiv:2112.02409  [pdf, other

    cs.LG cs.AI

    Understanding Dynamic Spatio-Temporal Contexts in Long Short-Term Memory for Road Traffic Speed Prediction

    Authors: Won Kyung Lee, Deuk Sin Kwon, So Young Sohn

    Abstract: Reliable traffic flow prediction is crucial to creating intelligent transportation systems. Many big-data-based prediction approaches have been developed but they do not reflect complicated dynamic interactions between roads considering time and location. In this study, we propose a dynamically localised long short-term memory (LSTM) model that involves both spatial and temporal dependence between… ▽ More

    Submitted 16 June, 2023; v1 submitted 4 December, 2021; originally announced December 2021.

    Comments: 10pages, 2 tables, 4 figures, 2017 KDD Cup

  29. arXiv:2111.09858  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    Successor Feature Landmarks for Long-Horizon Goal-Conditioned Reinforcement Learning

    Authors: Christopher Hoang, Sungryull Sohn, Jongwook Choi, Wilka Carvalho, Honglak Lee

    Abstract: Operating in the real-world often requires agents to learn about a complex environment and apply this understanding to achieve a breadth of goals. This problem, known as goal-conditioned reinforcement learning (GCRL), becomes especially challenging for long-horizon goals. Current methods have tackled this problem by augmenting goal-conditioned policies with graph-based planning algorithms. However… ▽ More

    Submitted 18 November, 2021; originally announced November 2021.

    Comments: NeurIPS 2021. Video and code at https://2016choang.github.io/sfl

  30. arXiv:2107.06405  [pdf, other

    cs.LG cs.AI cs.RO

    Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks

    Authors: Sungryull Sohn, Sungtae Lee, Jongwook Choi, Harm van Seijen, Mehdi Fatemi, Honglak Lee

    Abstract: We propose the k-Shortest-Path (k-SP) constraint: a novel constraint on the agent's trajectory that improves the sample efficiency in sparse-reward MDPs. We show that any optimal policy necessarily satisfies the k-SP constraint. Notably, the k-SP constraint prevents the policy from exploring state-action pairs along the non-k-SP trajectories (e.g., going back and forth). However, in practice, excl… ▽ More

    Submitted 13 July, 2021; originally announced July 2021.

    Comments: In proceedings of ICML 2021

  31. Reinforcement Learning for Sparse-Reward Object-Interaction Tasks in a First-person Simulated 3D Environment

    Authors: Wilka Carvalho, Anthony Liang, Kimin Lee, Sungryull Sohn, Honglak Lee, Richard L. Lewis, Satinder Singh

    Abstract: First-person object-interaction tasks in high-fidelity, 3D, simulated environments such as the AI2Thor virtual home-environment pose significant sample-efficiency challenges for reinforcement learning (RL) agents learning from sparse task rewards. To alleviate these challenges, prior work has provided extensive supervision via a combination of reward-shaping, ground-truth object-information, and e… ▽ More

    Submitted 20 May, 2021; v1 submitted 28 October, 2020; originally announced October 2020.

    Comments: Accepted to IJCAI 2021

    Journal ref: Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence (IJCAI 2021)

  32. arXiv:2002.05522  [pdf, other

    cs.LG cs.AI stat.ML

    BRPO: Batch Residual Policy Optimization

    Authors: Sungryull Sohn, Yinlam Chow, Jayden Ooi, Ofir Nachum, Honglak Lee, Ed Chi, Craig Boutilier

    Abstract: In batch reinforcement learning (RL), one often constrains a learned policy to be close to the behavior (data-generating) policy, e.g., by constraining the learned action distribution to differ from the behavior policy by some maximum degree that is the same at each state. This can cause batch RL to be overly conservative, unable to exploit large policy changes at frequently-visited, high-confiden… ▽ More

    Submitted 28 March, 2020; v1 submitted 7 February, 2020; originally announced February 2020.

  33. arXiv:2001.00248  [pdf, other

    cs.LG cs.AI stat.ML

    Meta Reinforcement Learning with Autonomous Inference of Subtask Dependencies

    Authors: Sungryull Sohn, Hyunjae Woo, Jongwook Choi, Honglak Lee

    Abstract: We propose and address a novel few-shot RL problem, where a task is characterized by a subtask graph which describes a set of subtasks and their dependencies that are unknown to the agent. The agent needs to quickly adapt to the task over few episodes during adaptation phase to maximize the return in the test phase. Instead of directly learning a meta-policy, we develop a Meta-learner with Subtask… ▽ More

    Submitted 13 April, 2020; v1 submitted 1 January, 2020; originally announced January 2020.

    Comments: Published in ICLR 2020

  34. Clinical Concept Extraction: a Methodology Review

    Authors: Sunyang Fu, David Chen, Huan He, Sijia Liu, Sungrim Moon, Kevin J Peterson, Feichen Shen, Liwei Wang, Yanshan Wang, Andrew Wen, Yiqing Zhao, Sunghwan Sohn, Hongfang Liu

    Abstract: Background Concept extraction, a subdomain of natural language processing (NLP) with a focus on extracting concepts of interest, has been adopted to computationally extract clinical information from text for a wide range of applications ranging from clinical decision support to care quality improvement. Objectives In this literature review, we provide a methodology review of clinical concept ext… ▽ More

    Submitted 10 August, 2020; v1 submitted 24 October, 2019; originally announced October 2019.

    Journal ref: Journal of Biomedical Informatics (2020): 103526

  35. arXiv:1910.05810  [pdf, other

    cs.AI cs.CV

    Deep Crowd-Flow Prediction in Built Environments

    Authors: Samuel S. Sohn, Seonghyeon Moon, Honglu Zhou, Sejong Yoon, Vladimir Pavlovic, Mubbasir Kapadia

    Abstract: Predicting the behavior of crowds in complex environments is a key requirement in a multitude of application areas, including crowd and disaster management, architectural design, and urban planning. Given a crowd's immediate state, current approaches simulate crowd movement to arrive at a future state. However, most applications require the ability to predict hundreds of possible simulation outcom… ▽ More

    Submitted 13 October, 2019; originally announced October 2019.

  36. arXiv:1910.00767  [pdf

    cs.MA cs.AI

    Cognitive Agent Based Simulation Model For Improving Disaster Response Procedures

    Authors: Rohit K. Dubey, Samuel S. Sohn, Christoph Hoelscher, Mubbasir Kapadia

    Abstract: In the event of a disaster, saving human lives is of utmost importance. For developing proper evacuation procedures and guidance systems, behavioural data on how people respond during panic and stress is crucial. In the absence of real human data on building evacuation, there is a need for a crowd simulator to model egress and decision-making under uncertainty. In this paper, we propose an agent-b… ▽ More

    Submitted 1 October, 2019; originally announced October 2019.

  37. arXiv:1807.07665  [pdf, other

    cs.LG cs.AI stat.ML

    Hierarchical Reinforcement Learning for Zero-shot Generalization with Subtask Dependencies

    Authors: Sungryull Sohn, Junhyuk Oh, Honglak Lee

    Abstract: We introduce a new RL problem where the agent is required to generalize to a previously-unseen environment characterized by a subtask graph which describes a set of subtasks and their dependencies. Unlike existing hierarchical multitask RL approaches that explicitly describe what the agent should do at a high level, our problem only describes properties of subtasks and relationships among them, wh… ▽ More

    Submitted 24 May, 2019; v1 submitted 19 July, 2018; originally announced July 2018.

    Comments: In NeurIPS 2018

  38. arXiv:1804.07814  [pdf

    cs.IR

    A Deep Representation Empowered Distant Supervision Paradigm for Clinical Information Extraction

    Authors: Yanshan Wang, Sunghwan Sohn, Sijia Liu, Feichen Shen, Liwei Wang, Elizabeth J. Atkinson, Shreyasee Amin, Hongfang Liu

    Abstract: Objective: To automatically create large labeled training datasets and reduce the efforts of feature engineering for training accurate machine learning models for clinical information extraction. Materials and Methods: We propose a distant supervision paradigm empowered by deep representation for extracting information from clinical text. In this paradigm, the rule-based NLP algorithms are utilize… ▽ More

    Submitted 20 April, 2018; originally announced April 2018.

  39. Detection of Surgical Site Infection Utilizing Automated Feature Generation in Clinical Notes

    Authors: Feichen Shen, David W Larson, James M. Naessens, Elizabeth B. Habermann, Hongfang Liu, Sunghwan Sohn

    Abstract: Postsurgical complications (PSCs) are known as a deviation from the normal postsurgical course and categorized by severity and treatment requirements. Surgical site infection (SSI) is one of major PSCs and the most common healthcare-associated infection, resulting in increased length of hospital stay and cost. In this work, we assessed an automated way to generate lexicon (i.e., keyword features)… ▽ More

    Submitted 26 March, 2018; v1 submitted 23 March, 2018; originally announced March 2018.

  40. arXiv:1704.05831  [pdf, other

    cs.CV

    Learning to Generate Long-term Future via Hierarchical Prediction

    Authors: Ruben Villegas, Jimei Yang, Yuliang Zou, Sungryull Sohn, Xunyu Lin, Honglak Lee

    Abstract: We propose a hierarchical approach for making long-term predictions of future frames. To avoid inherent compounding errors in recursive pixel-level prediction, we propose to first estimate high-level structure in the input frames, then predict how that structure evolves in the future, and finally by observing a single frame from the past and the predicted high-level structure, we construct the fut… ▽ More

    Submitted 7 January, 2018; v1 submitted 19 April, 2017; originally announced April 2017.

    Comments: International Conference on Machine Learning (ICML) 2017