Skip to main content

Showing 1–50 of 59 results for author: Du, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.01928  [pdf, other

    cs.RO

    History-Aware Planning for Risk-free Autonomous Navigation on Unknown Uneven Terrain

    Authors: Yinchuan Wang, Nianfei Du, Yongsen Qin, Xiang Zhang, Rui Song, Chaoqun Wang

    Abstract: It is challenging for the mobile robot to achieve autonomous and mapless navigation in the unknown environment with uneven terrain. In this study, we present a layered and systematic pipeline. At the local level, we maintain a tree structure that is dynamically extended with the navigation. This structure unifies the planning with the terrain identification. Besides, it contributes to explicitly i… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: This paper has been accepted by 2024 IEEE International Conference on Robotics and Automation (ICRA 2024)

  2. arXiv:2405.15052  [pdf, other

    cs.LG cs.AI

    Revisiting MoE and Dense Speed-Accuracy Comparisons for LLM Training

    Authors: Xianzhi Du, Tom Gunter, Xiang Kong, Mark Lee, Zirui Wang, Aonan Zhang, Nan Du, Ruoming Pang

    Abstract: Mixture-of-Experts (MoE) enjoys performance gain by increasing model capacity while keeping computation cost constant. When comparing MoE to dense models, prior work typically adopt the following setting: 1) use FLOPs or activated parameters as a measure of model complexity; 2) train all models to the same number of tokens. We argue that this setting favors MoE as FLOPs and activated parameters do… ▽ More

    Submitted 28 June, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: 8 pages

  3. arXiv:2405.13640  [pdf, other

    cs.CL cs.AI cs.LG

    Knowledge Graph Reasoning with Self-supervised Reinforcement Learning

    Authors: Ying Ma, Owen Burns, Mingqiu Wang, Gang Li, Nan Du, Laurent El Shafey, Liqiang Wang, Izhak Shafran, Hagen Soltau

    Abstract: Reinforcement learning (RL) is an effective method of finding reasoning pathways in incomplete knowledge graphs (KGs). To overcome the challenges of a large action space, a self-supervised pre-training method is proposed to warm up the policy network before the RL training stage. To alleviate the distributional mismatch issue in general self-supervised RL (SSRL), in our supervised learning (SL) st… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 17 pages, 11 figures

  4. arXiv:2404.10642  [pdf, other

    cs.CL cs.LG

    Self-playing Adversarial Language Game Enhances LLM Reasoning

    Authors: Pengyu Cheng, Tianhao Hu, Han Xu, Zhisong Zhang, Yong Dai, Lei Han, Nan Du

    Abstract: We explore the self-play training procedure of large language models (LLMs) in a two-player adversarial language game called Adversarial Taboo. In this game, an attacker and a defender communicate around a target word only visible to the attacker. The attacker aims to induce the defender to speak the target word unconsciously, while the defender tries to infer the target word from the attacker's u… ▽ More

    Submitted 23 May, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: Preprint

  5. arXiv:2403.09611  [pdf, other

    cs.CV cs.CL cs.LG

    MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

    Authors: Brandon McKinzie, Zhe Gan, Jean-Philippe Fauconnier, Sam Dodge, Bowen Zhang, Philipp Dufter, Dhruti Shah, Xianzhi Du, Futang Peng, Floris Weers, Anton Belyi, Haotian Zhang, Karanjeet Singh, Doug Kang, Ankur Jain, Hongyu Hè, Max Schwarzer, Tom Gunter, Xiang Kong, Aonan Zhang, Jianyu Wang, Chong Wang, Nan Du, Tao Lei, Sam Wiseman , et al. (7 additional authors not shown)

    Abstract: In this work, we discuss building performant Multimodal Large Language Models (MLLMs). In particular, we study the importance of various architecture components and data choices. Through careful and comprehensive ablations of the image encoder, the vision language connector, and various pre-training data choices, we identified several crucial design lessons. For example, we demonstrate that for la… ▽ More

    Submitted 18 April, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

  6. arXiv:2402.16696  [pdf, other

    cs.CL

    Look Before You Leap: Towards Decision-Aware and Generalizable Tool-Usage for Large Language Models

    Authors: Anchun Gui, Jian Li, Yong Dai, Nan Du, Han Xiao

    Abstract: Tool-augmented large language models (LLMs) are attracting widespread attention when accessing up-to-date knowledge and alleviating hallucination issues. Nowadays, advanced closed-source LLMs (e.g., ChatGPT) have demonstrated surprising tool-usage capabilities through prompting and in-context learning techniques. To empower the capabilities of open-source LLMs (e.g., LLaMA) in manipulating tools,… ▽ More

    Submitted 27 February, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: 20 pages, 18 figures

  7. arXiv:2402.15572  [pdf, other

    cs.AI cs.CV cs.RO

    Improving Explainable Object-induced Model through Uncertainty for Automated Vehicles

    Authors: Shihong Ling, Yue Wan, Xiaowei Jia, Na Du

    Abstract: The rapid evolution of automated vehicles (AVs) has the potential to provide safer, more efficient, and comfortable travel options. However, these systems face challenges regarding reliability in complex driving scenarios. Recent explainable AV architectures neglect crucial information related to inherent uncertainties while providing explanations for actions. To overcome such challenges, our stud… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: In Proceedings of the 2024 ACM / IEEE International Conference on Human-Robot Interaction (HRI '24), March 11--14, 2024, Boulder, CO, USA. ACM, New York, NY, USA, 9 pages

  8. arXiv:2402.02101  [pdf, other

    cs.CL cs.AI

    Are Large Language Models Good Prompt Optimizers?

    Authors: Ruotian Ma, Xiaolei Wang, Xin Zhou, Jian Li, Nan Du, Tao Gui, Qi Zhang, Xuanjing Huang

    Abstract: LLM-based Automatic Prompt Optimization, which typically utilizes LLMs as Prompt Optimizers to self-reflect and refine prompts, has shown promising performance in recent studies. Despite the success, the underlying mechanism of this approach remains unexplored, and the true effectiveness of LLMs as Prompt Optimizers requires further validation. In this work, we conducted a comprehensive study to u… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

  9. arXiv:2312.07401  [pdf, other

    cs.AI

    On Diversified Preferences of Large Language Model Alignment

    Authors: Dun Zeng, Yong Dai, Pengyu Cheng, Longyue Wang, Tianhao Hu, Wanshun Chen, Nan Du, Zenglin Xu

    Abstract: Aligning large language models (LLMs) with human preferences has been recognized as the key to improving LLMs' interaction quality. However, in this pluralistic world, human preferences can be diversified due to annotators' different tastes, which hinders the effectiveness of LLM alignment methods. This paper presents the first quantitative analysis of commonly used human feedback datasets to inve… ▽ More

    Submitted 17 April, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: preprint

  10. arXiv:2312.01170  [pdf, other

    cs.CR

    Power-balanced Memristive Cryptographic Implementation Against Side Channel Attacks

    Authors: Ziang Chen, Li-Wei Chen, Xianyue Zhao, Kefeng Li, Heidemarie Schmidt, Ilia Polian, Nan Du

    Abstract: Memristors, as emerging nano-devices, offer promising performance and exhibit rich electrical dynamic behavior. Having already found success in applications such as neuromorphic and in-memory computing, researchers are now exploring their potential for cryptographic implementations. In this study, we present a novel power-balanced hiding strategy utilizing memristor groups to conceal power consump… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  11. arXiv:2311.15436  [pdf, other

    cs.CL

    Learning to Skip for Language Modeling

    Authors: Dewen Zeng, Nan Du, Tao Wang, Yuanzhong Xu, Tao Lei, Zhifeng Chen, Claire Cui

    Abstract: Overparameterized large-scale language models have impressive generalization performance of in-context few-shot learning. However, most language models allocate the same amount of parameters or computation to each token, disregarding the complexity or importance of the input data. We argue that in language model pretraining, a variable amount of computation should be assigned to different tokens,… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

  12. arXiv:2311.08045  [pdf, other

    cs.CL cs.AI cs.LG

    Adversarial Preference Optimization: Enhancing Your Alignment via RM-LLM Game

    Authors: Pengyu Cheng, Yifan Yang, Jian Li, Yong Dai, Tianhao Hu, Peixin Cao, Nan Du, Xiaolong Li

    Abstract: Human preference alignment is essential to improve the interaction quality of large language models (LLMs). Existing alignment methods depend on manually annotated preference data to guide the LLM optimization directions. However, continuously updating LLMs for alignment raises a distribution gap between model-generated samples and human-annotated responses, hindering training effectiveness. To mi… ▽ More

    Submitted 3 June, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: Accepted by ACL2024 findings

  13. TDPP: Two-Dimensional Permutation-Based Protection of Memristive Deep Neural Networks

    Authors: Minhui Zou, Zhenhua Zhu, Tzofnat Greenberg-Toledo, Orian Leitersdorf, Jiang Li, Junlong Zhou, Yu Wang, Nan Du, Shahar Kvatinsky

    Abstract: The execution of deep neural network (DNN) algorithms suffers from significant bottlenecks due to the separation of the processing and memory units in traditional computer systems. Emerging memristive computing systems introduce an in situ approach that overcomes this bottleneck. The non-volatility of memristive devices, however, may expose the DNN weights stored in memristive crossbars to potenti… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: 14 pages, 11 figures

  14. arXiv:2309.03126  [pdf, other

    cs.CL

    Everyone Deserves A Reward: Learning Customized Human Preferences

    Authors: Pengyu Cheng, Jiawen Xie, Ke Bai, Yong Dai, Nan Du

    Abstract: Reward models (RMs) are essential for aligning large language models (LLMs) with human preferences to improve interaction quality. However, the real world is pluralistic, which leads to diversified human preferences with respect to different religions, politics, cultures, etc. Moreover, each individual can have their unique preferences on various topics. Neglecting the diversity of human preferenc… ▽ More

    Submitted 15 September, 2023; v1 submitted 6 September, 2023; originally announced September 2023.

  15. arXiv:2308.13191  [pdf, other

    cs.CL cs.AI

    Chunk, Align, Select: A Simple Long-sequence Processing Method for Transformers

    Authors: Jiawen Xie, Pengyu Cheng, Xiao Liang, Yong Dai, Nan Du

    Abstract: Although dominant in natural language processing, transformer-based models remain challenged by the task of long-sequence processing, because the computational cost of self-attention operations in transformers swells quadratically with the input sequence length. To alleviate the complexity of long-sequence processing, we propose a simple framework to enable the offthe-shelf pre-trained transformer… ▽ More

    Submitted 5 July, 2024; v1 submitted 25 August, 2023; originally announced August 2023.

    Comments: ACL 2024

  16. arXiv:2306.00008  [pdf, other

    cs.LG cs.CL

    Brainformers: Trading Simplicity for Efficiency

    Authors: Yanqi Zhou, Nan Du, Yanping Huang, Daiyi Peng, Chang Lan, Da Huang, Siamak Shakeri, David So, Andrew Dai, Yifeng Lu, Zhifeng Chen, Quoc Le, Claire Cui, James Laudon, Jeff Dean

    Abstract: Transformers are central to recent successes in natural language processing and computer vision. Transformers have a mostly uniform backbone where layers alternate between feed-forward and self-attention in order to build a deep network. Here we investigate this design choice and find that more complex blocks that have different permutations of layer primitives can be more efficient. Using this in… ▽ More

    Submitted 25 April, 2024; v1 submitted 29 May, 2023; originally announced June 2023.

  17. arXiv:2305.14705  [pdf, other

    cs.CL

    Mixture-of-Experts Meets Instruction Tuning:A Winning Combination for Large Language Models

    Authors: Sheng Shen, Le Hou, Yanqi Zhou, Nan Du, Shayne Longpre, Jason Wei, Hyung Won Chung, Barret Zoph, William Fedus, Xinyun Chen, Tu Vu, Yuexin Wu, Wuyang Chen, Albert Webson, Yunxuan Li, Vincent Zhao, Hongkun Yu, Kurt Keutzer, Trevor Darrell, Denny Zhou

    Abstract: Sparse Mixture-of-Experts (MoE) is a neural architecture design that can be utilized to add learnable parameters to Large Language Models (LLMs) without increasing inference cost. Instruction tuning is a technique for training LLMs to follow instructions. We advocate combining these two approaches, as we find that MoE models benefit more from instruction tuning than dense models. In particular, we… ▽ More

    Submitted 5 July, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Preprint

  18. arXiv:2305.12281  [pdf, other

    cs.CL cs.LG

    Lifelong Language Pretraining with Distribution-Specialized Experts

    Authors: Wuyang Chen, Yanqi Zhou, Nan Du, Yanping Huang, James Laudon, Zhifeng Chen, Claire Cu

    Abstract: Pretraining on a large-scale corpus has become a standard method to build general language models (LMs). Adapting a model to new data distributions targeting different downstream tasks poses significant challenges. Naive fine-tuning may incur catastrophic forgetting when the over-parameterized LMs overfit the new data but fail to preserve the pretrained features. Lifelong learning (LLL) aims to en… ▽ More

    Submitted 20 May, 2023; originally announced May 2023.

    Comments: ICML 2023

  19. arXiv:2305.10429  [pdf, other

    cs.CL cs.LG

    DoReMi: Optimizing Data Mixtures Speeds Up Language Model Pretraining

    Authors: Sang Michael Xie, Hieu Pham, Xuanyi Dong, Nan Du, Hanxiao Liu, Yifeng Lu, Percy Liang, Quoc V. Le, Tengyu Ma, Adams Wei Yu

    Abstract: The mixture proportions of pretraining data domains (e.g., Wikipedia, books, web text) greatly affect language model (LM) performance. In this paper, we propose Domain Reweighting with Minimax Optimization (DoReMi), which first trains a small proxy model using group distributionally robust optimization (Group DRO) over domains to produce domain weights (mixture proportions) without knowledge of do… ▽ More

    Submitted 20 November, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023

  20. arXiv:2305.10403  [pdf, other

    cs.CL cs.AI

    PaLM 2 Technical Report

    Authors: Rohan Anil, Andrew M. Dai, Orhan Firat, Melvin Johnson, Dmitry Lepikhin, Alexandre Passos, Siamak Shakeri, Emanuel Taropa, Paige Bailey, Zhifeng Chen, Eric Chu, Jonathan H. Clark, Laurent El Shafey, Yanping Huang, Kathy Meier-Hellstern, Gaurav Mishra, Erica Moreira, Mark Omernick, Kevin Robinson, Sebastian Ruder, Yi Tay, Kefan Xiao, Yuanzhong Xu, Yujing Zhang, Gustavo Hernandez Abrego , et al. (103 additional authors not shown)

    Abstract: We introduce PaLM 2, a new state-of-the-art language model that has better multilingual and reasoning capabilities and is more compute-efficient than its predecessor PaLM. PaLM 2 is a Transformer-based model trained using a mixture of objectives. Through extensive evaluations on English and multilingual language, and reasoning tasks, we demonstrate that PaLM 2 has significantly improved quality on… ▽ More

    Submitted 13 September, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

  21. arXiv:2304.04947  [pdf, other

    cs.CL

    Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference

    Authors: Tao Lei, Junwen Bai, Siddhartha Brahma, Joshua Ainslie, Kenton Lee, Yanqi Zhou, Nan Du, Vincent Y. Zhao, Yuexin Wu, Bo Li, Yu Zhang, Ming-Wei Chang

    Abstract: We propose Conditional Adapter (CoDA), a parameter-efficient transfer learning method that also improves inference efficiency. CoDA generalizes beyond standard adapter approaches to enable a new way of balancing speed and accuracy using conditional computation. Starting with an existing dense pretrained model, CoDA adds sparse activation together with a small number of new parameters and a light-w… ▽ More

    Submitted 26 November, 2023; v1 submitted 10 April, 2023; originally announced April 2023.

    Comments: NeurIPS camera ready version

  22. arXiv:2302.08917  [pdf, other

    cs.CL cs.LG

    Massively Multilingual Shallow Fusion with Large Language Models

    Authors: Ke Hu, Tara N. Sainath, Bo Li, Nan Du, Yanping Huang, Andrew M. Dai, Yu Zhang, Rodrigo Cabrera, Zhifeng Chen, Trevor Strohman

    Abstract: While large language models (LLM) have made impressive progress in natural language processing, it remains unclear how to utilize them in improving automatic speech recognition (ASR). In this work, we propose to train a single multilingual language model (LM) for shallow fusion in multiple languages. We push the limits of the multilingual LM to cover up to 84 languages by scaling up using a mixtur… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

    Comments: Accepted to IEEE ICASSP 2023

  23. arXiv:2212.09347  [pdf, other

    cs.CR cs.ET

    Review of security techniques for memristor computing systems

    Authors: Minhui Zou, Nan Du, Shahar Kvatinsky

    Abstract: Neural network (NN) algorithms have become the dominant tool in visual object recognition, natural language processing, and robotics. To enhance the computational efficiency of these algorithms, in comparison to the traditional von Neuman computing architectures, researchers have been focusing on memristor computing systems. A major drawback when using memristor computing systems today is that, in… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

    Comments: 15 pages, 5 figures

    Journal ref: Front. Electron. Mater, 19 December 2022, Sec. Semiconducting Materials and Devices Sec. Semiconducting Materials and Devices

  24. arXiv:2210.03629  [pdf, other

    cs.CL cs.AI cs.LG

    ReAct: Synergizing Reasoning and Acting in Language Models

    Authors: Shunyu Yao, Jeffrey Zhao, Dian Yu, Nan Du, Izhak Shafran, Karthik Narasimhan, Yuan Cao

    Abstract: While large language models (LLMs) have demonstrated impressive capabilities across tasks in language understanding and interactive decision making, their abilities for reasoning (e.g. chain-of-thought prompting) and acting (e.g. action plan generation) have primarily been studied as separate topics. In this paper, we explore the use of LLMs to generate both reasoning traces and task-specific acti… ▽ More

    Submitted 9 March, 2023; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: v3 is the ICLR camera ready version with some typos fixed. Project site with code: https://react-lm.github.io

  25. arXiv:2210.03465  [pdf, other

    cs.ET cond-mat.mes-hall cs.CR physics.comp-ph

    Physics inspired compact modelling of BiFeO$_3$ based memristors for hardware security applications

    Authors: Sahitya Yarragolla, Nan Du, Torben Hemke, Xianyue Zhao, Ziang Chen, Ilia Polian, Thomas Mussenbrock

    Abstract: With the advent of the Internet of Things, nanoelectronic devices or memristors have been the subject of significant interest for use as new hardware security primitives. Among the several available memristors, BiFe$\rm O_{3}$ (BFO)-based electroforming-free memristors have attracted considerable attention due to their excellent properties, such as long retention time, self-rectification, intrinsi… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

    Comments: 13 pages and 8 figures

  26. arXiv:2204.02311  [pdf, other

    cs.CL

    PaLM: Scaling Language Modeling with Pathways

    Authors: Aakanksha Chowdhery, Sharan Narang, Jacob Devlin, Maarten Bosma, Gaurav Mishra, Adam Roberts, Paul Barham, Hyung Won Chung, Charles Sutton, Sebastian Gehrmann, Parker Schuh, Kensen Shi, Sasha Tsvyashchenko, Joshua Maynez, Abhishek Rao, Parker Barnes, Yi Tay, Noam Shazeer, Vinodkumar Prabhakaran, Emily Reif, Nan Du, Ben Hutchinson, Reiner Pope, James Bradbury, Jacob Austin , et al. (42 additional authors not shown)

    Abstract: Large language models have been shown to achieve remarkable performance across a variety of natural language tasks using few-shot learning, which drastically reduces the number of task-specific training examples needed to adapt the model to a particular application. To further our understanding of the impact of scale on few-shot learning, we trained a 540-billion parameter, densely activated, Tran… ▽ More

    Submitted 5 October, 2022; v1 submitted 5 April, 2022; originally announced April 2022.

  27. arXiv:2202.09368  [pdf, other

    cs.LG cs.AI

    Mixture-of-Experts with Expert Choice Routing

    Authors: Yanqi Zhou, Tao Lei, Hanxiao Liu, Nan Du, Yanping Huang, Vincent Zhao, Andrew Dai, Zhifeng Chen, Quoc Le, James Laudon

    Abstract: Sparsely-activated Mixture-of-experts (MoE) models allow the number of parameters to greatly increase while keeping the amount of computation for a given token or a given sample unchanged. However, a poor expert routing strategy (e.g. one resulting in load imbalance) can cause certain experts to be under-trained, leading to an expert being under or over-specialized. Prior work allocates a fixed nu… ▽ More

    Submitted 13 October, 2022; v1 submitted 18 February, 2022; originally announced February 2022.

  28. arXiv:2202.08906  [pdf, other

    cs.CL cs.LG

    ST-MoE: Designing Stable and Transferable Sparse Expert Models

    Authors: Barret Zoph, Irwan Bello, Sameer Kumar, Nan Du, Yanping Huang, Jeff Dean, Noam Shazeer, William Fedus

    Abstract: Scale has opened new frontiers in natural language processing -- but at a high cost. In response, Mixture-of-Experts (MoE) and Switch Transformers have been proposed as an energy efficient path to even larger and more capable language models. But advancing the state-of-the-art across a broad set of natural language tasks has been hindered by training instabilities and uncertain quality during fine… ▽ More

    Submitted 29 April, 2022; v1 submitted 17 February, 2022; originally announced February 2022.

    Comments: 25 pages main text, 39 pages overall

  29. arXiv:2112.06905  [pdf, other

    cs.CL

    GLaM: Efficient Scaling of Language Models with Mixture-of-Experts

    Authors: Nan Du, Yanping Huang, Andrew M. Dai, Simon Tong, Dmitry Lepikhin, Yuanzhong Xu, Maxim Krikun, Yanqi Zhou, Adams Wei Yu, Orhan Firat, Barret Zoph, Liam Fedus, Maarten Bosma, Zongwei Zhou, Tao Wang, Yu Emma Wang, Kellie Webster, Marie Pellat, Kevin Robinson, Kathleen Meier-Hellstern, Toju Duke, Lucas Dixon, Kun Zhang, Quoc V Le, Yonghui Wu , et al. (2 additional authors not shown)

    Abstract: Scaling language models with more data, compute and parameters has driven significant progress in natural language processing. For example, thanks to scaling, GPT-3 was able to achieve strong results on in-context learning tasks. However, training these large dense models requires significant amounts of computing resources. In this paper, we propose and develop a family of language models named GL… ▽ More

    Submitted 1 August, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

    Comments: Accepted to ICML 2022

  30. arXiv:2109.01652  [pdf, other

    cs.CL

    Finetuned Language Models Are Zero-Shot Learners

    Authors: Jason Wei, Maarten Bosma, Vincent Y. Zhao, Kelvin Guu, Adams Wei Yu, Brian Lester, Nan Du, Andrew M. Dai, Quoc V. Le

    Abstract: This paper explores a simple method for improving the zero-shot learning abilities of language models. We show that instruction tuning -- finetuning language models on a collection of tasks described via instructions -- substantially improves zero-shot performance on unseen tasks. We take a 137B parameter pretrained language model and instruction-tune it on over 60 NLP tasks verbalized via natur… ▽ More

    Submitted 8 February, 2022; v1 submitted 3 September, 2021; originally announced September 2021.

    Comments: Version 5. Find list of changes in Appendix F (page 35)

  31. arXiv:2107.09545  [pdf, other

    cs.LG cs.HC cs.RO

    Predicting Driver Takeover Time in Conditionally Automated Driving

    Authors: Jackie Ayoub, Na Du, X. Jessie Yang, Feng Zhou

    Abstract: It is extremely important to ensure a safe takeover transition in conditionally automated driving. One of the critical factors that quantifies the safe takeover transition is takeover time. Previous studies identified the effects of many factors on takeover time, such as takeover lead time, non-driving tasks, modalities of the takeover requests (TORs), and scenario urgency. However, there is a lac… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

  32. arXiv:2105.04645  [pdf, other

    cs.CL

    R2D2: Relational Text Decoding with Transformers

    Authors: Aryan Arbabi, Mingqiu Wang, Laurent El Shafey, Nan Du, Izhak Shafran

    Abstract: We propose a novel framework for modeling the interaction between graphical structures and the natural language text associated with their nodes and edges. Existing approaches typically fall into two categories. On group ignores the relational structure by converting them into linear sequences and then utilize the highly successful Seq2Seq models. The other side ignores the sequential nature of th… ▽ More

    Submitted 10 May, 2021; originally announced May 2021.

  33. arXiv:2010.03047  [pdf, other

    cs.HC

    Psychophysiological responses to takeover requests in conditionally automated driving

    Authors: Na Du, X. Jessie Yang, Feng Zhou

    Abstract: In SAE Level 3 automated driving, taking over control from automation raises significant safety concerns because drivers out of the vehicle control loop have difficulty negotiating takeover transitions. Existing studies on takeover transitions have focused on drivers' behavioral responses to takeover requests (TORs). As a complement, this exploratory study aimed to examine drivers' psychophysiolog… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

  34. arXiv:2008.01051  [pdf, other

    cs.HC

    Enhancing autonomy transparency: an option-centric rationale approach

    Authors: Ruikun Luo, Na Du, X. Jessie Yang

    Abstract: While the advances in artificial intelligence and machine learning empower a new generation of autonomous systems for assisting human performance, one major concern arises from the human factors perspective: Humans have difficulty deciphering autonomy-generated solutions and increasingly perceive autonomy as a mysterious black box. The lack of transparency contributes to the lack of trust in auton… ▽ More

    Submitted 3 August, 2020; originally announced August 2020.

  35. arXiv:2007.06199  [pdf, other

    eess.IV cs.CV cs.LG

    CheXphoto: 10,000+ Photos and Transformations of Chest X-rays for Benchmarking Deep Learning Robustness

    Authors: Nick A. Phillips, Pranav Rajpurkar, Mark Sabini, Rayan Krishnan, Sharon Zhou, Anuj Pareek, Nguyet Minh Phu, Chris Wang, Mudit Jain, Nguyen Duong Du, Steven QH Truong, Andrew Y. Ng, Matthew P. Lungren

    Abstract: Clinical deployment of deep learning algorithms for chest x-ray interpretation requires a solution that can integrate into the vast spectrum of clinical workflows across the world. An appealing approach to scaled deployment is to leverage the ubiquity of smartphones by capturing photos of x-rays to share with clinicians using messaging services like WhatsApp. However, the application of chest x-ra… ▽ More

    Submitted 11 December, 2020; v1 submitted 13 July, 2020; originally announced July 2020.

  36. arXiv:2003.11531  [pdf, other

    cs.CL

    The Medical Scribe: Corpus Development and Model Performance Analyses

    Authors: Izhak Shafran, Nan Du, Linh Tran, Amanda Perry, Lauren Keyes, Mark Knichel, Ashley Domin, Lei Huang, Yuhui Chen, Gang Li, Mingqiu Wang, Laurent El Shafey, Hagen Soltau, Justin S. Paul

    Abstract: There is a growing interest in creating tools to assist in clinical note generation using the audio of provider-patient encounters. Motivated by this goal and with the help of providers and medical scribes, we developed an annotation scheme to extract relevant clinical concepts. We used this annotation scheme to label a corpus of about 6k clinical encounters. This was used to train a state-of-the-… ▽ More

    Submitted 11 March, 2020; originally announced March 2020.

    Comments: Extended version of the paper accepted at LREC 2020

    Journal ref: Proceedings of Language Resources and Evaluation, 2020

  37. arXiv:2001.04509  [pdf

    cs.HC cs.AI cs.CY cs.RO

    Examining the Effects of Emotional Valence and Arousal on Takeover Performance in Conditionally Automated Driving

    Authors: Na Du, Feng Zhou, Elizabeth Pulver, Dawn M. Tilbury, Lionel P. Robert, Anuj K. Pradhan, X. Jessie Yang

    Abstract: In conditionally automated driving, drivers have difficulty in takeover transitions as they become increasingly decoupled from the operational level of driving. Factors influencing takeover performance, such as takeover lead time and the engagement of non-driving related tasks, have been studied in the past. However, despite the important role emotions play in human-machine interaction and in manu… ▽ More

    Submitted 13 January, 2020; originally announced January 2020.

    Comments: 28 pages, 7 figures

  38. arXiv:1912.01762  [pdf, ps, other

    cs.LG stat.ML

    Deep Physiological State Space Model for Clinical Forecasting

    Authors: Yuan Xue, Denny Zhou, Nan Du, Andrew Dai, Zhen Xu, Kun Zhang, Claire Cui

    Abstract: Clinical forecasting based on electronic medical records (EMR) can uncover the temporal correlations between patients' conditions and outcomes from sequences of longitudinal clinical measurements. In this work, we propose an intervention-augmented deep state space generative model to capture the interactions among clinical measurements and interventions by explicitly modeling the dynamics of patie… ▽ More

    Submitted 3 December, 2019; originally announced December 2019.

  39. arXiv:1908.11536  [pdf, other

    cs.CL

    Learning to Infer Entities, Properties and their Relations from Clinical Conversations

    Authors: Nan Du, Mingqiu Wang, Linh Tran, Gang Li, Izhak Shafran

    Abstract: Recently we proposed the Span Attribute Tagging (SAT) Model (Du et al., 2019) to infer clinical entities (e.g., symptoms) and their properties (e.g., duration). It tackles the challenge of large label space and limited training data using a hierarchical two-stage approach that identifies the span of interest in a tagging step and assigns labels to the span in a classification step. We extend the… ▽ More

    Submitted 30 August, 2019; originally announced August 2019.

    Journal ref: Proc. Empirical Methods in Natural Language Processing, 2019

  40. Multi-Grained Named Entity Recognition

    Authors: Congying Xia, Chenwei Zhang, Tao Yang, Yaliang Li, Nan Du, Xian Wu, Wei Fan, Fenglong Ma, Philip Yu

    Abstract: This paper presents a novel framework, MGNER, for Multi-Grained Named Entity Recognition where multiple entities or entity mentions in a sentence could be non-overlapping or totally nested. Different from traditional approaches regarding NER as a sequential labeling task and annotate entities consecutively, MGNER detects and recognizes entities on multiple granularities: it is able to recognize na… ▽ More

    Submitted 20 June, 2019; originally announced June 2019.

    Comments: In ACL 2019 as a long paper

  41. arXiv:1906.02239  [pdf, other

    cs.CL cs.LG

    Extracting Symptoms and their Status from Clinical Conversations

    Authors: Nan Du, Kai Chen, Anjuli Kannan, Linh Tran, Yuhui Chen, Izhak Shafran

    Abstract: This paper describes novel models tailored for a new application, that of extracting the symptoms mentioned in clinical conversations along with their status. Lack of any publicly available corpus in this privacy-sensitive domain led us to develop our own corpus, consisting of about 3K conversations annotated by professional medical scribes. We propose two novel deep learning approaches to infer t… ▽ More

    Submitted 5 June, 2019; originally announced June 2019.

    Journal ref: Proceedings of the Annual Meeting of the Association of Computational Linguistics, 2019

  42. arXiv:1905.08878  [pdf

    cs.HC cs.CY cs.RO

    Look Who's Talking Now: Implications of AV's Explanations on Driver's Trust, AV Preference, Anxiety and Mental Workload

    Authors: Na Du, Jacob Haspiel, Qiaoning Zhang, Dawn Tilbury, Anuj K. Pradhan, X. Jessie Yang, Lionel P. Robert Jr

    Abstract: Explanations given by automation are often used to promote automation adoption. However, it remains unclear whether explanations promote acceptance of automated vehicles (AVs). In this study, we conducted a within-subject experiment in a driving simulator with 32 participants, using four different conditions. The four conditions included: (1) no explanation, (2) explanation given before or (3) aft… ▽ More

    Submitted 21 May, 2019; originally announced May 2019.

    Comments: 42 pages, 5 figures, 3 Tables

  43. arXiv:1901.00056  [pdf, other

    cs.CL cs.AI

    Entity Synonym Discovery via Multipiece Bilateral Context Matching

    Authors: Chenwei Zhang, Yaliang Li, Nan Du, Wei Fan, Philip S. Yu

    Abstract: Being able to automatically discover synonymous entities in an open-world setting benefits various tasks such as entity disambiguation or knowledge graph canonicalization. Existing works either only utilize entity features, or rely on structured annotations from a single piece of context where the entity is mentioned. To leverage diverse contexts where entities are mentioned, in this paper, we gen… ▽ More

    Submitted 10 May, 2020; v1 submitted 31 December, 2018; originally announced January 2019.

    Comments: In IJCAI 2020 as a long paper. Code and data are available at https://github.com/czhang99/SynonymNet

  44. arXiv:1812.09471  [pdf, other

    cs.CL

    Joint Slot Filling and Intent Detection via Capsule Neural Networks

    Authors: Chenwei Zhang, Yaliang Li, Nan Du, Wei Fan, Philip S. Yu

    Abstract: Being able to recognize words as slots and detect the intent of an utterance has been a keen issue in natural language understanding. The existing works either treat slot filling and intent detection separately in a pipeline manner, or adopt joint models which sequentially label slots while summarizing the utterance-level intent without explicitly preserving the hierarchical relationship among wor… ▽ More

    Submitted 7 July, 2019; v1 submitted 22 December, 2018; originally announced December 2018.

    Comments: In ACL 2019 as a long paper. Code and data available at https://github.com/czhang99/Capsule-NLU

  45. arXiv:1812.02354  [pdf, other

    cs.CL

    Multi-Task Learning with Multi-View Attention for Answer Selection and Knowledge Base Question Answering

    Authors: Yang Deng, Yuexiang Xie, Yaliang Li, Min Yang, Nan Du, Wei Fan, Kai Lei, Ying Shen

    Abstract: Answer selection and knowledge base question answering (KBQA) are two important tasks of question answering (QA) systems. Existing methods solve these two tasks separately, which requires large number of repetitive work and neglects the rich correlation information between tasks. In this paper, we tackle answer selection and KBQA tasks simultaneously via multi-task learning (MTL), motivated by the… ▽ More

    Submitted 6 December, 2018; originally announced December 2018.

    Comments: Accepted by AAAI 2019

  46. arXiv:1811.11339  [pdf, other

    stat.ML cs.LG

    Statistical Robust Chinese Remainder Theorem for Multiple Numbers: Wrapped Gaussian Mixture Model

    Authors: Nan Du, Zhikang Wang, Hanshen Xiao

    Abstract: Generalized Chinese Remainder Theorem (CRT) has been shown to be a powerful approach to solve the ambiguity resolution problem. However, with its close relationship to number theory, study in this area is mainly from a coding theory perspective under deterministic conditions. Nevertheless, it can be proved that even with the best deterministic condition known, the probability of success in robust… ▽ More

    Submitted 27 November, 2018; originally announced November 2018.

  47. arXiv:1811.05016  [pdf, other

    cs.LG stat.ML

    Learning Temporal Point Processes via Reinforcement Learning

    Authors: Shuang Li, Shuai Xiao, Shixiang Zhu, Nan Du, Yao Xie, Le Song

    Abstract: Social goods, such as healthcare, smart city, and information networks, often produce ordered event data in continuous time. The generative processes of these event data can be very complex, requiring flexible models to capture their dynamics. Temporal point processes offer an elegant framework for modeling event data without discretizing the time. However, the existing maximum-likelihood-estimati… ▽ More

    Submitted 26 December, 2020; v1 submitted 12 November, 2018; originally announced November 2018.

    Comments: Add code link

  48. arXiv:1811.00681  [pdf, other

    cs.AI cs.CL

    On the Generation of Medical Question-Answer Pairs

    Authors: Sheng Shen, Yaliang Li, Nan Du, Xian Wu, Yusheng Xie, Shen Ge, Tao Yang, Kai Wang, Xingzheng Liang, Wei Fan

    Abstract: Question answering (QA) has achieved promising progress recently. However, answering a question in real-world scenarios like the medical domain is still challenging, due to the requirement of external knowledge and the insufficient quantity of high-quality training data. In the light of these challenges, we study the task of generating medical QA pairs in this paper. With the insight that each med… ▽ More

    Submitted 6 December, 2019; v1 submitted 1 November, 2018; originally announced November 2018.

    Comments: AAAI 2020

  49. arXiv:1810.05983  [pdf, other

    cs.AI cs.CL

    Finding Similar Medical Questions from Question Answering Websites

    Authors: Yaliang Li, Liuyi Yao, Nan Du, Jing Gao, Qi Li, Chuishi Meng, Chenwei Zhang, Wei Fan

    Abstract: The past few years have witnessed the flourishing of crowdsourced medical question answering (Q&A) websites. Patients who have medical information demands tend to post questions about their health conditions on these crowdsourced Q&A websites and get answers from other users. However, we observe that a large portion of new medical questions cannot be answered in time or receive only few answers fr… ▽ More

    Submitted 14 October, 2018; originally announced October 2018.

  50. arXiv:1809.10404  [pdf, other

    cs.DB

    MedTruth: A Semi-supervised Approach to Discovering Knowledge Condition Information from Multi-Source Medical Data

    Authors: Yang Deng, Yaliang Li, Ying Shen, Nan Du, Wei Fan, Min Yang, Kai Lei

    Abstract: Knowledge Graph (KG) contains entities and the relations between entities. Due to its representation ability, KG has been successfully applied to support many medical/healthcare tasks. However, in the medical domain, knowledge holds under certain conditions. For example, symptom \emph{runny nose} highly indicates the existence of disease \emph{whooping cough} when the patient is a baby rather than… ▽ More

    Submitted 18 August, 2019; v1 submitted 27 September, 2018; originally announced September 2018.

    Comments: Accepted as CIKM2019 long paper