Zum Hauptinhalt springen

Showing 1–40 of 40 results for author: Lai, G

.
  1. arXiv:2405.20563  [pdf, ps, other

    math.DS

    Limit sets, internal chain transitivity and orbital shadowing of tree-shifts defined on Markov-Cayley trees

    Authors: Jung-Chao Ban, Nai-Zhu Huang, Guan-Yu Lai

    Abstract: In this paper, we introduce the concepts of $ω$-limit sets and pseudo orbits for a tree-shift defined on a Markov-Cayley tree, extending the results of tree-shifts defined on $d$-trees [5,6]. Firstly, we establish the relationships between $ω$-limit sets and we introduce a modified definition of $ω$-limit set based on complete prefix sets (Theorems 1.4 and 1.9). Secondly, we introduce the concept… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  2. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1110 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 8 August, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  3. arXiv:2402.19324  [pdf, other

    math.DS math.CO math.NT

    Entropy of axial product of multiplicative subshifts

    Authors: Jung-Chao Ban, Wen-Guei Hu, Guan-Yu Lai, Lingmin Liao

    Abstract: We obtain the entropy and the surface entropy of the axial products on $\mathbb{N}^d$ and the $d$-tree $T^d$ of two types of systems: the subshift and the multiplicative subshift.

    Submitted 29 February, 2024; originally announced February 2024.

  4. arXiv:2402.18822  [pdf, ps, other

    math.DS

    Hausdorff dimensions of affine multiplicative subshifts

    Authors: Jung-Chao Ban, Wen-Guei Hu, Guan-Yu Lai, Lingmin Liao

    Abstract: We calculate the Minkowski and Hausdorff dimensions of affine multiplicative subshifts on $\mathbb{N}$.

    Submitted 28 February, 2024; originally announced February 2024.

  5. arXiv:2401.09695  [pdf

    cs.HC cs.AI

    Should ChatGPT Write Your Breakup Text? Exploring the Role of AI in Relationship Dissolution

    Authors: Yue Fu, Yixin Chen, Zelia Gomes Da Costa Lai, Alexis Hiniker

    Abstract: Relationships are essential to our happiness and wellbeing. The dissolution of a relationship, the final stage of relationship's lifecycle and one of the most stressful events in an individual's life, can have profound and long-lasting impacts on people. With the breakup process increasingly facilitated by computer-mediated communication (CMC), and the likely future influence of AI-mediated commun… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  6. arXiv:2401.05320  [pdf, other

    math.PR

    Hausdorff dimensions of topologically transitive Markov hom tree-shifts

    Authors: Jung-Chao Ban, Guan-Yu Lai, Yu-Liang Wu

    Abstract: This paper features an analog of Sanov's theorem for finite-state Markov chains indexed by rooted d-trees, obtained via the method of types in the classical analysis of large deviations. Along with the theorem comes two applications: an almost-sure type convergence of sample means and a formula for the Hausdorff dimension of the symbolic space associated with the irreducible Markov chain.

    Submitted 10 January, 2024; originally announced January 2024.

    MSC Class: 28A80; 60J10 (Primary) 37B10 (Secondary)

  7. arXiv:2311.10614  [pdf, other

    cs.CL cs.AI

    A Self-enhancement Approach for Domain-specific Chatbot Training via Knowledge Mining and Digest

    Authors: Ruohong Zhang, Luyu Gao, Chen Zheng, Zhen Fan, Guokun Lai, Zheng Zhang, Fangzhou Ai, Yiming Yang, Hongxia Yang

    Abstract: Large Language Models (LLMs), despite their great power in language generation, often encounter challenges when dealing with intricate and knowledge-demanding queries in specific domains. This paper introduces a novel approach to enhance LLMs by effectively extracting the relevant knowledge from domain-specific textual sources, and the adaptive training of a chatbot with domain-specific inquiries.… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: Work in progress

  8. arXiv:2309.00309  [pdf, other

    math.DS

    The strip entropy approximation of Markov shifts on trees

    Authors: Jung-Chao Ban, Guan-Yu Lai, Cheng-Yu Tsai

    Abstract: The strip entropy is studied in this article. We prove that the strip entropy approximation is valid for every ray of a golden-mean tree. This result extends the previous result of [Petersen-Salama, Discrete \& Continuous Dynamical Systems, 2020] on the conventional 2-tree. Lastly, we prove that the strip entropy approximation is valid for eventually periodic rays of a class of Markov-Cayley trees… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

  9. arXiv:2303.13011  [pdf, other

    math.DS

    The entropy structures of axial products on $\mathbb{N}^d$ and Trees

    Authors: Jung-Chao Ban, Wen-Guei Hu, Guan-Yu Lai

    Abstract: In this paper, we first concentrate on the possible values and dense property of entropies for isotropic and anisotropic axial products of subshifts of finite type (SFTs) on $\mathbb{N}^d$ and $d$-tree $\mathcal{T}_d$. We prove that the entropies of isotropic and anisotropic axial products of SFTs on $\mathbb{N}^d$ are dense in $[0,\infty)$, and the same result also holds for anisotropic axial pro… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

  10. arXiv:2303.11550  [pdf, ps, other

    nlin.SI

    On the discrete modified KP hierarchy: tau functions, Fay identity and squared eigenfunction symmetries

    Authors: Kelei Tian, Guangmiao Lai, Ge Yi, Ying Xu

    Abstract: In this paper, we prove the existence of tau functions of the discrete modified KP hierarchy and define the squared eigenfunction symmetry. Meanwhile, the Fay identity with its difference form, the squared eigenfunction potentials and the symmetry flow acting on tau functions are obtained.

    Submitted 20 March, 2023; originally announced March 2023.

  11. Boundary complexity and surface entropy of 2-multiplicative integer systems on $\mathbb{N}^d$

    Authors: Jung-Chao Ban, Wen-Guei Hu, Guan-Yu Lai

    Abstract: In this article, we introduce the concept of the boundary complexity and prove that for a 2-multiplicative integer system (2-MIS) $X^{p}_Ω$ on $\mathbb{N}$ (or $X^{\bf p}_Ω$ on $\mathbb{N}^d,d\geq 2$), every point in $[h(X^p_Ω), \log r]$ can be realized as a boundary complexity of a 2-MIS with a specific speed, where r stands for the number of the alphabets. The result is new and quite different f… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

  12. Uniformly convex neural networks and non-stationary iterated network Tikhonov (iNETT) method

    Authors: Davide Bianchi, Guanghao Lai, Wenbin Li

    Abstract: We propose a non-stationary iterated network Tikhonov (iNETT) method for the solution of ill-posed inverse problems. The iNETT employs deep neural networks to build a data-driven regularizer, and it avoids the difficult task of estimating the optimal regularization parameter. To achieve the theoretical convergence of iNETT, we introduce uniformly convex neural networks to build the data-driven reg… ▽ More

    Submitted 1 February, 2023; v1 submitted 7 October, 2022; originally announced October 2022.

    MSC Class: 47A52; 65F22; 68T07

  13. arXiv:2208.04089  [pdf

    cond-mat.mtrl-sci

    The mechanism of Li deposition on the Cu substrates in the anode-free Li metal batteries

    Authors: Genming Lai, Junyu Jiao, Chi Fang, Liyuan Sheng, Yao Jiang, Chuying Ouyang, Jiaxin Zheng

    Abstract: Due to the rapid growth in the demand for high-energy-density Li batteries and insufficient global Li reserves, the anode-free Li metal batteries are receiving increasing attention. Various strategies, such as surface modification and structural design of Cu current collectors, have been proposed to stabilize the anode-free Li metal batteries. Unfortunately, the mechanism of Li deposition on the C… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

  14. arXiv:2207.11381  [pdf, other

    math.DS

    On spatial entropy and periodic entropies of Two-dimensional Shifts of Finite Type

    Authors: Wen-Guei Hu, Guan-Yu Lai, Song-Sun Lin

    Abstract: Topological entropy or spatial entropy is a way to measure the complexity of shift spaces. This study investigates the relationships between the spatial entropy and the various periodic entropies which are computed by skew-coordinated systems $γ\in GL_2(\mathbb{Z})$ on two dimensional shifts of finite type.

    Submitted 22 July, 2022; originally announced July 2022.

  15. arXiv:2207.06366  [pdf, other

    cs.CL cs.LG

    N-Grammer: Augmenting Transformers with latent n-grams

    Authors: Aurko Roy, Rohan Anil, Guangda Lai, Benjamin Lee, Jeffrey Zhao, Shuyuan Zhang, Shibo Wang, Ye Zhang, Shen Wu, Rigel Swavely, Tao, Yu, Phuong Dao, Christopher Fifty, Zhifeng Chen, Yonghui Wu

    Abstract: Transformer models have recently emerged as one of the foundational models in natural language processing, and as a byproduct, there is significant recent interest and investment in scaling these models. However, the training and inference costs of these large Transformer language models are prohibitive, thus necessitating more research in identifying more efficient variants. In this work, we prop… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

    Comments: 8 pages, 2 figures

  16. arXiv:2204.07705  [pdf, other

    cs.CL cs.AI

    Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks

    Authors: Yizhong Wang, Swaroop Mishra, Pegah Alipoormolabashi, Yeganeh Kordi, Amirreza Mirzaei, Anjana Arunkumar, Arjun Ashok, Arut Selvan Dhanasekaran, Atharva Naik, David Stap, Eshaan Pathak, Giannis Karamanolakis, Haizhi Gary Lai, Ishan Purohit, Ishani Mondal, Jacob Anderson, Kirby Kuznia, Krima Doshi, Maitreya Patel, Kuntal Kumar Pal, Mehrad Moradshahi, Mihir Parmar, Mirali Purohit, Neeraj Varshney, Phani Rohitha Kaza , et al. (15 additional authors not shown)

    Abstract: How well can NLP models generalize to a variety of unseen tasks when provided with task instructions? To address this question, we first introduce Super-NaturalInstructions, a benchmark of 1,616 diverse NLP tasks and their expert-written instructions. Our collection covers 76 distinct task types, including but not limited to classification, extraction, infilling, sequence tagging, text rewriting,… ▽ More

    Submitted 24 October, 2022; v1 submitted 15 April, 2022; originally announced April 2022.

    Comments: Accepted to EMNLP 2022, 25 pages

  17. arXiv:2204.02604  [pdf, other

    cs.NE

    Interactive Evolutionary Multi-Objective Optimization via Learning-to-Rank

    Authors: Ke Li, Guiyu Lai, Xin Yao

    Abstract: In practical multi-criterion decision-making, it is cumbersome if a decision maker (DM) is asked to choose among a set of trade-off alternatives covering the whole Pareto-optimal front. This is a paradox in conventional evolutionary multi-objective optimization (EMO) that always aim to achieve a well balance between convergence and diversity. In essence, the ultimate goal of multi-objective optimi… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

  18. arXiv:2203.08970  [pdf, other

    math.PR math.DS

    Thermodynamic formalism and large deviation principle of multiplicative Ising models

    Authors: Jung-Chao Ban, Wen-Guei Hu, Guan-Yu Lai

    Abstract: The aim of this study is tree-fold. First, we investigate the thermodynamics of the Ising models with respect to 2-multiple Hamiltonians. This extends the previous results of [Chazotte and Redig, Electron. J. Probably., 2014] to $\mathbb{N}^d$. Second, we establish the large deviation principle (LDP) of the average $\frac{1}{N} S_N^G$, where $S_N^G$ is a 2-multiple sum along a semigroup generated… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

  19. Two Decades of Game Jams

    Authors: Gorm Lai, Annakaisa Kultima, Foaad Khosmood, Johanna Pirker, Allan Fowler, Ilaria Vecchi, William Latham, Frederic Fol Leymarie

    Abstract: In less than a year's time, March 2022 will mark the twentieth anniversary of the first documented game jam, the Indie Game Jam, which took place in Oakland, California in 2002. Initially, game jams were widely seen as frivolous activities. Since then, they have taken the world by storm. Game jams have not only become part of the day-to-day process of many game developers, but jams are also used f… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

    Journal ref: ICGJ 2021: Sixth Annual International Conference on Game Jams, Hackathons, and Game Creation Events

  20. arXiv:2108.12986  [pdf, other

    math.DS

    Characterization and Topological Behavior of Homomorphism Tree-Shifts

    Authors: Jung-Chao Ban, Chih-Hung Chang, Wen-Guei Hu, Guan-Yu Lai, Yu-Liang Wu

    Abstract: The purpose of this article is twofold. On one hand, we reveal the equivalence of shift of finite type between a one-sided shift $X$ and its associated hom tree-shift $\mathcal{T}_{X}$, as well as the equivalence in the sofic shift. On the other hand, we investigate the interrelationship among the comparable mixing properties on tree-shifts as those on multidimensional shift spaces. They include i… ▽ More

    Submitted 30 August, 2021; originally announced August 2021.

    MSC Class: 37B10; 37E25

  21. arXiv:2106.10979  [pdf

    cond-mat.mtrl-sci physics.comp-ph

    Self-healing mechanism of lithium in lithium metal batteries

    Authors: Junyu Jiao, Genming Lai, Liang Zhao, Jiaze Lu, Qidong Li, Xianqi Xu, Yao Jiang, Yan-Bing He, Chuying Ouyang, Feng Pan, Hong Li, Jiaxin Zheng

    Abstract: Li metal is an ideal anode material for use in state-of-the-art secondary batteries. However, Li-dendrite growth is a safety concern and results in low coulombic efficiency, which significantly restricts the commercial application of Li secondary batteries. Unfortunately, the Li deposition (growth) mechanism is poorly understood on the atomic scale. Here, we used machine learning to construct a Li… ▽ More

    Submitted 27 September, 2021; v1 submitted 21 June, 2021; originally announced June 2021.

  22. arXiv:2106.09860  [pdf, other

    math.PR math.DS

    Large Deviation Principle of Multidimensional Multiple Averages on $\mathbb{N}^d$

    Authors: Jung-Chao Ban, Wen-Guei Hu, Guan-Yu Lai

    Abstract: This paper establishs the large deviation principle (LDP) for multiple averages on $\mathbb{N}^d$. We extend the previous work of [Carinci et al., Indag. Math. 2012] to multidimensional lattice $\mathbb{N}^d$ for $d\geq 2$. The same technique is also applicable to the weighted multiple average launched by Fan [Fan, Adv. Math. 2021]. Finally, the boundary conditions are imposed to the multiple sum… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

  23. arXiv:2012.11291  [pdf

    stat.ME

    How to estimate the association between change in a risk factor and a health outcome?

    Authors: Michail Katsoulis, Alvina G Lai, Dimitra-Kleio Kipourou, Reecha Sofat, Manuel Gomes, Amitava Banerjee, Spiros Denaxas, Thomas R Lumbers, Kostas Tsilidis, Harry Hemingway, Karla Diaz-Ordaz

    Abstract: Estimating the effect of a change in a particular risk factor and a chronic disease requires information on the risk factor from two time points; the enrolment and the first follow-up. When using observational data to study the effect of such an exposure (change in risk factor) extra complications arise, namely (i) when is time zero? and (ii) which information on confounders should we account for… ▽ More

    Submitted 21 December, 2020; originally announced December 2020.

    Comments: 13 pages, 2 Tables, 3 Figures

    MSC Class: 62-07 (in MSC2010) or 62R07 (in MSC2020)

  24. arXiv:2009.08595  [pdf, ps, other

    cs.CL

    Unsupervised Parallel Corpus Mining on Web Data

    Authors: Guokun Lai, Zihang Dai, Yiming Yang

    Abstract: With a large amount of parallel data, neural machine translation systems are able to deliver human-level performance for sentence-level translation. However, it is costly to label a large amount of parallel data by humans. In contrast, there is a large-scale of parallel corpus created by humans on the Internet. The major difficulty to utilize them is how to filter them out from the noise website e… ▽ More

    Submitted 17 September, 2020; originally announced September 2020.

  25. arXiv:2006.03236  [pdf, other

    cs.LG cs.CL stat.ML

    Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing

    Authors: Zihang Dai, Guokun Lai, Yiming Yang, Quoc V. Le

    Abstract: With the success of language pretraining, it is highly desirable to develop more efficient architectures of good scalability that can exploit the abundant unlabeled data at a lower cost. To improve the efficiency, we examine the much-overlooked redundancy in maintaining a full-length token-level presentation, especially for tasks that only require a single-vector presentation of the sequence. With… ▽ More

    Submitted 5 June, 2020; originally announced June 2020.

  26. arXiv:2005.09324  [pdf, other

    cs.HC

    Towards Friendly Mixed Initiative Procedural Content Generation: Three Pillars of Industry

    Authors: Gorm Lai, William Latham, Frederic Fol Leymarie

    Abstract: While the games industry is moving towards procedural content generation (PCG) with tools available under popular platforms such as Unreal, Unity or Houdini, and video game titles like No Man's Sky and Horizon Zero Dawn taking advantage of PCG, the gap between academia and industry is as wide as it has ever been, in terms of communication and sharing methods. One of the authors, has worked on both… ▽ More

    Submitted 19 May, 2020; originally announced May 2020.

  27. arXiv:2004.11934  [pdf, other

    cs.LG stat.ML

    Correlation-aware Unsupervised Change-point Detection via Graph Neural Networks

    Authors: Ruohong Zhang, Yu Hao, Donghan Yu, Wei-Cheng Chang, Guokun Lai, Yiming Yang

    Abstract: Change-point detection (CPD) aims to detect abrupt changes over time series data. Intuitively, effective CPD over multivariate time series should require explicit modeling of the dependencies across input variables. However, existing CPD methods either ignore the dependency structures entirely or rely on the (unrealistic) assumption that the correlation structures are static over time. In this pap… ▽ More

    Submitted 13 September, 2020; v1 submitted 24 April, 2020; originally announced April 2020.

    Comments: Accepted for publication in the International Conference on Neural Information Processing (ICONIP) 2020 Original paper is 12 pages, additional appendix is available on arxiv

    MSC Class: I.2.6

    Journal ref: ICONIP 2020: Neural Information Processing

  28. arXiv:2004.01170  [pdf, other

    cs.CV

    DOPS: Learning to Detect 3D Objects and Predict their 3D Shapes

    Authors: Mahyar Najibi, Guangda Lai, Abhijit Kundu, Zhichao Lu, Vivek Rathod, Thomas Funkhouser, Caroline Pantofaru, David Ross, Larry S. Davis, Alireza Fathi

    Abstract: We propose DOPS, a fast single-stage 3D object detection method for LIDAR data. Previous methods often make domain-specific design decisions, for example projecting points into a bird-eye view image in autonomous driving scenarios. In contrast, we propose a general-purpose method that works on both indoor and outdoor scenes. The core novelty of our method is a fast, single-pass architecture that b… ▽ More

    Submitted 6 April, 2020; v1 submitted 2 April, 2020; originally announced April 2020.

    Comments: To appear in CVPR 2020

  29. arXiv:2003.10256  [pdf, ps, other

    math.AP

    Self-similar solutions of the spherically symmetric Euler equations for general equations of state

    Authors: Jianjun Chen, Geng Lai

    Abstract: The study of spherically symmetric motion is important for the theory of explosion waves. In this paper, we construct rigorously self-similar solutions to the Riemann problem of the spherically symmetric Euler equations for general equations of state. We used the assumption of self-similarity to reduce the spherically symmetric Euler equations to a system of nonlinear ordinary differential equatio… ▽ More

    Submitted 23 March, 2020; originally announced March 2020.

  30. arXiv:1911.10000  [pdf, other

    math.DS

    Topologically Mixing Properties of Multiplicative Integer System

    Authors: Jung-Chao Ban, Chih-Hung Chang, Wen-Guei Hu, Guan-Yu Lai, Yu-Liang Wu

    Abstract: Motivated from the study of multiple ergodic average, the investigation of multiplicative shift spaces has drawn much of interest among researchers. This paper focuses on the relation of topologically mixing properties between multiplicative shift spaces and traditional shift spaces. Suppose that $\mathsf{X}_Ω^{(l)}$ is the multiplicative subshift derived from the shift space $Ω$ with given… ▽ More

    Submitted 22 November, 2019; originally announced November 2019.

    Comments: 14 pages, 6 figures

    MSC Class: 37B10

  31. arXiv:1909.07009  [pdf, other

    cs.CL

    Bridging the domain gap in cross-lingual document classification

    Authors: Guokun Lai, Barlas Oguz, Yiming Yang, Veselin Stoyanov

    Abstract: The scarcity of labeled training data often prohibits the internationalization of NLP models to multiple languages. Recent developments in cross-lingual understanding (XLU) has made progress in this area, trying to bridge the language barrier using language universal representations. However, even if the language problem was resolved, models trained in one language would not transfer to another la… ▽ More

    Submitted 20 September, 2019; v1 submitted 16 September, 2019; originally announced September 2019.

  32. Introducing: The Game Jam License

    Authors: Gorm Lai, Kai Erenli, Foaad Khosmood, William Latham

    Abstract: Since their inception at the Indie Game Jam in 2002, a significant part of game jams has been knowledge sharing and showcasing ideas and work to peers. While various licensing mechanisms have been used for game jams throughout the years, there has never been a licence uniquely designed for artifacts created during a game jam. In this paper, we present to the community the Game Jam License (GJL) wh… ▽ More

    Submitted 29 August, 2019; originally announced August 2019.

  33. arXiv:1907.05221  [pdf, ps, other

    math.AP

    Global non-isentropic rotational supersonic flows in a semi-infinite divergent duct

    Authors: Geng Lai

    Abstract: Supersonic flows for the two-dimensional (2D) steady full Euler system are studied. We construct a global non-isentropic rotational supersonic flow in a semi-infinite divergent duct. The flow satisfies the slip condition on the walls of the duct, and the state of the flow is given at the inlet of the duct. The solution is constructed by the method of characteristics. The main difficulty for the gl… ▽ More

    Submitted 23 March, 2020; v1 submitted 11 July, 2019; originally announced July 2019.

  34. arXiv:1902.01388  [pdf, ps, other

    cs.LG stat.ML

    Re-examination of the Role of Latent Variables in Sequence Modeling

    Authors: Zihang Dai, Guokun Lai, Yiming Yang, Shinjae Yoo

    Abstract: With latent variables, stochastic recurrent models have achieved state-of-the-art performance in modeling sound-wave sequence. However, opposite results are also observed in other domains, where standard recurrent networks often outperform stochastic models. To better understand this discrepancy, we re-examine the roles of latent variables in stochastic recurrent models for speech density estimati… ▽ More

    Submitted 16 September, 2019; v1 submitted 4 February, 2019; originally announced February 2019.

    Comments: Code available at https://github.com/zihangdai/reexamine-srnn, accepted by NeurIPS 2019

  35. arXiv:1806.06116  [pdf, other

    cs.LG stat.ML

    Stochastic WaveNet: A Generative Latent Variable Model for Sequential Data

    Authors: Guokun Lai, Bohan Li, Guoqing Zheng, Yiming Yang

    Abstract: How to model distribution of sequential data, including but not limited to speech and human motions, is an important ongoing research problem. It has been demonstrated that model capacity can be significantly enhanced by introducing stochastic latent variables in the hidden states of recurrent neural networks. Simultaneously, WaveNet, equipped with dilated convolutions, achieves astonishing empiri… ▽ More

    Submitted 15 June, 2018; originally announced June 2018.

    Comments: ICML 2018 Workshop

  36. arXiv:1711.03225  [pdf, other

    cs.CL cs.AI

    Large-scale Cloze Test Dataset Created by Teachers

    Authors: Qizhe Xie, Guokun Lai, Zihang Dai, Eduard Hovy

    Abstract: Cloze tests are widely adopted in language exams to evaluate students' language proficiency. In this paper, we propose the first large-scale human-created cloze test dataset CLOTH, containing questions used in middle-school and high-school language exams. With missing blanks carefully created by teachers and candidate choices purposely designed to be nuanced, CLOTH requires a deeper language under… ▽ More

    Submitted 27 August, 2018; v1 submitted 8 November, 2017; originally announced November 2017.

    Comments: EMNLP 2018

  37. arXiv:1710.11577  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Depthwise Separable Graph Convolution from Data Manifold

    Authors: Guokun Lai, Hanxiao Liu, Yiming Yang

    Abstract: Convolution Neural Network (CNN) has gained tremendous success in computer vision tasks with its outstanding ability to capture the local latent features. Recently, there has been an increasing interest in extending convolution operations to the non-Euclidean geometry. Although various types of convolution operations have been proposed for graphs or manifolds, their connections with traditional co… ▽ More

    Submitted 8 November, 2018; v1 submitted 31 October, 2017; originally announced October 2017.

  38. arXiv:1704.04683  [pdf, other

    cs.CL cs.AI cs.LG

    RACE: Large-scale ReAding Comprehension Dataset From Examinations

    Authors: Guokun Lai, Qizhe Xie, Hanxiao Liu, Yiming Yang, Eduard Hovy

    Abstract: We present RACE, a new dataset for benchmark evaluation of methods in the reading comprehension task. Collected from the English exams for middle and high school Chinese students in the age range between 12 to 18, RACE consists of near 28,000 passages and near 100,000 questions generated by human experts (English instructors), and covers a variety of topics which are carefully designed for evaluat… ▽ More

    Submitted 5 December, 2017; v1 submitted 15 April, 2017; originally announced April 2017.

    Comments: EMNLP 2017

  39. arXiv:1703.07015  [pdf, other

    cs.LG

    Modeling Long- and Short-Term Temporal Patterns with Deep Neural Networks

    Authors: Guokun Lai, Wei-Cheng Chang, Yiming Yang, Hanxiao Liu

    Abstract: Multivariate time series forecasting is an important machine learning problem across many domains, including predictions of solar plant energy output, electricity consumption, and traffic jam situation. Temporal data arise in these real-world applications often involves a mixture of long-term and short-term patterns, for which traditional approaches such as Autoregressive models and Gaussian Proce… ▽ More

    Submitted 18 April, 2018; v1 submitted 20 March, 2017; originally announced March 2017.

    Comments: Accepted by SIGIR 2018

  40. arXiv:0906.1276  [pdf

    cond-mat.other

    Ultrafast Imaging and the "Phase Problem" for Inelastic X-Ray Scattering

    Authors: P. Abbamonte, G. C. L. Wong, D. Cahill, J. P. Reed, R. H. Coridan, N. W. Schmidt, G. H. Lai, Y. I. Joe, D. Casa

    Abstract: We describe a new method for imaging ultrafast dynamics in condensed matter using inelastic x-ray scattering (IXS). We use the concepts of causality and irreversibility to construct a general solution to the inverse scattering problem (or "phase problem") for inelastic x-ray scattering, which enables direct imaging of dynamics of the electron density with resolutions of ~1 attosecond (10-18 sec)… ▽ More

    Submitted 12 June, 2009; v1 submitted 6 June, 2009; originally announced June 2009.

    Comments: General-interest paper; 19 pages, 3 figures; submission to Advanced Materials