Zum Hauptinhalt springen

Showing 1–50 of 210 results for author: Lam, S

.
  1. arXiv:2408.15232  [pdf, other

    cs.CL cs.AI cs.IR

    Into the Unknown Unknowns: Engaged Human Learning through Participation in Language Model Agent Conversations

    Authors: Yucheng Jiang, Yijia Shao, Dekun Ma, Sina J. Semnani, Monica S. Lam

    Abstract: While language model (LM)-powered chatbots and generative search engines excel at answering concrete queries, discovering information in the terrain of unknown unknowns remains challenging for users. To emulate the common educational scenario where children/students learn by listening to and participating in conversations of their parents/teachers, we create Collaborative STORM (Co-STORM). Unlike… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

    ACM Class: I.2.7; H.5.2; H.3.3

  2. arXiv:2408.09846  [pdf, other

    cs.CL

    Continual Dialogue State Tracking via Reason-of-Select Distillation

    Authors: Yujie Feng, Bo Liu, Xiaoyu Dong, Zexin Lu, Li-Ming Zhan, Xiao-Ming Wu, Albert Y. S. Lam

    Abstract: An ideal dialogue system requires continuous skill acquisition and adaptation to new tasks while retaining prior knowledge. Dialogue State Tracking (DST), vital in these systems, often involves learning new services and confronting catastrophic forgetting, along with a critical capability loss termed the "Value Selection Quandary." To address these challenges, we introduce the Reason-of-Select (Ro… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: Accepted to ACL 2024 Findings

  3. arXiv:2408.08389  [pdf, other

    physics.atm-clus physics.chem-ph quant-ph

    Differentiating Three-Dimensional Molecular Structures using Laser-induced Coulomb Explosion Imaging

    Authors: Huynh Van Sa Lam, Anbu Selvam Venkatachalam, Surjendu Bhattacharyya, Keyu Chen, Kurtis Borne, Enliang Wang, Rebecca Boll, Till Jahnke, Vinod Kumarappan, Artem Rudenko, Daniel Rolles

    Abstract: Coulomb explosion imaging (CEI) with x-ray free electron lasers has recently been shown to be a powerful method for obtaining detailed structural information of gas-phase planar ring molecules [R. Boll et al. Nat. Phys. 18, 423-428 (2022)]. In this Letter, we investigate the potential of CEI driven by a tabletop laser and extend this approach to differentiating three-dimensional (3D) structures. W… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Journal ref: Phys. Rev. Lett. 132, 123201 (2024)

  4. arXiv:2408.07958  [pdf, other

    physics.chem-ph physics.atm-clus physics.optics quant-ph

    Imaging coupled vibrational, rotational, and electronic wave packet dynamics in a triatomic molecule

    Authors: Huynh Van Sa Lam, Van-Hung Hoang, Anbu Selvam Venkatachalam, Surjendu Bhattacharyya, Keyu Chen, Sina Jacob, Sanduni Kudagama, Tu Thanh Nguyen, Daniel Rolles, Uwe Thumm, Artem Rudenko, Vinod Kumarappan

    Abstract: Molecular dynamics triggered by interaction with light often involve the excitation of several electronic, vibrational, and rotational states. Characterizing the resulting coupled electronic and nuclear wave packet motion represents a severe challenge, even for small polyatomic systems. In this Letter, we demonstrate how the interplay between vibrational, rotational, and electronic degrees of free… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  5. arXiv:2407.13519  [pdf, other

    cs.CV

    GPSFormer: A Global Perception and Local Structure Fitting-based Transformer for Point Cloud Understanding

    Authors: Changshuo Wang, Meiqing Wu, Siew-Kei Lam, Xin Ning, Shangshu Yu, Ruiping Wang, Weijun Li, Thambipillai Srikanthan

    Abstract: Despite the significant advancements in pre-training methods for point cloud understanding, directly capturing intricate shape information from irregular point clouds without reliance on external data remains a formidable challenge. To address this problem, we propose GPSFormer, an innovative Global Perception and Local Structure Fitting-based Transformer, which learns detailed shape information f… ▽ More

    Submitted 24 July, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: Accepted by ECCV 2024

  6. arXiv:2407.11417  [pdf, other

    cs.CL

    SPINACH: SPARQL-Based Information Navigation for Challenging Real-World Questions

    Authors: Shicheng Liu, Sina J. Semnani, Harold Triedman, Jialiang Xu, Isaac Dan Zhao, Monica S. Lam

    Abstract: Recent work integrating Large Language Models (LLMs) has led to significant improvements in the Knowledge Base Question Answering (KBQA) task. However, we posit that existing KBQA datasets that either have simple questions, use synthetically generated logical forms, or are based on small knowledge base (KB) schemas, do not capture the true complexity of KBQA tasks. To address this, we introduce… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  7. arXiv:2407.09943  [pdf, other

    cs.CL

    Minimizing PLM-Based Few-Shot Intent Detectors

    Authors: Haode Zhang, Xiao-Ming Wu, Albert Y. S. Lam

    Abstract: Recent research has demonstrated the feasibility of training efficient intent detectors based on pre-trained language model~(PLM) with limited labeled data. However, deploying these detectors in resource-constrained environments such as mobile devices poses challenges due to their large sizes. In this work, we aim to address this issue by exploring techniques to minimize the size of PLM-based inte… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

  8. arXiv:2407.05674  [pdf, other

    cs.AI cs.CL cs.PL

    LLM-Based Open-Domain Integrated Task and Knowledge Assistants with Programmable Policies

    Authors: Harshit Joshi, Shicheng Liu, James Chen, Robert Weigle, Monica S. Lam

    Abstract: Programming LLM-based knowledge and task assistants that faithfully conform to developer-provided policies is challenging. These agents must retrieve and provide consistent, accurate, and relevant information to address user's queries and needs. Yet such agents generate unfounded responses ("hallucinate"). Traditional dialogue trees can only handle a limited number of conversation flows, making th… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: preprint

  9. arXiv:2407.03585  [pdf, other

    cs.CL

    Zero-shot Persuasive Chatbots with LLM-Generated Strategies and Information Retrieval

    Authors: Kazuaki Furumai, Roberto Legaspi, Julio Vizcarra, Yudai Yamazaki, Yasutaka Nishimura, Sina J. Semnani, Kazushi Ikeda, Weiyan Shi, Monica S. Lam

    Abstract: Persuasion plays a pivotal role in a wide range of applications from health intervention to the promotion of social good. Persuasive chatbots can accelerate the positive effects of persuasion in such applications. Existing methods rely on fine-tuning persuasive chatbots with task-specific training data which is costly, if not infeasible, to collect. To address this issue, we propose a method to le… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  10. arXiv:2406.00562  [pdf, other

    cs.CL

    SPAGHETTI: Open-Domain Question Answering from Heterogeneous Data Sources with Retrieval and Semantic Parsing

    Authors: Heidi C. Zhang, Sina J. Semnani, Farhad Ghassemi, Jialiang Xu, Shicheng Liu, Monica S. Lam

    Abstract: We introduce SPAGHETTI: Semantic Parsing Augmented Generation for Hybrid English information from Text Tables and Infoboxes, a hybrid question-answering (QA) pipeline that utilizes information from heterogeneous knowledge sources, including knowledge base, text, tables, and infoboxes. Our LLM-augmented approach achieves state-of-the-art performance on the Compmix dataset, the most comprehensive he… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: ACL Findings 2024

  11. arXiv:2405.20585  [pdf, other

    cs.CL cs.AI

    GAMedX: Generative AI-based Medical Entity Data Extractor Using Large Language Models

    Authors: Mohammed-Khalil Ghali, Abdelrahman Farrag, Hajar Sakai, Hicham El Baz, Yu Jin, Sarah Lam

    Abstract: In the rapidly evolving field of healthcare and beyond, the integration of generative AI in Electronic Health Records (EHRs) represents a pivotal advancement, addressing a critical gap in current information extraction techniques. This paper introduces GAMedX, a Named Entity Recognition (NER) approach utilizing Large Language Models (LLMs) to efficiently extract entities from medical narratives an… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  12. arXiv:2405.17840  [pdf, other

    cs.CL

    Benchmarks Underestimate the Readiness of Multi-lingual Dialogue Agents

    Authors: Andrew H. Lee, Sina J. Semnani, Galo Castillo-López, Gäel de Chalendar, Monojit Choudhury, Ashna Dua, Kapil Rajesh Kavitha, Sungkyun Kim, Prashant Kodali, Ponnurangam Kumaraguru, Alexis Lombard, Mehrad Moradshahi, Gihyun Park, Nasredine Semmar, Jiwon Seo, Tianhao Shen, Manish Shrivastava, Deyi Xiong, Monica S. Lam

    Abstract: Creating multilingual task-oriented dialogue (TOD) agents is challenging due to the high cost of training data acquisition. Following the research trend of improving training data efficiency, we show for the first time, that in-context learning is sufficient to tackle multilingual TOD. To handle the challenging dialogue state tracking (DST) subtask, we break it down to simpler steps that are mor… ▽ More

    Submitted 16 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

  13. arXiv:2405.15367  [pdf

    physics.chem-ph physics.atom-ph

    X-ray Coulomb explosion imaging reveals role of molecular structure in internal conversion

    Authors: Till Jahnke, Sebastian Mai, Surjendu Bhattacharyya, Keyu Chen, Rebecca Boll, Maria Elena Castellani, Simon Dold, Avijit Duley, Ulrike Frühling, Alice E. Green, Markus Ilchen, Rebecca Ingle, Gregor Kastirke, Huynh Van Sa Lam, Fabiano Lever, Dennis Mayer, Tommaso Mazza, Terence Mullins, Yevheniy Ovcharenko, Björn Senfftleben, Florian Trinter, Atia Tul Noor, Sergey Usenko, Anbu Selvam Venkatachalam, Artem Rudenko , et al. (4 additional authors not shown)

    Abstract: Molecular photoabsorption results in an electronic excitation/ionization which couples to the rearrangement of the nuclei. The resulting intertwined change of nuclear and electronic degrees of freedom determines the conversion of photoenergy into other molecular energy forms. Nucleobases are excellent candidates for studying such dynamics, and great effort has been taken in the past to observe the… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 19 pages, 8 figures

  14. arXiv:2405.10583  [pdf, other

    cond-mat.supr-con cond-mat.mtrl-sci cond-mat.str-el

    Large Fermi surface in pristine kagome metal CsV$_3$Sb$_5$ and enhanced quasiparticle effective masses

    Authors: Wei Zhang, Tsz Fung Poon, Chun Wai Tsang, Wenyan Wang, X. Liu, J. Xie, S. T. Lam, Shanmin Wang, Kwing To Lai, A. Pourret, G. Seyfarth, G. Knebel, Wing Chi Yu, Swee K. Goh

    Abstract: The kagome metal CsV$_3$Sb$_5$ is an ideal platform to study the interplay between topology and electron correlation. To understand the fermiology of CsV$_3$Sb$_5$, intensive quantum oscillation (QO) studies at ambient pressure have been conducted. However, due to the Fermi surface reconstruction by the complicated charge density wave (CDW) order, the QO spectrum is exceedingly complex, hindering… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: 4 figures, 1 table. This is the preprint of a published paper in PNAS

    Journal ref: Proc. Natl. Acad. Sci. U.S.A. 121, e2322270121 (2024)

  15. arXiv:2405.10325  [pdf

    cond-mat.mtrl-sci physics.chem-ph

    Uncertainty and Exploration of Deep Learning-based Atomistic Models for Screening Molten Salt Properties and Compositions

    Authors: Stephen T. Lam, Shubhojit Banerjee, Rajni Chahal

    Abstract: Due to extreme chemical, thermal, and radiation environments, existing molten salt property databases lack the necessary experimental thermal properties of reactor-relevant salt compositions. Meanwhile, simulating these properties directly is typically either computationally expensive or inaccurate. In recent years, deep learning (DL)-based atomistic simulations have emerged as a method for achiev… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

  16. Concept Induction: Analyzing Unstructured Text with High-Level Concepts Using LLooM

    Authors: Michelle S. Lam, Janice Teoh, James Landay, Jeffrey Heer, Michael S. Bernstein

    Abstract: Data analysts have long sought to turn unstructured text data into meaningful concepts. Though common, topic modeling and clustering focus on lower-level keywords and require significant interpretative work. We introduce concept induction, a computational process that instead produces high-level concepts, defined by explicit inclusion criteria, from unstructured text. For a dataset of toxic online… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: To appear at CHI 2024

  17. arXiv:2403.16825  [pdf, ps, other

    cs.LG math.OC math.PR stat.ML

    Weak Convergence Analysis of Online Neural Actor-Critic Algorithms

    Authors: Samuel Chun-Hei Lam, Justin Sirignano, Ziheng Wang

    Abstract: We prove that a single-layer neural network trained with the online actor critic algorithm converges in distribution to a random ordinary differential equation (ODE) as the number of hidden units and the number of training steps $\rightarrow \infty$. In the online actor-critic algorithm, the distribution of the data samples dynamically changes as the model is updated, which is a key challenge for… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  18. arXiv:2403.06049  [pdf

    cond-mat.mtrl-sci

    X-ray and molecular dynamics study of the temperature-dependent structure of molten NaF-ZrF4

    Authors: Anubhav Wadehra, Rajni Chahal, Shubhojit Banerjee, Alexander Levy, Yifan Zhang, Haoxuan Yan, Daniel Olds, Yu Zhong, Uday Pal, Stephen Lam, Karl Ludwig

    Abstract: The local atomic structure of NaF-ZrF$_4$ (53-47 mol%) molten system and its evolution with temperature are examined with x-ray scattering measurements and compared with $ab-initio$ and Neural Network-based molecular dynamics (NNMD) simulations in the temperature range 515-700 °C. The machine-learning enhanced NNMD calculations offer improved efficiency while maintaining accuracy at higher distanc… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: 26 pages, 15 figures, 3 tables

  19. arXiv:2402.16184  [pdf, other

    cs.LG

    Deep Neural Network Initialization with Sparsity Inducing Activations

    Authors: Ilan Price, Nicholas Daultry Ball, Samuel C. H. Lam, Adam C. Jones, Jared Tanner

    Abstract: Inducing and leveraging sparse activations during training and inference is a promising avenue for improving the computational efficiency of deep networks, which is increasingly important as network sizes continue to grow and their application becomes more widespread. Here we use the large width Gaussian process limit to analyze the behaviour, at random initialization, of nonlinear activations tha… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

    Comments: Published in the International Conference on Learning Representations (ICLR) 2024

  20. arXiv:2402.15805  [pdf, other

    cond-mat.stat-mech

    Distinguishable-particle Glassy Crystal: the simplest molecular model of glass

    Authors: Leo S. I. Lam, Gautham Gopinath, Zichen Zhao, Shuling Wang, Chun-Shing Lee, Hai-Yao Deng, Feng Wang, Yilong Han, Cho-Tung Yip, Chi-Hang Lam

    Abstract: The nature of glassy dynamics and the glass transition are long-standing problems under active debate. In the presence of a structural disorder widely believed to be an essential characteristic of structural glass, identifying and understanding key dynamical behaviors are very challenging. In this work, we demonstrate that an energetic disorder, which usually results from a structural disorder, is… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  21. arXiv:2402.14534  [pdf, other

    cond-mat.supr-con cond-mat.mtrl-sci cond-mat.str-el

    Shubnikov-de Haas oscillations of biaxial-strain-tuned superconductors in pulsed magnetic field up to 60 T

    Authors: King Yau Yip, Lingfei Wang, Tsz Fung Poon, Kai Ham Yu, Siu Tung Lam, Kwing To Lai, John Singleton, Fedor F. Balakirev, Swee K. Goh

    Abstract: Two-dimensional (2D) materials have gained increasing prominence not only in fundamental research but also in daily applications. However, to fully harness their potential, it is crucial to optimize their properties with an external parameter and track the electronic structure simultaneously. Magnetotransport over a wide magnetic field range is a powerful method to probe the electronic structure a… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: 6 pages, 4 figures

    Journal ref: APL Mater. 12, 021124 (2024)

  22. arXiv:2402.14207  [pdf, other

    cs.CL cs.AI

    Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models

    Authors: Yijia Shao, Yucheng Jiang, Theodore A. Kanell, Peter Xu, Omar Khattab, Monica S. Lam

    Abstract: We study how to apply large language models to write grounded and organized long-form articles from scratch, with comparable breadth and depth to Wikipedia pages. This underexplored problem poses new challenges at the pre-writing stage, including how to research the topic and prepare an outline prior to writing. We propose STORM, a writing system for the Synthesis of Topic Outlines through Retriev… ▽ More

    Submitted 8 April, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: 27 pages, NAACL 2024 Main Conference

  23. arXiv:2402.08788  [pdf

    cs.CL cs.SD eess.AS

    Syllable based DNN-HMM Cantonese Speech to Text System

    Authors: Timothy Wong, Claire Li, Sam Lam, Billy Chiu, Qin Lu, Minglei Li, Dan Xiong, Roy Shing Yu, Vincent T. Y. Ng

    Abstract: This paper reports our work on building up a Cantonese Speech-to-Text (STT) system with a syllable based acoustic model. This is a part of an effort in building a STT system to aid dyslexic students who have cognitive deficiency in writing skills but have no problem expressing their ideas through speech. For Cantonese speech recognition, the basic unit of acoustic models can either be the conventi… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 7 pages, 3 figures, LREC 2016

    MSC Class: 94-06 ACM Class: I.2.7

  24. arXiv:2402.03715  [pdf, other

    cs.LG cs.AI cs.CL

    Clarify: Improving Model Robustness With Natural Language Corrections

    Authors: Yoonho Lee, Michelle S. Lam, Helena Vasconcelos, Michael S. Bernstein, Chelsea Finn

    Abstract: The standard way to teach models is by feeding them lots of data. However, this approach often teaches models incorrect ideas because they pick up on misleading signals in the data. To prevent such misconceptions, we must necessarily provide additional information beyond the training data. Prior methods incorporate additional instance-level supervision, such as labels for misleading features or ad… ▽ More

    Submitted 21 August, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: UIST 2024. Interface code available at https://github.com/yoonholee/Clarify

  25. arXiv:2401.16515  [pdf, other

    cs.ET eess.SP eess.SY physics.optics

    Dynamic Electro-Optic Analog Memory for Neuromorphic Photonic Computing

    Authors: Sean Lam, Ahmed Khaled, Simon Bilodeau, Bicky A. Marquez, Paul R. Prucnal, Lukas Chrostowski, Bhavin J. Shastri, Sudip Shekhar

    Abstract: Artificial intelligence (AI) has seen remarkable advancements across various domains, including natural language processing, computer vision, autonomous vehicles, and biology. However, the rapid expansion of AI technologies has escalated the demand for more powerful computing resources. As digital computing approaches fundamental limits, neuromorphic photonics emerges as a promising platform to co… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 22 pages, 10 figures

  26. arXiv:2401.10477  [pdf, other

    gr-qc

    Dynamical Property of Black Hole Matter

    Authors: C. S. Lam

    Abstract: Matter loses its original characteristics after entering a black hole, thus becoming a new kind of (black hole) matter. The property of this new matter cannot be measured experimentally, but some of it can be deduced theoretically from the Einstein equations and the conservation laws which it must still satisfy. In a previous paper, this matter is modelled by an ideal fluid, with an equation of st… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  27. arXiv:2312.11681  [pdf, other

    cs.HC cs.AI cs.CL

    Designing LLM Chains by Adapting Techniques from Crowdsourcing Workflows

    Authors: Madeleine Grunde-McLaughlin, Michelle S. Lam, Ranjay Krishna, Daniel S. Weld, Jeffrey Heer

    Abstract: LLM chains enable complex tasks by decomposing work into a sequence of subtasks. Similarly, the more established techniques of crowdsourcing workflows decompose complex tasks into smaller tasks for human crowdworkers. Chains address LLM errors analogously to the way crowdsourcing workflows address human error. To characterize opportunities for LLM chaining, we survey 107 papers across the crowdsou… ▽ More

    Submitted 6 May, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  28. arXiv:2311.13537  [pdf

    cond-mat.mtrl-sci

    ab initio informed inelastic neutron scattering for time-resolved local dynamics in molten MgCl2

    Authors: Shubhojit Banerjee, Rajni Chahal, Alexander S. Ivanov, Santanu Roy, Vyacheslav S. Bryantsev, Yuya Shinohara, Stephen T Lam

    Abstract: Ion dynamics that drive the transport and thermophysical properties of molten salts are poorly understood due to challenges in precisely quantifying the spatial and temporal fluctuations of specific ions in highly disordered systems. While the Van Hove correlation function (VHF) obtained from inelastic neutron scattering (INS) probes these dynamics directly, its interpretation is limited by the in… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  29. arXiv:2311.09818  [pdf, other

    cs.CL cs.PL

    SUQL: Conversational Search over Structured and Unstructured Data with Large Language Models

    Authors: Shicheng Liu, Jialiang Xu, Wesley Tjangnaka, Sina J. Semnani, Chen Jie Yu, Monica S. Lam

    Abstract: While most conversational agents are grounded on either free-text or structured knowledge, many knowledge corpora consist of hybrid sources. This paper presents the first conversational agent that supports the full generality of hybrid data access for large knowledge corpora, through a language we developed called SUQL (Structured and Unstructured Query Language). Specifically, SUQL extends SQL wi… ▽ More

    Submitted 13 March, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  30. arXiv:2311.05187  [pdf

    physics.optics quant-ph

    Ultrafast all-optical second harmonic wavefront shaping

    Authors: A. Sinelnik, S. H. Lam, F. Coviello, S. Klimmer, G. Della Valle, D. -Y. Choi, T. Pertsch, G. Soavi, I. Staude

    Abstract: Optical communication can be revolutionized by encoding data into the orbital angular momentum of light beams. However, state-of-the-art approaches for dynamic control of complex optical wavefronts are mainly based on liquid crystal spatial light modulators or miniaturized mirrors, which suffer from intrinsically slow response times. Here, we experimentally realize a hybrid meta-optical system tha… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  31. arXiv:2311.05099  [pdf

    physics.chem-ph physics.atm-clus

    Time-Resolved Coulomb Explosion Imaging Unveils Ultrafast Ring Opening of Furan

    Authors: Enliang Wang, Surjendu Bhattacharyya, Keyu Chen, Kurtis Borne, Farzaneh Ziaee, Shashank Pathak, Huynh Van Sa Lam, Anbu Selvam Venkatachalam, Xiangjun Chen, Rebecca Boll, Till Jahnke, Artem Rudenko, Daniel Rolles

    Abstract: Following the changes in molecular structure throughout the entirety of a chemical reaction with atomic resolution is a long-term goal in femtochemistry. Although the development of a plethora of ultrafast technique has enabled detailed investigations of the electronic and nuclear dynamics on femtosecond time scales, direct and unambiguous imaging of the nuclear motion during a reaction is still a… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 18 pages, 4 figures

    MSC Class: 81V55; 92E10

  32. arXiv:2309.00261  [pdf, other

    cond-mat.supr-con cond-mat.mtrl-sci cond-mat.str-el

    Suppression of both superconductivity and structural transition in hole-doped MoTe$_2$ induced by Ta substitution

    Authors: Siu Tung Lam, K. Y. Yip, Swee K. Goh, Kwing To Lai

    Abstract: Type-II Weyl semimetal MoTe$_2$ exhibits a first-order structural transition at $T_s$ $\sim$250~K and superconducts at $T_c$ $\sim$0.1~K at ambient pressure. Both $T_s$ and $T_c$ can be manipulated by several tuning parameters, such as hydrostatic pressure and chemical substitution. It is often reported that suppressing $T_s$ enhances $T_c$, but our study shows a different behaviour when MoTe$_2$… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

    Journal ref: Phys. Rev. Materials 7, 084802 (2023)

  33. arXiv:2308.15768  [pdf, other

    cs.HC cs.CY

    Sociotechnical Audits: Broadening the Algorithm Auditing Lens to Investigate Targeted Advertising

    Authors: Michelle S. Lam, Ayush Pandit, Colin H. Kalicki, Rachit Gupta, Poonam Sahoo, Danaë Metaxa

    Abstract: Algorithm audits are powerful tools for studying black-box systems. While very effective in examining technical components, the method stops short of a sociotechnical frame, which would also consider users as an integral and dynamic part of the system. Addressing this gap, we propose the concept of sociotechnical auditing: auditing methods that evaluate algorithmic systems at the sociotechnical le… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

    Comments: To appear at CSCW 2023

  34. arXiv:2308.15623  [pdf, other

    astro-ph.EP astro-ph.GA

    Discovery of Spherules of Likely Extrasolar Composition in the Pacific Ocean Site of the CNEOS 2014-01-08 (IM1) Bolide

    Authors: Abraham Loeb, Toby Adamson, Sophie Bergstrom, Richard Cloete, Shai Cohen, Kevin Conrad, Laura Domine, Hairuo Fu, Charles Hoskinson, Eugenia Hyung, Stein Jacobsen, Mike Kelly, Jason Kohn, Edwin Lard, Sebastian Lam, Frank Laukien, Jim Lem, Rob McCallum, Rob Millsap, Christopher Parendo, Michail Pataev, Chaitanya Peddeti, Jeff Pugh, Shmuel Samuha, Dimitar Sasselov , et al. (9 additional authors not shown)

    Abstract: We have conducted an extensive towed-magnetic-sled survey during the period 14-28 June, 2023, over the seafloor centered around the calculated path of the bolide CNEOS 2014-01-08 (IM1) about 85 km north of Manus Island, Papua New Guinea. We found about 700 spherules of diameter 0.05-1.3 millimeters in our samples, of which 57 were analyzed so far. The spherules were significantly concentrated alon… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

    Comments: Submitted for publication in a peer-reviewed journal

  35. arXiv:2308.14555  [pdf, other

    cs.LG math.PR stat.ML

    Kernel Limit of Recurrent Neural Networks Trained on Ergodic Data Sequences

    Authors: Samuel Chun-Hei Lam, Justin Sirignano, Konstantinos Spiliopoulos

    Abstract: Mathematical methods are developed to characterize the asymptotics of recurrent neural networks (RNN) as the number of hidden units, data samples in the sequence, hidden state updates, and training steps simultaneously grow to infinity. In the case of an RNN with a simplified weight matrix, we prove the convergence of the RNN to the solution of an infinite-dimensional ODE coupled with the fixed po… ▽ More

    Submitted 15 May, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: Major revision for lemma 7.1

    MSC Class: 68T07 (Primary); 68T05; 60J20 (Secondary)

  36. Turning hazardous volatile matter compounds into fuel by catalytic steam reforming: An evolutionary machine learning approach

    Authors: Alireza Shafizadeh, Hossein Shahbeik, Mohammad Hossein Nadian, Vijai Kumar Gupta, Abdul-Sattar Nizami, Su Shiung Lam, Wanxi Peng, Junting Pan, Meisam Tabatabaei, Mortaza Aghbashlo

    Abstract: Chemical and biomass processing systems release volatile matter compounds into the environment daily. Catalytic reforming can convert these compounds into valuable fuels, but developing stable and efficient catalysts is challenging. Machine learning can handle complex relationships in big data and optimize reaction conditions, making it an effective solution for addressing the mentioned issues. Th… ▽ More

    Submitted 25 July, 2023; originally announced August 2023.

  37. arXiv:2307.16278  [pdf, other

    gr-qc

    A Model of the Black Hole Interior

    Authors: C. S. Lam

    Abstract: A model is proposed for the interior of a neutral non-rotating black hole. It consists of an ideal fluid with density $\r$ and a negative pressure $p$, obeying an equation of state $p=-ξ\r$. In order to have a solution, $ξ$ must lie in the narrow range between 0.1429 and 0.1716.

    Submitted 3 December, 2023; v1 submitted 30 July, 2023; originally announced July 2023.

  38. arXiv:2307.13912  [pdf, other

    cs.HC cs.AI

    Embedding Democratic Values into Social Media AIs via Societal Objective Functions

    Authors: Chenyan Jia, Michelle S. Lam, Minh Chau Mai, Jeff Hancock, Michael S. Bernstein

    Abstract: Can we design artificial intelligence (AI) systems that rank our social media feeds to consider democratic values such as mitigating partisan animosity as part of their objective functions? We introduce a method for translating established, vetted social scientific constructs into AI objective functions, which we term societal objective functions, and demonstrate the method with application to the… ▽ More

    Submitted 14 February, 2024; v1 submitted 25 July, 2023; originally announced July 2023.

    Comments: This paper has been accepted to CSCW 2024 and will be published in Proc. ACM Hum.-Comput. Interact. 8, CSCW1, Article 163 (April 2024)

    Journal ref: Proceedings of the ACM: Human-Computer Interaction, 8, CSCW1, Article 163 (2024)

  39. arXiv:2306.17674  [pdf, other

    cs.CL

    X-RiSAWOZ: High-Quality End-to-End Multilingual Dialogue Datasets and Few-shot Agents

    Authors: Mehrad Moradshahi, Tianhao Shen, Kalika Bali, Monojit Choudhury, Gaël de Chalendar, Anmol Goel, Sungkyun Kim, Prashant Kodali, Ponnurangam Kumaraguru, Nasredine Semmar, Sina J. Semnani, Jiwon Seo, Vivek Seshadri, Manish Shrivastava, Michael Sun, Aditya Yadavalli, Chaobin You, Deyi Xiong, Monica S. Lam

    Abstract: Task-oriented dialogue research has mainly focused on a few popular languages like English and Chinese, due to the high dataset creation cost for a new language. To reduce the cost, we apply manual editing to automatically translated data. We create a new multilingual benchmark, X-RiSAWOZ, by translating the Chinese RiSAWOZ to 4 languages: English, French, Hindi, Korean; and a code-mixed English-H… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Comments: Accepted by ACL 2023 Findings

  40. ReactGenie: A Development Framework for Complex Multimodal Interactions Using Large Language Models

    Authors: Jackie Junrui Yang, Yingtian Shi, Yuhan Zhang, Karina Li, Daniel Wan Rosli, Anisha Jain, Shuning Zhang, Tianshi Li, James A. Landay, Monica S. Lam

    Abstract: By combining voice and touch interactions, multimodal interfaces can surpass the efficiency of either modality alone. Traditional multimodal frameworks require laborious developer work to support rich multimodal commands where the user's multimodal command involves possibly exponential combinations of actions/function invocations. This paper presents ReactGenie, a programming framework that better… ▽ More

    Submitted 2 May, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

  41. arXiv:2306.08486  [pdf, other

    q-bio.GN

    Collection of prokaryotic genome contents expectation rules from scientific literature

    Authors: Serena Lam, Giorgio Gonnella

    Abstract: Shaped by natural selection and other evolutionary forces, an organism's evolutionary history is reflected through its genome sequence, content of functional elements and organization. Consequently, organisms connected through phylogeny, metabolic or morphological traits, geographical proximity, or habitat features are likely to exhibit similarities in their genomes. These similarities give rise t… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

  42. arXiv:2306.05278  [pdf, other

    cs.CL

    Revisit Few-shot Intent Classification with PLMs: Direct Fine-tuning vs. Continual Pre-training

    Authors: Haode Zhang, Haowen Liang, Liming Zhan, Xiao-Ming Wu, Albert Y. S. Lam

    Abstract: We consider the task of few-shot intent detection, which involves training a deep learning model to classify utterances based on their underlying intents using only a small amount of labeled data. The current approach to address this problem is through continual pre-training, i.e., fine-tuning pre-trained language models (PLMs) on external resources (e.g., conversational corpora, public intent det… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: ACL 2023, Findings

  43. arXiv:2305.16917  [pdf, other

    cs.CL

    Large Language Models Are Partially Primed in Pronoun Interpretation

    Authors: Suet-Ying Lam, Qingcheng Zeng, Kexun Zhang, Chenyu You, Rob Voigt

    Abstract: While a large body of literature suggests that large language models (LLMs) acquire rich linguistic representations, little is known about whether they adapt to linguistic biases in a human-like way. The present study probes this question by asking whether LLMs display human-like referential biases using stimuli and procedures from real psycholinguistic experiments. Recent psycholinguistic studies… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted at Findings of ACL 2023

  44. Using evolutionary machine learning to characterize and optimize co-pyrolysis of biomass feedstocks and polymeric wastes

    Authors: Hossein Shahbeik, Alireza Shafizadeh, Mohammad Hossein Nadian, Dorsa Jeddi, Seyedali Mirjalili, Yadong Yang, Su Shiung Lam, Junting Pan, Meisam Tabatabaei, Mortaza Aghbashlo

    Abstract: Co-pyrolysis of biomass feedstocks with polymeric wastes is a promising strategy for improving the quantity and quality parameters of the resulting liquid fuel. Numerous experimental measurements are typically conducted to find the optimal operating conditions. However, performing co-pyrolysis experiments is highly challenging due to the need for costly and lengthy procedures. Machine learning (ML… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Journal ref: Journal of Cleaner Production, Volume 387, 10 February 2023, 135881

  45. WikiChat: Stopping the Hallucination of Large Language Model Chatbots by Few-Shot Grounding on Wikipedia

    Authors: Sina J. Semnani, Violet Z. Yao, Heidi C. Zhang, Monica S. Lam

    Abstract: This paper presents the first few-shot LLM-based chatbot that almost never hallucinates and has high conversationality and low latency. WikiChat is grounded on the English Wikipedia, the largest curated free-text corpus. WikiChat generates a response from an LLM, retains only the grounded facts, and combines them with additional information it retrieves from the corpus to form factual and engagi… ▽ More

    Submitted 27 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Findings of EMNLP 2023

  46. arXiv:2305.14202  [pdf, other

    cs.CL

    Fine-tuned LLMs Know More, Hallucinate Less with Few-Shot Sequence-to-Sequence Semantic Parsing over Wikidata

    Authors: Silei Xu, Shicheng Liu, Theo Culhane, Elizaveta Pertseva, Meng-Hsi Wu, Sina J. Semnani, Monica S. Lam

    Abstract: While large language models (LLMs) can answer many questions correctly, they can also hallucinate and give wrong answers. Wikidata, with its over 12 billion facts, can be used to ground LLMs to improve their factuality. This paper presents WikiWebQuestions, a high-quality question answering benchmark for Wikidata. Ported over from WebQuestions for Freebase, it consists of real-world data with SPAR… ▽ More

    Submitted 5 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023 Main

  47. arXiv:2303.02884  [pdf, other

    cs.HC cs.AI cs.LG

    Model Sketching: Centering Concepts in Early-Stage Machine Learning Model Design

    Authors: Michelle S. Lam, Zixian Ma, Anne Li, Izequiel Freitas, Dakuo Wang, James A. Landay, Michael S. Bernstein

    Abstract: Machine learning practitioners often end up tunneling on low-level technical details like model architectures and performance metrics. Could early model development instead focus on high-level questions of which factors a model ought to pay attention to? Inspired by the practice of sketching in design, which distills ideas to their minimal representation, we introduce model sketching: a technical… ▽ More

    Submitted 5 March, 2023; originally announced March 2023.

    Comments: To appear at CHI 2023

  48. arXiv:2302.09424  [pdf, other

    cs.CL

    Zero and Few-Shot Localization of Task-Oriented Dialogue Agents with a Distilled Representation

    Authors: Mehrad Moradshahi, Sina J. Semnani, Monica S. Lam

    Abstract: Task-oriented Dialogue (ToD) agents are mostly limited to a few widely-spoken languages, mainly due to the high cost of acquiring training data for each language. Existing low-cost approaches that rely on cross-lingual embeddings or naive machine translation sacrifice a lot of accuracy for data efficiency, and largely fail in creating a usable dialogue agent. We propose automatic methods that use… ▽ More

    Submitted 18 February, 2023; originally announced February 2023.

    Comments: Published in EACL 2023

  49. arXiv:2302.02610  [pdf, other

    cond-mat.supr-con cond-mat.mtrl-sci

    Drastic enhancement of the superconducting temperature in type-II Weyl semimetal candidate MoTe$_2$ via biaxial strain

    Authors: King Yau Yip, Siu Tung Lam, Kai Ham Yu, Wing Shing Chow, Jiayu Zeng, Kwing To Lai, Swee K. Goh

    Abstract: Type-II Weyl semimetal candidate MoTe$_2$, which superconducts at T_c~0.1 K, is one of the promising candidates for realizing topological superconductivity. However, the exceedingly low $T_c$ is associated with a small upper critical field ($H_{c2}$), implying a fragile superconducting phase that only exists on a small region of the $H$-$T$ phase diagram. Here, we describe a simple and versatile a… ▽ More

    Submitted 7 February, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

    Comments: 6 pages, 4 figures. Reference list updated. APL Materials (in press)

    Journal ref: APL Materials 11, 021111 (2023)

  50. arXiv:2301.07374  [pdf, other

    cond-mat.supr-con cond-mat.mtrl-sci

    Nodeless superconductivity in kagome metal CsV$_{3}$Sb$_{5}$ with and without time reversal symmetry breaking

    Authors: Wei Zhang, Xinyou Liu, Lingfei Wang, Chun Wai Tsang, Zheyu Wang, Siu Tung Lam, Wenyan Wang, Jianyu Xie, Xuefeng Zhou, Yusheng Zhao, Shanmin Wang, Jeff Tallon, Kwing To Lai, Swee K. Goh

    Abstract: The kagome metal CsV$_{3}$Sb$_{5}$ features an unusual competition between the charge-density-wave (CDW) order and superconductivity. Evidence for time-reversal symmetry breaking (TRSB) inside the CDW phase has been accumulating. Hence, the superconductivity in CsV$_{3}$Sb$_{5}$ emerges from a TRSB normal state, potentially resulting in an exotic superconducting state. To reveal the pairing symmet… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

    Comments: 8 pages, 4 figures. Nano Letters (in press)

    Journal ref: Nano Lett., 23, 872-879 (2023)