Search | arXiv e-print repository

Searching for GEMS: Characterizing Six Giant Planets around Cool Dwarfs

Authors: Shubham Kanodia, Arvind F. Gupta, Caleb I. Canas, Lia Marta Bernabo, Varghese Reji, Te Han, Madison Brady, Andreas Seifahrt, William D. Cochran, Nidia Morrell, Ritvik Basant, Jacob Bean, Chad F. Bender, Zoe L. de Beurs, Allyson Bieryla, Alexina Birkholz, Nina Brown, Franklin Chapman, David R. Ciardi, Catherine A. Clark, Ethan G. Cotter, Scott A. Diddams, Samuel Halverson, Suzanne Hawley, Leslie Hebb , et al. (20 additional authors not shown)

Abstract: Transiting giant exoplanets around M-dwarf stars (GEMS) are rare, owing to the low-mass host stars. However, the all-sky coverage of TESS has enabled the detection of an increasingly large number of them to enable statistical surveys like the \textit{Searching for GEMS} survey. As part of this endeavour, we describe the observations of six transiting giant planets, which includes precise mass meas… ▽ More Transiting giant exoplanets around M-dwarf stars (GEMS) are rare, owing to the low-mass host stars. However, the all-sky coverage of TESS has enabled the detection of an increasingly large number of them to enable statistical surveys like the \textit{Searching for GEMS} survey. As part of this endeavour, we describe the observations of six transiting giant planets, which includes precise mass measurements for two GEMS (K2-419Ab, TOI-6034b) and statistical validation for four systems, which includes validation and mass upper limits for three of them (TOI-5218b, TOI-5616b, TOI-5634Ab), while the fourth one -- TOI-5414b is classified as a `likely planet'. Our observations include radial velocities from the Habitable-zone Planet Finder on the Hobby-Eberly Telescope, and MAROON-X on Gemini-North, along with photometry and high-contrast imaging from multiple ground-based facilities. In addition to TESS photometry, K2-419Ab was also observed and statistically validated as part of the K2 mission in Campaigns 5 and 18, which provides precise orbital and planetary constraints despite the faint host star and long orbital period of $\sim 20.4$ days. With an equilibrium temperature of only 380 K, K2-419Ab is one of the coolest known well-characterized transiting planets. TOI-6034 has a late F-type companion about 40\arcsec~away, making it the first GEMS host star to have an earlier main-sequence binary companion. These confirmations add to the existing small sample of confirmed transiting GEMS. △ Less

Submitted 27 August, 2024; v1 submitted 26 August, 2024; originally announced August 2024.

Comments: Accepted in AJ

arXiv:2408.13318 [pdf, ps, other]

Earths within Reach: Evaluation of Strategies for Mitigating Solar Variability using 3.5 years of NEID Sun-as-a-Star Observations

Authors: Eric B. Ford, Chad F. Bender, Cullen H. Blake, Arvind F. Gupta, Shubham Kanodia, Andrea S. J. Lin, Sarah E. Logsdon, Jacob K. Luhn, Suvrath Mahadevan, Michael L. Palumbo III, Ryan C. Terrien, Jason T. Wright, Jinglin Zhao, Samuel Halverson, Emily Hunting, Paul Robertson, Arpita Roy, Gudmundur Stefansson

Abstract: We present the results of Sun-as-a-star observations by the NEID Solar Telescope at WIYN Observatory, spanning January 1, 2021 through June 30, 2024. We identify 117,060 observations which are unlikely to be significantly affected by weather, hardware or major calibration issues. We describe several high-level data products being made available to the community to aid in the interpretation and int… ▽ More We present the results of Sun-as-a-star observations by the NEID Solar Telescope at WIYN Observatory, spanning January 1, 2021 through June 30, 2024. We identify 117,060 observations which are unlikely to be significantly affected by weather, hardware or major calibration issues. We describe several high-level data products being made available to the community to aid in the interpretation and inter comparisons of NEID solar observations. Solar observations demonstrate excellent performance of NEID, including radial velocity (RV) accuracy and long-term stability of better than $\simeq 0.37$ m s$^{-1}$ over $\simeq 3.5$ years, even though NEID was not originally designed or optimized for daytime observations of the Sun. Currently, intrinsic stellar variability is the primary barrier to detecting Earth-analog planets for most nearby, Sun-like stars. We present a comparison of the effectiveness of several methods proposed to mitigate the effects of solar variability on the Sun's estimated RV. We find that the Scalpels algorithm performs particularly well and substantially reduces the RMS RV of solar spectra from over 2 m s$^{-1}$ to 0.277 m s$^{-1}$. Even when training on a subset of days with NEID solar observations and testing on a held-out sample, the RMS of cleaned RV is 0.34-0.42 m s$^{-1}$. This is significantly better than previous attempts at removing solar variability and suggests that the current generation of EPRV instruments are technically capable of detecting Earth-mass planets orbiting a solar twin if provided with sufficient observing time allocations ($\sim 10^3$ nights of observations). △ Less

Submitted 23 August, 2024; originally announced August 2024.

Comments: 25 pages, 14 figures. Submitted to AAS Journals. Data release archived at https://zenodo.org/doi/10.5281/zenodo.13363761

arXiv:2408.02873 [pdf, other]

Utilizing Photometry from Multiple Sources to Mitigate Stellar Variability in Precise Radial Velocities: A Case Study of Kepler-21

Authors: Corey Beard, Paul Robertson, Mark R. Giovinazzi, Joseph M. Akana Murphy, Eric B. Ford, Samuel Halverson, Te Han, Rae Holcomb, Jack Lubin, Rafael Luque, Pranav Premnath, Chad F. Bender, Cullen H. Blake, Qian Gong, Howard Isaacson, Shubham Kanodia, Dan Li, Andrea S. J. Lin, 5 Sarah E. Logsdon, Emily Lubar, Michael W. McElwain, Andrew Monson, Joe P. Ninan, Jayadev Rajagopal, Arpita Roy , et al. (4 additional authors not shown)

Abstract: We present a new analysis of Kepler-21, the brightest (V = 8.5) Kepler system with a known transiting exoplanet, Kepler-21 b. Kepler-21 b is a radius valley planet ($R = 1.6\pm 0.2 R_{\oplus}$) with an Earth-like composition (8.38$\pm$1.62 g/cc), though its mass and radius fall in the regime of possible "water worlds." We utilize new Keck/HIRES and WIYN/NEID radial velocity (RV) data in conjunctio… ▽ More We present a new analysis of Kepler-21, the brightest (V = 8.5) Kepler system with a known transiting exoplanet, Kepler-21 b. Kepler-21 b is a radius valley planet ($R = 1.6\pm 0.2 R_{\oplus}$) with an Earth-like composition (8.38$\pm$1.62 g/cc), though its mass and radius fall in the regime of possible "water worlds." We utilize new Keck/HIRES and WIYN/NEID radial velocity (RV) data in conjunction with Kepler and TESS photometry to perform a detailed study of activity mitigation between photometry and RVs. We additionally refine the system parameters, and we utilize Gaia astrometry to place constraints on a long-term RV trend. Our activity analysis affirms the quality of Kepler photometry for removing correlated noise from RVs, despite its temporal distance, though we reveal some cases where TESS may be superior. Using refined orbital parameters and updated composition curves, we rule out a ``water world" scenario for Kepler-21 b, and we identify a long period super-Jupiter planetary candidate, Kepler-21 (c). △ Less

Submitted 5 August, 2024; originally announced August 2024.

arXiv:2407.21075 [pdf, other]

Apple Intelligence Foundation Language Models

Authors: Tom Gunter, Zirui Wang, Chong Wang, Ruoming Pang, Andy Narayanan, Aonan Zhang, Bowen Zhang, Chen Chen, Chung-Cheng Chiu, David Qiu, Deepak Gopinath, Dian Ang Yap, Dong Yin, Feng Nan, Floris Weers, Guoli Yin, Haoshuo Huang, Jianyu Wang, Jiarui Lu, John Peebles, Ke Ye, Mark Lee, Nan Du, Qibin Chen, Quentin Keunebroek , et al. (130 additional authors not shown)

Abstract: We present foundation language models developed to power Apple Intelligence features, including a ~3 billion parameter model designed to run efficiently on devices and a large server-based language model designed for Private Cloud Compute. These models are designed to perform a wide range of tasks efficiently, accurately, and responsibly. This report describes the model architecture, the data used… ▽ More We present foundation language models developed to power Apple Intelligence features, including a ~3 billion parameter model designed to run efficiently on devices and a large server-based language model designed for Private Cloud Compute. These models are designed to perform a wide range of tasks efficiently, accurately, and responsibly. This report describes the model architecture, the data used to train the model, the training process, how the models are optimized for inference, and the evaluation results. We highlight our focus on Responsible AI and how the principles are applied throughout the model development. △ Less

Submitted 29 July, 2024; originally announced July 2024.

arXiv:2407.17565 [pdf, other]

Periodicity significance testing with null-signal templates: reassessment of PTF's SMBH binary candidates

Authors: Jakob Robnik, Adrian E. Bayer, Maria Charisi, Zoltán Haiman, Allison Lin, Uroš Seljak

Abstract: Periodograms are widely employed for identifying periodicity in time series data, yet they often struggle to accurately quantify the statistical significance of detected periodic signals when the data complexity precludes reliable simulations. We develop a data-driven approach to address this challenge by introducing a null-signal template (NST). The NST is created by carefully randomizing the per… ▽ More Periodograms are widely employed for identifying periodicity in time series data, yet they often struggle to accurately quantify the statistical significance of detected periodic signals when the data complexity precludes reliable simulations. We develop a data-driven approach to address this challenge by introducing a null-signal template (NST). The NST is created by carefully randomizing the period of each cycle in the periodogram template, rendering it non-periodic. It has the same frequentist properties as a periodic signal template regardless of the noise probability distribution, and we show with simulations that the distribution of false positives is the same as with the original periodic template, regardless of the underlying data. Thus, performing a periodicity search with the NST acts as an effective simulation of the null (no-signal) hypothesis, without having to simulate the noise properties of the data. We apply the NST method to the supermassive black hole binaries (SMBHB) search in the Palomar Transient Factory (PTF), where Charisi et al. had previously proposed 33 high signal to (white) noise candidates utilizing simulations to quantify their significance. Our approach reveals that these simulations do not capture the complexity of the real data. There are no statistically significant periodic signal detections above the non-periodic background. To improve the search sensitivity we introduce a Gaussian quadrature based algorithm for the Bayes Factor with correlated noise as a test statistic, in contrast to the standard signal to white noise. We show with simulations that this improves sensitivity to true signals by more than an order of magnitude. However, using the Bayes Factor approach also results in no statistically significant detections in the PTF data. △ Less

Submitted 24 July, 2024; originally announced July 2024.

Comments: 13 pages, 12 figures

arXiv:2407.08617 [pdf, other]

Quantum-Train Long Short-Term Memory: Application on Flood Prediction Problem

Authors: Chu-Hsuan Abraham Lin, Chen-Yu Liu, Kuan-Cheng Chen

Abstract: Flood prediction is a critical challenge in the context of climate change, with significant implications for ecosystem preservation, human safety, and infrastructure protection. In this study, we tackle this problem by applying the Quantum-Train (QT) technique to a forecasting Long Short-Term Memory (LSTM) model trained by Quantum Machine Learning (QML) with significant parameter reduction. The QT… ▽ More Flood prediction is a critical challenge in the context of climate change, with significant implications for ecosystem preservation, human safety, and infrastructure protection. In this study, we tackle this problem by applying the Quantum-Train (QT) technique to a forecasting Long Short-Term Memory (LSTM) model trained by Quantum Machine Learning (QML) with significant parameter reduction. The QT technique, originally successful in the A Matter of Taste challenge at QHack 2024, leverages QML to reduce the number of trainable parameters to a polylogarithmic function of the number of parameters in a classical neural network (NN). This innovative framework maps classical NN weights to a Hilbert space, altering quantum state probability distributions to adjust NN parameters. Our approach directly processes classical data without the need for quantum embedding and operates independently of quantum computing resources post-training, making it highly practical and accessible for real-world flood prediction applications. This model aims to improve the efficiency of flood forecasts, ultimately contributing to better disaster preparedness and response. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 6 pages, 4 figures

arXiv:2407.06766 [pdf, other]

Relational Perspective on Graph Query Languages

Authors: Diego Figueira, Anthony W. Lin, Liat Peterfreund

Abstract: We study a relational perspective of graph database querying. Such a perspective underlies various graph database systems but very few theoretical investigations have been conducted on it. This perspective offers a powerful and unified framework to study graph database querying, by which algorithms and complexity follow from classical results. We provide two concrete applications. The first is q… ▽ More We study a relational perspective of graph database querying. Such a perspective underlies various graph database systems but very few theoretical investigations have been conducted on it. This perspective offers a powerful and unified framework to study graph database querying, by which algorithms and complexity follow from classical results. We provide two concrete applications. The first is querying property graphs. The property graph data model supersedes previously proposed graph models and underlies the new standard GQL for graph query languages. We show that this standard can be, by and large, expressed by extensions of relational calculus with transitive closure operators (FO[TC]) and existential second-order quantifiers (ESO). With this, we obtain optimal data complexity bounds, along with extensions including schema validation. The second application is incorporating data from concrete domains (e.g., numbers) in graph database querying. We use embedded finite model theory and, by exploiting a generic Restricted Quantifier Collapse (RQC) result for FO[TC] and ESO, we obtain optimal data complexity bounds for GQL with arithmetics and comparisons. Moreover, we show that Regular Data Path Querying with operations on data (i.e. using register automata formalisms) can be captured in FO[TC] over embedded finite graphs while preserving nondeterministic logspace data complexity. △ Less

Submitted 9 July, 2024; originally announced July 2024.

arXiv:2407.06103 [pdf, other]

QTRL: Toward Practical Quantum Reinforcement Learning via Quantum-Train

Authors: Chen-Yu Liu, Chu-Hsuan Abraham Lin, Chao-Han Huck Yang, Kuan-Cheng Chen, Min-Hsiu Hsieh

Abstract: Quantum reinforcement learning utilizes quantum layers to process information within a machine learning model. However, both pure and hybrid quantum reinforcement learning face challenges such as data encoding and the use of quantum computers during the inference stage. We apply the Quantum-Train method to reinforcement learning tasks, called QTRL, training the classical policy network model using… ▽ More Quantum reinforcement learning utilizes quantum layers to process information within a machine learning model. However, both pure and hybrid quantum reinforcement learning face challenges such as data encoding and the use of quantum computers during the inference stage. We apply the Quantum-Train method to reinforcement learning tasks, called QTRL, training the classical policy network model using a quantum machine learning model with polylogarithmic parameter reduction. This QTRL approach eliminates the data encoding issues of conventional quantum machine learning and reduces the training parameters of the corresponding classical policy network. Most importantly, the training result of the QTRL is a classical model, meaning the inference stage only requires classical computer. This is extremely practical and cost-efficient for reinforcement learning tasks, where low-latency feedback from the policy model is essential. △ Less

Submitted 8 July, 2024; originally announced July 2024.

Comments: 6 pages, 1 figure

arXiv:2406.17871 [pdf, other]

Revisiting the Expressiveness Landscape of Data Graph Queries

Authors: Michael Benedikt, Anthony Widjaja Lin, Di-De Yen

Abstract: The study of graph queries in database theory has spanned more than three decades, resulting in a multitude of proposals for graph query languages. These languages differ in the mechanisms. We can identify three main families of languages, with the canonical representatives being: (1) regular path queries, (2) walk logic, and (3) first-order logic with transitive closure operators. This paper prov… ▽ More The study of graph queries in database theory has spanned more than three decades, resulting in a multitude of proposals for graph query languages. These languages differ in the mechanisms. We can identify three main families of languages, with the canonical representatives being: (1) regular path queries, (2) walk logic, and (3) first-order logic with transitive closure operators. This paper provides a complete picture of the expressive power of these languages in the context of data graphs. Specifically, we consider a graph data model that supports querying over both data and topology. For example, "Does there exist a path between two different persons in a social network with the same last name?". We also show that an extension of (1), augmented with transitive closure operators, can unify the expressivity of (1)--(3) without increasing the query evaluation complexity. △ Less

Submitted 25 June, 2024; originally announced June 2024.

arXiv:2406.16942 [pdf, other]

Enhancing Diagnostic Reliability of Foundation Model with Uncertainty Estimation in OCT Images

Authors: Yuanyuan Peng, Aidi Lin, Meng Wang, Tian Lin, Ke Zou, Yinglin Cheng, Tingkun Shi, Xulong Liao, Lixia Feng, Zhen Liang, Xinjian Chen, Huazhu Fu, Haoyu Chen

Abstract: Inability to express the confidence level and detect unseen classes has limited the clinical implementation of artificial intelligence in the real-world. We developed a foundation model with uncertainty estimation (FMUE) to detect 11 retinal conditions on optical coherence tomography (OCT). In the internal test set, FMUE achieved a higher F1 score of 96.76% than two state-of-the-art algorithms, RE… ▽ More Inability to express the confidence level and detect unseen classes has limited the clinical implementation of artificial intelligence in the real-world. We developed a foundation model with uncertainty estimation (FMUE) to detect 11 retinal conditions on optical coherence tomography (OCT). In the internal test set, FMUE achieved a higher F1 score of 96.76% than two state-of-the-art algorithms, RETFound and UIOS, and got further improvement with thresholding strategy to 98.44%. In the external test sets obtained from other OCT devices, FMUE achieved an accuracy of 88.75% and 92.73% before and after thresholding. Our model is superior to two ophthalmologists with a higher F1 score (95.17% vs. 61.93% &71.72%). Besides, our model correctly predicts high uncertainty scores for samples with ambiguous features, of non-target-category diseases, or with low-quality to prompt manual checks and prevent misdiagnosis. FMUE provides a trustworthy method for automatic retinal anomalies detection in the real-world clinical open set environment. △ Less

Submitted 17 June, 2024; originally announced June 2024.

Comments: All codes are available at https://github.com/yuanyuanpeng0129/FMUE

arXiv:2406.09317 [pdf, other]

Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases

Authors: Meng Wang, Tian Lin, Aidi Lin, Kai Yu, Yuanyuan Peng, Lianyu Wang, Cheng Chen, Ke Zou, Huiyu Liang, Man Chen, Xue Yao, Meiqin Zhang, Binwei Huang, Chaoxin Zheng, Peixin Zhang, Wei Chen, Yilong Luo, Yifan Chen, Honghe Xia, Tingkun Shi, Qi Zhang, Jinming Guo, Xiaolin Chen, Jingcheng Wang, Yih Chung Tham , et al. (24 additional authors not shown)

Abstract: Previous foundation models for retinal images were pre-trained with limited disease categories and knowledge base. Here we introduce RetiZero, a vision-language foundation model that leverages knowledge from over 400 fundus diseases. To RetiZero's pre-training, we compiled 341,896 fundus images paired with text descriptions, sourced from public datasets, ophthalmic literature, and online resources… ▽ More Previous foundation models for retinal images were pre-trained with limited disease categories and knowledge base. Here we introduce RetiZero, a vision-language foundation model that leverages knowledge from over 400 fundus diseases. To RetiZero's pre-training, we compiled 341,896 fundus images paired with text descriptions, sourced from public datasets, ophthalmic literature, and online resources, encompassing a diverse range of diseases across multiple ethnicities and countries. RetiZero exhibits superior performance in several downstream tasks, including zero-shot disease recognition, image-to-image retrieval, and internal- and cross-domain disease identification. In zero-shot scenarios, RetiZero achieves Top5 accuracy scores of 0.8430 for 15 fundus diseases and 0.7561 for 52 fundus diseases. For image retrieval, it achieves Top5 scores of 0.9500 and 0.8860 for the same disease sets, respectively. Clinical evaluations show that RetiZero's Top3 zero-shot performance surpasses the average of 19 ophthalmologists from Singapore, China and the United States. Furthermore, RetiZero significantly enhances clinicians' accuracy in diagnosing fundus disease. These findings underscore the value of integrating the RetiZero foundation model into clinical settings, where a variety of fundus diseases are encountered. △ Less

Submitted 30 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

arXiv:2406.06038 [pdf, other]

Navigation and 3D Surface Reconstruction from Passive Whisker Sensing

Authors: Michael A. Lin, Hao Li, Chengyi Xing, Mark R. Cutkosky

Abstract: Whiskers provide a way to sense surfaces in the immediate environment without disturbing it. In this paper we present a method for using highly flexible, curved, passive whiskers mounted along a robot arm to gather sensory data as they brush past objects during normal robot motion. The information is useful both for guiding the robot in cluttered spaces and for reconstructing the exposed faces of… ▽ More Whiskers provide a way to sense surfaces in the immediate environment without disturbing it. In this paper we present a method for using highly flexible, curved, passive whiskers mounted along a robot arm to gather sensory data as they brush past objects during normal robot motion. The information is useful both for guiding the robot in cluttered spaces and for reconstructing the exposed faces of objects. Surface reconstruction depends on accurate localization of contact points along each whisker. We present an algorithm based on Bayesian filtering that rapidly converges to within 1\,mm of the actual contact locations. The piecewise-continuous history of contact locations from each whisker allows for accurate reconstruction of curves on object surfaces. Employing multiple whiskers and traces, we are able to produce an occupancy map of proximal objects. △ Less

Submitted 10 June, 2024; originally announced June 2024.

Comments: arXiv admin note: text overlap with arXiv:2210.12387

arXiv:2406.02778 [pdf, other]

MS-IMAP -- A Multi-Scale Graph Embedding Approach for Interpretable Manifold Learning

Authors: Shay Deutsch, Lionel Yelibi, Alex Tong Lin, Arjun Ravi Kannan

Abstract: Deriving meaningful representations from complex, high-dimensional data in unsupervised settings is crucial across diverse machine learning applications. This paper introduces a framework for multi-scale graph network embedding based on spectral graph wavelets that employs a contrastive learning approach. A significant feature of the proposed embedding is its capacity to establish a correspondence… ▽ More Deriving meaningful representations from complex, high-dimensional data in unsupervised settings is crucial across diverse machine learning applications. This paper introduces a framework for multi-scale graph network embedding based on spectral graph wavelets that employs a contrastive learning approach. A significant feature of the proposed embedding is its capacity to establish a correspondence between the embedding space and the input feature space which aids in deriving feature importance of the original features. We theoretically justify our approach and demonstrate that, in Paley-Wiener spaces on combinatorial graphs, the spectral graph wavelets operator offers greater flexibility and better control over smoothness properties compared to the Laplacian operator. We validate the effectiveness of our proposed graph embedding on a variety of public datasets through a range of downstream tasks, including clustering and unsupervised feature importance. △ Less

Submitted 5 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

arXiv:2405.18530 [pdf]

doi 10.18429/JACoW-IPAC2024-THYN1

First results of AUP Nb3Sn quadrupole horizontal tests

Authors: M. Baldini, G. Ambrosio, G. Apollinari, J. Blowers, R. Bossert, R. Carcagno, G. Chlachidze, J. DiMarco, S. Feher, S. Krave, V. Lombardo, L. Martin, C. Narug, T. H. Nicol, V. Nikolic, A. Nobrega, V. Marinozzi, C. Orozco, T. Page, S. Stoynev, T. Strauss, M. Turenne, D. Turrioni, A. Vouris, M. Yu , et al. (26 additional authors not shown)

Abstract: The Large Hadron Collider will soon undergo an upgrade to increase its luminosity by a factor of ~10 [1]. A crucial part of this upgrade will be replacement of the NbTi focusing magnets with Nb3Sn magnets that achieve a ~50% increase in the field strength. This will be the first ever large-scale implementation of Nb3Sn magnets in a particle accelerator. The High-Luminosity LHC Upgrade, HL-LHC is a… ▽ More The Large Hadron Collider will soon undergo an upgrade to increase its luminosity by a factor of ~10 [1]. A crucial part of this upgrade will be replacement of the NbTi focusing magnets with Nb3Sn magnets that achieve a ~50% increase in the field strength. This will be the first ever large-scale implementation of Nb3Sn magnets in a particle accelerator. The High-Luminosity LHC Upgrade, HL-LHC is a CERN project with a world-wide collaboration. It is under construction and utilizes Nb3Sn Magnets (named MQXF) as key ingredients to increase tenfold the integrated luminosity delivered to the CMS and ATLAS experiments in the next decade. The HL-LHC AUP is the US effort to contribute approximately 50% of the low-beta focusing magnets and crab cavities for the HL-LHC. This paper will present the program to fabricate the Nb3Sn superconducting magnets. We are reporting the status of the HL-LHC AUP project present the results from horizontal tests of the first fully assembled cryo-assembly. △ Less

Submitted 28 May, 2024; originally announced May 2024.

Comments: IPAC'24 - 15th International Particle Accelerator Conference

Report number: FERMILAB-CONF-24-0273-TD

Journal ref: JACoW IPAC2024 (2024) THYN1

arXiv:2405.18457 [pdf, other]

Improving Linear System Solvers for Hyperparameter Optimisation in Iterative Gaussian Processes

Authors: Jihao Andreas Lin, Shreyas Padhy, Bruno Mlodozeniec, Javier Antorán, José Miguel Hernández-Lobato

Abstract: Scaling hyperparameter optimisation to very large datasets remains an open problem in the Gaussian process community. This paper focuses on iterative methods, which use linear system solvers, like conjugate gradients, alternating projections or stochastic gradient descent, to construct an estimate of the marginal likelihood gradient. We discuss three key improvements which are applicable across so… ▽ More Scaling hyperparameter optimisation to very large datasets remains an open problem in the Gaussian process community. This paper focuses on iterative methods, which use linear system solvers, like conjugate gradients, alternating projections or stochastic gradient descent, to construct an estimate of the marginal likelihood gradient. We discuss three key improvements which are applicable across solvers: (i) a pathwise gradient estimator, which reduces the required number of solver iterations and amortises the computational cost of making predictions, (ii) warm starting linear system solvers with the solution from the previous step, which leads to faster solver convergence at the cost of negligible bias, (iii) early stopping linear system solvers after a limited computational budget, which synergises with warm starting, allowing solver progress to accumulate over multiple marginal likelihood steps. These techniques provide speed-ups of up to $72\times$ when solving to tolerance, and decrease the average residual norm by up to $7\times$ when stopping early. △ Less

Submitted 6 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

Comments: Preprint. arXiv admin note: text overlap with arXiv:2405.18328

arXiv:2405.18328 [pdf, other]

Warm Start Marginal Likelihood Optimisation for Iterative Gaussian Processes

Authors: Jihao Andreas Lin, Shreyas Padhy, Bruno Mlodozeniec, José Miguel Hernández-Lobato

Abstract: Gaussian processes are a versatile probabilistic machine learning model whose effectiveness often depends on good hyperparameters, which are typically learned by maximising the marginal likelihood. In this work, we consider iterative methods, which use iterative linear system solvers to approximate marginal likelihood gradients up to a specified numerical precision, allowing a trade-off between co… ▽ More Gaussian processes are a versatile probabilistic machine learning model whose effectiveness often depends on good hyperparameters, which are typically learned by maximising the marginal likelihood. In this work, we consider iterative methods, which use iterative linear system solvers to approximate marginal likelihood gradients up to a specified numerical precision, allowing a trade-off between compute time and accuracy of a solution. We introduce a three-level hierarchy of marginal likelihood optimisation for iterative Gaussian processes, and identify that the computational costs are dominated by solving sequential batches of large positive-definite systems of linear equations. We then propose to amortise computations by reusing solutions of linear system solvers as initialisations in the next step, providing a $\textit{warm start}$. Finally, we discuss the necessary conditions and quantify the consequences of warm starts and demonstrate their effectiveness on regression tasks, where warm starts achieve the same results as the conventional procedure while providing up to a $16 \times$ average speed-up among datasets. △ Less

Submitted 28 May, 2024; originally announced May 2024.

Comments: Advances in Approximate Bayesian Inference 2024

arXiv:2405.16166 [pdf, other]

The Power of Hard Attention Transformers on Data Sequences: A Formal Language Theoretic Perspective

Authors: Pascal Bergsträßer, Chris Köcher, Anthony Widjaja Lin, Georg Zetzsche

Abstract: Formal language theory has recently been successfully employed to unravel the power of transformer encoders. This setting is primarily applicable in Natural Languange Processing (NLP), as a token embedding function (where a bounded number of tokens is admitted) is first applied before feeding the input to the transformer. On certain kinds of data (e.g. time series), we want our transformers to be… ▽ More Formal language theory has recently been successfully employed to unravel the power of transformer encoders. This setting is primarily applicable in Natural Languange Processing (NLP), as a token embedding function (where a bounded number of tokens is admitted) is first applied before feeding the input to the transformer. On certain kinds of data (e.g. time series), we want our transformers to be able to handle \emph{arbitrary} input sequences of numbers (or tuples thereof) without \emph{a priori} limiting the values of these numbers. In this paper, we initiate the study of the expressive power of transformer encoders on sequences of data (i.e. tuples of numbers). Our results indicate an increase in expressive power of hard attention transformers over data sequences, in stark contrast to the case of strings. In particular, we prove that Unique Hard Attention Transformers (UHAT) over inputs as data sequences no longer lie within the circuit complexity class $AC^0$ (even without positional encodings), unlike the case of string inputs, but are still within the complexity class $TC^0$ (even with positional encodings). Over strings, UHAT without positional encodings capture only regular languages. In contrast, we show that over data sequences UHAT can capture non-regular properties. Finally, we show that UHAT capture languages definable in an extension of linear temporal logic with unary numeric predicates and arithmetics. △ Less

Submitted 25 May, 2024; originally announced May 2024.

arXiv:2405.11304 [pdf, other]

Quantum-Train: Rethinking Hybrid Quantum-Classical Machine Learning in the Model Compression Perspective

Authors: Chen-Yu Liu, En-Jui Kuo, Chu-Hsuan Abraham Lin, Jason Gemsun Young, Yeong-Jar Chang, Min-Hsiu Hsieh, Hsi-Sheng Goan

Abstract: We introduces the Quantum-Train(QT) framework, a novel approach that integrates quantum computing with classical machine learning algorithms to address significant challenges in data encoding, model compression, and inference hardware requirements. Even with a slight decrease in accuracy, QT achieves remarkable results by employing a quantum neural network alongside a classical mapping model, whic… ▽ More We introduces the Quantum-Train(QT) framework, a novel approach that integrates quantum computing with classical machine learning algorithms to address significant challenges in data encoding, model compression, and inference hardware requirements. Even with a slight decrease in accuracy, QT achieves remarkable results by employing a quantum neural network alongside a classical mapping model, which significantly reduces the parameter count from $M$ to $O(\text{polylog} (M))$ during training. Our experiments demonstrate QT's effectiveness in classification tasks, offering insights into its potential to revolutionize machine learning by leveraging quantum computational advantages. This approach not only improves model efficiency but also reduces generalization errors, showcasing QT's potential across various machine learning applications. △ Less

Submitted 10 June, 2024; v1 submitted 18 May, 2024; originally announced May 2024.

Comments: 12 pages, 6 figures

arXiv:2405.06945 [pdf, other]

Direct Learning of Mesh and Appearance via 3D Gaussian Splatting

Authors: Ancheng Lin, Jun Li

Abstract: Accurately reconstructing a 3D scene including explicit geometry information is both attractive and challenging. Geometry reconstruction can benefit from incorporating differentiable appearance models, such as Neural Radiance Fields and 3D Gaussian Splatting (3DGS). In this work, we propose a learnable scene model that incorporates 3DGS with an explicit geometry representation, namely a mesh. Our… ▽ More Accurately reconstructing a 3D scene including explicit geometry information is both attractive and challenging. Geometry reconstruction can benefit from incorporating differentiable appearance models, such as Neural Radiance Fields and 3D Gaussian Splatting (3DGS). In this work, we propose a learnable scene model that incorporates 3DGS with an explicit geometry representation, namely a mesh. Our model learns the mesh and appearance in an end-to-end manner, where we bind 3D Gaussians to the mesh faces and perform differentiable rendering of 3DGS to obtain photometric supervision. The model creates an effective information pathway to supervise the learning of the scene, including the mesh. Experimental results demonstrate that the learned scene model not only achieves state-of-the-art rendering quality but also supports manipulation using the explicit mesh. In addition, our model has a unique advantage in adapting to scene updates, thanks to the end-to-end learning of both mesh and appearance. △ Less

Submitted 11 May, 2024; originally announced May 2024.

arXiv:2404.08887 [pdf, other]

doi 10.1007/978-3-031-56069-9_6

Countering Mainstream Bias via End-to-End Adaptive Local Learning

Authors: Jinhao Pan, Ziwei Zhu, Jianling Wang, Allen Lin, James Caverlee

Abstract: Collaborative filtering (CF) based recommendations suffer from mainstream bias -- where mainstream users are favored over niche users, leading to poor recommendation quality for many long-tail users. In this paper, we identify two root causes of this mainstream bias: (i) discrepancy modeling, whereby CF algorithms focus on modeling mainstream users while neglecting niche users with unique preferen… ▽ More Collaborative filtering (CF) based recommendations suffer from mainstream bias -- where mainstream users are favored over niche users, leading to poor recommendation quality for many long-tail users. In this paper, we identify two root causes of this mainstream bias: (i) discrepancy modeling, whereby CF algorithms focus on modeling mainstream users while neglecting niche users with unique preferences; and (ii) unsynchronized learning, where niche users require more training epochs than mainstream users to reach peak performance. Targeting these causes, we propose a novel end-To-end Adaptive Local Learning (TALL) framework to provide high-quality recommendations to both mainstream and niche users. TALL uses a loss-driven Mixture-of-Experts module to adaptively ensemble experts to provide customized local models for different users. Further, it contains an adaptive weight module to synchronize the learning paces of different users by dynamically adjusting weights in the loss. Extensive experiments demonstrate the state-of-the-art performance of the proposed model. Code and data are provided at \url{https://github.com/JP-25/end-To-end-Adaptive-Local-Leanring-TALL-} △ Less

Submitted 12 April, 2024; originally announced April 2024.

Comments: ECIR 2024

Journal ref: In European Conference on Information Retrieval 2024, vol 14612 (pp. 75-89)

arXiv:2403.19594 [pdf]

Reproducibility Made Easy: A Tool for Methodological Transparency and Efficient Standardized Reporting based on the proposed MRSinMRS Consensus

Authors: Antonia Susnjar, Antonia Kaiser, Dunja Simicic, Gianna Nossa, Alexander Lin, Georg Oeltzschner, Aaron Gudmundson

Abstract: A recent expert consensus found that non-standard reporting in MRS studies led to poor reproducibility. In order to address this, MRSinMRS guidelines were introduced; however, because of the disparate nomenclature and data formats, adoption has been slow. To get around this problem, REMY, a toolbox that supports major vendor formats, was created. By efficiently filling in important fields in the M… ▽ More A recent expert consensus found that non-standard reporting in MRS studies led to poor reproducibility. In order to address this, MRSinMRS guidelines were introduced; however, because of the disparate nomenclature and data formats, adoption has been slow. To get around this problem, REMY, a toolbox that supports major vendor formats, was created. By efficiently filling in important fields in the MRSinMRS table, it improves reproducibility. Even with certain hardware-related restrictions, REMY makes a substantial contribution to the completion of acquisition parameters, which facilitates reporting. Its compatibility and user-friendly interface should promote widespread adoption of MRSinMRS, raising the caliber of MRS research. △ Less

Submitted 6 August, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

arXiv:2403.19039 [pdf, other]

Expanding Density-Correlation Machine Learning Representations for Anisotropic Coarse-Grained Particles

Authors: Arthur Y. Lin, Kevin K. Huguenin-Dumittan, Yong-Cheol Cho, Jigyasa Nigam, Rose K. Cersonsky

Abstract: Physics-based, atom-centered machine learning (ML) representations have been instrumental to the effective integration of ML within the atomistic simulation community. Many of these representations build off the idea of atoms as having spherical, or isotropic, interactions. In many communities, there is often a need to represent groups of atoms, either to increase the computational efficiency of s… ▽ More Physics-based, atom-centered machine learning (ML) representations have been instrumental to the effective integration of ML within the atomistic simulation community. Many of these representations build off the idea of atoms as having spherical, or isotropic, interactions. In many communities, there is often a need to represent groups of atoms, either to increase the computational efficiency of simulation via coarse-graining or to understand molecular influences on system behavior. In such cases, atom-centered representations will have limited utility, as groups of atoms may not be well-approximated as spheres. In this work, we extend the popular Smooth Overlap of Atomic Positions (SOAP) ML representation for systems consisting of non-spherical anisotropic particles or clusters of atoms. We show the power of this anisotropic extension of SOAP, which we deem \AniSOAP, in accurately characterizing liquid crystal systems and predicting the energetics of Gay-Berne ellipsoids and coarse-grained benzene crystals. With our study of these prototypical anisotropic systems, we derive fundamental insights into how molecular shape influences mesoscale behavior and explain how to reincorporate important atom-atom interactions typically not captured by coarse-grained models. Moving forward, we propose \AniSOAP as a flexible, unified framework for coarse-graining in complex, multiscale simulation. △ Less

Submitted 27 March, 2024; originally announced March 2024.

Comments: The following article has been submitted to the Journal of Chemical Physics. After it is published, the updated version can be found through their website

arXiv:2402.16465 [pdf, other]

Training Classical Neural Networks by Quantum Machine Learning

Authors: Chen-Yu Liu, En-Jui Kuo, Chu-Hsuan Abraham Lin, Sean Chen, Jason Gemsun Young, Yeong-Jar Chang, Min-Hsiu Hsieh

Abstract: In recent years, advanced deep neural networks have required a large number of parameters for training. Therefore, finding a method to reduce the number of parameters has become crucial for achieving efficient training. This work proposes a training scheme for classical neural networks (NNs) that utilizes the exponentially large Hilbert space of a quantum system. By mapping a classical NN with… ▽ More In recent years, advanced deep neural networks have required a large number of parameters for training. Therefore, finding a method to reduce the number of parameters has become crucial for achieving efficient training. This work proposes a training scheme for classical neural networks (NNs) that utilizes the exponentially large Hilbert space of a quantum system. By mapping a classical NN with $M$ parameters to a quantum neural network (QNN) with $O(\text{polylog} (M))$ rotational gate angles, we can significantly reduce the number of parameters. These gate angles can be updated to train the classical NN. Unlike existing quantum machine learning (QML) methods, the results obtained from quantum computers using our approach can be directly used on classical computers. Numerical results on the MNIST and Iris datasets are presented to demonstrate the effectiveness of our approach. Additionally, we investigate the effects of deeper QNNs and the number of measurement shots for the QNN, followed by the theoretical perspective of the proposed method. This work opens a new branch of QML and offers a practical tool that can greatly enhance the influence of QML, as the trained QML results can benefit classical computing in our daily lives. △ Less

Submitted 26 February, 2024; originally announced February 2024.

Comments: 7 pages, 3 figures

arXiv:2402.14817 [pdf, other]

Cameras as Rays: Pose Estimation via Ray Diffusion

Authors: Jason Y. Zhang, Amy Lin, Moneish Kumar, Tzu-Hsuan Yang, Deva Ramanan, Shubham Tulsiani

Abstract: Estimating camera poses is a fundamental task for 3D reconstruction and remains challenging given sparsely sampled views (<10). In contrast to existing approaches that pursue top-down prediction of global parametrizations of camera extrinsics, we propose a distributed representation of camera pose that treats a camera as a bundle of rays. This representation allows for a tight coupling with spatia… ▽ More Estimating camera poses is a fundamental task for 3D reconstruction and remains challenging given sparsely sampled views (<10). In contrast to existing approaches that pursue top-down prediction of global parametrizations of camera extrinsics, we propose a distributed representation of camera pose that treats a camera as a bundle of rays. This representation allows for a tight coupling with spatial image features improving pose precision. We observe that this representation is naturally suited for set-level transformers and develop a regression-based approach that maps image patches to corresponding rays. To capture the inherent uncertainties in sparse-view pose inference, we adapt this approach to learn a denoising diffusion model which allows us to sample plausible modes while improving performance. Our proposed methods, both regression- and diffusion-based, demonstrate state-of-the-art performance on camera pose estimation on CO3D while generalizing to unseen object categories and in-the-wild captures. △ Less

Submitted 4 April, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

Comments: In ICLR 2024 (oral). v2-3: updated references. Project webpage: https://jasonyzhang.com/RayDiffusion

arXiv:2402.09430 [pdf, other]

WiMANS: A Benchmark Dataset for WiFi-based Multi-user Activity Sensing

Authors: Shuokang Huang, Kaihan Li, Di You, Yichong Chen, Arvin Lin, Siying Liu, Xiaohui Li, Julie A. McCann

Abstract: WiFi-based human sensing has exhibited remarkable potential to analyze user behaviors in a non-intrusive and device-free manner, benefiting applications as diverse as smart homes and healthcare. However, most previous works focus on single-user sensing, which has limited practicability in scenarios involving multiple users. Although recent studies have begun to investigate WiFi-based multi-user se… ▽ More WiFi-based human sensing has exhibited remarkable potential to analyze user behaviors in a non-intrusive and device-free manner, benefiting applications as diverse as smart homes and healthcare. However, most previous works focus on single-user sensing, which has limited practicability in scenarios involving multiple users. Although recent studies have begun to investigate WiFi-based multi-user sensing, there remains a lack of benchmark datasets to facilitate reproducible and comparable research. To bridge this gap, we present WiMANS, to our knowledge, the first dataset for multi-user sensing based on WiFi. WiMANS contains over 9.4 hours of dual-band WiFi Channel State Information (CSI), as well as synchronized videos, monitoring simultaneous activities of multiple users. We exploit WiMANS to benchmark the performance of state-of-the-art WiFi-based human sensing models and video-based models, posing new challenges and opportunities for future work. We believe WiMANS can push the boundaries of current studies and catalyze the research on WiFi-based multi-user sensing. △ Less

Submitted 12 March, 2024; v1 submitted 24 January, 2024; originally announced February 2024.

Comments: We present WiMANS, to our knowledge, the first dataset for multi-user activity sensing based on WiFi

arXiv:2402.04946 [pdf, other]

Searching for Giant Exoplanets around M-dwarf Stars (GEMS) I: Survey Motivation

Authors: Shubham Kanodia, Caleb I. Cañas, Suvrath Mahadevan, Eric B. Ford, Ravit Helled, Dana E. Anderson, Alan Boss, William D. Cochran, Megan Delamer, Te Han, Jessica E. Libby-Roberts, Andrea S. J. Lin, Simon Müller, Paul Robertson, Guðmundur Stefánsson, Johanna Teske

Abstract: Recent discoveries of transiting giant exoplanets around M-dwarf stars (GEMS), aided by the all-sky coverage of TESS, are starting to stretch theories of planet formation through the core-accretion scenario. Recent upper limits on their occurrence suggest that they decrease with lower stellar masses, with fewer GEMS around lower-mass stars compared to solar-type. In this paper, we discuss existing… ▽ More Recent discoveries of transiting giant exoplanets around M-dwarf stars (GEMS), aided by the all-sky coverage of TESS, are starting to stretch theories of planet formation through the core-accretion scenario. Recent upper limits on their occurrence suggest that they decrease with lower stellar masses, with fewer GEMS around lower-mass stars compared to solar-type. In this paper, we discuss existing GEMS both through confirmed planets, as well as protoplanetary disk observations, and a combination of tests to reconcile these with theoretical predictions. We then introduce the \textit{Searching for GEMS} survey, where we utilize multi-dimensional nonparameteric statistics to simulate hypothetical survey scenarios to predict the required sample size of transiting GEMS with mass measurements to robustly compare their bulk-density with canonical hot-Jupiters orbiting FGK stars. Our Monte-Carlo simulations predict that a robust comparison requires about 40 transiting GEMS (compared to the existing sample of $\sim$ 15) with 5-$σ$ mass measurements. Furthermore, we discuss the limitations of existing occurrence estimates for GEMS, and provide a brief description of our planned systematic search to improve the occurrence rate estimates for GEMS. △ Less

Submitted 7 February, 2024; originally announced February 2024.

Comments: 16 pages + references, including 7 figures. Accepted in AAS Journals

arXiv:2402.03665 [pdf, ps, other]

Multi-color Wavefront Sensor using Talbot effect for High-order Harmonic Generation

Authors: Yang Du, Kui Li, Jin Niu, Angyi Lin, Jie Li, Zhongwei Fan, Guorong Wu, Xiaoshi Zhang, Fucai Zhang

Abstract: We present a novel method for multi-color wavefront measurement of high-order harmonic generation beams using the Talbot effect, validated both theoretically and experimentally for the first time. Each harmonic maintains a unique wavefront and produces an independent set of self-images along the optical axis.We achieved the wavefronts reconstruction of three harmonics in a single measurement scan,… ▽ More We present a novel method for multi-color wavefront measurement of high-order harmonic generation beams using the Talbot effect, validated both theoretically and experimentally for the first time. Each harmonic maintains a unique wavefront and produces an independent set of self-images along the optical axis.We achieved the wavefronts reconstruction of three harmonics in a single measurement scan, expanding the spectrally-resolved capability of the conventional Talbot effect wavefront sensor. This breakthrough introduces a novel tool for studying the multi-color wavefront in high-order harmonic generation, unlocking the potential to investigate spatiotemporal ultrafast nonlinear dynamics in attosecond pulse formation on a shot-by-shot basis. △ Less

Submitted 5 February, 2024; originally announced February 2024.

arXiv:2402.01695 [pdf, other]

Language-Guided World Models: A Model-Based Approach to AI Control

Authors: Alex Zhang, Khanh Nguyen, Jens Tuyls, Albert Lin, Karthik Narasimhan

Abstract: This paper introduces the concept of Language-Guided World Models (LWMs) -- probabilistic models that can simulate environments by reading texts. Agents equipped with these models provide humans with more extensive and efficient control, allowing them to simultaneously alter agent behaviors in multiple tasks via natural verbal communication. In this work, we take initial steps in developing robust… ▽ More This paper introduces the concept of Language-Guided World Models (LWMs) -- probabilistic models that can simulate environments by reading texts. Agents equipped with these models provide humans with more extensive and efficient control, allowing them to simultaneously alter agent behaviors in multiple tasks via natural verbal communication. In this work, we take initial steps in developing robust LWMs that can generalize to compositionally novel language descriptions. We design a challenging world modeling benchmark based on the game of MESSENGER (Hanjie et al., 2021), featuring evaluation settings that require varying degrees of compositional generalization. Our experiments reveal the lack of generalizability of the state-of-the-art Transformer model, as it offers marginal improvements in simulation quality over a no-text baseline. We devise a more robust model by fusing the Transformer with the EMMA attention mechanism (Hanjie et al., 2021). Our model substantially outperforms the Transformer and approaches the performance of a model with an oracle semantic parsing and grounding capability. To demonstrate the practicality of this model in improving AI safety and transparency, we simulate a scenario in which the model enables an agent to present plans to a human before execution, and to revise plans based on their language feedback. △ Less

Submitted 4 July, 2024; v1 submitted 23 January, 2024; originally announced February 2024.

Comments: SpLU-RoboNLP workshop at ACL 2024

arXiv:2401.02618 [pdf, ps, other]

doi 10.1145/3632864

Regular Abstractions for Array Systems

Authors: Chih-Duo Hong, Anthony W. Lin

Abstract: Verifying safety and liveness over array systems is a highly challenging problem. Array systems naturally capture parameterized systems such as distributed protocols with an unbounded number of processes. Such distributed protocols often exploit process IDs during their computation, resulting in array systems whose element values range over an infinite domain. In this paper, we develop a novel fra… ▽ More Verifying safety and liveness over array systems is a highly challenging problem. Array systems naturally capture parameterized systems such as distributed protocols with an unbounded number of processes. Such distributed protocols often exploit process IDs during their computation, resulting in array systems whose element values range over an infinite domain. In this paper, we develop a novel framework for proving safety and liveness over array systems. The crux of the framework is to overapproximate an array system as a string rewriting system (i.e. over a finite alphabet) by means of a new predicate abstraction that exploits the so-called indexed predicates. This allows us to tap into powerful verification methods for string rewriting systems that have been heavily developed in the last few decades (e.g. regular model checking). We demonstrate how our method yields simple, automatically verifiable proofs of safety and liveness properties for challenging examples, including Dijkstra's self-stabilizing protocol and the Chang-Roberts leader election protocol. △ Less

Submitted 4 January, 2024; originally announced January 2024.

arXiv:2312.10074 [pdf]

STAGER checklist: Standardized Testing and Assessment Guidelines for Evaluating Generative AI Reliability

Authors: Jinghong Chen, Lingxuan Zhu, Weiming Mou, Zaoqu Liu, Quan Cheng, Anqi Lin, Jian Zhang, Peng Luo

Abstract: Generative Artificial Intelligence (AI) holds immense potential in medical applications. Numerous studies have explored the efficacy of various generative AI models within healthcare contexts, but there is a lack of a comprehensive and systematic evaluation framework. Given that some studies evaluating the ability of generative AI for medical applications have deficiencies in their methodological… ▽ More Generative Artificial Intelligence (AI) holds immense potential in medical applications. Numerous studies have explored the efficacy of various generative AI models within healthcare contexts, but there is a lack of a comprehensive and systematic evaluation framework. Given that some studies evaluating the ability of generative AI for medical applications have deficiencies in their methodological design, standardized guidelines for their evaluation are also currently lacking. In response, our objective is to devise standardized assessment guidelines tailored for evaluating the performance of generative AI systems in medical contexts. To this end, we conducted a thorough literature review using the PubMed and Google Scholar databases, focusing on research that tests generative AI capabilities in medicine. Our multidisciplinary team, comprising experts in life sciences, clinical medicine, medical engineering, and generative AI users, conducted several discussion sessions and developed a checklist of 23 items. The checklist is designed to encompass the critical evaluation aspects of generative AI in medical applications comprehensively. This checklist, and the broader assessment framework it anchors, address several key dimensions, including question collection, querying methodologies, and assessment techniques. We aim to provide a holistic evaluation of AI systems. The checklist delineates a clear pathway from question gathering to result assessment, offering researchers guidance through potential challenges and pitfalls. Our framework furnishes a standardized, systematic approach for research involving the testing of generative AI's applicability in medicine. It enhances the quality of research reporting and aids in the evolution of generative AI in medicine and life sciences. △ Less

Submitted 7 December, 2023; originally announced December 2023.

Comments: 11 pages, 0 figure, 2 tables

arXiv:2312.08604 [pdf, other]

Verification of Neural Reachable Tubes via Scenario Optimization and Conformal Prediction

Authors: Albert Lin, Somil Bansal

Abstract: Learning-based approaches for controlling safety-critical systems are rapidly growing in popularity; thus, it is important to assure their performance and safety. Hamilton-Jacobi (HJ) reachability analysis is a popular formal verification tool for providing such guarantees, since it can handle general nonlinear system dynamics, bounded adversarial system disturbances, and state and input constrain… ▽ More Learning-based approaches for controlling safety-critical systems are rapidly growing in popularity; thus, it is important to assure their performance and safety. Hamilton-Jacobi (HJ) reachability analysis is a popular formal verification tool for providing such guarantees, since it can handle general nonlinear system dynamics, bounded adversarial system disturbances, and state and input constraints. However, its computational and memory complexity scales exponentially with the state dimension, making it intractable for large-scale systems. To overcome this challenge, neural approaches, such as DeepReach, have been used to synthesize reachable tubes and safety controllers for high-dimensional systems. However, verifying these neural reachable tubes remains challenging. In this work, we propose two verification methods, based on robust scenario optimization and conformal prediction, to provide probabilistic safety guarantees for neural reachable tubes. Our methods allow a direct trade-off between resilience to outlier errors in the neural tube, which are inevitable in a learning-based approach, and the strength of the probabilistic safety guarantee. Furthermore, we show that split conformal prediction, a widely used method in the machine learning community for uncertainty quantification, reduces to a scenario-based approach, making the two methods equivalent not only for verification of neural reachable tubes but also more generally. To our knowledge, our proof is the first in the literature to show a strong relationship between conformal prediction and scenario optimization. Finally, we propose an outlier-adjusted verification approach that uses the error distribution in neural reachable tubes to recover greater safe volumes. We demonstrate the efficacy of the proposed approaches for the high-dimensional problems of multi-vehicle collision avoidance and rocket landing with no-go zones. △ Less

Submitted 9 April, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

Comments: Accepted to 6th Annual Learning for Dynamics & Control Conference. arXiv admin note: text overlap with arXiv:2209.12336

arXiv:2311.17037 [pdf, other]

Concurrent Stochastic Lossy Channel Games

Authors: Daniel Stan, Muhammad Najib, Anthony Widjaja Lin, Parosh Aziz Abdulla

Abstract: Concurrent stochastic games are an important formalism for the rational verification of probabilistic multi-agent systems, which involves verifying whether a temporal logic property is satisfied in some or all game-theoretic equilibria of such systems. In this work, we study the rational verification of probabilistic multi-agent systems where agents can cooperate by communicating over unbounded lo… ▽ More Concurrent stochastic games are an important formalism for the rational verification of probabilistic multi-agent systems, which involves verifying whether a temporal logic property is satisfied in some or all game-theoretic equilibria of such systems. In this work, we study the rational verification of probabilistic multi-agent systems where agents can cooperate by communicating over unbounded lossy channels. To model such systems, we present concurrent stochastic lossy channel games (CSLCG) and employ an equilibrium concept from cooperative game theory known as the core, which is the most fundamental and widely studied cooperative equilibrium concept. Our main contribution is twofold. First, we show that the rational verification problem is undecidable for systems whose agents have almost-sure LTL objectives. Second, we provide a decidable fragment of such a class of objectives that subsumes almost-sure reachability and safety. Our techniques involve reductions to solving infinite-state zero-sum games with conjunctions of qualitative objectives. To the best of our knowledge, our result represents the first decidability result on the rational verification of stochastic multi-agent systems on infinite arenas. △ Less

Submitted 28 November, 2023; originally announced November 2023.

Comments: To appear at CSL 2024. Extended version

arXiv:2311.16811 [pdf, other]

Four errors students make with inverse-square law vectors

Authors: Colin S. Wallace, Liam Jones, Alex Lin

Abstract: In this paper, we discuss four errors introductory physics students make when attempting to add two inverse-square law vectors. We observe multiple instances in which students 1) add vectors as if they were scalars, 2) project the $r$ (or $r^2$) in the denominator, instead of the entire vector, when attempting to find the vector's components, 3) incorrectly apply the Pythagorean theorem when attem… ▽ More In this paper, we discuss four errors introductory physics students make when attempting to add two inverse-square law vectors. We observe multiple instances in which students 1) add vectors as if they were scalars, 2) project the $r$ (or $r^2$) in the denominator, instead of the entire vector, when attempting to find the vector's components, 3) incorrectly apply the Pythagorean theorem when attempting to calculate the magnitude of the resultant vector, and 4) incorrectly relate the signs of the components of an electric field (or force) to the signs of the electric charges. While these are not the only errors students make, they are the most frequently occurring based on our analysis of 678 exams taken by students in either introductory mechanics or electricity and magnetism (E&M). We then show how these errors can be encoded into a new type of activity or assessment question which we call a ``student error task." Introductory physics instructors can use the student error task in this paper as a way to engage or assess their students' understandings of how to add two inverse-square law vectors. △ Less

Submitted 28 November, 2023; originally announced November 2023.

Comments: 22 pages, 7 figures, submitted to the European Journal of Physics

arXiv:2311.16237 [pdf, other]

TOI-1670 c, a 40-day Orbital Period Warm Jupiter in a Compact System, is Well-aligned

Authors: Jack Lubin, Xian-Yu Wang, Malena Rice, Jiayin Dong, Songhu Wang, Brandon T. Radzom, Paul Robertson, Gudmundur Stefansson, Jaime A. Alvarado-Montes, Corey Beard, Chad F. Bender, Arvind F. Gupta, Samuel Halverson, Shubham Kanodia, Dan Li, Andrea S. J. Lin, Sarah E. Logsdon, Emily Lubar, Suvrath Mahadevan, Joe P. Ninan, Jayadev Rajagopal, Aripta Roy, Christian Schwab, Jason T. Wright

Abstract: We report the measurement of the sky-projected obliquity angle $λ$ of the Warm Jovian exoplanet TOI-1670 c via the Rossiter-McLaughlin effect as part of the Stellar Obliquities in Long-period Exoplanet Systems (SOLES) project. We observed the transit window during UT 20 April 2023 for 7 continuous hours with NEID on the 3.5 m WIYN Telescope at Kitt Peak National Observatory. TOI-1670 hosts a sub-N… ▽ More We report the measurement of the sky-projected obliquity angle $λ$ of the Warm Jovian exoplanet TOI-1670 c via the Rossiter-McLaughlin effect as part of the Stellar Obliquities in Long-period Exoplanet Systems (SOLES) project. We observed the transit window during UT 20 April 2023 for 7 continuous hours with NEID on the 3.5 m WIYN Telescope at Kitt Peak National Observatory. TOI-1670 hosts a sub-Neptune (P ~11 days; planet b) interior to the Warm Jovian (P ~40 days; planet c), which presents an opportunity to investigate the dynamics of a Warm Jupiter with an inner companion. Additionally, TOI-1670 c is now among the longest-period planets to date to have its sky-projected obliquity angle measured. We find planet c is well-aligned to the host star, with $λ$ = -0.3 +/- 2.2 degrees. TOI-1670 c joins a growing census of aligned Warm Jupiters around single stars and aligned planets in multi-planet systems. △ Less

Submitted 27 November, 2023; originally announced November 2023.

Comments: 11 pages, 2 figures, 1 table. Accepted to ApJ Letters

arXiv:2311.15883 [pdf, other]

Characterising and Verifying the Core in Concurrent Multi-Player Mean-Payoff Games (Full Version)

Authors: Julian Gutierrez, Anthony W. Lin, Muhammad Najib, Thomas Steeples, Michael Wooldridge

Abstract: Concurrent multi-player mean-payoff games are important models for systems of agents with individual, non-dichotomous preferences. Whilst these games have been extensively studied in terms of their equilibria in non-cooperative settings, this paper explores an alternative solution concept: the core from cooperative game theory. This concept is particularly relevant for cooperative AI systems, as i… ▽ More Concurrent multi-player mean-payoff games are important models for systems of agents with individual, non-dichotomous preferences. Whilst these games have been extensively studied in terms of their equilibria in non-cooperative settings, this paper explores an alternative solution concept: the core from cooperative game theory. This concept is particularly relevant for cooperative AI systems, as it enables the modelling of cooperation among agents, even when their goals are not fully aligned. Our contribution is twofold. First, we provide a characterisation of the core using discrete geometry techniques and establish a necessary and sufficient condition for its non-emptiness. We then use the characterisation to prove the existence of polynomial witnesses in the core. Second, we use the existence of such witnesses to solve key decision problems in rational verification and provide tight complexity bounds for the problem of checking whether some/every equilibrium in a game satisfies a given LTL or GR(1) specification. Our approach is general and can be adapted to handle other specifications expressed in various fragments of LTL without incurring additional computational costs. △ Less

Submitted 27 November, 2023; originally announced November 2023.

Comments: This is the full version of the paper with the same title that appears in the CSL'24 proceedings

arXiv:2311.04690 [pdf, other]

Learning Quantum Phase Estimation by Variational Quantum Circuits

Authors: Chen-Yu Liu, Chu-Hsuan Abraham Lin, Kuan-Cheng Chen

Abstract: Quantum Phase Estimation (QPE) stands as a pivotal quantum computing subroutine that necessitates an inverse Quantum Fourier Transform (QFT). However, it is imperative to recognize that enhancing the precision of the estimation inevitably results in a significantly deeper circuit. We developed a variational quantum circuit (VQC) approximation to reduce the depth of the QPE circuit, yielding enhanc… ▽ More Quantum Phase Estimation (QPE) stands as a pivotal quantum computing subroutine that necessitates an inverse Quantum Fourier Transform (QFT). However, it is imperative to recognize that enhancing the precision of the estimation inevitably results in a significantly deeper circuit. We developed a variational quantum circuit (VQC) approximation to reduce the depth of the QPE circuit, yielding enhanced performance in noisy simulations and real hardware. Our experiments demonstrated that the VQC outperformed both Noisy QPE and standard QPE on real hardware by reducing circuit noise. This VQC integration into quantum compilers as an intermediate step between input and transpiled circuits holds significant promise for quantum algorithms with deep circuits. Future research will explore its potential applicability across various quantum computing hardware architectures. △ Less

Submitted 8 November, 2023; originally announced November 2023.

Comments: 6 pages, 7 figures

arXiv:2311.04031 [pdf, other]

Ramsey Quantifiers in Linear Arithmetics

Authors: Pascal Bergsträßer, Moses Ganardi, Anthony W. Lin, Georg Zetzsche

Abstract: We study Satisfiability Modulo Theories (SMT) enriched with the so-called Ramsey quantifiers, which assert the existence of cliques (complete graphs) in the graph induced by some formulas. The extended framework is known to have applications in proving program termination (in particular, whether a transitive binary predicate is well-founded), and monadic decomposability of SMT formulas. Our main r… ▽ More We study Satisfiability Modulo Theories (SMT) enriched with the so-called Ramsey quantifiers, which assert the existence of cliques (complete graphs) in the graph induced by some formulas. The extended framework is known to have applications in proving program termination (in particular, whether a transitive binary predicate is well-founded), and monadic decomposability of SMT formulas. Our main result is a new algorithm for eliminating Ramsey quantifiers from three common SMT theories: Linear Integer Arithmetic (LIA), Linear Real Arithmetic (LRA), and Linear Integer Real Arithmetic (LIRA). In particular, if we work only with existentially quantified formulas, then our algorithm runs in polynomial time and produces a formula of linear size. One immediate consequence is that checking well-foundedness of a given formula in the aforementioned theory defining a transitive predicate can be straightforwardly handled by highly optimized SMT-solvers. We show also how this provides a uniform semi-algorithm for verifying termination and liveness with completeness guarantee (in fact, with an optimal computational complexity) for several well-known classes of infinite-state systems, which include succinct timed systems, one-counter systems, and monotonic counter systems. Another immediate consequence is a solution to an open problem on checking monadic decomposability of a given relation in quantifier-free fragments of LRA and LIRA, which is an important problem in automated reasoning and constraint databases. Our result immediately implies decidability of this problem with an optimal complexity (coNP-complete) and enables exploitation of SMT-solvers. It also provides a termination guarantee for the generic monadic decomposition algorithm of Veanes et al. for LIA, LRA, and LIRA. We report encouraging experimental results on a prototype implementation of our algorithms on micro-benchmarks. △ Less

Submitted 7 November, 2023; originally announced November 2023.

arXiv:2311.03901 [pdf, ps, other]

Parikh's Theorem Made Symbolic

Authors: Matthew Hague, Artur Jeż, Anthony W. Lin

Abstract: Parikh's Theorem is a fundamental result in automata theory with numerous applications in computer science: software verification (e.g. infinite-state verification, string constraints, and theory of arrays), verification of cryptographic protocols (e.g. using Horn clauses modulo equational theories) and database querying (e.g. evaluating path-queries in graph databases). Parikh's Theorem states th… ▽ More Parikh's Theorem is a fundamental result in automata theory with numerous applications in computer science: software verification (e.g. infinite-state verification, string constraints, and theory of arrays), verification of cryptographic protocols (e.g. using Horn clauses modulo equational theories) and database querying (e.g. evaluating path-queries in graph databases). Parikh's Theorem states that the letter-counting abstraction of a language recognized by finite automata or context-free grammars is definable in Presburger Arithmetic. Unfortunately, real-world applications typically require large alphabets - which are well-known to be not amenable to explicit treatment of the alphabets. Symbolic automata have proven in the last decade to be an effective algorithmic framework for handling large finite or even infinite alphabets. A symbolic automaton employs an effective boolean algebra, which offers a symbolic representation of character sets and often lends itself to an exponentially more succinct representation of a language. Instead of letter-counting, Parikh's Theorem for symbolic automata amounts to counting the number of times different predicates are satisfied by an input sequence. Unfortunately, naively applying Parikh's Theorem from classical automata theory to symbolic automata yields existential Presburger formulas of exponential size. We provide a new construction for Parikh's Theorem for symbolic automata and grammars, which avoids this exponential blowup: our algorithm computes an existential formula in polynomial-time over (quantifier-free) Presburger and the base theory. In fact, our algorithm extends to the model of parametric symbolic grammars, which are one of the most expressive models of languages over infinite alphabets. We have implemented our algorithm and show it can be used to solve string constraints that are difficult to solve by existing solvers. △ Less

Submitted 31 July, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

Comments: Accepted tp POPL '24

arXiv:2310.20634 [pdf, other]

doi 10.3847/1538-3881/ad09c2

TOI-5344 b: A Saturn-like planet orbiting a super-Solar metallicity M0 dwarf

Authors: Te Han, Paul Robertson, Shubham Kanodia, Caleb Cañas, Andrea S. J. Lin, Guðmundur Stefánsson, Jessica E. Libby-Roberts, Alexander Larsen, Henry A. Kobulnicky, Suvrath Mahadevan, Chad F. Bender, William D. Cochran, Michael Endl, Mark E. Everett, Arvind F. Gupta, Samuel Halverson, Fred Hearty, Andrew Monson, Joe P. Ninan, Arpita Roy, Christian Schwab, Ryan C. Terrien

Abstract: We confirm the planetary nature of TOI-5344 b as a transiting giant exoplanet around an M0 dwarf star. TOI-5344 b was discovered with the Transiting Exoplanet Survey Satellite photometry and confirmed with ground-based photometry (the Red Buttes Observatory 0.6m telescope), radial velocity (the Habitable-zone Planet Finder), and speckle imaging (the NN-Explore Exoplanet Stellar Speckle Imager). TO… ▽ More We confirm the planetary nature of TOI-5344 b as a transiting giant exoplanet around an M0 dwarf star. TOI-5344 b was discovered with the Transiting Exoplanet Survey Satellite photometry and confirmed with ground-based photometry (the Red Buttes Observatory 0.6m telescope), radial velocity (the Habitable-zone Planet Finder), and speckle imaging (the NN-Explore Exoplanet Stellar Speckle Imager). TOI-5344 b is a Saturn-like giant planet ($ρ= 0.80^{+0.17}_{-0.15}\ \text{g cm}^{-3}$) with a planetary radius of $9.7 \pm \ 0.5 \ \text{R}_{\oplus}$ ($0.87 \pm \ 0.04 \ \text{R}_{\text{Jup}}$) and a planetary mass of $135^{+17}_{-18} \text{M}_{\oplus}$ ($0.42^{+0.05}_{-0.06} \ \text{M}_{\text{Jup}}$). It has an orbital period of $3.792622 \pm 0.000010$ days and an orbital eccentricity of $0.06^{+0.07}_{-0.04}$. We measure a high metallicity for TOI-5344 of [Fe/H] = $0.48 \pm 0.12$, where the high metallicity is consistent with expectations from formation through core accretion. We compare the metallicity of the M-dwarf hosts of giant exoplanets to that of M-dwarf hosts of non-giants ($\lesssim 8\ \text{R}_{\oplus}$). While the two populations appear to show different metallicity distributions, quantitative tests are prohibited by various sample caveats. △ Less

Submitted 7 November, 2023; v1 submitted 31 October, 2023; originally announced October 2023.

Comments: 19 pages, 10 figures, 4 tables, AJ accepted. Added references

Journal ref: AJ 167 4 (2024)

arXiv:2310.20581 [pdf, other]

Stochastic Gradient Descent for Gaussian Processes Done Right

Authors: Jihao Andreas Lin, Shreyas Padhy, Javier Antorán, Austin Tripp, Alexander Terenin, Csaba Szepesvári, José Miguel Hernández-Lobato, David Janz

Abstract: As is well known, both sampling from the posterior and computing the mean of the posterior in Gaussian process regression reduces to solving a large linear system of equations. We study the use of stochastic gradient descent for solving this linear system, and show that when \emph{done right} -- by which we mean using specific insights from the optimisation and kernel communities -- stochastic gra… ▽ More As is well known, both sampling from the posterior and computing the mean of the posterior in Gaussian process regression reduces to solving a large linear system of equations. We study the use of stochastic gradient descent for solving this linear system, and show that when \emph{done right} -- by which we mean using specific insights from the optimisation and kernel communities -- stochastic gradient descent is highly effective. To that end, we introduce a particularly simple \emph{stochastic dual descent} algorithm, explain its design in an intuitive manner and illustrate the design choices through a series of ablation studies. Further experiments demonstrate that our new method is highly competitive. In particular, our evaluations on the UCI regression tasks and on Bayesian optimisation set our approach apart from preconditioned conjugate gradients and variational Gaussian process approximations. Moreover, our method places Gaussian process regression on par with state-of-the-art graph neural networks for molecular binding affinity prediction. △ Less

Submitted 28 April, 2024; v1 submitted 31 October, 2023; originally announced October 2023.

arXiv:2310.13898 [pdf, other]

Computational and Systems Biology Advances to Enable Bioagent-Agnostic Signatures

Authors: Andy Lin, Cameron Torres, Errett C. Hobbs, Jaydeep Bardhan, Stephen B. Aley, Charles T. Spencer, Karen L. Taylor, Tony Chiang

Abstract: Enumerated threat agent lists have long driven biodefense priorities. The global SARS-CoV-2 pandemic demonstrated the limitations of searching for known threat agents as compared to a more agnostic approach. Recent technological advances are enabling agent-agnostic biodefense, especially through the integration of multi-modal observations of host-pathogen interactions directed by a human immunolog… ▽ More Enumerated threat agent lists have long driven biodefense priorities. The global SARS-CoV-2 pandemic demonstrated the limitations of searching for known threat agents as compared to a more agnostic approach. Recent technological advances are enabling agent-agnostic biodefense, especially through the integration of multi-modal observations of host-pathogen interactions directed by a human immunological model. Although well-developed technical assays exist for many aspects of human-pathogen interaction, the analytic methods and pipelines to combine and holistically interpret the results of such assays are immature and require further investments to exploit new technologies. In this manuscript, we discuss potential immunologically based bioagent-agnostic approaches and the computational tool gaps the community should prioritize filling. △ Less

Submitted 28 February, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

arXiv:2310.11775 [pdf, other]

TOI-2015b: A Warm Neptune with Transit Timing Variations Orbiting an Active mid M Dwarf

Authors: Sinclaire E. Jones, Gudmundur Stefansson, Kento Masuda, Jessica E. Libby-Roberts, Cristilyn N. Gardner, Rae Holcomb, Corey Beard, Paul Robertson, Caleb I. Cañas, Suvrath Mahadevan, Shubham Kanodia, Andrea S. J. Lin, Henry A. Kobulnicky, Brock A. Parker, Chad F. Bender, William D. Cochran, Scott A. Diddams, Rachel B. Fernandes, Arvind F. Gupta, Samuel Halverson, Suzanne L. Hawley, Fred R. Hearty, Leslie Hebb, Adam Kowalski, Jack Lubin , et al. (7 additional authors not shown)

Abstract: We report the discovery of a close-in ($P_{\mathrm{orb}} = 3.349\:\mathrm{days}$) warm Neptune with clear transit timing variations (TTVs) orbiting the nearby ($d=47.3\:\mathrm{pc}$) active M4 star, TOI-2015. We characterize the planet's properties using TESS photometry, precise near-infrared radial velocities (RV) with the Habitable-zone Planet Finder (HP) Spectrograph, ground-based photometry, a… ▽ More We report the discovery of a close-in ($P_{\mathrm{orb}} = 3.349\:\mathrm{days}$) warm Neptune with clear transit timing variations (TTVs) orbiting the nearby ($d=47.3\:\mathrm{pc}$) active M4 star, TOI-2015. We characterize the planet's properties using TESS photometry, precise near-infrared radial velocities (RV) with the Habitable-zone Planet Finder (HP) Spectrograph, ground-based photometry, and high-contrast imaging. A joint photometry and RV fit yields a radius $R_p~=~3.37_{-0.20}^{+0.15} \:\mathrm{R_\oplus}$, mass $m_p~=~16.4_{-4.1}^{+4.1}\:\mathrm{M_\oplus}$, and density $ρ_p~=~2.32_{-0.37}^{+0.38} \:\mathrm{g cm^{-3}}$ for TOI-2015b, suggesting a likely volatile-rich planet. The young, active host star has a rotation period of $P_{\mathrm{rot}}~=~8.7 \pm~0.9~\mathrm{days}$ and associated rotation-based age estimate of $1.1~\pm~0.1\:\mathrm{Gyr}$. Though no other transiting planets are seen in the TESS data, the system shows clear TTVs of super period $P_{\mathrm{sup}}~\approx~430\:\mathrm{days}$ and amplitude $\sim$$100\:\mathrm{minutes}$. After considering multiple likely period ratio models, we show an outer planet candidate near a 2:1 resonance can explain the observed TTVs while offering a dynamically stable solution. However, other possible two-planet solutions -- including 3:2 and 4:3 resonance -- cannot be conclusively excluded without further observations. Assuming a 2:1 resonance in the joint TTV-RV modeling suggests a mass of $m_b~=~13.3_{-4.5}^{+4.7}\:\mathrm{M_\oplus}$ for TOI-2015b and $m_c~=~6.8_{-2.3}^{+3.5}\:\mathrm{M_\oplus}$ for the outer candidate. Additional transit and RV observations will be beneficial to explicitly identify the resonance and further characterize the properties of the system. △ Less

Submitted 9 May, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

Comments: 29 pages, 15 figures, 6 tables. Accepted for publication in The Astronomical Journal

arXiv:2310.09974 [pdf, other]

Algorithmic Contract Design for Crowdsourced Ranking

Authors: Kiriaki Frangias, Andrew Lin, Ellen Vitercik, Manolis Zampetakis

Abstract: Ranking is fundamental to many areas, such as search engine optimization, human feedback for language models, as well as peer grading. Crowdsourcing, which is often used for these tasks, requires proper incentivization to ensure accurate inputs. In this work, we draw on the field of \emph{contract theory} from Economics to propose a novel mechanism that enables a \emph{principal} to accurately ran… ▽ More Ranking is fundamental to many areas, such as search engine optimization, human feedback for language models, as well as peer grading. Crowdsourcing, which is often used for these tasks, requires proper incentivization to ensure accurate inputs. In this work, we draw on the field of \emph{contract theory} from Economics to propose a novel mechanism that enables a \emph{principal} to accurately rank a set of items by incentivizing agents to provide pairwise comparisons of the items. Our mechanism implements these incentives by verifying a subset of each agent's comparisons, a task we assume to be costly. The agent is compensated (for example, monetarily or with class credit) based on the accuracy of these comparisons. Our mechanism achieves the following guarantees: (1) it only requires the principal to verify $O(\log s)$ comparisons, where $s$ is the total number of agents, and (2) it provably achieves higher total utility for the principal compared to ranking the items herself with no crowdsourcing. △ Less

Submitted 24 January, 2024; v1 submitted 15 October, 2023; originally announced October 2023.

arXiv:2310.08873 [pdf, other]

Interactive Navigation in Environments with Traversable Obstacles Using Large Language and Vision-Language Models

Authors: Zhen Zhang, Anran Lin, Chun Wai Wong, Xiangyu Chu, Qi Dou, K. W. Samuel Au

Abstract: This paper proposes an interactive navigation framework by using large language and vision-language models, allowing robots to navigate in environments with traversable obstacles. We utilize the large language model (GPT-3.5) and the open-set Vision-language Model (Grounding DINO) to create an action-aware costmap to perform effective path planning without fine-tuning. With the large models, we ca… ▽ More This paper proposes an interactive navigation framework by using large language and vision-language models, allowing robots to navigate in environments with traversable obstacles. We utilize the large language model (GPT-3.5) and the open-set Vision-language Model (Grounding DINO) to create an action-aware costmap to perform effective path planning without fine-tuning. With the large models, we can achieve an end-to-end system from textual instructions like "Can you pass through the curtains to deliver medicines to me?", to bounding boxes (e.g., curtains) with action-aware attributes. They can be used to segment LiDAR point clouds into two parts: traversable and untraversable parts, and then an action-aware costmap is constructed for generating a feasible path. The pre-trained large models have great generalization ability and do not require additional annotated data for training, allowing fast deployment in the interactive navigation tasks. We choose to use multiple traversable objects such as curtains and grasses for verification by instructing the robot to traverse them. Besides, traversing curtains in a medical scenario was tested. All experimental results demonstrated the proposed framework's effectiveness and adaptability to diverse environments. △ Less

Submitted 12 March, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

Comments: Accepted by 2024 IEEE International Conference on Robotics and Automation (ICRA), 7 pages, 8 figures

arXiv:2310.07916 [pdf, other]

Dynamic Appearance Particle Neural Radiance Field

Authors: Ancheng Lin, Jun Li

Abstract: Neural Radiance Fields (NeRFs) have shown great potential in modelling 3D scenes. Dynamic NeRFs extend this model by capturing time-varying elements, typically using deformation fields. The existing dynamic NeRFs employ a similar Eulerian representation for both light radiance and deformation fields. This leads to a close coupling of appearance and motion and lacks a physical interpretation. In th… ▽ More Neural Radiance Fields (NeRFs) have shown great potential in modelling 3D scenes. Dynamic NeRFs extend this model by capturing time-varying elements, typically using deformation fields. The existing dynamic NeRFs employ a similar Eulerian representation for both light radiance and deformation fields. This leads to a close coupling of appearance and motion and lacks a physical interpretation. In this work, we propose Dynamic Appearance Particle Neural Radiance Field (DAP-NeRF), which introduces particle-based representation to model the motions of visual elements in a dynamic 3D scene. DAP-NeRF consists of superposition of a static field and a dynamic field. The dynamic field is quantised as a collection of {\em appearance particles}, which carries the visual information of a small dynamic element in the scene and is equipped with a motion model. All components, including the static field, the visual features and motion models of the particles, are learned from monocular videos without any prior geometric knowledge of the scene. We develop an efficient computational framework for the particle-based model. We also construct a new dataset to evaluate motion modelling. Experimental results show that DAP-NeRF is an effective technique to capture not only the appearance but also the physically meaningful motions in a 3D dynamic scene. △ Less

Submitted 10 December, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

arXiv:2310.07827 [pdf, other]

Astrometry and Precise Radial Velocities Yield a Complete Orbital Solution for the Nearby Eccentric Brown Dwarf LHS 1610 b

Authors: Evan Fitzmaurice, Gudmundur Stefánsson, Robert D. Kavanagh, Suvrath Mahadevan, Caleb I. Cañas, Joshua N. Winn, Paul Robertson, Joe P. Ninan, Simon Albrecht, J. R. Callingham, William D. Cochran, Megan Delamer, Shubham Kanodia, Andrea S. J. Lin, Marcus L. Marcussen, Benjamin J. S. Pope, Lawrence W. Ramsey, Arpita Roy, Harish Vedantham, Jason T. Wright

Abstract: We characterize the LHS 1610 system, a nearby ($d=9.7$ pc) M5 dwarf hosting a brown dwarf in a $10.6$ day, eccentric ($e \sim 0.37$) orbit. A joint fit of the available Gaia two-body solution, discovery radial velocities (RVs) from TRES, and new RVs obtained with the Habitable-zone Planet Finder, yields an orbital inclination of $117.2\pm0.9^\circ$ and a mass constraint of $50.9\pm0.9$ M$_J$. This… ▽ More We characterize the LHS 1610 system, a nearby ($d=9.7$ pc) M5 dwarf hosting a brown dwarf in a $10.6$ day, eccentric ($e \sim 0.37$) orbit. A joint fit of the available Gaia two-body solution, discovery radial velocities (RVs) from TRES, and new RVs obtained with the Habitable-zone Planet Finder, yields an orbital inclination of $117.2\pm0.9^\circ$ and a mass constraint of $50.9\pm0.9$ M$_J$. This gives LHS 1610 b the second most precise mass of brown dwarfs orbiting M stars within 25pc. We highlight a discrepancy between the Gaia two-body solution eccentricity ($e=0.52 \pm 0.03$) and that from the RVs ($e=0.3702\pm0.0003$), which requires the astrometric time-series release (Gaia DR4) for further diagnostics. With a flare rate of $0.28\pm 0.07$ flares/day from TESS photometry, and a rotation period of $84 \pm 8$ days, LHS 1610 joins other mid M stars -- including Proxima Centauri and YZ Ceti -- as nearby mid M dwarfs with flare rates on the higher end for their long rotation periods. These stars are promising candidates for searching for sub-Alfvénic star-companion interactions, raising the question whether LHS 1610 b could be driving the flares on its host star. However, the available TESS photometry is insufficient to confirm or rule out any orbital phase-dependence of the flares. We show that the LHS 1610 system, as a nearby mid M star with a large, short-period companion, is a promising target to look for evidence of star-companion interactions or aural emission from the brown dwarf at radio wavelengths. △ Less

Submitted 11 October, 2023; originally announced October 2023.

Comments: 24 pages, 7 figures, 3 tables. Submitted to AAS Journals on Oct 11, 2023

arXiv:2310.05126 [pdf, other]

UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model

Authors: Jiabo Ye, Anwen Hu, Haiyang Xu, Qinghao Ye, Ming Yan, Guohai Xu, Chenliang Li, Junfeng Tian, Qi Qian, Ji Zhang, Qin Jin, Liang He, Xin Alex Lin, Fei Huang

Abstract: Text is ubiquitous in our visual world, conveying crucial information, such as in documents, websites, and everyday photographs. In this work, we propose UReader, a first exploration of universal OCR-free visually-situated language understanding based on the Multimodal Large Language Model (MLLM). By leveraging the shallow text recognition ability of the MLLM, we only finetuned 1.2% parameters and… ▽ More Text is ubiquitous in our visual world, conveying crucial information, such as in documents, websites, and everyday photographs. In this work, we propose UReader, a first exploration of universal OCR-free visually-situated language understanding based on the Multimodal Large Language Model (MLLM). By leveraging the shallow text recognition ability of the MLLM, we only finetuned 1.2% parameters and the training cost is much lower than previous work following domain-specific pretraining and finetuning paradigms. Concretely, UReader is jointly finetuned on a wide range of Visually-situated Language Understanding tasks via a unified instruction format. To enhance the visual text and semantic understanding, we further apply two auxiliary tasks with the same format, namely text reading and key points generation tasks. We design a shape-adaptive cropping module before the encoder-decoder architecture of MLLM to leverage the frozen low-resolution vision encoder for processing high-resolution images. Without downstream finetuning, our single model achieves state-of-the-art ocr-free performance in 8 out of 10 visually-situated language understanding tasks, across 5 domains: documents, tables, charts, natural images, and webpage screenshots. Codes and instruction-tuning datasets will be released. △ Less

Submitted 8 October, 2023; originally announced October 2023.

arXiv:2310.03817 [pdf, ps, other]

Logical Languages Accepted by Transformer Encoders with Hard Attention

Authors: Pablo Barcelo, Alexander Kozachinskiy, Anthony Widjaja Lin, Vladimir Podolskii

Abstract: We contribute to the study of formal languages that can be recognized by transformer encoders. We focus on two self-attention mechanisms: (1) UHAT (Unique Hard Attention Transformers) and (2) AHAT (Average Hard Attention Transformers). UHAT encoders are known to recognize only languages inside the circuit complexity class ${\sf AC}^0$, i.e., accepted by a family of poly-sized and depth-bounded boo… ▽ More We contribute to the study of formal languages that can be recognized by transformer encoders. We focus on two self-attention mechanisms: (1) UHAT (Unique Hard Attention Transformers) and (2) AHAT (Average Hard Attention Transformers). UHAT encoders are known to recognize only languages inside the circuit complexity class ${\sf AC}^0$, i.e., accepted by a family of poly-sized and depth-bounded boolean circuits with unbounded fan-ins. On the other hand, AHAT encoders can recognize languages outside ${\sf AC}^0$), but their expressive power still lies within the bigger circuit complexity class ${\sf TC}^0$, i.e., ${\sf AC}^0$-circuits extended by majority gates. We first show a negative result that there is an ${\sf AC}^0$-language that cannot be recognized by an UHAT encoder. On the positive side, we show that UHAT encoders can recognize a rich fragment of ${\sf AC}^0$-languages, namely, all languages definable in first-order logic with arbitrary unary numerical predicates. This logic, includes, for example, all regular languages from ${\sf AC}^0$. We then show that AHAT encoders can recognize all languages of our logic even when we enrich it with counting terms. We apply these results to derive new results on the expressive power of UHAT and AHAT up to permutation of letters (a.k.a. Parikh images). △ Less

Submitted 5 October, 2023; originally announced October 2023.

arXiv:2310.00420 [pdf, other]

An Efficient Algorithm for Clustered Multi-Task Compressive Sensing

Authors: Alexander Lin, Demba Ba

Abstract: This paper considers clustered multi-task compressive sensing, a hierarchical model that solves multiple compressive sensing tasks by finding clusters of tasks that leverage shared information to mutually improve signal reconstruction. The existing inference algorithm for this model is computationally expensive and does not scale well in high dimensions. The main bottleneck involves repeated matri… ▽ More This paper considers clustered multi-task compressive sensing, a hierarchical model that solves multiple compressive sensing tasks by finding clusters of tasks that leverage shared information to mutually improve signal reconstruction. The existing inference algorithm for this model is computationally expensive and does not scale well in high dimensions. The main bottleneck involves repeated matrix inversion and log-determinant computation for multiple large covariance matrices. We propose a new algorithm that substantially accelerates model inference by avoiding the need to explicitly compute these covariance matrices. Our approach combines Monte Carlo sampling with iterative linear solvers. Our experiments reveal that compared to the existing baseline, our algorithm can be up to thousands of times faster and an order of magnitude more memory-efficient. △ Less

Submitted 30 September, 2023; originally announced October 2023.

arXiv:2309.12372 [pdf, ps, other]

The Furstenberg property in Puiseux monoids

Authors: Andrew Lin, Henrick Rabinovitz, Qiao Zhang

Abstract: Let $M$ be a commutative monoid. The monoid $M$ is called atomic if every non-invertible element of $M$ factors into atoms (i.e., irreducible elements), while $M$ is called a Furstenberg monoid if every non-invertible element of $M$ is divisible by an atom. Additive submonoids of $\mathbb{Q}$ consisting of nonnegative rationals are called Puiseux monoids, and their atomic structure has been active… ▽ More Let $M$ be a commutative monoid. The monoid $M$ is called atomic if every non-invertible element of $M$ factors into atoms (i.e., irreducible elements), while $M$ is called a Furstenberg monoid if every non-invertible element of $M$ is divisible by an atom. Additive submonoids of $\mathbb{Q}$ consisting of nonnegative rationals are called Puiseux monoids, and their atomic structure has been actively studied during the past few years. The primary purpose of this paper is to investigate the property of being Furstenberg in the context of Puiseux monoids. In this direction, we consider some properties weaker than being Furstenberg, and then we connect these properties with some atomic results which have been already established for Puiseux monoids. △ Less

Submitted 20 September, 2023; originally announced September 2023.

Comments: 17 pages

MSC Class: Primary: 20M13; 11Y05; Secondary: 20M14; 06F05

Showing 1–50 of 237 results for author: Lin, A