Search | arXiv e-print repository

Implementing engrams from a machine learning perspective: the relevance of a latent space

Abstract: In our previous work, we proposed that engrams in the brain could be biologically implemented as autoencoders over recurrent neural networks. These autoencoders would comprise basic excitatory/inhibitory motifs, with credit assignment deriving from a simple homeostatic criterion. This brief note examines the relevance of the latent space in these autoencoders. We consider the relationship between… ▽ More In our previous work, we proposed that engrams in the brain could be biologically implemented as autoencoders over recurrent neural networks. These autoencoders would comprise basic excitatory/inhibitory motifs, with credit assignment deriving from a simple homeostatic criterion. This brief note examines the relevance of the latent space in these autoencoders. We consider the relationship between the dimensionality of these autoencoders and the complexity of the information being encoded. We discuss how observed differences between species in their connectome could be linked to their cognitive capacities. Finally, we link this analysis with a basic but often overlooked fact: human cognition is likely limited by our own brain structure. However, this limitation does not apply to machine learning systems, and we should be aware of the need to learn how to exploit this augmented vision of the nature. △ Less

Submitted 23 July, 2024; originally announced July 2024.

Comments: 6 pages, 2 figures

arXiv:2406.18630 [pdf, other]

Improving Hyperparameter Optimization with Checkpointed Model Weights

Authors: Nikhil Mehta, Jonathan Lorraine, Steve Masson, Ramanathan Arunachalam, Zaid Pervaiz Bhat, James Lucas, Arun George Zachariah

Abstract: When training deep learning models, the performance depends largely on the selected hyperparameters. However, hyperparameter optimization (HPO) is often one of the most expensive parts of model design. Classical HPO methods treat this as a black-box optimization problem. However, gray-box HPO methods, which incorporate more information about the setup, have emerged as a promising direction for mor… ▽ More When training deep learning models, the performance depends largely on the selected hyperparameters. However, hyperparameter optimization (HPO) is often one of the most expensive parts of model design. Classical HPO methods treat this as a black-box optimization problem. However, gray-box HPO methods, which incorporate more information about the setup, have emerged as a promising direction for more efficient optimization. For example, using intermediate loss evaluations to terminate bad selections. In this work, we propose an HPO method for neural networks using logged checkpoints of the trained weights to guide future hyperparameter selections. Our method, Forecasting Model Search (FMS), embeds weights into a Gaussian process deep kernel surrogate model, using a permutation-invariant graph metanetwork to be data-efficient with the logged network weights. To facilitate reproducibility and further research, we open-source our code at https://github.com/NVlabs/forecasting-model-search. △ Less

Submitted 26 June, 2024; originally announced June 2024.

Comments: See the project website at https://research.nvidia.com/labs/toronto-ai/FMS/

MSC Class: 68T05 ACM Class: I.2.6; G.1.6; D.2.8

arXiv:2406.09940 [pdf, other]

Implementing engrams from a machine learning perspective: XOR as a basic motif

Authors: Jesus Marco de Lucas, Maria Peña Fernandez, Lara Lloret Iglesias

Abstract: We have previously presented the idea of how complex multimodal information could be represented in our brains in a compressed form, following mechanisms similar to those employed in machine learning tools, like autoencoders. In this short comment note we reflect, mainly with a didactical purpose, upon the basic question for a biological implementation: what could be the mechanism working as a los… ▽ More We have previously presented the idea of how complex multimodal information could be represented in our brains in a compressed form, following mechanisms similar to those employed in machine learning tools, like autoencoders. In this short comment note we reflect, mainly with a didactical purpose, upon the basic question for a biological implementation: what could be the mechanism working as a loss function, and how it could be connected to a neuronal network providing the required feedback to build a simple training configuration. We present our initial ideas based on a basic motif that implements an XOR switch, using few excitatory and inhibitory neurons. Such motif is guided by a principle of homeostasis, and it implements a loss function that could provide feedback to other neuronal structures, establishing a control system. We analyse the presence of this XOR motif in the connectome of C.Elegans, and indicate the relationship with the well-known lateral inhibition motif. We then explore how to build a basic biological neuronal structure with learning capacity integrating this XOR motif. Guided by the computational analogy, we show an initial example that indicates the feasibility of this approach, applied to learning binary sequences, like it is the case for simple melodies. In summary, we provide didactical examples exploring the parallelism between biological and computational learning mechanisms, identifying basic motifs and training procedures, and how an engram encoding a melody could be built using a simple recurrent network involving both excitatory and inhibitory neurons. △ Less

Submitted 14 June, 2024; originally announced June 2024.

Comments: 9 pages, short comment

arXiv:2406.01844 [pdf, other]

doi 10.1117/12.3019196

The Simons Observatory: Studies of Detector Yield and Readout Noise From the First Large-Scale Deployment of Microwave Multiplexing at the Large Aperture Telescope

Authors: Thomas P. Satterthwaite, Zeeshan Ahmed, Kyuyoung Bae, Mark Devlin, Simon Dicker, Shannon M. Duff, Daniel Dutcher, Saianeesh K. Haridas, Shawn W. Henderson, Johannes Hubmayr, Bradley R. Johnson, Anna Kofman, Jack Lashner, Michael J. Link, Tammy J. Lucas, Alex Manduca, Michael D. Niemack, John Orlowski-Scherer, Tristan Pinsonneault-Marotte, Max Silva-Feaver, Suzanne Staggs, Eve M. Vavagiakis, Yuhan Wang, Kaiwen Zheng

Abstract: The Simons Observatory is a new ground-based cosmic microwave background experiment, which is currently being commissioned in Chile's Atacama Desert. During its survey, the observatory's small aperture telescopes will map 10% of the sky in bands centered at frequencies ranging from 27 to 280 GHz to constrain cosmic inflation models, and its large aperture telescope will map 40% of the sky in the s… ▽ More The Simons Observatory is a new ground-based cosmic microwave background experiment, which is currently being commissioned in Chile's Atacama Desert. During its survey, the observatory's small aperture telescopes will map 10% of the sky in bands centered at frequencies ranging from 27 to 280 GHz to constrain cosmic inflation models, and its large aperture telescope will map 40% of the sky in the same bands to constrain cosmological parameters and use weak lensing to study large-scale structure. To achieve these science goals, the Simons Observatory is deploying these telescopes' receivers with 60,000 state-of-the-art superconducting transition-edge sensor bolometers for its first five year survey. Reading out this unprecedented number of cryogenic sensors, however, required the development of a novel readout system. The SMuRF electronics were developed to enable high-density readout of superconducting sensors using cryogenic microwave SQUID multiplexing technology. The commissioning of the SMuRF systems at the Simons Observatory is the largest deployment to date of microwave multiplexing technology for transition-edge sensors. In this paper, we show that a significant fraction of the systems deployed so far to the Simons Observatory's large aperture telescope meet baseline specifications for detector yield and readout noise in this early phase of commissioning. △ Less

Submitted 3 June, 2024; originally announced June 2024.

Comments: 10 pages, 5 figures, 1 table. To be presented at SPIE Astronomical Telescopes + Instrumentation 2024

Journal ref: Proc. SPIE 13102, Millimeter, Submillimeter, and Far-Infrared Detectors and Instrumentation for Astronomy XII. 1310223 (2024)

arXiv:2405.06868 [pdf, other]

Simons Observatory: Pre-deployment Performance of a Large Aperture Telescope Optics Tube in the 90 and 150 GHz Spectral Bands

Authors: Carlos E. Sierra, Kathleen Harrington, Shreya Sutariya, Thomas Alford, Anna M. Kofman, Grace E. Chesmore, Jason E. Austermann, Andrew Bazarko, James A. Beall, Tanay Bhandarkar, Mark J. Devlin, Simon R. Dicker, Peter N. Dow, Shannon M. Duff, Daniel Dutcher, Nicholas Galitzki, Joseph E. Golec, John C. Groh, Jon E. Gudmundsson, Saianeesh K. Haridas, Erin Healy, Johannes Hubmayr, Jeffrey Iuliano, Bradley R. Johnson, Claire S. Lessler , et al. (20 additional authors not shown)

Abstract: The Simons Observatory will map the temperature and polarization over half of the sky, at millimeter wavelengths in six spectral bands from the Atacama Desert in Chile. These data will provide new insights into the genesis, content, and history of our Universe; the astrophysics of galaxies and galaxy clusters; objects in our solar system; and time-varying astrophysical phenomena. This ambitious ne… ▽ More The Simons Observatory will map the temperature and polarization over half of the sky, at millimeter wavelengths in six spectral bands from the Atacama Desert in Chile. These data will provide new insights into the genesis, content, and history of our Universe; the astrophysics of galaxies and galaxy clusters; objects in our solar system; and time-varying astrophysical phenomena. This ambitious new instrument suite, initially comprising three 0.5 m small-aperture telescopes and one 6 m large aperture telescope, is designed using a common combination of new technologies and new implementations to realize an observatory significantly more capable than the previous generation. In this paper, we present the pre-deployment performance of the first mid-frequency "optics tube" which will be fielded on the large aperture telescope with sensitivity to the 90 and 150 GHz spectral bands. This optics tube contains lenses, filters, detectors, and readout components, all of which operate at cryogenic temperatures. It is one of seven that form the core of the large aperture telescope receiver in its initial deployment. We describe this optics tube, including details of comprehensive testing methods, new techniques for beam and passband characterization, and its measured performance. The performance metrics include beams, optical efficiency, passbands, and forecasts for the on-sky performance of the system. We forecast a sensitivity that exceeds the requirements of the large aperture telescope with greater than 30% margin in each spectral band, and predict that the instrument will realize diffraction-limited performance and the expected detector passbands. △ Less

Submitted 10 May, 2024; originally announced May 2024.

arXiv:2405.05550 [pdf, other]

The Simons Observatory: Design, integration, and testing of the small aperture telescopes

Authors: Nicholas Galitzki, Tran Tsan, Jake Spisak, Michael Randall, Max Silva-Feaver, Joseph Seibert, Jacob Lashner, Shunsuke Adachi, Sean M. Adkins, Thomas Alford, Kam Arnold, Peter C. Ashton, Jason E. Austermann, Carlo Baccigalupi, Andrew Bazarko, James A. Beall, Sanah Bhimani, Bryce Bixler, Gabriele Coppi, Lance Corbett, Kevin D. Crowley, Kevin T. Crowley, Samuel Day-Weiss, Simon Dicker, Peter N. Dow , et al. (55 additional authors not shown)

Abstract: The Simons Observatory (SO) is a cosmic microwave background (CMB) survey experiment that includes small-aperture telescopes (SATs) observing from an altitude of 5,200 m in the Atacama Desert in Chile. The SO SATs will cover six spectral bands between 27 and 280 GHz to search for primordial B-modes to a sensitivity of $σ(r)=0.002$, with quantified systematic errors well below this value. Each SAT… ▽ More The Simons Observatory (SO) is a cosmic microwave background (CMB) survey experiment that includes small-aperture telescopes (SATs) observing from an altitude of 5,200 m in the Atacama Desert in Chile. The SO SATs will cover six spectral bands between 27 and 280 GHz to search for primordial B-modes to a sensitivity of $σ(r)=0.002$, with quantified systematic errors well below this value. Each SAT is a self-contained cryogenic telescope with a 35$^\circ$ field of view, 42 cm diameter optical aperture, 40 K half-wave plate, 1 K refractive optics, and $<0.1$ K focal plane that holds $>12,000$ TES detectors. We describe the nominal design of the SATs and present details about the integration and testing for one operating at 93 and 145 GHz. △ Less

Submitted 10 May, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

arXiv:2404.19246 [pdf]

Logistic Map Pseudo Random Number Generator in FPGA

Authors: Mateo Jalen Andrew Calderon, Lee Jun Lei Lucas, Syarifuddin Azhar Bin Rosli, Stephanie See Hui Ying, Jarell Lim En Yu, Maoyang Xiang, T. Hui Teo

Abstract: This project develops a pseudo-random number generator (PRNG) using the logistic map, implemented in Verilog HDL on an FPGA and processes its output through a Central Limit Theorem (CLT) function to achieve a Gaussian distribution. The system integrates additional FPGA modules for real-time interaction and visualisation, including a clock generator, UART interface, XADC, and a 7-segment display dr… ▽ More This project develops a pseudo-random number generator (PRNG) using the logistic map, implemented in Verilog HDL on an FPGA and processes its output through a Central Limit Theorem (CLT) function to achieve a Gaussian distribution. The system integrates additional FPGA modules for real-time interaction and visualisation, including a clock generator, UART interface, XADC, and a 7-segment display driver. These components facilitate the direct display of PRNG values on the FPGA and the transmission of data to a laptop for histogram analysis, verifying the Gaussian nature of the output. This approach demonstrates the practical application of chaotic systems for generating Gaussian-distributed pseudo-random numbers in digital hardware, highlighting the logistic map's potential in PRNG design. △ Less

Submitted 30 April, 2024; originally announced April 2024.

Comments: 10 pages, 6 figures

arXiv:2403.18225 [pdf, other]

The Simons Observatory: Production-level Fabrication of the Mid- and Ultra-High-Frequency Wafers

Authors: Shannon M. Duff, Jason Austermann, James A. Beall, David P. Daniel, Johannes Hubmayr, Greg C. Jaehnig, Bradley R. Johnson, Dante Jones, Michael J. Link, Tammy J. Lucas, Rita F. Sonka, Suzanne T. Staggs, Joel Ullom, Yuhan Wang

Abstract: The Simons Observatory (SO) is a cosmic microwave background instrumentation suite in the Atacama Desert of Chile. More than 65,000 polarization-sensitive transition-edge sensor (TES) bolometers will be fielded in the frequency range spanning 27 to 280 GHz, with three separate dichroic designs. The mid-frequency 90/150 GHz and ultra-high-frequency 220/280 GHz detector arrays, fabricated at NIST, a… ▽ More The Simons Observatory (SO) is a cosmic microwave background instrumentation suite in the Atacama Desert of Chile. More than 65,000 polarization-sensitive transition-edge sensor (TES) bolometers will be fielded in the frequency range spanning 27 to 280 GHz, with three separate dichroic designs. The mid-frequency 90/150 GHz and ultra-high-frequency 220/280 GHz detector arrays, fabricated at NIST, account for 39 of 49 total detector modules and implement the feedhorn-fed orthomode transducer (OMT)-coupled TES bolometer architecture. A robust production-level fabrication framework for these detector arrays and the monolithic DC/RF routing wafers has been developed, which includes single device prototyping, process monitoring techniques, in-process metrology, and cryogenic measurements of critical film properties. Application of this framework has resulted in timely delivery of nearly 100 total superconducting focal plane components to SO with 88% of detector wafers meeting nominal criteria for integration into a detector module: a channel yield > 95% and Tc in the targeted range. △ Less

Submitted 26 March, 2024; originally announced March 2024.

Comments: 10 pages, 6 figures. Proceedings of the 20th International Conference on Low Temperature Detectors (LTD20). Submitted to JLTP

arXiv:2403.15385 [pdf, other]

LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis

Authors: Kevin Xie, Jonathan Lorraine, Tianshi Cao, Jun Gao, James Lucas, Antonio Torralba, Sanja Fidler, Xiaohui Zeng

Abstract: Recent text-to-3D generation approaches produce impressive 3D results but require time-consuming optimization that can take up to an hour per prompt. Amortized methods like ATT3D optimize multiple prompts simultaneously to improve efficiency, enabling fast text-to-3D synthesis. However, they cannot capture high-frequency geometry and texture details and struggle to scale to large prompt sets, so t… ▽ More Recent text-to-3D generation approaches produce impressive 3D results but require time-consuming optimization that can take up to an hour per prompt. Amortized methods like ATT3D optimize multiple prompts simultaneously to improve efficiency, enabling fast text-to-3D synthesis. However, they cannot capture high-frequency geometry and texture details and struggle to scale to large prompt sets, so they generalize poorly. We introduce LATTE3D, addressing these limitations to achieve fast, high-quality generation on a significantly larger prompt set. Key to our method is 1) building a scalable architecture and 2) leveraging 3D data during optimization through 3D-aware diffusion priors, shape regularization, and model initialization to achieve robustness to diverse and complex training prompts. LATTE3D amortizes both neural field and textured surface generation to produce highly detailed textured meshes in a single forward pass. LATTE3D generates 3D objects in 400ms, and can be further enhanced with fast test-time optimization. △ Less

Submitted 22 March, 2024; originally announced March 2024.

Comments: See the project website at https://research.nvidia.com/labs/toronto-ai/LATTE3D/

MSC Class: 68T45 ACM Class: I.2.6; I.2.7; I.3.6; I.3.7

arXiv:2402.07483 [pdf, other]

T-RAG: Lessons from the LLM Trenches

Authors: Masoomali Fatehkia, Ji Kim Lucas, Sanjay Chawla

Abstract: Large Language Models (LLM) have shown remarkable language capabilities fueling attempts to integrate them into applications across a wide range of domains. An important application area is question answering over private enterprise documents where the main considerations are data security, which necessitates applications that can be deployed on-prem, limited computational resources and the need f… ▽ More Large Language Models (LLM) have shown remarkable language capabilities fueling attempts to integrate them into applications across a wide range of domains. An important application area is question answering over private enterprise documents where the main considerations are data security, which necessitates applications that can be deployed on-prem, limited computational resources and the need for a robust application that correctly responds to queries. Retrieval-Augmented Generation (RAG) has emerged as the most prominent framework for building LLM-based applications. While building a RAG is relatively straightforward, making it robust and a reliable application requires extensive customization and relatively deep knowledge of the application domain. We share our experiences building and deploying an LLM application for question answering over private organizational documents. Our application combines the use of RAG with a finetuned open-source LLM. Additionally, our system, which we call Tree-RAG (T-RAG), uses a tree structure to represent entity hierarchies within the organization. This is used to generate a textual description to augment the context when responding to user queries pertaining to entities within the organization's hierarchy. Our evaluations, including a Needle in a Haystack test, show that this combination performs better than a simple RAG or finetuning implementation. Finally, we share some lessons learned based on our experiences building an LLM application for real-world use. △ Less

Submitted 6 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

Comments: Added Needle in a Haystack analysis for T-RAG

arXiv:2401.07867 [pdf, other]

Authorship Obfuscation in Multilingual Machine-Generated Text Detection

Authors: Dominik Macko, Robert Moro, Adaku Uchendu, Ivan Srba, Jason Samuel Lucas, Michiharu Yamashita, Nafis Irtiza Tripto, Dongwon Lee, Jakub Simko, Maria Bielikova

Abstract: High-quality text generation capability of recent Large Language Models (LLMs) causes concerns about their misuse (e.g., in massive generation/spread of disinformation). Machine-generated text (MGT) detection is important to cope with such threats. However, it is susceptible to authorship obfuscation (AO) methods, such as paraphrasing, which can cause MGTs to evade detection. So far, this was eval… ▽ More High-quality text generation capability of recent Large Language Models (LLMs) causes concerns about their misuse (e.g., in massive generation/spread of disinformation). Machine-generated text (MGT) detection is important to cope with such threats. However, it is susceptible to authorship obfuscation (AO) methods, such as paraphrasing, which can cause MGTs to evade detection. So far, this was evaluated only in monolingual settings. Thus, the susceptibility of recently proposed multilingual detectors is still unknown. We fill this gap by comprehensively benchmarking the performance of 10 well-known AO methods, attacking 37 MGT detection methods against MGTs in 11 languages (i.e., 10 $\times$ 37 $\times$ 11 = 4,070 combinations). We also evaluate the effect of data augmentation on adversarial robustness using obfuscated texts. The results indicate that all tested AO methods can cause evasion of automated detection in all tested languages, where homoglyph attacks are especially successful. However, some of the AO methods severely damaged the text, making it no longer readable or easily recognizable by humans (e.g., changed language, weird characters). △ Less

Submitted 18 June, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

arXiv:2312.09192 [pdf, ps, other]

A symplectic approach to Schrödinger equations in the infinite-dimensional unbounded setting

Authors: Javier de Lucas, Julia Lange, Xavier Rivas

Abstract: By using the theory of analytic vectors and manifolds modelled on normed spaces, we provide a rigorous symplectic differential geometric approach to $t$-dependent Schrödinger equations on separable (possibly infinite-dimensional) Hilbert spaces determined by unbounded $t$-dependent self-adjoint Hamiltonians satisfying a technical condition. As an application, the Marsden--Weinstein reduction proce… ▽ More By using the theory of analytic vectors and manifolds modelled on normed spaces, we provide a rigorous symplectic differential geometric approach to $t$-dependent Schrödinger equations on separable (possibly infinite-dimensional) Hilbert spaces determined by unbounded $t$-dependent self-adjoint Hamiltonians satisfying a technical condition. As an application, the Marsden--Weinstein reduction procedure is employed to map above-mentioned $t$-dependent Schrödinger equations onto their projective spaces. Other applications of physical and mathematical relevance are also analysed. △ Less

Submitted 14 December, 2023; originally announced December 2023.

Comments: 27 pages

MSC Class: 34A26; 34A34 (primary) 17B66; 53Z05 (secondary)

arXiv:2312.05238 [pdf, ps, other]

Quasi-rectifiable Lie algebras for partial differential equations

Authors: A. M. Grundland, J. de Lucas

Abstract: We introduce families of quasi-rectifiable vector fields and study their geometric and algebraic aspects. Then, we analyse their applications to systems of partial differential equations. Our results explain, in a simpler manner, previous findings about hydrodynamic-type equations. Facts concerning families of quasi-rectifiable vector fields, their relation to Hamiltonian systems, and practical pr… ▽ More We introduce families of quasi-rectifiable vector fields and study their geometric and algebraic aspects. Then, we analyse their applications to systems of partial differential equations. Our results explain, in a simpler manner, previous findings about hydrodynamic-type equations. Facts concerning families of quasi-rectifiable vector fields, their relation to Hamiltonian systems, and practical procedures for studying such families are developed. We introduce and analyse quasi-rectifiable Lie algebras, which are motivated by geometric and practical reasons. We classify different types of quasi-rectifiable Lie algebras, e.g. indecomposable ones up to dimension five. New methods for solving systems of hydrodynamic-type equations are established to illustrate our results. In particular, we study hydrodynamic-type systems admitting $k$-wave solutions through quasi-rectifiable Lie algebras of vector fields. We develop techniques for obtaining the submanifolds related to quasi-rectifiable Lie algebras of vector fields and systems of partial differential equations admitting a nonlinear superposition rule: the PDE Lie systems. △ Less

Submitted 8 April, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

Comments: 45 pages. Improved terminology. Typos and minor issues corrected

MSC Class: 35Q53 (primary); 35A30; 35Q58; 53A05 (secondary)

arXiv:2312.04501 [pdf, other]

Graph Metanetworks for Processing Diverse Neural Architectures

Authors: Derek Lim, Haggai Maron, Marc T. Law, Jonathan Lorraine, James Lucas

Abstract: Neural networks efficiently encode learned information within their parameters. Consequently, many tasks can be unified by treating neural networks themselves as input data. When doing so, recent studies demonstrated the importance of accounting for the symmetries and geometry of parameter spaces. However, those works developed architectures tailored to specific networks such as MLPs and CNNs with… ▽ More Neural networks efficiently encode learned information within their parameters. Consequently, many tasks can be unified by treating neural networks themselves as input data. When doing so, recent studies demonstrated the importance of accounting for the symmetries and geometry of parameter spaces. However, those works developed architectures tailored to specific networks such as MLPs and CNNs without normalization layers, and generalizing such architectures to other types of networks can be challenging. In this work, we overcome these challenges by building new metanetworks - neural networks that take weights from other neural networks as input. Put simply, we carefully build graphs representing the input neural networks and process the graphs using graph neural networks. Our approach, Graph Metanetworks (GMNs), generalizes to neural architectures where competing methods struggle, such as multi-head attention layers, normalization layers, convolutional layers, ResNet blocks, and group-equivariant linear layers. We prove that GMNs are expressive and equivariant to parameter permutation symmetries that leave the input neural network functions unchanged. We validate the effectiveness of our method on several metanetwork tasks over diverse neural network architectures. △ Less

Submitted 29 December, 2023; v1 submitted 7 December, 2023; originally announced December 2023.

Comments: 29 pages. v2 updated experimental results and details

arXiv:2311.15035 [pdf, ps, other]

An energy-momentum method for ordinary differential equations with an underlying $k$-polysymplectic manifold

Authors: Leonardo Colombo, Javier de Lucas, Xavier Rivas, Bartosz M. Zawora

Abstract: This work presents a comprehensive review of the $k$-polysymplectic Marsden-Weinstein reduction theory, rectifying prior errors and inaccuracies in the literature while introducing novel findings. It also emphasises the genuine practical significance of seemingly minor technical details. On this basis, we introduce a novel $k$-polysymplectic energy-momentum method, new related stability analysis t… ▽ More This work presents a comprehensive review of the $k$-polysymplectic Marsden-Weinstein reduction theory, rectifying prior errors and inaccuracies in the literature while introducing novel findings. It also emphasises the genuine practical significance of seemingly minor technical details. On this basis, we introduce a novel $k$-polysymplectic energy-momentum method, new related stability analysis techniques, and apply them to Hamiltonian systems of ordinary differential equations relative to a $k$-polysymplectic manifold. We provide detailed examples of both physical and mathematical significance, including the study of complex Schwarz equations related to the Schwarz derivative, a series of isotropic oscillators, integrable Hamiltonian systems, quantum oscillators with dissipation, affine systems of differential equations, and polynomial dynamical systems. △ Less

Submitted 25 November, 2023; originally announced November 2023.

Comments: 40 pages

MSC Class: 34A26; 34D20; 37J39 (primary); 53B50; 53C15 (secondary)

arXiv:2311.08427 [pdf, other]

Towards a Transportable Causal Network Model Based on Observational Healthcare Data

Authors: Alice Bernasconi, Alessio Zanga, Peter J. F. Lucas, Marco Scutari, Fabio Stella

Abstract: Over the last decades, many prognostic models based on artificial intelligence techniques have been used to provide detailed predictions in healthcare. Unfortunately, the real-world observational data used to train and validate these models are almost always affected by biases that can strongly impact the outcomes validity: two examples are values missing not-at-random and selection bias. Addressi… ▽ More Over the last decades, many prognostic models based on artificial intelligence techniques have been used to provide detailed predictions in healthcare. Unfortunately, the real-world observational data used to train and validate these models are almost always affected by biases that can strongly impact the outcomes validity: two examples are values missing not-at-random and selection bias. Addressing them is a key element in achieving transportability and in studying the causal relationships that are critical in clinical decision making, going beyond simpler statistical approaches based on probabilistic association. In this context, we propose a novel approach that combines selection diagrams, missingness graphs, causal discovery and prior knowledge into a single graphical model to estimate the cardiovascular risk of adolescent and young females who survived breast cancer. We learn this model from data comprising two different cohorts of patients. The resulting causal network model is validated by expert clinicians in terms of risk assessment, accuracy and explainability, and provides a prognostic model that outperforms competing machine learning methods. △ Less

Submitted 20 November, 2023; v1 submitted 13 November, 2023; originally announced November 2023.

arXiv:2311.05583 [pdf, other]

doi 10.21203/rs.3.rs-3547073/v1

The Simons Observatory: Large-Scale Characterization of 90/150 GHz TES Detector Modules

Authors: Daniel Dutcher, Shannon M. Duff, John C. Groh, Erin Healy, Johannes Hubmayr, Bradley R. Johnson, Dante Jones, Ben Keller, Lawrence T. Lin, Michael J. Link, Tammy J. Lucas, Samuel Morgan, Yudai Seino, Rita F. Sonka, Suzanne T. Staggs, Yuhan Wang, Kaiwen Zheng

Abstract: The Simons Observatory (SO) is a cosmic microwave background instrumentation suite being deployed in the Atacama Desert in northern Chile. The telescopes within SO use three types of dichroic transition-edge sensor (TES) detector arrays, with the 90 and 150 GHz Mid-Frequency (MF) arrays containing 65% of the approximately 68,000 detectors in the first phase of SO. All of the 26 required MF detecto… ▽ More The Simons Observatory (SO) is a cosmic microwave background instrumentation suite being deployed in the Atacama Desert in northern Chile. The telescopes within SO use three types of dichroic transition-edge sensor (TES) detector arrays, with the 90 and 150 GHz Mid-Frequency (MF) arrays containing 65% of the approximately 68,000 detectors in the first phase of SO. All of the 26 required MF detector arrays have now been fabricated, packaged into detector modules, and tested in laboratory cryostats. Across all modules, we find an average operable detector yield of 84% and median saturation powers of (2.8, 8.0) pW with interquartile ranges of (1, 2) pW at (90, 150) GHz, respectively, falling within their targeted ranges. We measure TES normal resistances and superconducting transition temperatures on each detector wafer to be uniform within 3%, with overall central values of 7.5 mohm and 165 mK, respectively. Results on time constants, optical efficiency, and noise performance are also presented and are consistent with achieving instrument sensitivity forecasts. △ Less

Submitted 29 January, 2024; v1 submitted 9 November, 2023; originally announced November 2023.

Comments: 8 pages, 3 figures. Proceedings of the 20th International Conference on Low Temperature Detectors (LTD20). Accepted to JLTP

arXiv:2310.15515 [pdf, other]

Fighting Fire with Fire: The Dual Role of LLMs in Crafting and Detecting Elusive Disinformation

Authors: Jason Lucas, Adaku Uchendu, Michiharu Yamashita, Jooyoung Lee, Shaurya Rohatgi, Dongwon Lee

Abstract: Recent ubiquity and disruptive impacts of large language models (LLMs) have raised concerns about their potential to be misused (.i.e, generating large-scale harmful and misleading content). To combat this emerging risk of LLMs, we propose a novel "Fighting Fire with Fire" (F3) strategy that harnesses modern LLMs' generative and emergent reasoning capabilities to counter human-written and LLM-gene… ▽ More Recent ubiquity and disruptive impacts of large language models (LLMs) have raised concerns about their potential to be misused (.i.e, generating large-scale harmful and misleading content). To combat this emerging risk of LLMs, we propose a novel "Fighting Fire with Fire" (F3) strategy that harnesses modern LLMs' generative and emergent reasoning capabilities to counter human-written and LLM-generated disinformation. First, we leverage GPT-3.5-turbo to synthesize authentic and deceptive LLM-generated content through paraphrase-based and perturbation-based prefix-style prompts, respectively. Second, we apply zero-shot in-context semantic reasoning techniques with cloze-style prompts to discern genuine from deceptive posts and news articles. In our extensive experiments, we observe GPT-3.5-turbo's zero-shot superiority for both in-distribution and out-of-distribution datasets, where GPT-3.5-turbo consistently achieved accuracy at 68-72%, unlike the decline observed in previous customized and fine-tuned disinformation detectors. Our codebase and dataset are available at https://github.com/mickeymst/F3. △ Less

Submitted 24 October, 2023; originally announced October 2023.

Comments: Accepted at EMNLP 2023

arXiv:2310.13606 [pdf, other]

doi 10.18653/v1/2023.emnlp-main.616

MULTITuDE: Large-Scale Multilingual Machine-Generated Text Detection Benchmark

Authors: Dominik Macko, Robert Moro, Adaku Uchendu, Jason Samuel Lucas, Michiharu Yamashita, Matúš Pikuliak, Ivan Srba, Thai Le, Dongwon Lee, Jakub Simko, Maria Bielikova

Abstract: There is a lack of research into capabilities of recent LLMs to generate convincing text in languages other than English and into performance of detectors of machine-generated text in multilingual settings. This is also reflected in the available benchmarks which lack authentic texts in languages other than English and predominantly cover older generators. To fill this gap, we introduce MULTITuDE,… ▽ More There is a lack of research into capabilities of recent LLMs to generate convincing text in languages other than English and into performance of detectors of machine-generated text in multilingual settings. This is also reflected in the available benchmarks which lack authentic texts in languages other than English and predominantly cover older generators. To fill this gap, we introduce MULTITuDE, a novel benchmarking dataset for multilingual machine-generated text detection comprising of 74,081 authentic and machine-generated texts in 11 languages (ar, ca, cs, de, en, es, nl, pt, ru, uk, and zh) generated by 8 multilingual LLMs. Using this benchmark, we compare the performance of zero-shot (statistical and black-box) and fine-tuned detectors. Considering the multilinguality, we evaluate 1) how these detectors generalize to unseen languages (linguistically similar as well as dissimilar) and unseen LLMs and 2) whether the detectors improve their performance when trained on multiple languages. △ Less

Submitted 20 October, 2023; originally announced October 2023.

Journal ref: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing

arXiv:2308.00820 [pdf, other]

Geometry preserving numerical methods for physical systems with finite-dimensional Lie algebras

Authors: L. Blanco, F. Jiménez Alburquerque, J. de Lucas, C. Sardón

Abstract: We propose a geometric integrator to numerically approximate the flow of Lie systems. The key is a novel procedure that integrates the Lie system on a Lie group intrinsically associated with a Lie system on a general manifold via a Lie group action, and then generates the discrete solution of the Lie system on the manifold via a solution of the Lie system on the Lie group. One major result from th… ▽ More We propose a geometric integrator to numerically approximate the flow of Lie systems. The key is a novel procedure that integrates the Lie system on a Lie group intrinsically associated with a Lie system on a general manifold via a Lie group action, and then generates the discrete solution of the Lie system on the manifold via a solution of the Lie system on the Lie group. One major result from the integration of a Lie system on a Lie group is that one is able to solve all associated Lie systems on manifolds at the same time, and that Lie systems on Lie groups can be described through first-order systems of linear homogeneous ordinary differential equations (ODEs) in normal form. This brings a lot of advantages, since solving a linear system of ODEs involves less numerical cost. Specifically, we use two families of numerical schemes on the Lie group, which are designed to preserve its geometrical structure: the first one based on the Magnus expansion, whereas the second is based on Runge-Kutta-Munthe-Kaas (RKMK) methods. Moreover, since the aforementioned action relates the Lie group and the manifold where the Lie system evolves, the resulting integrator preserves any geometric structure of the latter. We compare both methods for Lie systems with geometric invariants, particularly a class on Lie systems on curved spaces. We also illustrate the superiority of our method for describing long-term behavior and for differential equations admitting solutions whose geometric features depends heavily on initial conditions. As already mentioned, our milestone is to show that the method we propose preserves all the geometric invariants very faithfully, in comparison with nongeometric numerical methods. △ Less

Submitted 2 December, 2023; v1 submitted 1 August, 2023; originally announced August 2023.

Comments: New theoretical remarks and applications added. Presentation improved. arXiv admin note: text overlap with arXiv:2204.00046

MSC Class: 34A26; 53A70 (primary) 37M15; 49M25 (secondary)

arXiv:2307.06232 [pdf, ps, other]

Hamiltonian stochastic Lie systems and applications

Authors: J. de Lucas, X. Rivas, M. Zajac

Abstract: This paper provides a practical approach to stochastic Lie systems, i.e. stochastic differential equations whose general solutions can be written as a function depending only on a generic family of particular solutions and some constants, so as to emphasise their applications. We correct the known stochastic Lie theorem characterising stochastic Lie systems, proving that, contrary to previous clai… ▽ More This paper provides a practical approach to stochastic Lie systems, i.e. stochastic differential equations whose general solutions can be written as a function depending only on a generic family of particular solutions and some constants, so as to emphasise their applications. We correct the known stochastic Lie theorem characterising stochastic Lie systems, proving that, contrary to previous claims, it satisfies the Malliavin's principle. Meanwhile, we show that stochastic Lie systems admit new stochastic features in the Ito approach. New generalisations of stochastic Lie systems, like the so-called stochastic foliated Lie systems, are devised. Subsequently, we focus on stochastic (foliated) Lie systems that can be studied as Hamiltonian systems using different types of differential geometric structures. We study their stability properties and we devise the basics of an energy-momentum method. A stochastic Poisson coalgebra method is developed to derive superposition rules for Hamiltonian stochastic Lie systems. Applications of our results are found in coronavirus stochastic models, stochastic Lotka-Volterra systems, stochastic SIS models of different types, etc. Our results improve previous approaches by using stochastic differential equations instead of deterministic models designed to grasp some of their stochastic features. △ Less

Submitted 12 July, 2023; originally announced July 2023.

Comments: 24 pages

MSC Class: 60H10; 34A26 (Primary) 37N25; 53Z05. (Secondary)

arXiv:2306.08556 [pdf, ps, other]

On Darboux theorems for geometric structures induced by closed forms

Authors: Xavier Gràcia, Javier de Lucas, Xavier Rivas, Narciso Román-Roy

Abstract: This work reviews the classical Darboux theorem for symplectic, presymplectic, and cosymplectic manifolds (which are used to describe regular and singular mechanical systems), and certain cases of multisymplectic manifolds, and extends it in new ways to k-symplectic and k-cosymplectic manifolds (all these structures appear in the geometric formulation of first-order classical field theories). More… ▽ More This work reviews the classical Darboux theorem for symplectic, presymplectic, and cosymplectic manifolds (which are used to describe regular and singular mechanical systems), and certain cases of multisymplectic manifolds, and extends it in new ways to k-symplectic and k-cosymplectic manifolds (all these structures appear in the geometric formulation of first-order classical field theories). Moreover, we discuss the existence of Darboux theorems for classes of precosymplectic, k-presymplectic, k-precosymplectic, and premultisymplectic manifolds, which are the geometrical structures underlying some kinds of singular field theories. Approaches to Darboux theorems based on flat connections associated with geometric structures are given, while new results on polarisations for (k-)(pre)(co)symplectic structures arise. △ Less

Submitted 6 July, 2023; v1 submitted 14 June, 2023; originally announced June 2023.

Comments: improved and extended proofs. 33 pp

MSC Class: 53C15; 53C12; 53D05; 53C10

arXiv:2306.07349 [pdf, other]

ATT3D: Amortized Text-to-3D Object Synthesis

Authors: Jonathan Lorraine, Kevin Xie, Xiaohui Zeng, Chen-Hsuan Lin, Towaki Takikawa, Nicholas Sharp, Tsung-Yi Lin, Ming-Yu Liu, Sanja Fidler, James Lucas

Abstract: Text-to-3D modelling has seen exciting progress by combining generative text-to-image models with image-to-3D methods like Neural Radiance Fields. DreamFusion recently achieved high-quality results but requires a lengthy, per-prompt optimization to create 3D objects. To address this, we amortize optimization over text prompts by training on many prompts simultaneously with a unified model, instead… ▽ More Text-to-3D modelling has seen exciting progress by combining generative text-to-image models with image-to-3D methods like Neural Radiance Fields. DreamFusion recently achieved high-quality results but requires a lengthy, per-prompt optimization to create 3D objects. To address this, we amortize optimization over text prompts by training on many prompts simultaneously with a unified model, instead of separately. With this, we share computation across a prompt set, training in less time than per-prompt optimization. Our framework - Amortized text-to-3D (ATT3D) - enables knowledge-sharing between prompts to generalize to unseen setups and smooth interpolations between text for novel assets and simple animations. △ Less

Submitted 6 June, 2023; originally announced June 2023.

Comments: 22 pages, 20 figures

MSC Class: 68T45 ACM Class: I.2.6; I.2.7; I.3.6; I.3.7

arXiv:2306.03399 [pdf, other]

Scalable telomere-to-telomere assembly for diploid and polyploid genomes with double graph

Authors: Haoyu Cheng, Mobin Asri, Julian Lucas, Sergey Koren, Heng Li

Abstract: Despite recent advances in the length and the accuracy of long-read data, building haplotype-resolved genome assemblies from telomere to telomere still requires considerable computational resources. In this study, we present an efficient de novo assembly algorithm that combines multiple sequencing technologies to scale up population-wide telomere-to-telomere assemblies. By utilizing twenty-two hum… ▽ More Despite recent advances in the length and the accuracy of long-read data, building haplotype-resolved genome assemblies from telomere to telomere still requires considerable computational resources. In this study, we present an efficient de novo assembly algorithm that combines multiple sequencing technologies to scale up population-wide telomere-to-telomere assemblies. By utilizing twenty-two human and two plant genomes, we demonstrate that our algorithm is around an order of magnitude cheaper than existing methods, while producing better diploid and haploid assemblies. Notably, our algorithm is the only feasible solution to the haplotype-resolved assembly of polyploid genomes. △ Less

Submitted 6 June, 2023; originally announced June 2023.

Comments: 14 pages, 4 fuhires

arXiv:2305.10050 [pdf, other]

The Impact of Missing Data on Causal Discovery: A Multicentric Clinical Study

Authors: Alessio Zanga, Alice Bernasconi, Peter J. F. Lucas, Hanny Pijnenborg, Casper Reijnen, Marco Scutari, Fabio Stella

Abstract: Causal inference for testing clinical hypotheses from observational data presents many difficulties because the underlying data-generating model and the associated causal graph are not usually available. Furthermore, observational data may contain missing values, which impact the recovery of the causal graph by causal discovery algorithms: a crucial issue often ignored in clinical studies. In this… ▽ More Causal inference for testing clinical hypotheses from observational data presents many difficulties because the underlying data-generating model and the associated causal graph are not usually available. Furthermore, observational data may contain missing values, which impact the recovery of the causal graph by causal discovery algorithms: a crucial issue often ignored in clinical studies. In this work, we use data from a multi-centric study on endometrial cancer to analyze the impact of different missingness mechanisms on the recovered causal graph. This is achieved by extending state-of-the-art causal discovery algorithms to exploit expert knowledge without sacrificing theoretical soundness. We validate the recovered graph with expert physicians, showing that our approach finds clinically-relevant solutions. Finally, we discuss the goodness of fit of our graph and its consistency from a clinical decision-making perspective using graphical separation to validate causal pathways. △ Less

Submitted 3 November, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

arXiv:2305.10041 [pdf, other]

Risk Assessment of Lymph Node Metastases in Endometrial Cancer Patients: A Causal Approach

Authors: Alessio Zanga, Alice Bernasconi, Peter J. F. Lucas, Hanny Pijnenborg, Casper Reijnen, Marco Scutari, Fabio Stella

Abstract: Assessing the pre-operative risk of lymph node metastases in endometrial cancer patients is a complex and challenging task. In principle, machine learning and deep learning models are flexible and expressive enough to capture the dynamics of clinical risk assessment. However, in this setting we are limited to observational data with quality issues, missing values, small sample size and high dimens… ▽ More Assessing the pre-operative risk of lymph node metastases in endometrial cancer patients is a complex and challenging task. In principle, machine learning and deep learning models are flexible and expressive enough to capture the dynamics of clinical risk assessment. However, in this setting we are limited to observational data with quality issues, missing values, small sample size and high dimensionality: we cannot reliably learn such models from limited observational data with these sources of bias. Instead, we choose to learn a causal Bayesian network to mitigate the issues above and to leverage the prior knowledge on endometrial cancer available from clinicians and physicians. We introduce a causal discovery algorithm for causal Bayesian networks based on bootstrap resampling, as opposed to the single imputation used in related works. Moreover, we include a context variable to evaluate whether selection bias results in learning spurious associations. Finally, we discuss the strengths and limitations of our findings in light of the presence of missing data that may be missing-not-at-random, which is common in real-world clinical settings. △ Less

Submitted 17 May, 2023; originally announced May 2023.

arXiv:2303.02754 [pdf, other]

doi 10.1021/acsapm.3c00015

On the use of organic semiconductors as handles for optical tweezers experiments: trapping and manipulating polyaniline (PANI) microparticles

Authors: Kairon M. Oliveira, Tiago A. Moura, Janaisa L. C. Lucas, Alvaro V. N. C. Teixeira, Marcio S. Rocha, Joaquim B. S. Mendes

Abstract: Here we propose the use of the organic semiconductor polyaniline (PANI) for the preparation of spherical-shaped microparticles to serve as handles in optical tweezers (OT) experiments. The stable trapping and manipulation of PANI beads was demonstrated for the first time, using a Gaussian ($TEM_{00}$) beam optical tweezers. The trap stiffness was characterized for various different parameters such… ▽ More Here we propose the use of the organic semiconductor polyaniline (PANI) for the preparation of spherical-shaped microparticles to serve as handles in optical tweezers (OT) experiments. The stable trapping and manipulation of PANI beads was demonstrated for the first time, using a Gaussian ($TEM_{00}$) beam optical tweezers. The trap stiffness was characterized for various different parameters such as the bead radius, the laser power and the distance between the bead and the coverslip of the sample chamber, attesting the viability of using such material for optical manipulation. Since the effective optical properties of PANI can be modulated by the synthesis process, new related applications are also proposed. The results of the present work therefore open the door for using semiconductor polymeric materials in OT applications. △ Less

Submitted 5 March, 2023; originally announced March 2023.

Comments: 10 pages and 5 figures

arXiv:2303.01253 [pdf, other]

Implementing engrams from a machine learning perspective: matching for prediction

Authors: Jesus Marco de Lucas

Abstract: Despite evidence for the existence of engrams as memory support structures in our brains, there is no consensus framework in neuroscience as to what their physical implementation might be. Here we propose how we might design a computer system to implement engrams using neural networks, with the main aim of exploring new ideas using machine learning techniques, guided by challenges in neuroscience.… ▽ More Despite evidence for the existence of engrams as memory support structures in our brains, there is no consensus framework in neuroscience as to what their physical implementation might be. Here we propose how we might design a computer system to implement engrams using neural networks, with the main aim of exploring new ideas using machine learning techniques, guided by challenges in neuroscience. Building on autoencoders, we propose latent neural spaces as indexes for storing and retrieving information in a compressed format. We consider this technique as a first step towards predictive learning: autoencoders are designed to compare reconstructed information with the original information received, providing a kind of predictive ability, which is an attractive evolutionary argument. We then consider how different states in latent neural spaces corresponding to different types of sensory input could be linked by synchronous activation, providing the basis for a sparse implementation of memory using concept neurons. Finally, we list some of the challenges and questions that link neuroscience and data science and that could have implications for both fields, and conclude that a more interdisciplinary approach is needed, as many scientists have already suggested. △ Less

Submitted 1 March, 2023; originally announced March 2023.

Comments: 7 pages, 1 figure

ACM Class: I.2.0

arXiv:2302.09037 [pdf, ps, other]

doi 10.1016/j.geomphys.2023.104899

On k-polycosymplectic Marsden-Weinstein reductions

Authors: J. de Lucas, X. Rivas, S. Vilariño, B. M. Zawora

Abstract: We review and slightly improve the known k-polysymplectic Marsden--Weinstein reduction theory by removing some technical conditions on k-polysymplectic momentum maps by developing a theory of affine Lie group actions for k-polysymplectic momentum maps, removing the necessity of their co-adjoint equivariance. Then, we focus on the analysis of a particular case of k-polysymplectic manifolds, the so-… ▽ More We review and slightly improve the known k-polysymplectic Marsden--Weinstein reduction theory by removing some technical conditions on k-polysymplectic momentum maps by developing a theory of affine Lie group actions for k-polysymplectic momentum maps, removing the necessity of their co-adjoint equivariance. Then, we focus on the analysis of a particular case of k-polysymplectic manifolds, the so-called fibred ones, and we study their k-polysymplectic Marsden--Weinstein reductions. Previous results allow us to devise a k-polycosymplectic Marsden--Weinstein reduction theory, which represents one of our main results. Our findings are applied to study coupled vibrating strings and, more generally, k-polycosymplectic Hamiltonian systems with field symmetries. We show that k-polycosymplectic geometry can be understood as a particular type of k-polysymplectic geometry. Finally, a k-cosymplectic to l-cosymplectic geometric reduction theory is presented, which reduces, geometrically, the space-time variables in a k-cosymplectic framework. An application of this latter result to a vibrating membrane with symmetries is given. △ Less

Submitted 5 June, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

Comments: 49 pages. Revised version. Added a reduction procedure of the space-time coordinates

MSC Class: 53C15; 53Z05; 70H33 (primary) 35A30; 35B06 (secondary)

Journal ref: J. Geom. Phys. 191, 104899 (2023)

arXiv:2302.05827 [pdf, ps, other]

Cosymplectic geometry, reductions, and energy-momentum methods with applications

Authors: J. de Lucas, A. Maskalaniec, B. M. Zawora

Abstract: Classical energy-momentum methods study the existence and stability properties of solutions of $t$-dependent Hamilton equations on symplectic manifolds whose evolution is given by their Hamiltonian Lie symmetries. The points of such solutions are called relative equilibrium points. This work devises a new cosymplectic energy-momentum method providing a new and more general framework to study $t$-d… ▽ More Classical energy-momentum methods study the existence and stability properties of solutions of $t$-dependent Hamilton equations on symplectic manifolds whose evolution is given by their Hamiltonian Lie symmetries. The points of such solutions are called relative equilibrium points. This work devises a new cosymplectic energy-momentum method providing a new and more general framework to study $t$-dependent Hamilton equations. In fact, cosymplectic geometry allows for using more types of distinguished Lie symmetries (given by Hamiltonian, gradient, or evolution vector fields), relative equilibrium points, and reduction methods, than symplectic techniques. To make our work more self-contained and to fill some gaps in the literature, a review of the cosymplectic formalism and the cosymplectic Marsden-Weinstein reduction is included. Known and new types of relative equilibrium points are characterised and studied. Our methods remove technical conditions used in previous energy-momentum methods, like the ${\rm Ad}^*$-equivariance of momentum maps. Eigenfunctions of $t$-dependent Schrödinger equations are interpreted in terms of relative equilibrium points in cosymplectic manifolds. A new cosymplectic-to-symplectic reduction is developed and a new associated type of relative equilibrium points, the so-called gradient relative equilibrium points, are introduced and applied to study the Lagrange points and Hill radii of a restricted circular three-body system by means of a not Hamiltonian Lie symmetry of the system. △ Less

Submitted 23 November, 2023; v1 submitted 11 February, 2023; originally announced February 2023.

Comments: 41 pages, 1 figure. A new cosymplectic-to-symplectic reduction theory added. Several other minor theoretical improvements included

MSC Class: 34A26; 37J39 (Primary) 34A05; 70H05; 70H14 (Secondary)

arXiv:2302.04832 [pdf, other]

Bridging the Sim2Real gap with CARE: Supervised Detection Adaptation with Conditional Alignment and Reweighting

Authors: Viraj Prabhu, David Acuna, Andrew Liao, Rafid Mahmood, Marc T. Law, Judy Hoffman, Sanja Fidler, James Lucas

Abstract: Sim2Real domain adaptation (DA) research focuses on the constrained setting of adapting from a labeled synthetic source domain to an unlabeled or sparsely labeled real target domain. However, for high-stakes applications (e.g. autonomous driving), it is common to have a modest amount of human-labeled real data in addition to plentiful auto-labeled source data (e.g. from a driving simulator). We st… ▽ More Sim2Real domain adaptation (DA) research focuses on the constrained setting of adapting from a labeled synthetic source domain to an unlabeled or sparsely labeled real target domain. However, for high-stakes applications (e.g. autonomous driving), it is common to have a modest amount of human-labeled real data in addition to plentiful auto-labeled source data (e.g. from a driving simulator). We study this setting of supervised sim2real DA applied to 2D object detection. We propose Domain Translation via Conditional Alignment and Reweighting (CARE) a novel algorithm that systematically exploits target labels to explicitly close the sim2real appearance and content gaps. We present an analytical justification of our algorithm and demonstrate strong gains over competing methods on standard benchmarks. △ Less

Submitted 9 February, 2023; originally announced February 2023.

arXiv:2210.01964 [pdf, other]

The Calibration Generalization Gap

Authors: A. Michael Carrell, Neil Mallinar, James Lucas, Preetum Nakkiran

Abstract: Calibration is a fundamental property of a good predictive model: it requires that the model predicts correctly in proportion to its confidence. Modern neural networks, however, provide no strong guarantees on their calibration -- and can be either poorly calibrated or well-calibrated depending on the setting. It is currently unclear which factors contribute to good calibration (architecture, data… ▽ More Calibration is a fundamental property of a good predictive model: it requires that the model predicts correctly in proportion to its confidence. Modern neural networks, however, provide no strong guarantees on their calibration -- and can be either poorly calibrated or well-calibrated depending on the setting. It is currently unclear which factors contribute to good calibration (architecture, data augmentation, overparameterization, etc), though various claims exist in the literature. We propose a systematic way to study the calibration error: by decomposing it into (1) calibration error on the train set, and (2) the calibration generalization gap. This mirrors the fundamental decomposition of generalization. We then investigate each of these terms, and give empirical evidence that (1) DNNs are typically always calibrated on their train set, and (2) the calibration generalization gap is upper-bounded by the standard generalization gap. Taken together, this implies that models with small generalization gap (|Test Error - Train Error|) are well-calibrated. This perspective unifies many results in the literature, and suggests that interventions which reduce the generalization gap (such as adding data, using heavy augmentation, or smaller model size) also improve calibration. We thus hope our initial study lays the groundwork for a more systematic and comprehensive understanding of the relation between calibration, generalization, and optimization. △ Less

Submitted 6 October, 2022; v1 submitted 4 October, 2022; originally announced October 2022.

Comments: Appeared at ICML 2022 Workshop on Distribution-Free Uncertainty Quantification

arXiv:2210.01234 [pdf, other]

Optimizing Data Collection for Machine Learning

Authors: Rafid Mahmood, James Lucas, Jose M. Alvarez, Sanja Fidler, Marc T. Law

Abstract: Modern deep learning systems require huge data sets to achieve impressive performance, but there is little guidance on how much or what kind of data to collect. Over-collecting data incurs unnecessary present costs, while under-collecting may incur future costs and delay workflows. We propose a new paradigm for modeling the data collection workflow as a formal optimal data collection problem that… ▽ More Modern deep learning systems require huge data sets to achieve impressive performance, but there is little guidance on how much or what kind of data to collect. Over-collecting data incurs unnecessary present costs, while under-collecting may incur future costs and delay workflows. We propose a new paradigm for modeling the data collection workflow as a formal optimal data collection problem that allows designers to specify performance targets, collection costs, a time horizon, and penalties for failing to meet the targets. Additionally, this formulation generalizes to tasks requiring multiple data sources, such as labeled and unlabeled data used in semi-supervised learning. To solve our problem, we develop Learn-Optimize-Collect (LOC), which minimizes expected future collection costs. Finally, we numerically compare our framework to the conventional baseline of estimating data requirements by extrapolating from neural scaling laws. We significantly reduce the risks of failing to meet desired performance targets on several classification, segmentation, and detection tasks, while maintaining low total collection costs. △ Less

Submitted 3 October, 2022; originally announced October 2022.

Comments: Accepted to NeurIPS 2022

arXiv:2207.08935 [pdf, ps, other]

doi 10.1134/S1560354722050045

More on superintegrable models on spaces of constant curvature

Authors: Cezary Gonera, Joanna Gonera, Javier de Lucas, Wioletta Szczesek, Bartosz Zawora

Abstract: A known general class of superintegrable systems on 2D spaces of constant curvature can be defined by potentials separating in (geodesic) polar coordinates. The radial parts of these potentials correspond either to an isotropic harmonic oscillator or a generalised Kepler potential. The angular components, on the contrary, are given implicitly by a transcendental, in general, equation. In the prese… ▽ More A known general class of superintegrable systems on 2D spaces of constant curvature can be defined by potentials separating in (geodesic) polar coordinates. The radial parts of these potentials correspond either to an isotropic harmonic oscillator or a generalised Kepler potential. The angular components, on the contrary, are given implicitly by a transcendental, in general, equation. In the present note, devoted to the previously less studied models with the radial potential of the generalised Kepler type, a new two-parameter family of relevant angular potentials is constructed in terms of elementary functions. For an appropriate choice of parameters, the family reduces to an asymmetric spherical Higgs oscillator. △ Less

Submitted 18 July, 2022; originally announced July 2022.

Comments: 18 pages

MSC Class: 37J35; 70H06

arXiv:2207.04038 [pdf, other]

doi 10.1088/1751-8121/ace0e7

Contact Lie systems

Authors: Javier de Lucas, Xavier Rivas

Abstract: We define and analyse the properties of contact Lie systems, namely systems of first-order differential equations describing the integral curves of a $t$-dependent vector field taking values in a finite-dimensional Lie algebra of Hamiltonian vector fields relative to a contact structure. As a particular example, we study families of conservative contact Lie systems. Liouville theorems, contact red… ▽ More We define and analyse the properties of contact Lie systems, namely systems of first-order differential equations describing the integral curves of a $t$-dependent vector field taking values in a finite-dimensional Lie algebra of Hamiltonian vector fields relative to a contact structure. As a particular example, we study families of conservative contact Lie systems. Liouville theorems, contact reductions, and Gromov non-squeezing theorems are developed and applied to contact Lie systems. Our results are illustrated by examples with relevant physical and mathematical applications, e.g. Schwarz equations, Brockett systems, etcetera. △ Less

Submitted 25 October, 2022; v1 submitted 8 July, 2022; originally announced July 2022.

Comments: 29 pp, 4 figures. New version of the manuscript with Sections 4, 5.4, and 6 added. Many new results included and typos corrected

MSC Class: 37J55; 53D10; 53Z05; 34A26; 34A05; 34A34; 17B66; 22E70

arXiv:2207.01725 [pdf, other]

How Much More Data Do I Need? Estimating Requirements for Downstream Tasks

Authors: Rafid Mahmood, James Lucas, David Acuna, Daiqing Li, Jonah Philion, Jose M. Alvarez, Zhiding Yu, Sanja Fidler, Marc T. Law

Abstract: Given a small training data set and a learning algorithm, how much more data is necessary to reach a target validation or test performance? This question is of critical importance in applications such as autonomous driving or medical imaging where collecting data is expensive and time-consuming. Overestimating or underestimating data requirements incurs substantial costs that could be avoided with… ▽ More Given a small training data set and a learning algorithm, how much more data is necessary to reach a target validation or test performance? This question is of critical importance in applications such as autonomous driving or medical imaging where collecting data is expensive and time-consuming. Overestimating or underestimating data requirements incurs substantial costs that could be avoided with an adequate budget. Prior work on neural scaling laws suggest that the power-law function can fit the validation performance curve and extrapolate it to larger data set sizes. We find that this does not immediately translate to the more difficult downstream task of estimating the required data set size to meet a target performance. In this work, we consider a broad class of computer vision tasks and systematically investigate a family of functions that generalize the power-law function to allow for better estimation of data requirements. Finally, we show that incorporating a tuned correction factor and collecting over multiple rounds significantly improves the performance of the data estimators. Using our guidelines, practitioners can accurately estimate data requirements of machine learning systems to gain savings in both development time and data acquisition costs. △ Less

Submitted 13 July, 2022; v1 submitted 4 July, 2022; originally announced July 2022.

Comments: Accepted to CVPR 2022

arXiv:2204.05869 [pdf, other]

doi 10.1117/12.2561743

Assembly development for the Simons Observatory focal plane readout module

Authors: Erin Healy, Aamir M. Ali, Kam Arnold, Jason E. Austermann, James A. Beall, Sarah Marie Bruno, Steve K. Choi, Jake Connors, Nicholas F. Cothar, Bradley Dober, Shannon M. Duff, Nicholas Galitzki, Gene Hilton, Shuay-Pwu Patty Ho, Johannes Hubmayr, Bradley R. Johnson, Yaqiong Li, Michael J. Link, Tammy J. Lucas, Heather McCarrick, Michael D. Niemack, Maximiliano Silva-Feaver, Rita F. Sonka, Suzanne Staggs, Eve M. Vavagiakis , et al. (6 additional authors not shown)

Abstract: The Simons Observatory (SO) is a suite of instruments sensitive to temperature and polarization of the cosmic microwave background (CMB) to be located at Cerro Toco in the Atacama Desert in Chile. Five telescopes, one large aperture telescope and four small aperture telescopes, will host roughly 70,000 highly multiplexed transition edge sensor (TES) detectors operated at 100 mK. Each SO focal plan… ▽ More The Simons Observatory (SO) is a suite of instruments sensitive to temperature and polarization of the cosmic microwave background (CMB) to be located at Cerro Toco in the Atacama Desert in Chile. Five telescopes, one large aperture telescope and four small aperture telescopes, will host roughly 70,000 highly multiplexed transition edge sensor (TES) detectors operated at 100 mK. Each SO focal plane module (UFM) couples 1,764 TESes to microwave resonators in a microwave multiplexing (uMux) readout circuit. Before detector integration, the 100 mK uMux components are packaged into multiplexing modules (UMMs), which are independently validated to ensure they meet SO performance specifications. Here we present the assembly developments of these UMM readout packages for mid frequency (90/150 GHz) and ultra high frequency (220/280 GHz) UFMs. △ Less

Submitted 25 July, 2022; v1 submitted 12 April, 2022; originally announced April 2022.

Journal ref: Proc. SPIE 11453, Millimeter, Submillimeter, and Far-Infrared Detectors and Instrumentation for Astronomy X, 1145317 (2020)

arXiv:2204.00954 [pdf, ps, other]

doi 10.1140/epjp/s13360-023-03883-9

Quantum quasi-Lie systems: properties and applications

Authors: J. F. Cariñena, J. de Lucas, C. Sardón

Abstract: A Lie system is a non-autonomous system of ordinary differential equations describing the integral curves of a $t$-dependent vector field taking values in a finite-dimensional Lie algebra of vector fields. Lie systems have been generalised in the literature to deal with $t$-dependent Schrödinger equations determined by a particular class of $t$-dependent Hamiltonian operators, the quantum Lie syst… ▽ More A Lie system is a non-autonomous system of ordinary differential equations describing the integral curves of a $t$-dependent vector field taking values in a finite-dimensional Lie algebra of vector fields. Lie systems have been generalised in the literature to deal with $t$-dependent Schrödinger equations determined by a particular class of $t$-dependent Hamiltonian operators, the quantum Lie systems, and other differential equations through the so-called quasi-Lie schemes. This work extends quasi-Lie schemes and quantum Lie systems to cope with $t$-dependent Schrödinger equations associated with the here called quantum quasi-Lie systems. To illustrate our methods, we propose and study a quantum analogue of the classical nonlinear oscillator searched by Perelomov and we analyse a quantum one-dimensional fluid in a trapping potential along with quantum $t$-dependent Smorodinsky--Winternitz oscillators. △ Less

Submitted 2 April, 2022; originally announced April 2022.

MSC Class: 46N50; 34A36 (primary); 35Q40; 47D03; 58Z05 (secondary)

Journal ref: Eur. Phys. J. Plus 138, 339 (2023)

arXiv:2204.00046 [pdf, ps, other]

doi 10.3390/sym15061285

Geometric numerical methods for Lie systems and their application in optimal control

Authors: L. Blanco, F. Jiménez, J. de Lucas, C. Sardón

Abstract: A Lie system is a non-autonomous system of first-order ordinary differential equations whose general solution can be written via an autonomous function, a so-called (nonlinear) superposition rule of a finite number of particular solutions and some parameters to be related to initial conditions. Even if the superposition rules for some Lie systems are known, the explicit analytic expression of thei… ▽ More A Lie system is a non-autonomous system of first-order ordinary differential equations whose general solution can be written via an autonomous function, a so-called (nonlinear) superposition rule of a finite number of particular solutions and some parameters to be related to initial conditions. Even if the superposition rules for some Lie systems are known, the explicit analytic expression of their solutions frequently is not. This is why this article focuses on a novel geometric attempt to integrate Lie systems analytically and numerically. We focus on two families of methods: those based on Magnus expansions and the Runge-Kutta-Munthe-Kaas method, which are here adapted to the geometric properties of Lie systems. To illustrate the accuracy of our techniques we propose examples based on the SL$(n,\mathbb{R})$ Lie group, which plays a very relevant role in mechanics. In particular, we depict an optimal control problem for a vehicle with quadratic cost function. Particular numerical solutions of the studied examples are given. △ Less

Submitted 20 June, 2023; v1 submitted 31 March, 2022; originally announced April 2022.

Comments: 32 pages. 11 figures. Slightly improved version to appear published

MSC Class: 34A26; 53A70 (primary) 37M15; 49M25 (secondary)

Journal ref: Symmetry 15(6), 1285 (2023)

arXiv:2203.15122 [pdf, ps, other]

Multiple Riemann wave solutions of the general form of quasilinear hyperbolic systems

Authors: A. M. Grundland, J. de Lucas

Abstract: The objective of this paper is to construct geometrically Riemann $k$-wave solutions of the general form of first-order quasilinear hyperbolic systems of partial differential equations. To this end, we adapt and combine elements of two approaches to the construction of Riemann $k$-waves, namely the symmetry reduction method and the generalized method of characteristics. We formulate a geometrical… ▽ More The objective of this paper is to construct geometrically Riemann $k$-wave solutions of the general form of first-order quasilinear hyperbolic systems of partial differential equations. To this end, we adapt and combine elements of two approaches to the construction of Riemann $k$-waves, namely the symmetry reduction method and the generalized method of characteristics. We formulate a geometrical setting for the general form of the $k$-wave problem and discuss in detail the conditions for the existence of $k$-wave solutions. An auxiliary result concerning the Frobenius theorem is established. We use it to obtain formulae describing the $k$-wave solutions in closed form. Our theoretical considerations are illustrated by examples of hydrodynamic type systems including the Brownian motion equation. △ Less

Submitted 28 March, 2022; originally announced March 2022.

Comments: 32 pages. No figures

MSC Class: 35Q53 (primary); 35A30; 35Q58; 53A05 (secondary)

arXiv:2202.13748 [pdf, other]

doi 10.1088/1751-8121/ac78ab

Reduction and reconstruction of multisymplectic Lie systems

Authors: Javier de Lucas, Xavier Gràcia, Xavier Rivas, Narciso Román-Roy, Silvia Vilariño

Abstract: A Lie system is a non-autonomous system of first-order ordinary differential equations describing the integral curves of a non-autonomous vector field taking values in a finite-dimensional real Lie algebra of vector fields, a so-called Vessiot--Guldberg Lie algebra. In this work, multisymplectic structures are applied to the study of the reduction of Lie systems through their Lie symmetries. By us… ▽ More A Lie system is a non-autonomous system of first-order ordinary differential equations describing the integral curves of a non-autonomous vector field taking values in a finite-dimensional real Lie algebra of vector fields, a so-called Vessiot--Guldberg Lie algebra. In this work, multisymplectic structures are applied to the study of the reduction of Lie systems through their Lie symmetries. By using a momentum map, we perform a reduction and reconstruction procedure of multisymplectic Lie systems, which allows us to solve the original problem by analysing several simpler multisymplectic Lie systems. Conversely, we study how reduced multisymplectic Lie systems allow us to retrieve the form of the multisymplectic Lie system that gave rise to them. Our results are illustrated with examples occurring in physics, mathematics, and control theory. △ Less

Submitted 14 June, 2022; v1 submitted 28 February, 2022; originally announced February 2022.

Comments: 29 pages, no figures. A new multisymplectic reconstruction theorem and several applications thereof added. Several parts have been simplified and the presentation has been improved

MSC Class: 34A26; 34A05; 34A34 (primary); 17B66; 22E70 (secondary)

Journal ref: J. Phys. A: Math. Theor. 55(29):295204, 2022

arXiv:2202.03651 [pdf, other]

Causal Scene BERT: Improving object detection by searching for challenging groups of data

Authors: Cinjon Resnick, Or Litany, Amlan Kar, Karsten Kreis, James Lucas, Kyunghyun Cho, Sanja Fidler

Abstract: Modern computer vision applications rely on learning-based perception modules parameterized with neural networks for tasks like object detection. These modules frequently have low expected error overall but high error on atypical groups of data due to biases inherent in the training process. In building autonomous vehicles (AV), this problem is an especially important challenge because their perce… ▽ More Modern computer vision applications rely on learning-based perception modules parameterized with neural networks for tasks like object detection. These modules frequently have low expected error overall but high error on atypical groups of data due to biases inherent in the training process. In building autonomous vehicles (AV), this problem is an especially important challenge because their perception modules are crucial to the overall system performance. After identifying failures in AV, a human team will comb through the associated data to group perception failures that share common causes. More data from these groups is then collected and annotated before retraining the model to fix the issue. In other words, error groups are found and addressed in hindsight. Our main contribution is a pseudo-automatic method to discover such groups in foresight by performing causal interventions on simulated scenes. To keep our interventions on the data manifold, we utilize masked language models. We verify that the prioritized groups found via intervention are challenging for the object detector and show that retraining with data collected from these groups helps inordinately compared to adding more IID data. We also plan to release software to run interventions in simulated scenes, which we hope will benefit the causality community. △ Less

Submitted 21 April, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

Comments: In submission at JMLR; 0xe5110eA3B5014cd9a585Dc76c74Ee509F504Be14

arXiv:2111.06928 [pdf, other]

Generalized Nested Rollout Policy Adaptation with Dynamic Bias for Vehicle Routing

Authors: Julien Sentuc, Tristan Cazenave, Jean-Yves Lucas

Abstract: In this paper we present an extension of the Nested Rollout Policy Adaptation algorithm (NRPA), namely the Generalized Nested Rollout Policy Adaptation (GNRPA), as well as its use for solving some instances of the Vehicle Routing Problem. We detail some results obtained on the Solomon instances set which is a conventional benchmark for the Vehicle Routing Problem (VRP). We show that on all instanc… ▽ More In this paper we present an extension of the Nested Rollout Policy Adaptation algorithm (NRPA), namely the Generalized Nested Rollout Policy Adaptation (GNRPA), as well as its use for solving some instances of the Vehicle Routing Problem. We detail some results obtained on the Solomon instances set which is a conventional benchmark for the Vehicle Routing Problem (VRP). We show that on all instances, GNRPA performs better than NRPA. On some instances, it performs better than the Google OR Tool module dedicated to VRP. △ Less

Submitted 29 December, 2021; v1 submitted 12 November, 2021; originally announced November 2021.

arXiv:2109.05117 [pdf]

Cosmic Ray Induced Mass-Independent Oxygen Isotope Exchange: A Novel Mechanism for Producing $^{16}$O depletions in the Early Solar System

Authors: G. Dominguez, J. Lucas, L. Tafla, M. C. Liu, K. McKeegan

Abstract: A fundamental puzzle of our solar system's formation is understanding why the terrestrial bodies including the planets,comets,and asteroids are depleted in $^{16}$O compared to the Sun. The most favored mechanism,the selective photodissociation of CO gas to produce $^{16}$O depleted water,requires finely tuned mixing timescales to transport $^{16}$O depleted water from the cold outer solar system… ▽ More A fundamental puzzle of our solar system's formation is understanding why the terrestrial bodies including the planets,comets,and asteroids are depleted in $^{16}$O compared to the Sun. The most favored mechanism,the selective photodissociation of CO gas to produce $^{16}$O depleted water,requires finely tuned mixing timescales to transport $^{16}$O depleted water from the cold outer solar system to exchange isotopically with dust grains to produce the $^{16}$O depleted planetary bodies observed today. Here we show that energetic particle irradiation of SiO$_2$ (and Al$_2$O$_3$) makes them susceptible to anomalous isotope exchange with H$_2$O ice at temperatures as low as 10 K. The observed magnitude of the anomalous isotope exchange (D$^{17}$O) is sufficient to generate the $^{16}$O depletion characteristic of the terrestrial bodies in the solar system. We calculated the cosmic-ray exposure times needed to produce the observed $^{16}$O depletions in silicate (SiO2) dust in the interstellar medium and early solar system and find that radiation damage induced oxygen isotope exchange could have rapidly (~10-100 yrs) depleted dust grains of $^{16}$O during the Sun's T-Tauri phase. Our model explains whythe oldest and most refractory minerals found in the solar system, the anhydrous Calcium with Aluminum Inclusions (CAIs),are generally $^{16}$O enriched compared to chondrules and the bulk terrestrial solids and provides a mechanism for producing $^{16}$O depleted grains very early in the solar system's history. Our findings have broad implications for the distribution of oxygen isotopes in the solar system, the interstellar medium, the formation of the planets and its building blocks as well as the nature of mass-independent isotope effects. △ Less

Submitted 10 September, 2021; originally announced September 2021.

arXiv:2106.14797 [pdf, other]

doi 10.3847/1538-4357/ac2232

The Simons Observatory microwave SQUID multiplexing detector module design

Authors: Heather McCarrick, Erin Healy, Zeeshan Ahmed, Kam Arnold, Zachary Atkins, Jason E. Austermann, Tanay Bhandarkar, Jim A. Beall, Sarah Marie Bruno, Steve K. Choi, Jake Connors, Nicholas F. Cothard, Kevin D. Crowley, Simon Dicker, Bradley Dober, Cody J. Duell, Shannon M. Duff, Daniel Dutcher, Josef C. Frisch, Nicholas Galitzki, Megan B. Gralla, Jon E. Gudmundsson, Shawn W. Henderson, Gene C. Hilton, Shuay-Pwu Patty Ho , et al. (34 additional authors not shown)

Abstract: Advances in cosmic microwave background (CMB) science depend on increasing the number of sensitive detectors observing the sky. New instruments deploy large arrays of superconducting transition-edge sensor (TES) bolometers tiled densely into ever larger focal planes. High multiplexing factors reduce the thermal loading on the cryogenic receivers and simplify their design. We present the design of… ▽ More Advances in cosmic microwave background (CMB) science depend on increasing the number of sensitive detectors observing the sky. New instruments deploy large arrays of superconducting transition-edge sensor (TES) bolometers tiled densely into ever larger focal planes. High multiplexing factors reduce the thermal loading on the cryogenic receivers and simplify their design. We present the design of focal-plane modules with an order of magnitude higher multiplexing factor than has previously been achieved with TES bolometers. We focus on the novel cold readout component, which employs microwave SQUID multiplexing ($μ$mux). Simons Observatory will use 49 modules containing 60,000 bolometers to make exquisitely sensitive measurements of the CMB. We validate the focal-plane module design, presenting measurements of the readout component with and without a prototype detector array of 1728 polarization-sensitive bolometers coupled to feedhorns. The readout component achieves a $95\%$ yield and a 910 multiplexing factor. The median white noise of each readout channel is 65 $\mathrm{pA/\sqrt{Hz}}$. This impacts the projected SO mapping speed by $< 8\%$, which is less than is assumed in the sensitivity projections. The results validate the full functionality of the module. We discuss the measured performance in the context of SO science requirements, which are exceeded. △ Less

Submitted 16 September, 2021; v1 submitted 28 June, 2021; originally announced June 2021.

Comments: Accepted to The Astrophysical Journal

Journal ref: 2021 ApJ 922 38

arXiv:2104.11044 [pdf, other]

Analyzing Monotonic Linear Interpolation in Neural Network Loss Landscapes

Authors: James Lucas, Juhan Bae, Michael R. Zhang, Stanislav Fort, Richard Zemel, Roger Grosse

Abstract: Linear interpolation between initial neural network parameters and converged parameters after training with stochastic gradient descent (SGD) typically leads to a monotonic decrease in the training objective. This Monotonic Linear Interpolation (MLI) property, first observed by Goodfellow et al. (2014) persists in spite of the non-convex objectives and highly non-linear training dynamics of neural… ▽ More Linear interpolation between initial neural network parameters and converged parameters after training with stochastic gradient descent (SGD) typically leads to a monotonic decrease in the training objective. This Monotonic Linear Interpolation (MLI) property, first observed by Goodfellow et al. (2014) persists in spite of the non-convex objectives and highly non-linear training dynamics of neural networks. Extending this work, we evaluate several hypotheses for this property that, to our knowledge, have not yet been explored. Using tools from differential geometry, we draw connections between the interpolated paths in function space and the monotonicity of the network - providing sufficient conditions for the MLI property under mean squared error. While the MLI property holds under various settings (e.g. network architectures and learning problems), we show in practice that networks violating the MLI property can be produced systematically, by encouraging the weights to move far from initialization. The MLI property raises important questions about the loss landscape geometry of neural networks and highlights the need to further study their global properties. △ Less

Submitted 23 April, 2021; v1 submitted 22 April, 2021; originally announced April 2021.

Comments: 15 pages in main paper, 4 pages of references, 24 pages in appendix. 29 figures in total

arXiv:2104.09511 [pdf, other]

doi 10.3847/2515-5172/abf9ab

The Simons Observatory: the Large Aperture Telescope (LAT)

Authors: Zhilei Xu, Shunsuke Adachi, Peter Ade, J. A. Beall, Tanay Bhandarkar, J. Richard Bond, Grace E. Chesmore, Yuji Chinone, Steve K. Choi, Jake A. Connors, Gabriele Coppi, Nicholas F. Cothard, Kevin D. Crowley, Mark Devlin, Simon Dicker, Bradley Dober, Shannon M. Duff, Nicholas Galitzki, Patricio A. Gallardo, Joseph E. Golec, Jon E. Gudmundsson, Saianeesh K. Haridas, Kathleen Harrington, Carlos Hervias-Caimapo, Shuay-Pwu Patty Ho , et al. (35 additional authors not shown)

Abstract: The Simons Observatory (SO) is a Cosmic Microwave Background (CMB) experiment to observe the microwave sky in six frequency bands from 30GHz to 290GHz. The Observatory -- at $\sim$5200m altitude -- comprises three Small Aperture Telescopes (SATs) and one Large Aperture Telescope (LAT) at the Atacama Desert, Chile. This research note describes the design and current status of the LAT along with its… ▽ More The Simons Observatory (SO) is a Cosmic Microwave Background (CMB) experiment to observe the microwave sky in six frequency bands from 30GHz to 290GHz. The Observatory -- at $\sim$5200m altitude -- comprises three Small Aperture Telescopes (SATs) and one Large Aperture Telescope (LAT) at the Atacama Desert, Chile. This research note describes the design and current status of the LAT along with its future timeline. △ Less

Submitted 29 April, 2021; v1 submitted 19 April, 2021; originally announced April 2021.

Comments: 4 pages, 1 figure

Journal ref: Research Notes AAS, 5, 100 (2021)

arXiv:2103.02747 [pdf, other]

doi 10.3847/1538-4365/ac0db7

The Simons Observatory Large Aperture Telescope Receiver

Authors: Ningfeng Zhu, Tanay Bhandarkar, Gabriele Coppi, Anna M. Kofman, John L. Orlowski-Scherer, Zhilei Xu, Shunsuke Adachi, Peter Ade, Simone Aiola, Jason Austermann, Andrew O. Bazarko, James A. Beall, Sanah Bhimani, J. Richard Bond, Grace E. Chesmore, Steve K. Choi, Jake Connors, Nicholas F. Cothard, Mark Devlin, Simon Dicker, Bradley Dober, Cody J. Duell, Shannon M. Duff, Rolando Dünner, Giulio Fabbian , et al. (46 additional authors not shown)

Abstract: The Simons Observatory (SO) Large Aperture Telescope Receiver (LATR) will be coupled to the Large Aperture Telescope located at an elevation of 5,200 m on Cerro Toco in Chile. The resulting instrument will produce arcminute-resolution millimeter-wave maps of half the sky with unprecedented precision. The LATR is the largest cryogenic millimeter-wave camera built to date with a diameter of 2.4 m an… ▽ More The Simons Observatory (SO) Large Aperture Telescope Receiver (LATR) will be coupled to the Large Aperture Telescope located at an elevation of 5,200 m on Cerro Toco in Chile. The resulting instrument will produce arcminute-resolution millimeter-wave maps of half the sky with unprecedented precision. The LATR is the largest cryogenic millimeter-wave camera built to date with a diameter of 2.4 m and a length of 2.6 m. It cools 1200 kg of material to 4 K and 200 kg to 100 mk, the operating temperature of the bolometric detectors with bands centered around 27, 39, 93, 145, 225, and 280 GHz. Ultimately, the LATR will accommodate 13 40 cm diameter optics tubes, each with three detector wafers and a total of 62,000 detectors. The LATR design must simultaneously maintain the optical alignment of the system, control stray light, provide cryogenic isolation, limit thermal gradients, and minimize the time to cool the system from room temperature to 100 mK. The interplay between these competing factors poses unique challenges. We discuss the trade studies involved with the design, the final optimization, the construction, and ultimate performance of the system. △ Less

Submitted 3 March, 2021; originally announced March 2021.

arXiv:2102.05969 [pdf, ps, other]

doi 10.3390/sym13030465

Darboux families and the classification of real four-dimensional indecomposable coboundary Lie bialgebras

Authors: J. de Lucas, D. Wysocki

Abstract: This work introduces a new concept, the so-called Darboux family, which is employed to determine, to analyse geometrically, and to classify up to Lie algebra automorphisms, in a relatively easy manner, coboundary Liebialgebras on real four-dimensional indecomposable Lie algebras. The Darboux family notion can be consideredas a generalisation of the Darboux polynomial for a vector field. The classi… ▽ More This work introduces a new concept, the so-called Darboux family, which is employed to determine, to analyse geometrically, and to classify up to Lie algebra automorphisms, in a relatively easy manner, coboundary Liebialgebras on real four-dimensional indecomposable Lie algebras. The Darboux family notion can be consideredas a generalisation of the Darboux polynomial for a vector field. The classification of $r$-matrices and solutions to classical Yang-Baxter equations for real four-dimensional indecomposable Lie algebras is also given in detail. Our methods can further be applied to general, even higher-dimensional, Lie algebras. As a byproduct, a method to obtain matrix representations of certain Lie algebras with a non-trivial center is developed. △ Less

Submitted 11 February, 2021; originally announced February 2021.

Comments: 41 pages

Journal ref: Symmetry, 13(3), 465 (2021)

arXiv:2101.00616 [pdf, ps, other]

doi 10.1088/1751-8121/abf1db

Poisson-Hopf deformations of Lie-Hamilton systems revisited: deformed superposition rules and applications to the oscillator algebra

Authors: Angel Ballesteros, Rutwig Campoamor-Stursberg, Eduardo Fernandez-Saiz, Francisco J. Herranz, Javier de Lucas

Abstract: The formalism for Poisson-Hopf (PH) deformations of Lie-Hamilton systems is refined in one of its crucial points concerning applications, namely the obtention of effective and computationally feasible PH deformed superposition rules for prolonged PH deformations of Lie-Hamilton systems. The two new notions here proposed are a generalization of the standard superposition rules and the concept of di… ▽ More The formalism for Poisson-Hopf (PH) deformations of Lie-Hamilton systems is refined in one of its crucial points concerning applications, namely the obtention of effective and computationally feasible PH deformed superposition rules for prolonged PH deformations of Lie-Hamilton systems. The two new notions here proposed are a generalization of the standard superposition rules and the concept of diagonal prolongations for Lie systems, which are consistently recovered under the non-deformed limit. Using a technique from superintegrability theory, we obtain a maximal number of functionally independent constants of the motion for a generic prolonged PH deformation of a Lie-Hamilton system, from which a simplified deformed superposition rule can be derived. As an application, explicit deformed superposition rules for prolonged PH deformations of Lie-Hamilton systems based on the oscillator Lie algebra ${h}_4$ are computed. Moreover, by making use that the main structural properties of the book subalgebra ${b}_2$ of ${h}_4$ are preserved under the PH deformation, we consider prolonged PH deformations based on ${b}_2$ as restrictions of those for ${h}_4$-Lie-Hamilton systems, thus allowing the study of prolonged PH deformations of the complex Bernoulli equations, for which both the constants of the motion and the deformed superposition rules are explicitly presented. △ Less

Submitted 27 March, 2021; v1 submitted 3 January, 2021; originally announced January 2021.

Comments: 30 pages. A new subsection 4.2 on twist maps and canonical transformations has been added

MSC Class: 16T05; 17B66; 34A26

Journal ref: J. Phys. A: Math. Theor. 54 (2021) 205202

Showing 1–50 of 134 results for author: Lucas, J