-
Implementing engrams from a machine learning perspective: the relevance of a latent space
Authors:
J Marco de Lucas
Abstract:
In our previous work, we proposed that engrams in the brain could be biologically implemented as autoencoders over recurrent neural networks. These autoencoders would comprise basic excitatory/inhibitory motifs, with credit assignment deriving from a simple homeostatic criterion. This brief note examines the relevance of the latent space in these autoencoders. We consider the relationship between…
▽ More
In our previous work, we proposed that engrams in the brain could be biologically implemented as autoencoders over recurrent neural networks. These autoencoders would comprise basic excitatory/inhibitory motifs, with credit assignment deriving from a simple homeostatic criterion. This brief note examines the relevance of the latent space in these autoencoders. We consider the relationship between the dimensionality of these autoencoders and the complexity of the information being encoded. We discuss how observed differences between species in their connectome could be linked to their cognitive capacities. Finally, we link this analysis with a basic but often overlooked fact: human cognition is likely limited by our own brain structure. However, this limitation does not apply to machine learning systems, and we should be aware of the need to learn how to exploit this augmented vision of the nature.
△ Less
Submitted 23 July, 2024;
originally announced July 2024.
-
Improving Hyperparameter Optimization with Checkpointed Model Weights
Authors:
Nikhil Mehta,
Jonathan Lorraine,
Steve Masson,
Ramanathan Arunachalam,
Zaid Pervaiz Bhat,
James Lucas,
Arun George Zachariah
Abstract:
When training deep learning models, the performance depends largely on the selected hyperparameters. However, hyperparameter optimization (HPO) is often one of the most expensive parts of model design. Classical HPO methods treat this as a black-box optimization problem. However, gray-box HPO methods, which incorporate more information about the setup, have emerged as a promising direction for mor…
▽ More
When training deep learning models, the performance depends largely on the selected hyperparameters. However, hyperparameter optimization (HPO) is often one of the most expensive parts of model design. Classical HPO methods treat this as a black-box optimization problem. However, gray-box HPO methods, which incorporate more information about the setup, have emerged as a promising direction for more efficient optimization. For example, using intermediate loss evaluations to terminate bad selections. In this work, we propose an HPO method for neural networks using logged checkpoints of the trained weights to guide future hyperparameter selections. Our method, Forecasting Model Search (FMS), embeds weights into a Gaussian process deep kernel surrogate model, using a permutation-invariant graph metanetwork to be data-efficient with the logged network weights. To facilitate reproducibility and further research, we open-source our code at https://github.com/NVlabs/forecasting-model-search.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Implementing engrams from a machine learning perspective: XOR as a basic motif
Authors:
Jesus Marco de Lucas,
Maria Peña Fernandez,
Lara Lloret Iglesias
Abstract:
We have previously presented the idea of how complex multimodal information could be represented in our brains in a compressed form, following mechanisms similar to those employed in machine learning tools, like autoencoders. In this short comment note we reflect, mainly with a didactical purpose, upon the basic question for a biological implementation: what could be the mechanism working as a los…
▽ More
We have previously presented the idea of how complex multimodal information could be represented in our brains in a compressed form, following mechanisms similar to those employed in machine learning tools, like autoencoders. In this short comment note we reflect, mainly with a didactical purpose, upon the basic question for a biological implementation: what could be the mechanism working as a loss function, and how it could be connected to a neuronal network providing the required feedback to build a simple training configuration. We present our initial ideas based on a basic motif that implements an XOR switch, using few excitatory and inhibitory neurons. Such motif is guided by a principle of homeostasis, and it implements a loss function that could provide feedback to other neuronal structures, establishing a control system. We analyse the presence of this XOR motif in the connectome of C.Elegans, and indicate the relationship with the well-known lateral inhibition motif. We then explore how to build a basic biological neuronal structure with learning capacity integrating this XOR motif. Guided by the computational analogy, we show an initial example that indicates the feasibility of this approach, applied to learning binary sequences, like it is the case for simple melodies. In summary, we provide didactical examples exploring the parallelism between biological and computational learning mechanisms, identifying basic motifs and training procedures, and how an engram encoding a melody could be built using a simple recurrent network involving both excitatory and inhibitory neurons.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
The Simons Observatory: Studies of Detector Yield and Readout Noise From the First Large-Scale Deployment of Microwave Multiplexing at the Large Aperture Telescope
Authors:
Thomas P. Satterthwaite,
Zeeshan Ahmed,
Kyuyoung Bae,
Mark Devlin,
Simon Dicker,
Shannon M. Duff,
Daniel Dutcher,
Saianeesh K. Haridas,
Shawn W. Henderson,
Johannes Hubmayr,
Bradley R. Johnson,
Anna Kofman,
Jack Lashner,
Michael J. Link,
Tammy J. Lucas,
Alex Manduca,
Michael D. Niemack,
John Orlowski-Scherer,
Tristan Pinsonneault-Marotte,
Max Silva-Feaver,
Suzanne Staggs,
Eve M. Vavagiakis,
Yuhan Wang,
Kaiwen Zheng
Abstract:
The Simons Observatory is a new ground-based cosmic microwave background experiment, which is currently being commissioned in Chile's Atacama Desert. During its survey, the observatory's small aperture telescopes will map 10% of the sky in bands centered at frequencies ranging from 27 to 280 GHz to constrain cosmic inflation models, and its large aperture telescope will map 40% of the sky in the s…
▽ More
The Simons Observatory is a new ground-based cosmic microwave background experiment, which is currently being commissioned in Chile's Atacama Desert. During its survey, the observatory's small aperture telescopes will map 10% of the sky in bands centered at frequencies ranging from 27 to 280 GHz to constrain cosmic inflation models, and its large aperture telescope will map 40% of the sky in the same bands to constrain cosmological parameters and use weak lensing to study large-scale structure. To achieve these science goals, the Simons Observatory is deploying these telescopes' receivers with 60,000 state-of-the-art superconducting transition-edge sensor bolometers for its first five year survey. Reading out this unprecedented number of cryogenic sensors, however, required the development of a novel readout system. The SMuRF electronics were developed to enable high-density readout of superconducting sensors using cryogenic microwave SQUID multiplexing technology. The commissioning of the SMuRF systems at the Simons Observatory is the largest deployment to date of microwave multiplexing technology for transition-edge sensors. In this paper, we show that a significant fraction of the systems deployed so far to the Simons Observatory's large aperture telescope meet baseline specifications for detector yield and readout noise in this early phase of commissioning.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Simons Observatory: Pre-deployment Performance of a Large Aperture Telescope Optics Tube in the 90 and 150 GHz Spectral Bands
Authors:
Carlos E. Sierra,
Kathleen Harrington,
Shreya Sutariya,
Thomas Alford,
Anna M. Kofman,
Grace E. Chesmore,
Jason E. Austermann,
Andrew Bazarko,
James A. Beall,
Tanay Bhandarkar,
Mark J. Devlin,
Simon R. Dicker,
Peter N. Dow,
Shannon M. Duff,
Daniel Dutcher,
Nicholas Galitzki,
Joseph E. Golec,
John C. Groh,
Jon E. Gudmundsson,
Saianeesh K. Haridas,
Erin Healy,
Johannes Hubmayr,
Jeffrey Iuliano,
Bradley R. Johnson,
Claire S. Lessler
, et al. (20 additional authors not shown)
Abstract:
The Simons Observatory will map the temperature and polarization over half of the sky, at millimeter wavelengths in six spectral bands from the Atacama Desert in Chile. These data will provide new insights into the genesis, content, and history of our Universe; the astrophysics of galaxies and galaxy clusters; objects in our solar system; and time-varying astrophysical phenomena. This ambitious ne…
▽ More
The Simons Observatory will map the temperature and polarization over half of the sky, at millimeter wavelengths in six spectral bands from the Atacama Desert in Chile. These data will provide new insights into the genesis, content, and history of our Universe; the astrophysics of galaxies and galaxy clusters; objects in our solar system; and time-varying astrophysical phenomena. This ambitious new instrument suite, initially comprising three 0.5 m small-aperture telescopes and one 6 m large aperture telescope, is designed using a common combination of new technologies and new implementations to realize an observatory significantly more capable than the previous generation. In this paper, we present the pre-deployment performance of the first mid-frequency "optics tube" which will be fielded on the large aperture telescope with sensitivity to the 90 and 150 GHz spectral bands. This optics tube contains lenses, filters, detectors, and readout components, all of which operate at cryogenic temperatures. It is one of seven that form the core of the large aperture telescope receiver in its initial deployment. We describe this optics tube, including details of comprehensive testing methods, new techniques for beam and passband characterization, and its measured performance. The performance metrics include beams, optical efficiency, passbands, and forecasts for the on-sky performance of the system. We forecast a sensitivity that exceeds the requirements of the large aperture telescope with greater than 30% margin in each spectral band, and predict that the instrument will realize diffraction-limited performance and the expected detector passbands.
△ Less
Submitted 10 May, 2024;
originally announced May 2024.
-
The Simons Observatory: Design, integration, and testing of the small aperture telescopes
Authors:
Nicholas Galitzki,
Tran Tsan,
Jake Spisak,
Michael Randall,
Max Silva-Feaver,
Joseph Seibert,
Jacob Lashner,
Shunsuke Adachi,
Sean M. Adkins,
Thomas Alford,
Kam Arnold,
Peter C. Ashton,
Jason E. Austermann,
Carlo Baccigalupi,
Andrew Bazarko,
James A. Beall,
Sanah Bhimani,
Bryce Bixler,
Gabriele Coppi,
Lance Corbett,
Kevin D. Crowley,
Kevin T. Crowley,
Samuel Day-Weiss,
Simon Dicker,
Peter N. Dow
, et al. (55 additional authors not shown)
Abstract:
The Simons Observatory (SO) is a cosmic microwave background (CMB) survey experiment that includes small-aperture telescopes (SATs) observing from an altitude of 5,200 m in the Atacama Desert in Chile. The SO SATs will cover six spectral bands between 27 and 280 GHz to search for primordial B-modes to a sensitivity of $σ(r)=0.002$, with quantified systematic errors well below this value. Each SAT…
▽ More
The Simons Observatory (SO) is a cosmic microwave background (CMB) survey experiment that includes small-aperture telescopes (SATs) observing from an altitude of 5,200 m in the Atacama Desert in Chile. The SO SATs will cover six spectral bands between 27 and 280 GHz to search for primordial B-modes to a sensitivity of $σ(r)=0.002$, with quantified systematic errors well below this value. Each SAT is a self-contained cryogenic telescope with a 35$^\circ$ field of view, 42 cm diameter optical aperture, 40 K half-wave plate, 1 K refractive optics, and $<0.1$ K focal plane that holds $>12,000$ TES detectors. We describe the nominal design of the SATs and present details about the integration and testing for one operating at 93 and 145 GHz.
△ Less
Submitted 10 May, 2024; v1 submitted 9 May, 2024;
originally announced May 2024.
-
Logistic Map Pseudo Random Number Generator in FPGA
Authors:
Mateo Jalen Andrew Calderon,
Lee Jun Lei Lucas,
Syarifuddin Azhar Bin Rosli,
Stephanie See Hui Ying,
Jarell Lim En Yu,
Maoyang Xiang,
T. Hui Teo
Abstract:
This project develops a pseudo-random number generator (PRNG) using the logistic map, implemented in Verilog HDL on an FPGA and processes its output through a Central Limit Theorem (CLT) function to achieve a Gaussian distribution. The system integrates additional FPGA modules for real-time interaction and visualisation, including a clock generator, UART interface, XADC, and a 7-segment display dr…
▽ More
This project develops a pseudo-random number generator (PRNG) using the logistic map, implemented in Verilog HDL on an FPGA and processes its output through a Central Limit Theorem (CLT) function to achieve a Gaussian distribution. The system integrates additional FPGA modules for real-time interaction and visualisation, including a clock generator, UART interface, XADC, and a 7-segment display driver. These components facilitate the direct display of PRNG values on the FPGA and the transmission of data to a laptop for histogram analysis, verifying the Gaussian nature of the output. This approach demonstrates the practical application of chaotic systems for generating Gaussian-distributed pseudo-random numbers in digital hardware, highlighting the logistic map's potential in PRNG design.
△ Less
Submitted 30 April, 2024;
originally announced April 2024.
-
The Simons Observatory: Production-level Fabrication of the Mid- and Ultra-High-Frequency Wafers
Authors:
Shannon M. Duff,
Jason Austermann,
James A. Beall,
David P. Daniel,
Johannes Hubmayr,
Greg C. Jaehnig,
Bradley R. Johnson,
Dante Jones,
Michael J. Link,
Tammy J. Lucas,
Rita F. Sonka,
Suzanne T. Staggs,
Joel Ullom,
Yuhan Wang
Abstract:
The Simons Observatory (SO) is a cosmic microwave background instrumentation suite in the Atacama Desert of Chile. More than 65,000 polarization-sensitive transition-edge sensor (TES) bolometers will be fielded in the frequency range spanning 27 to 280 GHz, with three separate dichroic designs. The mid-frequency 90/150 GHz and ultra-high-frequency 220/280 GHz detector arrays, fabricated at NIST, a…
▽ More
The Simons Observatory (SO) is a cosmic microwave background instrumentation suite in the Atacama Desert of Chile. More than 65,000 polarization-sensitive transition-edge sensor (TES) bolometers will be fielded in the frequency range spanning 27 to 280 GHz, with three separate dichroic designs. The mid-frequency 90/150 GHz and ultra-high-frequency 220/280 GHz detector arrays, fabricated at NIST, account for 39 of 49 total detector modules and implement the feedhorn-fed orthomode transducer (OMT)-coupled TES bolometer architecture. A robust production-level fabrication framework for these detector arrays and the monolithic DC/RF routing wafers has been developed, which includes single device prototyping, process monitoring techniques, in-process metrology, and cryogenic measurements of critical film properties. Application of this framework has resulted in timely delivery of nearly 100 total superconducting focal plane components to SO with 88% of detector wafers meeting nominal criteria for integration into a detector module: a channel yield > 95% and Tc in the targeted range.
△ Less
Submitted 26 March, 2024;
originally announced March 2024.
-
LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis
Authors:
Kevin Xie,
Jonathan Lorraine,
Tianshi Cao,
Jun Gao,
James Lucas,
Antonio Torralba,
Sanja Fidler,
Xiaohui Zeng
Abstract:
Recent text-to-3D generation approaches produce impressive 3D results but require time-consuming optimization that can take up to an hour per prompt. Amortized methods like ATT3D optimize multiple prompts simultaneously to improve efficiency, enabling fast text-to-3D synthesis. However, they cannot capture high-frequency geometry and texture details and struggle to scale to large prompt sets, so t…
▽ More
Recent text-to-3D generation approaches produce impressive 3D results but require time-consuming optimization that can take up to an hour per prompt. Amortized methods like ATT3D optimize multiple prompts simultaneously to improve efficiency, enabling fast text-to-3D synthesis. However, they cannot capture high-frequency geometry and texture details and struggle to scale to large prompt sets, so they generalize poorly. We introduce LATTE3D, addressing these limitations to achieve fast, high-quality generation on a significantly larger prompt set. Key to our method is 1) building a scalable architecture and 2) leveraging 3D data during optimization through 3D-aware diffusion priors, shape regularization, and model initialization to achieve robustness to diverse and complex training prompts. LATTE3D amortizes both neural field and textured surface generation to produce highly detailed textured meshes in a single forward pass. LATTE3D generates 3D objects in 400ms, and can be further enhanced with fast test-time optimization.
△ Less
Submitted 22 March, 2024;
originally announced March 2024.
-
T-RAG: Lessons from the LLM Trenches
Authors:
Masoomali Fatehkia,
Ji Kim Lucas,
Sanjay Chawla
Abstract:
Large Language Models (LLM) have shown remarkable language capabilities fueling attempts to integrate them into applications across a wide range of domains. An important application area is question answering over private enterprise documents where the main considerations are data security, which necessitates applications that can be deployed on-prem, limited computational resources and the need f…
▽ More
Large Language Models (LLM) have shown remarkable language capabilities fueling attempts to integrate them into applications across a wide range of domains. An important application area is question answering over private enterprise documents where the main considerations are data security, which necessitates applications that can be deployed on-prem, limited computational resources and the need for a robust application that correctly responds to queries. Retrieval-Augmented Generation (RAG) has emerged as the most prominent framework for building LLM-based applications. While building a RAG is relatively straightforward, making it robust and a reliable application requires extensive customization and relatively deep knowledge of the application domain. We share our experiences building and deploying an LLM application for question answering over private organizational documents. Our application combines the use of RAG with a finetuned open-source LLM. Additionally, our system, which we call Tree-RAG (T-RAG), uses a tree structure to represent entity hierarchies within the organization. This is used to generate a textual description to augment the context when responding to user queries pertaining to entities within the organization's hierarchy. Our evaluations, including a Needle in a Haystack test, show that this combination performs better than a simple RAG or finetuning implementation. Finally, we share some lessons learned based on our experiences building an LLM application for real-world use.
△ Less
Submitted 6 June, 2024; v1 submitted 12 February, 2024;
originally announced February 2024.
-
Authorship Obfuscation in Multilingual Machine-Generated Text Detection
Authors:
Dominik Macko,
Robert Moro,
Adaku Uchendu,
Ivan Srba,
Jason Samuel Lucas,
Michiharu Yamashita,
Nafis Irtiza Tripto,
Dongwon Lee,
Jakub Simko,
Maria Bielikova
Abstract:
High-quality text generation capability of recent Large Language Models (LLMs) causes concerns about their misuse (e.g., in massive generation/spread of disinformation). Machine-generated text (MGT) detection is important to cope with such threats. However, it is susceptible to authorship obfuscation (AO) methods, such as paraphrasing, which can cause MGTs to evade detection. So far, this was eval…
▽ More
High-quality text generation capability of recent Large Language Models (LLMs) causes concerns about their misuse (e.g., in massive generation/spread of disinformation). Machine-generated text (MGT) detection is important to cope with such threats. However, it is susceptible to authorship obfuscation (AO) methods, such as paraphrasing, which can cause MGTs to evade detection. So far, this was evaluated only in monolingual settings. Thus, the susceptibility of recently proposed multilingual detectors is still unknown. We fill this gap by comprehensively benchmarking the performance of 10 well-known AO methods, attacking 37 MGT detection methods against MGTs in 11 languages (i.e., 10 $\times$ 37 $\times$ 11 = 4,070 combinations). We also evaluate the effect of data augmentation on adversarial robustness using obfuscated texts. The results indicate that all tested AO methods can cause evasion of automated detection in all tested languages, where homoglyph attacks are especially successful. However, some of the AO methods severely damaged the text, making it no longer readable or easily recognizable by humans (e.g., changed language, weird characters).
△ Less
Submitted 18 June, 2024; v1 submitted 15 January, 2024;
originally announced January 2024.
-
A symplectic approach to Schrödinger equations in the infinite-dimensional unbounded setting
Authors:
Javier de Lucas,
Julia Lange,
Xavier Rivas
Abstract:
By using the theory of analytic vectors and manifolds modelled on normed spaces, we provide a rigorous symplectic differential geometric approach to $t$-dependent Schrödinger equations on separable (possibly infinite-dimensional) Hilbert spaces determined by unbounded $t$-dependent self-adjoint Hamiltonians satisfying a technical condition. As an application, the Marsden--Weinstein reduction proce…
▽ More
By using the theory of analytic vectors and manifolds modelled on normed spaces, we provide a rigorous symplectic differential geometric approach to $t$-dependent Schrödinger equations on separable (possibly infinite-dimensional) Hilbert spaces determined by unbounded $t$-dependent self-adjoint Hamiltonians satisfying a technical condition. As an application, the Marsden--Weinstein reduction procedure is employed to map above-mentioned $t$-dependent Schrödinger equations onto their projective spaces. Other applications of physical and mathematical relevance are also analysed.
△ Less
Submitted 14 December, 2023;
originally announced December 2023.
-
Quasi-rectifiable Lie algebras for partial differential equations
Authors:
A. M. Grundland,
J. de Lucas
Abstract:
We introduce families of quasi-rectifiable vector fields and study their geometric and algebraic aspects. Then, we analyse their applications to systems of partial differential equations. Our results explain, in a simpler manner, previous findings about hydrodynamic-type equations. Facts concerning families of quasi-rectifiable vector fields, their relation to Hamiltonian systems, and practical pr…
▽ More
We introduce families of quasi-rectifiable vector fields and study their geometric and algebraic aspects. Then, we analyse their applications to systems of partial differential equations. Our results explain, in a simpler manner, previous findings about hydrodynamic-type equations. Facts concerning families of quasi-rectifiable vector fields, their relation to Hamiltonian systems, and practical procedures for studying such families are developed. We introduce and analyse quasi-rectifiable Lie algebras, which are motivated by geometric and practical reasons. We classify different types of quasi-rectifiable Lie algebras, e.g. indecomposable ones up to dimension five. New methods for solving systems of hydrodynamic-type equations are established to illustrate our results. In particular, we study hydrodynamic-type systems admitting $k$-wave solutions through quasi-rectifiable Lie algebras of vector fields. We develop techniques for obtaining the submanifolds related to quasi-rectifiable Lie algebras of vector fields and systems of partial differential equations admitting a nonlinear superposition rule: the PDE Lie systems.
△ Less
Submitted 8 April, 2024; v1 submitted 8 December, 2023;
originally announced December 2023.
-
Graph Metanetworks for Processing Diverse Neural Architectures
Authors:
Derek Lim,
Haggai Maron,
Marc T. Law,
Jonathan Lorraine,
James Lucas
Abstract:
Neural networks efficiently encode learned information within their parameters. Consequently, many tasks can be unified by treating neural networks themselves as input data. When doing so, recent studies demonstrated the importance of accounting for the symmetries and geometry of parameter spaces. However, those works developed architectures tailored to specific networks such as MLPs and CNNs with…
▽ More
Neural networks efficiently encode learned information within their parameters. Consequently, many tasks can be unified by treating neural networks themselves as input data. When doing so, recent studies demonstrated the importance of accounting for the symmetries and geometry of parameter spaces. However, those works developed architectures tailored to specific networks such as MLPs and CNNs without normalization layers, and generalizing such architectures to other types of networks can be challenging. In this work, we overcome these challenges by building new metanetworks - neural networks that take weights from other neural networks as input. Put simply, we carefully build graphs representing the input neural networks and process the graphs using graph neural networks. Our approach, Graph Metanetworks (GMNs), generalizes to neural architectures where competing methods struggle, such as multi-head attention layers, normalization layers, convolutional layers, ResNet blocks, and group-equivariant linear layers. We prove that GMNs are expressive and equivariant to parameter permutation symmetries that leave the input neural network functions unchanged. We validate the effectiveness of our method on several metanetwork tasks over diverse neural network architectures.
△ Less
Submitted 29 December, 2023; v1 submitted 7 December, 2023;
originally announced December 2023.
-
An energy-momentum method for ordinary differential equations with an underlying $k$-polysymplectic manifold
Authors:
Leonardo Colombo,
Javier de Lucas,
Xavier Rivas,
Bartosz M. Zawora
Abstract:
This work presents a comprehensive review of the $k$-polysymplectic Marsden-Weinstein reduction theory, rectifying prior errors and inaccuracies in the literature while introducing novel findings. It also emphasises the genuine practical significance of seemingly minor technical details. On this basis, we introduce a novel $k$-polysymplectic energy-momentum method, new related stability analysis t…
▽ More
This work presents a comprehensive review of the $k$-polysymplectic Marsden-Weinstein reduction theory, rectifying prior errors and inaccuracies in the literature while introducing novel findings. It also emphasises the genuine practical significance of seemingly minor technical details. On this basis, we introduce a novel $k$-polysymplectic energy-momentum method, new related stability analysis techniques, and apply them to Hamiltonian systems of ordinary differential equations relative to a $k$-polysymplectic manifold. We provide detailed examples of both physical and mathematical significance, including the study of complex Schwarz equations related to the Schwarz derivative, a series of isotropic oscillators, integrable Hamiltonian systems, quantum oscillators with dissipation, affine systems of differential equations, and polynomial dynamical systems.
△ Less
Submitted 25 November, 2023;
originally announced November 2023.
-
Towards a Transportable Causal Network Model Based on Observational Healthcare Data
Authors:
Alice Bernasconi,
Alessio Zanga,
Peter J. F. Lucas,
Marco Scutari,
Fabio Stella
Abstract:
Over the last decades, many prognostic models based on artificial intelligence techniques have been used to provide detailed predictions in healthcare. Unfortunately, the real-world observational data used to train and validate these models are almost always affected by biases that can strongly impact the outcomes validity: two examples are values missing not-at-random and selection bias. Addressi…
▽ More
Over the last decades, many prognostic models based on artificial intelligence techniques have been used to provide detailed predictions in healthcare. Unfortunately, the real-world observational data used to train and validate these models are almost always affected by biases that can strongly impact the outcomes validity: two examples are values missing not-at-random and selection bias. Addressing them is a key element in achieving transportability and in studying the causal relationships that are critical in clinical decision making, going beyond simpler statistical approaches based on probabilistic association.
In this context, we propose a novel approach that combines selection diagrams, missingness graphs, causal discovery and prior knowledge into a single graphical model to estimate the cardiovascular risk of adolescent and young females who survived breast cancer. We learn this model from data comprising two different cohorts of patients. The resulting causal network model is validated by expert clinicians in terms of risk assessment, accuracy and explainability, and provides a prognostic model that outperforms competing machine learning methods.
△ Less
Submitted 20 November, 2023; v1 submitted 13 November, 2023;
originally announced November 2023.
-
The Simons Observatory: Large-Scale Characterization of 90/150 GHz TES Detector Modules
Authors:
Daniel Dutcher,
Shannon M. Duff,
John C. Groh,
Erin Healy,
Johannes Hubmayr,
Bradley R. Johnson,
Dante Jones,
Ben Keller,
Lawrence T. Lin,
Michael J. Link,
Tammy J. Lucas,
Samuel Morgan,
Yudai Seino,
Rita F. Sonka,
Suzanne T. Staggs,
Yuhan Wang,
Kaiwen Zheng
Abstract:
The Simons Observatory (SO) is a cosmic microwave background instrumentation suite being deployed in the Atacama Desert in northern Chile. The telescopes within SO use three types of dichroic transition-edge sensor (TES) detector arrays, with the 90 and 150 GHz Mid-Frequency (MF) arrays containing 65% of the approximately 68,000 detectors in the first phase of SO. All of the 26 required MF detecto…
▽ More
The Simons Observatory (SO) is a cosmic microwave background instrumentation suite being deployed in the Atacama Desert in northern Chile. The telescopes within SO use three types of dichroic transition-edge sensor (TES) detector arrays, with the 90 and 150 GHz Mid-Frequency (MF) arrays containing 65% of the approximately 68,000 detectors in the first phase of SO. All of the 26 required MF detector arrays have now been fabricated, packaged into detector modules, and tested in laboratory cryostats. Across all modules, we find an average operable detector yield of 84% and median saturation powers of (2.8, 8.0) pW with interquartile ranges of (1, 2) pW at (90, 150) GHz, respectively, falling within their targeted ranges. We measure TES normal resistances and superconducting transition temperatures on each detector wafer to be uniform within 3%, with overall central values of 7.5 mohm and 165 mK, respectively. Results on time constants, optical efficiency, and noise performance are also presented and are consistent with achieving instrument sensitivity forecasts.
△ Less
Submitted 29 January, 2024; v1 submitted 9 November, 2023;
originally announced November 2023.
-
Fighting Fire with Fire: The Dual Role of LLMs in Crafting and Detecting Elusive Disinformation
Authors:
Jason Lucas,
Adaku Uchendu,
Michiharu Yamashita,
Jooyoung Lee,
Shaurya Rohatgi,
Dongwon Lee
Abstract:
Recent ubiquity and disruptive impacts of large language models (LLMs) have raised concerns about their potential to be misused (.i.e, generating large-scale harmful and misleading content). To combat this emerging risk of LLMs, we propose a novel "Fighting Fire with Fire" (F3) strategy that harnesses modern LLMs' generative and emergent reasoning capabilities to counter human-written and LLM-gene…
▽ More
Recent ubiquity and disruptive impacts of large language models (LLMs) have raised concerns about their potential to be misused (.i.e, generating large-scale harmful and misleading content). To combat this emerging risk of LLMs, we propose a novel "Fighting Fire with Fire" (F3) strategy that harnesses modern LLMs' generative and emergent reasoning capabilities to counter human-written and LLM-generated disinformation. First, we leverage GPT-3.5-turbo to synthesize authentic and deceptive LLM-generated content through paraphrase-based and perturbation-based prefix-style prompts, respectively. Second, we apply zero-shot in-context semantic reasoning techniques with cloze-style prompts to discern genuine from deceptive posts and news articles. In our extensive experiments, we observe GPT-3.5-turbo's zero-shot superiority for both in-distribution and out-of-distribution datasets, where GPT-3.5-turbo consistently achieved accuracy at 68-72%, unlike the decline observed in previous customized and fine-tuned disinformation detectors. Our codebase and dataset are available at https://github.com/mickeymst/F3.
△ Less
Submitted 24 October, 2023;
originally announced October 2023.
-
MULTITuDE: Large-Scale Multilingual Machine-Generated Text Detection Benchmark
Authors:
Dominik Macko,
Robert Moro,
Adaku Uchendu,
Jason Samuel Lucas,
Michiharu Yamashita,
Matúš Pikuliak,
Ivan Srba,
Thai Le,
Dongwon Lee,
Jakub Simko,
Maria Bielikova
Abstract:
There is a lack of research into capabilities of recent LLMs to generate convincing text in languages other than English and into performance of detectors of machine-generated text in multilingual settings. This is also reflected in the available benchmarks which lack authentic texts in languages other than English and predominantly cover older generators. To fill this gap, we introduce MULTITuDE,…
▽ More
There is a lack of research into capabilities of recent LLMs to generate convincing text in languages other than English and into performance of detectors of machine-generated text in multilingual settings. This is also reflected in the available benchmarks which lack authentic texts in languages other than English and predominantly cover older generators. To fill this gap, we introduce MULTITuDE, a novel benchmarking dataset for multilingual machine-generated text detection comprising of 74,081 authentic and machine-generated texts in 11 languages (ar, ca, cs, de, en, es, nl, pt, ru, uk, and zh) generated by 8 multilingual LLMs. Using this benchmark, we compare the performance of zero-shot (statistical and black-box) and fine-tuned detectors. Considering the multilinguality, we evaluate 1) how these detectors generalize to unseen languages (linguistically similar as well as dissimilar) and unseen LLMs and 2) whether the detectors improve their performance when trained on multiple languages.
△ Less
Submitted 20 October, 2023;
originally announced October 2023.
-
Geometry preserving numerical methods for physical systems with finite-dimensional Lie algebras
Authors:
L. Blanco,
F. Jiménez Alburquerque,
J. de Lucas,
C. Sardón
Abstract:
We propose a geometric integrator to numerically approximate the flow of Lie systems. The key is a novel procedure that integrates the Lie system on a Lie group intrinsically associated with a Lie system on a general manifold via a Lie group action, and then generates the discrete solution of the Lie system on the manifold via a solution of the Lie system on the Lie group. One major result from th…
▽ More
We propose a geometric integrator to numerically approximate the flow of Lie systems. The key is a novel procedure that integrates the Lie system on a Lie group intrinsically associated with a Lie system on a general manifold via a Lie group action, and then generates the discrete solution of the Lie system on the manifold via a solution of the Lie system on the Lie group. One major result from the integration of a Lie system on a Lie group is that one is able to solve all associated Lie systems on manifolds at the same time, and that Lie systems on Lie groups can be described through first-order systems of linear homogeneous ordinary differential equations (ODEs) in normal form. This brings a lot of advantages, since solving a linear system of ODEs involves less numerical cost. Specifically, we use two families of numerical schemes on the Lie group, which are designed to preserve its geometrical structure: the first one based on the Magnus expansion, whereas the second is based on Runge-Kutta-Munthe-Kaas (RKMK) methods. Moreover, since the aforementioned action relates the Lie group and the manifold where the Lie system evolves, the resulting integrator preserves any geometric structure of the latter. We compare both methods for Lie systems with geometric invariants, particularly a class on Lie systems on curved spaces. We also illustrate the superiority of our method for describing long-term behavior and for differential equations admitting solutions whose geometric features depends heavily on initial conditions. As already mentioned, our milestone is to show that the method we propose preserves all the geometric invariants very faithfully, in comparison with nongeometric numerical methods.
△ Less
Submitted 2 December, 2023; v1 submitted 1 August, 2023;
originally announced August 2023.
-
Hamiltonian stochastic Lie systems and applications
Authors:
J. de Lucas,
X. Rivas,
M. Zajac
Abstract:
This paper provides a practical approach to stochastic Lie systems, i.e. stochastic differential equations whose general solutions can be written as a function depending only on a generic family of particular solutions and some constants, so as to emphasise their applications. We correct the known stochastic Lie theorem characterising stochastic Lie systems, proving that, contrary to previous clai…
▽ More
This paper provides a practical approach to stochastic Lie systems, i.e. stochastic differential equations whose general solutions can be written as a function depending only on a generic family of particular solutions and some constants, so as to emphasise their applications. We correct the known stochastic Lie theorem characterising stochastic Lie systems, proving that, contrary to previous claims, it satisfies the Malliavin's principle. Meanwhile, we show that stochastic Lie systems admit new stochastic features in the Ito approach. New generalisations of stochastic Lie systems, like the so-called stochastic foliated Lie systems, are devised. Subsequently, we focus on stochastic (foliated) Lie systems that can be studied as Hamiltonian systems using different types of differential geometric structures. We study their stability properties and we devise the basics of an energy-momentum method. A stochastic Poisson coalgebra method is developed to derive superposition rules for Hamiltonian stochastic Lie systems. Applications of our results are found in coronavirus stochastic models, stochastic Lotka-Volterra systems, stochastic SIS models of different types, etc. Our results improve previous approaches by using stochastic differential equations instead of deterministic models designed to grasp some of their stochastic features.
△ Less
Submitted 12 July, 2023;
originally announced July 2023.
-
On Darboux theorems for geometric structures induced by closed forms
Authors:
Xavier Gràcia,
Javier de Lucas,
Xavier Rivas,
Narciso Román-Roy
Abstract:
This work reviews the classical Darboux theorem for symplectic, presymplectic, and cosymplectic manifolds (which are used to describe regular and singular mechanical systems), and certain cases of multisymplectic manifolds, and extends it in new ways to k-symplectic and k-cosymplectic manifolds (all these structures appear in the geometric formulation of first-order classical field theories). More…
▽ More
This work reviews the classical Darboux theorem for symplectic, presymplectic, and cosymplectic manifolds (which are used to describe regular and singular mechanical systems), and certain cases of multisymplectic manifolds, and extends it in new ways to k-symplectic and k-cosymplectic manifolds (all these structures appear in the geometric formulation of first-order classical field theories). Moreover, we discuss the existence of Darboux theorems for classes of precosymplectic, k-presymplectic, k-precosymplectic, and premultisymplectic manifolds, which are the geometrical structures underlying some kinds of singular field theories. Approaches to Darboux theorems based on flat connections associated with geometric structures are given, while new results on polarisations for (k-)(pre)(co)symplectic structures arise.
△ Less
Submitted 6 July, 2023; v1 submitted 14 June, 2023;
originally announced June 2023.
-
ATT3D: Amortized Text-to-3D Object Synthesis
Authors:
Jonathan Lorraine,
Kevin Xie,
Xiaohui Zeng,
Chen-Hsuan Lin,
Towaki Takikawa,
Nicholas Sharp,
Tsung-Yi Lin,
Ming-Yu Liu,
Sanja Fidler,
James Lucas
Abstract:
Text-to-3D modelling has seen exciting progress by combining generative text-to-image models with image-to-3D methods like Neural Radiance Fields. DreamFusion recently achieved high-quality results but requires a lengthy, per-prompt optimization to create 3D objects. To address this, we amortize optimization over text prompts by training on many prompts simultaneously with a unified model, instead…
▽ More
Text-to-3D modelling has seen exciting progress by combining generative text-to-image models with image-to-3D methods like Neural Radiance Fields. DreamFusion recently achieved high-quality results but requires a lengthy, per-prompt optimization to create 3D objects. To address this, we amortize optimization over text prompts by training on many prompts simultaneously with a unified model, instead of separately. With this, we share computation across a prompt set, training in less time than per-prompt optimization. Our framework - Amortized text-to-3D (ATT3D) - enables knowledge-sharing between prompts to generalize to unseen setups and smooth interpolations between text for novel assets and simple animations.
△ Less
Submitted 6 June, 2023;
originally announced June 2023.
-
Scalable telomere-to-telomere assembly for diploid and polyploid genomes with double graph
Authors:
Haoyu Cheng,
Mobin Asri,
Julian Lucas,
Sergey Koren,
Heng Li
Abstract:
Despite recent advances in the length and the accuracy of long-read data, building haplotype-resolved genome assemblies from telomere to telomere still requires considerable computational resources. In this study, we present an efficient de novo assembly algorithm that combines multiple sequencing technologies to scale up population-wide telomere-to-telomere assemblies. By utilizing twenty-two hum…
▽ More
Despite recent advances in the length and the accuracy of long-read data, building haplotype-resolved genome assemblies from telomere to telomere still requires considerable computational resources. In this study, we present an efficient de novo assembly algorithm that combines multiple sequencing technologies to scale up population-wide telomere-to-telomere assemblies. By utilizing twenty-two human and two plant genomes, we demonstrate that our algorithm is around an order of magnitude cheaper than existing methods, while producing better diploid and haploid assemblies. Notably, our algorithm is the only feasible solution to the haplotype-resolved assembly of polyploid genomes.
△ Less
Submitted 6 June, 2023;
originally announced June 2023.
-
The Impact of Missing Data on Causal Discovery: A Multicentric Clinical Study
Authors:
Alessio Zanga,
Alice Bernasconi,
Peter J. F. Lucas,
Hanny Pijnenborg,
Casper Reijnen,
Marco Scutari,
Fabio Stella
Abstract:
Causal inference for testing clinical hypotheses from observational data presents many difficulties because the underlying data-generating model and the associated causal graph are not usually available. Furthermore, observational data may contain missing values, which impact the recovery of the causal graph by causal discovery algorithms: a crucial issue often ignored in clinical studies. In this…
▽ More
Causal inference for testing clinical hypotheses from observational data presents many difficulties because the underlying data-generating model and the associated causal graph are not usually available. Furthermore, observational data may contain missing values, which impact the recovery of the causal graph by causal discovery algorithms: a crucial issue often ignored in clinical studies. In this work, we use data from a multi-centric study on endometrial cancer to analyze the impact of different missingness mechanisms on the recovered causal graph. This is achieved by extending state-of-the-art causal discovery algorithms to exploit expert knowledge without sacrificing theoretical soundness. We validate the recovered graph with expert physicians, showing that our approach finds clinically-relevant solutions. Finally, we discuss the goodness of fit of our graph and its consistency from a clinical decision-making perspective using graphical separation to validate causal pathways.
△ Less
Submitted 3 November, 2023; v1 submitted 17 May, 2023;
originally announced May 2023.
-
Risk Assessment of Lymph Node Metastases in Endometrial Cancer Patients: A Causal Approach
Authors:
Alessio Zanga,
Alice Bernasconi,
Peter J. F. Lucas,
Hanny Pijnenborg,
Casper Reijnen,
Marco Scutari,
Fabio Stella
Abstract:
Assessing the pre-operative risk of lymph node metastases in endometrial cancer patients is a complex and challenging task. In principle, machine learning and deep learning models are flexible and expressive enough to capture the dynamics of clinical risk assessment. However, in this setting we are limited to observational data with quality issues, missing values, small sample size and high dimens…
▽ More
Assessing the pre-operative risk of lymph node metastases in endometrial cancer patients is a complex and challenging task. In principle, machine learning and deep learning models are flexible and expressive enough to capture the dynamics of clinical risk assessment. However, in this setting we are limited to observational data with quality issues, missing values, small sample size and high dimensionality: we cannot reliably learn such models from limited observational data with these sources of bias. Instead, we choose to learn a causal Bayesian network to mitigate the issues above and to leverage the prior knowledge on endometrial cancer available from clinicians and physicians. We introduce a causal discovery algorithm for causal Bayesian networks based on bootstrap resampling, as opposed to the single imputation used in related works. Moreover, we include a context variable to evaluate whether selection bias results in learning spurious associations. Finally, we discuss the strengths and limitations of our findings in light of the presence of missing data that may be missing-not-at-random, which is common in real-world clinical settings.
△ Less
Submitted 17 May, 2023;
originally announced May 2023.
-
On the use of organic semiconductors as handles for optical tweezers experiments: trapping and manipulating polyaniline (PANI) microparticles
Authors:
Kairon M. Oliveira,
Tiago A. Moura,
Janaisa L. C. Lucas,
Alvaro V. N. C. Teixeira,
Marcio S. Rocha,
Joaquim B. S. Mendes
Abstract:
Here we propose the use of the organic semiconductor polyaniline (PANI) for the preparation of spherical-shaped microparticles to serve as handles in optical tweezers (OT) experiments. The stable trapping and manipulation of PANI beads was demonstrated for the first time, using a Gaussian ($TEM_{00}$) beam optical tweezers. The trap stiffness was characterized for various different parameters such…
▽ More
Here we propose the use of the organic semiconductor polyaniline (PANI) for the preparation of spherical-shaped microparticles to serve as handles in optical tweezers (OT) experiments. The stable trapping and manipulation of PANI beads was demonstrated for the first time, using a Gaussian ($TEM_{00}$) beam optical tweezers. The trap stiffness was characterized for various different parameters such as the bead radius, the laser power and the distance between the bead and the coverslip of the sample chamber, attesting the viability of using such material for optical manipulation. Since the effective optical properties of PANI can be modulated by the synthesis process, new related applications are also proposed. The results of the present work therefore open the door for using semiconductor polymeric materials in OT applications.
△ Less
Submitted 5 March, 2023;
originally announced March 2023.
-
Implementing engrams from a machine learning perspective: matching for prediction
Authors:
Jesus Marco de Lucas
Abstract:
Despite evidence for the existence of engrams as memory support structures in our brains, there is no consensus framework in neuroscience as to what their physical implementation might be. Here we propose how we might design a computer system to implement engrams using neural networks, with the main aim of exploring new ideas using machine learning techniques, guided by challenges in neuroscience.…
▽ More
Despite evidence for the existence of engrams as memory support structures in our brains, there is no consensus framework in neuroscience as to what their physical implementation might be. Here we propose how we might design a computer system to implement engrams using neural networks, with the main aim of exploring new ideas using machine learning techniques, guided by challenges in neuroscience. Building on autoencoders, we propose latent neural spaces as indexes for storing and retrieving information in a compressed format. We consider this technique as a first step towards predictive learning: autoencoders are designed to compare reconstructed information with the original information received, providing a kind of predictive ability, which is an attractive evolutionary argument. We then consider how different states in latent neural spaces corresponding to different types of sensory input could be linked by synchronous activation, providing the basis for a sparse implementation of memory using concept neurons. Finally, we list some of the challenges and questions that link neuroscience and data science and that could have implications for both fields, and conclude that a more interdisciplinary approach is needed, as many scientists have already suggested.
△ Less
Submitted 1 March, 2023;
originally announced March 2023.
-
On k-polycosymplectic Marsden-Weinstein reductions
Authors:
J. de Lucas,
X. Rivas,
S. Vilariño,
B. M. Zawora
Abstract:
We review and slightly improve the known k-polysymplectic Marsden--Weinstein reduction theory by removing some technical conditions on k-polysymplectic momentum maps by developing a theory of affine Lie group actions for k-polysymplectic momentum maps, removing the necessity of their co-adjoint equivariance. Then, we focus on the analysis of a particular case of k-polysymplectic manifolds, the so-…
▽ More
We review and slightly improve the known k-polysymplectic Marsden--Weinstein reduction theory by removing some technical conditions on k-polysymplectic momentum maps by developing a theory of affine Lie group actions for k-polysymplectic momentum maps, removing the necessity of their co-adjoint equivariance. Then, we focus on the analysis of a particular case of k-polysymplectic manifolds, the so-called fibred ones, and we study their k-polysymplectic Marsden--Weinstein reductions. Previous results allow us to devise a k-polycosymplectic Marsden--Weinstein reduction theory, which represents one of our main results. Our findings are applied to study coupled vibrating strings and, more generally, k-polycosymplectic Hamiltonian systems with field symmetries. We show that k-polycosymplectic geometry can be understood as a particular type of k-polysymplectic geometry. Finally, a k-cosymplectic to l-cosymplectic geometric reduction theory is presented, which reduces, geometrically, the space-time variables in a k-cosymplectic framework. An application of this latter result to a vibrating membrane with symmetries is given.
△ Less
Submitted 5 June, 2023; v1 submitted 17 February, 2023;
originally announced February 2023.
-
Cosymplectic geometry, reductions, and energy-momentum methods with applications
Authors:
J. de Lucas,
A. Maskalaniec,
B. M. Zawora
Abstract:
Classical energy-momentum methods study the existence and stability properties of solutions of $t$-dependent Hamilton equations on symplectic manifolds whose evolution is given by their Hamiltonian Lie symmetries. The points of such solutions are called relative equilibrium points. This work devises a new cosymplectic energy-momentum method providing a new and more general framework to study $t$-d…
▽ More
Classical energy-momentum methods study the existence and stability properties of solutions of $t$-dependent Hamilton equations on symplectic manifolds whose evolution is given by their Hamiltonian Lie symmetries. The points of such solutions are called relative equilibrium points. This work devises a new cosymplectic energy-momentum method providing a new and more general framework to study $t$-dependent Hamilton equations. In fact, cosymplectic geometry allows for using more types of distinguished Lie symmetries (given by Hamiltonian, gradient, or evolution vector fields), relative equilibrium points, and reduction methods, than symplectic techniques. To make our work more self-contained and to fill some gaps in the literature, a review of the cosymplectic formalism and the cosymplectic Marsden-Weinstein reduction is included. Known and new types of relative equilibrium points are characterised and studied. Our methods remove technical conditions used in previous energy-momentum methods, like the ${\rm Ad}^*$-equivariance of momentum maps. Eigenfunctions of $t$-dependent Schrödinger equations are interpreted in terms of relative equilibrium points in cosymplectic manifolds. A new cosymplectic-to-symplectic reduction is developed and a new associated type of relative equilibrium points, the so-called gradient relative equilibrium points, are introduced and applied to study the Lagrange points and Hill radii of a restricted circular three-body system by means of a not Hamiltonian Lie symmetry of the system.
△ Less
Submitted 23 November, 2023; v1 submitted 11 February, 2023;
originally announced February 2023.
-
Bridging the Sim2Real gap with CARE: Supervised Detection Adaptation with Conditional Alignment and Reweighting
Authors:
Viraj Prabhu,
David Acuna,
Andrew Liao,
Rafid Mahmood,
Marc T. Law,
Judy Hoffman,
Sanja Fidler,
James Lucas
Abstract:
Sim2Real domain adaptation (DA) research focuses on the constrained setting of adapting from a labeled synthetic source domain to an unlabeled or sparsely labeled real target domain. However, for high-stakes applications (e.g. autonomous driving), it is common to have a modest amount of human-labeled real data in addition to plentiful auto-labeled source data (e.g. from a driving simulator). We st…
▽ More
Sim2Real domain adaptation (DA) research focuses on the constrained setting of adapting from a labeled synthetic source domain to an unlabeled or sparsely labeled real target domain. However, for high-stakes applications (e.g. autonomous driving), it is common to have a modest amount of human-labeled real data in addition to plentiful auto-labeled source data (e.g. from a driving simulator). We study this setting of supervised sim2real DA applied to 2D object detection. We propose Domain Translation via Conditional Alignment and Reweighting (CARE) a novel algorithm that systematically exploits target labels to explicitly close the sim2real appearance and content gaps. We present an analytical justification of our algorithm and demonstrate strong gains over competing methods on standard benchmarks.
△ Less
Submitted 9 February, 2023;
originally announced February 2023.
-
The Calibration Generalization Gap
Authors:
A. Michael Carrell,
Neil Mallinar,
James Lucas,
Preetum Nakkiran
Abstract:
Calibration is a fundamental property of a good predictive model: it requires that the model predicts correctly in proportion to its confidence. Modern neural networks, however, provide no strong guarantees on their calibration -- and can be either poorly calibrated or well-calibrated depending on the setting. It is currently unclear which factors contribute to good calibration (architecture, data…
▽ More
Calibration is a fundamental property of a good predictive model: it requires that the model predicts correctly in proportion to its confidence. Modern neural networks, however, provide no strong guarantees on their calibration -- and can be either poorly calibrated or well-calibrated depending on the setting. It is currently unclear which factors contribute to good calibration (architecture, data augmentation, overparameterization, etc), though various claims exist in the literature.
We propose a systematic way to study the calibration error: by decomposing it into (1) calibration error on the train set, and (2) the calibration generalization gap. This mirrors the fundamental decomposition of generalization. We then investigate each of these terms, and give empirical evidence that (1) DNNs are typically always calibrated on their train set, and (2) the calibration generalization gap is upper-bounded by the standard generalization gap. Taken together, this implies that models with small generalization gap (|Test Error - Train Error|) are well-calibrated. This perspective unifies many results in the literature, and suggests that interventions which reduce the generalization gap (such as adding data, using heavy augmentation, or smaller model size) also improve calibration. We thus hope our initial study lays the groundwork for a more systematic and comprehensive understanding of the relation between calibration, generalization, and optimization.
△ Less
Submitted 6 October, 2022; v1 submitted 4 October, 2022;
originally announced October 2022.
-
Optimizing Data Collection for Machine Learning
Authors:
Rafid Mahmood,
James Lucas,
Jose M. Alvarez,
Sanja Fidler,
Marc T. Law
Abstract:
Modern deep learning systems require huge data sets to achieve impressive performance, but there is little guidance on how much or what kind of data to collect. Over-collecting data incurs unnecessary present costs, while under-collecting may incur future costs and delay workflows. We propose a new paradigm for modeling the data collection workflow as a formal optimal data collection problem that…
▽ More
Modern deep learning systems require huge data sets to achieve impressive performance, but there is little guidance on how much or what kind of data to collect. Over-collecting data incurs unnecessary present costs, while under-collecting may incur future costs and delay workflows. We propose a new paradigm for modeling the data collection workflow as a formal optimal data collection problem that allows designers to specify performance targets, collection costs, a time horizon, and penalties for failing to meet the targets. Additionally, this formulation generalizes to tasks requiring multiple data sources, such as labeled and unlabeled data used in semi-supervised learning. To solve our problem, we develop Learn-Optimize-Collect (LOC), which minimizes expected future collection costs. Finally, we numerically compare our framework to the conventional baseline of estimating data requirements by extrapolating from neural scaling laws. We significantly reduce the risks of failing to meet desired performance targets on several classification, segmentation, and detection tasks, while maintaining low total collection costs.
△ Less
Submitted 3 October, 2022;
originally announced October 2022.
-
More on superintegrable models on spaces of constant curvature
Authors:
Cezary Gonera,
Joanna Gonera,
Javier de Lucas,
Wioletta Szczesek,
Bartosz Zawora
Abstract:
A known general class of superintegrable systems on 2D spaces of constant curvature can be defined by potentials separating in (geodesic) polar coordinates. The radial parts of these potentials correspond either to an isotropic harmonic oscillator or a generalised Kepler potential. The angular components, on the contrary, are given implicitly by a transcendental, in general, equation. In the prese…
▽ More
A known general class of superintegrable systems on 2D spaces of constant curvature can be defined by potentials separating in (geodesic) polar coordinates. The radial parts of these potentials correspond either to an isotropic harmonic oscillator or a generalised Kepler potential. The angular components, on the contrary, are given implicitly by a transcendental, in general, equation. In the present note, devoted to the previously less studied models with the radial potential of the generalised Kepler type, a new two-parameter family of relevant angular potentials is constructed in terms of elementary functions. For an appropriate choice of parameters, the family reduces to an asymmetric spherical Higgs oscillator.
△ Less
Submitted 18 July, 2022;
originally announced July 2022.
-
Contact Lie systems
Authors:
Javier de Lucas,
Xavier Rivas
Abstract:
We define and analyse the properties of contact Lie systems, namely systems of first-order differential equations describing the integral curves of a $t$-dependent vector field taking values in a finite-dimensional Lie algebra of Hamiltonian vector fields relative to a contact structure. As a particular example, we study families of conservative contact Lie systems. Liouville theorems, contact red…
▽ More
We define and analyse the properties of contact Lie systems, namely systems of first-order differential equations describing the integral curves of a $t$-dependent vector field taking values in a finite-dimensional Lie algebra of Hamiltonian vector fields relative to a contact structure. As a particular example, we study families of conservative contact Lie systems. Liouville theorems, contact reductions, and Gromov non-squeezing theorems are developed and applied to contact Lie systems. Our results are illustrated by examples with relevant physical and mathematical applications, e.g. Schwarz equations, Brockett systems, etcetera.
△ Less
Submitted 25 October, 2022; v1 submitted 8 July, 2022;
originally announced July 2022.
-
How Much More Data Do I Need? Estimating Requirements for Downstream Tasks
Authors:
Rafid Mahmood,
James Lucas,
David Acuna,
Daiqing Li,
Jonah Philion,
Jose M. Alvarez,
Zhiding Yu,
Sanja Fidler,
Marc T. Law
Abstract:
Given a small training data set and a learning algorithm, how much more data is necessary to reach a target validation or test performance? This question is of critical importance in applications such as autonomous driving or medical imaging where collecting data is expensive and time-consuming. Overestimating or underestimating data requirements incurs substantial costs that could be avoided with…
▽ More
Given a small training data set and a learning algorithm, how much more data is necessary to reach a target validation or test performance? This question is of critical importance in applications such as autonomous driving or medical imaging where collecting data is expensive and time-consuming. Overestimating or underestimating data requirements incurs substantial costs that could be avoided with an adequate budget. Prior work on neural scaling laws suggest that the power-law function can fit the validation performance curve and extrapolate it to larger data set sizes. We find that this does not immediately translate to the more difficult downstream task of estimating the required data set size to meet a target performance. In this work, we consider a broad class of computer vision tasks and systematically investigate a family of functions that generalize the power-law function to allow for better estimation of data requirements. Finally, we show that incorporating a tuned correction factor and collecting over multiple rounds significantly improves the performance of the data estimators. Using our guidelines, practitioners can accurately estimate data requirements of machine learning systems to gain savings in both development time and data acquisition costs.
△ Less
Submitted 13 July, 2022; v1 submitted 4 July, 2022;
originally announced July 2022.
-
Assembly development for the Simons Observatory focal plane readout module
Authors:
Erin Healy,
Aamir M. Ali,
Kam Arnold,
Jason E. Austermann,
James A. Beall,
Sarah Marie Bruno,
Steve K. Choi,
Jake Connors,
Nicholas F. Cothar,
Bradley Dober,
Shannon M. Duff,
Nicholas Galitzki,
Gene Hilton,
Shuay-Pwu Patty Ho,
Johannes Hubmayr,
Bradley R. Johnson,
Yaqiong Li,
Michael J. Link,
Tammy J. Lucas,
Heather McCarrick,
Michael D. Niemack,
Maximiliano Silva-Feaver,
Rita F. Sonka,
Suzanne Staggs,
Eve M. Vavagiakis
, et al. (6 additional authors not shown)
Abstract:
The Simons Observatory (SO) is a suite of instruments sensitive to temperature and polarization of the cosmic microwave background (CMB) to be located at Cerro Toco in the Atacama Desert in Chile. Five telescopes, one large aperture telescope and four small aperture telescopes, will host roughly 70,000 highly multiplexed transition edge sensor (TES) detectors operated at 100 mK. Each SO focal plan…
▽ More
The Simons Observatory (SO) is a suite of instruments sensitive to temperature and polarization of the cosmic microwave background (CMB) to be located at Cerro Toco in the Atacama Desert in Chile. Five telescopes, one large aperture telescope and four small aperture telescopes, will host roughly 70,000 highly multiplexed transition edge sensor (TES) detectors operated at 100 mK. Each SO focal plane module (UFM) couples 1,764 TESes to microwave resonators in a microwave multiplexing (uMux) readout circuit. Before detector integration, the 100 mK uMux components are packaged into multiplexing modules (UMMs), which are independently validated to ensure they meet SO performance specifications. Here we present the assembly developments of these UMM readout packages for mid frequency (90/150 GHz) and ultra high frequency (220/280 GHz) UFMs.
△ Less
Submitted 25 July, 2022; v1 submitted 12 April, 2022;
originally announced April 2022.
-
Quantum quasi-Lie systems: properties and applications
Authors:
J. F. Cariñena,
J. de Lucas,
C. Sardón
Abstract:
A Lie system is a non-autonomous system of ordinary differential equations describing the integral curves of a $t$-dependent vector field taking values in a finite-dimensional Lie algebra of vector fields. Lie systems have been generalised in the literature to deal with $t$-dependent Schrödinger equations determined by a particular class of $t$-dependent Hamiltonian operators, the quantum Lie syst…
▽ More
A Lie system is a non-autonomous system of ordinary differential equations describing the integral curves of a $t$-dependent vector field taking values in a finite-dimensional Lie algebra of vector fields. Lie systems have been generalised in the literature to deal with $t$-dependent Schrödinger equations determined by a particular class of $t$-dependent Hamiltonian operators, the quantum Lie systems, and other differential equations through the so-called quasi-Lie schemes. This work extends quasi-Lie schemes and quantum Lie systems to cope with $t$-dependent Schrödinger equations associated with the here called quantum quasi-Lie systems. To illustrate our methods, we propose and study a quantum analogue of the classical nonlinear oscillator searched by Perelomov and we analyse a quantum one-dimensional fluid in a trapping potential along with quantum $t$-dependent Smorodinsky--Winternitz oscillators.
△ Less
Submitted 2 April, 2022;
originally announced April 2022.
-
Geometric numerical methods for Lie systems and their application in optimal control
Authors:
L. Blanco,
F. Jiménez,
J. de Lucas,
C. Sardón
Abstract:
A Lie system is a non-autonomous system of first-order ordinary differential equations whose general solution can be written via an autonomous function, a so-called (nonlinear) superposition rule of a finite number of particular solutions and some parameters to be related to initial conditions. Even if the superposition rules for some Lie systems are known, the explicit analytic expression of thei…
▽ More
A Lie system is a non-autonomous system of first-order ordinary differential equations whose general solution can be written via an autonomous function, a so-called (nonlinear) superposition rule of a finite number of particular solutions and some parameters to be related to initial conditions. Even if the superposition rules for some Lie systems are known, the explicit analytic expression of their solutions frequently is not. This is why this article focuses on a novel geometric attempt to integrate Lie systems analytically and numerically. We focus on two families of methods: those based on Magnus expansions and the Runge-Kutta-Munthe-Kaas method, which are here adapted to the geometric properties of Lie systems. To illustrate the accuracy of our techniques we propose examples based on the SL$(n,\mathbb{R})$ Lie group, which plays a very relevant role in mechanics. In particular, we depict an optimal control problem for a vehicle with quadratic cost function. Particular numerical solutions of the studied examples are given.
△ Less
Submitted 20 June, 2023; v1 submitted 31 March, 2022;
originally announced April 2022.
-
Multiple Riemann wave solutions of the general form of quasilinear hyperbolic systems
Authors:
A. M. Grundland,
J. de Lucas
Abstract:
The objective of this paper is to construct geometrically Riemann $k$-wave solutions of the general form of first-order quasilinear hyperbolic systems of partial differential equations. To this end, we adapt and combine elements of two approaches to the construction of Riemann $k$-waves, namely the symmetry reduction method and the generalized method of characteristics. We formulate a geometrical…
▽ More
The objective of this paper is to construct geometrically Riemann $k$-wave solutions of the general form of first-order quasilinear hyperbolic systems of partial differential equations. To this end, we adapt and combine elements of two approaches to the construction of Riemann $k$-waves, namely the symmetry reduction method and the generalized method of characteristics. We formulate a geometrical setting for the general form of the $k$-wave problem and discuss in detail the conditions for the existence of $k$-wave solutions. An auxiliary result concerning the Frobenius theorem is established. We use it to obtain formulae describing the $k$-wave solutions in closed form. Our theoretical considerations are illustrated by examples of hydrodynamic type systems including the Brownian motion equation.
△ Less
Submitted 28 March, 2022;
originally announced March 2022.
-
Reduction and reconstruction of multisymplectic Lie systems
Authors:
Javier de Lucas,
Xavier Gràcia,
Xavier Rivas,
Narciso Román-Roy,
Silvia Vilariño
Abstract:
A Lie system is a non-autonomous system of first-order ordinary differential equations describing the integral curves of a non-autonomous vector field taking values in a finite-dimensional real Lie algebra of vector fields, a so-called Vessiot--Guldberg Lie algebra. In this work, multisymplectic structures are applied to the study of the reduction of Lie systems through their Lie symmetries. By us…
▽ More
A Lie system is a non-autonomous system of first-order ordinary differential equations describing the integral curves of a non-autonomous vector field taking values in a finite-dimensional real Lie algebra of vector fields, a so-called Vessiot--Guldberg Lie algebra. In this work, multisymplectic structures are applied to the study of the reduction of Lie systems through their Lie symmetries. By using a momentum map, we perform a reduction and reconstruction procedure of multisymplectic Lie systems, which allows us to solve the original problem by analysing several simpler multisymplectic Lie systems. Conversely, we study how reduced multisymplectic Lie systems allow us to retrieve the form of the multisymplectic Lie system that gave rise to them. Our results are illustrated with examples occurring in physics, mathematics, and control theory.
△ Less
Submitted 14 June, 2022; v1 submitted 28 February, 2022;
originally announced February 2022.
-
Causal Scene BERT: Improving object detection by searching for challenging groups of data
Authors:
Cinjon Resnick,
Or Litany,
Amlan Kar,
Karsten Kreis,
James Lucas,
Kyunghyun Cho,
Sanja Fidler
Abstract:
Modern computer vision applications rely on learning-based perception modules parameterized with neural networks for tasks like object detection. These modules frequently have low expected error overall but high error on atypical groups of data due to biases inherent in the training process. In building autonomous vehicles (AV), this problem is an especially important challenge because their perce…
▽ More
Modern computer vision applications rely on learning-based perception modules parameterized with neural networks for tasks like object detection. These modules frequently have low expected error overall but high error on atypical groups of data due to biases inherent in the training process. In building autonomous vehicles (AV), this problem is an especially important challenge because their perception modules are crucial to the overall system performance. After identifying failures in AV, a human team will comb through the associated data to group perception failures that share common causes. More data from these groups is then collected and annotated before retraining the model to fix the issue. In other words, error groups are found and addressed in hindsight. Our main contribution is a pseudo-automatic method to discover such groups in foresight by performing causal interventions on simulated scenes. To keep our interventions on the data manifold, we utilize masked language models. We verify that the prioritized groups found via intervention are challenging for the object detector and show that retraining with data collected from these groups helps inordinately compared to adding more IID data. We also plan to release software to run interventions in simulated scenes, which we hope will benefit the causality community.
△ Less
Submitted 21 April, 2022; v1 submitted 8 February, 2022;
originally announced February 2022.
-
Generalized Nested Rollout Policy Adaptation with Dynamic Bias for Vehicle Routing
Authors:
Julien Sentuc,
Tristan Cazenave,
Jean-Yves Lucas
Abstract:
In this paper we present an extension of the Nested Rollout Policy Adaptation algorithm (NRPA), namely the Generalized Nested Rollout Policy Adaptation (GNRPA), as well as its use for solving some instances of the Vehicle Routing Problem. We detail some results obtained on the Solomon instances set which is a conventional benchmark for the Vehicle Routing Problem (VRP). We show that on all instanc…
▽ More
In this paper we present an extension of the Nested Rollout Policy Adaptation algorithm (NRPA), namely the Generalized Nested Rollout Policy Adaptation (GNRPA), as well as its use for solving some instances of the Vehicle Routing Problem. We detail some results obtained on the Solomon instances set which is a conventional benchmark for the Vehicle Routing Problem (VRP). We show that on all instances, GNRPA performs better than NRPA. On some instances, it performs better than the Google OR Tool module dedicated to VRP.
△ Less
Submitted 29 December, 2021; v1 submitted 12 November, 2021;
originally announced November 2021.
-
Cosmic Ray Induced Mass-Independent Oxygen Isotope Exchange: A Novel Mechanism for Producing $^{16}$O depletions in the Early Solar System
Authors:
G. Dominguez,
J. Lucas,
L. Tafla,
M. C. Liu,
K. McKeegan
Abstract:
A fundamental puzzle of our solar system's formation is understanding why the terrestrial bodies including the planets,comets,and asteroids are depleted in $^{16}$O compared to the Sun. The most favored mechanism,the selective photodissociation of CO gas to produce $^{16}$O depleted water,requires finely tuned mixing timescales to transport $^{16}$O depleted water from the cold outer solar system…
▽ More
A fundamental puzzle of our solar system's formation is understanding why the terrestrial bodies including the planets,comets,and asteroids are depleted in $^{16}$O compared to the Sun. The most favored mechanism,the selective photodissociation of CO gas to produce $^{16}$O depleted water,requires finely tuned mixing timescales to transport $^{16}$O depleted water from the cold outer solar system to exchange isotopically with dust grains to produce the $^{16}$O depleted planetary bodies observed today. Here we show that energetic particle irradiation of SiO$_2$ (and Al$_2$O$_3$) makes them susceptible to anomalous isotope exchange with H$_2$O ice at temperatures as low as 10 K. The observed magnitude of the anomalous isotope exchange (D$^{17}$O) is sufficient to generate the $^{16}$O depletion characteristic of the terrestrial bodies in the solar system. We calculated the cosmic-ray exposure times needed to produce the observed $^{16}$O depletions in silicate (SiO2) dust in the interstellar medium and early solar system and find that radiation damage induced oxygen isotope exchange could have rapidly (~10-100 yrs) depleted dust grains of $^{16}$O during the Sun's T-Tauri phase. Our model explains whythe oldest and most refractory minerals found in the solar system, the anhydrous Calcium with Aluminum Inclusions (CAIs),are generally $^{16}$O enriched compared to chondrules and the bulk terrestrial solids and provides a mechanism for producing $^{16}$O depleted grains very early in the solar system's history. Our findings have broad implications for the distribution of oxygen isotopes in the solar system, the interstellar medium, the formation of the planets and its building blocks as well as the nature of mass-independent isotope effects.
△ Less
Submitted 10 September, 2021;
originally announced September 2021.
-
The Simons Observatory microwave SQUID multiplexing detector module design
Authors:
Heather McCarrick,
Erin Healy,
Zeeshan Ahmed,
Kam Arnold,
Zachary Atkins,
Jason E. Austermann,
Tanay Bhandarkar,
Jim A. Beall,
Sarah Marie Bruno,
Steve K. Choi,
Jake Connors,
Nicholas F. Cothard,
Kevin D. Crowley,
Simon Dicker,
Bradley Dober,
Cody J. Duell,
Shannon M. Duff,
Daniel Dutcher,
Josef C. Frisch,
Nicholas Galitzki,
Megan B. Gralla,
Jon E. Gudmundsson,
Shawn W. Henderson,
Gene C. Hilton,
Shuay-Pwu Patty Ho
, et al. (34 additional authors not shown)
Abstract:
Advances in cosmic microwave background (CMB) science depend on increasing the number of sensitive detectors observing the sky. New instruments deploy large arrays of superconducting transition-edge sensor (TES) bolometers tiled densely into ever larger focal planes. High multiplexing factors reduce the thermal loading on the cryogenic receivers and simplify their design. We present the design of…
▽ More
Advances in cosmic microwave background (CMB) science depend on increasing the number of sensitive detectors observing the sky. New instruments deploy large arrays of superconducting transition-edge sensor (TES) bolometers tiled densely into ever larger focal planes. High multiplexing factors reduce the thermal loading on the cryogenic receivers and simplify their design. We present the design of focal-plane modules with an order of magnitude higher multiplexing factor than has previously been achieved with TES bolometers. We focus on the novel cold readout component, which employs microwave SQUID multiplexing ($μ$mux). Simons Observatory will use 49 modules containing 60,000 bolometers to make exquisitely sensitive measurements of the CMB. We validate the focal-plane module design, presenting measurements of the readout component with and without a prototype detector array of 1728 polarization-sensitive bolometers coupled to feedhorns. The readout component achieves a $95\%$ yield and a 910 multiplexing factor. The median white noise of each readout channel is 65 $\mathrm{pA/\sqrt{Hz}}$. This impacts the projected SO mapping speed by $< 8\%$, which is less than is assumed in the sensitivity projections. The results validate the full functionality of the module. We discuss the measured performance in the context of SO science requirements, which are exceeded.
△ Less
Submitted 16 September, 2021; v1 submitted 28 June, 2021;
originally announced June 2021.
-
Analyzing Monotonic Linear Interpolation in Neural Network Loss Landscapes
Authors:
James Lucas,
Juhan Bae,
Michael R. Zhang,
Stanislav Fort,
Richard Zemel,
Roger Grosse
Abstract:
Linear interpolation between initial neural network parameters and converged parameters after training with stochastic gradient descent (SGD) typically leads to a monotonic decrease in the training objective. This Monotonic Linear Interpolation (MLI) property, first observed by Goodfellow et al. (2014) persists in spite of the non-convex objectives and highly non-linear training dynamics of neural…
▽ More
Linear interpolation between initial neural network parameters and converged parameters after training with stochastic gradient descent (SGD) typically leads to a monotonic decrease in the training objective. This Monotonic Linear Interpolation (MLI) property, first observed by Goodfellow et al. (2014) persists in spite of the non-convex objectives and highly non-linear training dynamics of neural networks. Extending this work, we evaluate several hypotheses for this property that, to our knowledge, have not yet been explored. Using tools from differential geometry, we draw connections between the interpolated paths in function space and the monotonicity of the network - providing sufficient conditions for the MLI property under mean squared error. While the MLI property holds under various settings (e.g. network architectures and learning problems), we show in practice that networks violating the MLI property can be produced systematically, by encouraging the weights to move far from initialization. The MLI property raises important questions about the loss landscape geometry of neural networks and highlights the need to further study their global properties.
△ Less
Submitted 23 April, 2021; v1 submitted 22 April, 2021;
originally announced April 2021.
-
The Simons Observatory: the Large Aperture Telescope (LAT)
Authors:
Zhilei Xu,
Shunsuke Adachi,
Peter Ade,
J. A. Beall,
Tanay Bhandarkar,
J. Richard Bond,
Grace E. Chesmore,
Yuji Chinone,
Steve K. Choi,
Jake A. Connors,
Gabriele Coppi,
Nicholas F. Cothard,
Kevin D. Crowley,
Mark Devlin,
Simon Dicker,
Bradley Dober,
Shannon M. Duff,
Nicholas Galitzki,
Patricio A. Gallardo,
Joseph E. Golec,
Jon E. Gudmundsson,
Saianeesh K. Haridas,
Kathleen Harrington,
Carlos Hervias-Caimapo,
Shuay-Pwu Patty Ho
, et al. (35 additional authors not shown)
Abstract:
The Simons Observatory (SO) is a Cosmic Microwave Background (CMB) experiment to observe the microwave sky in six frequency bands from 30GHz to 290GHz. The Observatory -- at $\sim$5200m altitude -- comprises three Small Aperture Telescopes (SATs) and one Large Aperture Telescope (LAT) at the Atacama Desert, Chile. This research note describes the design and current status of the LAT along with its…
▽ More
The Simons Observatory (SO) is a Cosmic Microwave Background (CMB) experiment to observe the microwave sky in six frequency bands from 30GHz to 290GHz. The Observatory -- at $\sim$5200m altitude -- comprises three Small Aperture Telescopes (SATs) and one Large Aperture Telescope (LAT) at the Atacama Desert, Chile. This research note describes the design and current status of the LAT along with its future timeline.
△ Less
Submitted 29 April, 2021; v1 submitted 19 April, 2021;
originally announced April 2021.
-
The Simons Observatory Large Aperture Telescope Receiver
Authors:
Ningfeng Zhu,
Tanay Bhandarkar,
Gabriele Coppi,
Anna M. Kofman,
John L. Orlowski-Scherer,
Zhilei Xu,
Shunsuke Adachi,
Peter Ade,
Simone Aiola,
Jason Austermann,
Andrew O. Bazarko,
James A. Beall,
Sanah Bhimani,
J. Richard Bond,
Grace E. Chesmore,
Steve K. Choi,
Jake Connors,
Nicholas F. Cothard,
Mark Devlin,
Simon Dicker,
Bradley Dober,
Cody J. Duell,
Shannon M. Duff,
Rolando Dünner,
Giulio Fabbian
, et al. (46 additional authors not shown)
Abstract:
The Simons Observatory (SO) Large Aperture Telescope Receiver (LATR) will be coupled to the Large Aperture Telescope located at an elevation of 5,200 m on Cerro Toco in Chile. The resulting instrument will produce arcminute-resolution millimeter-wave maps of half the sky with unprecedented precision. The LATR is the largest cryogenic millimeter-wave camera built to date with a diameter of 2.4 m an…
▽ More
The Simons Observatory (SO) Large Aperture Telescope Receiver (LATR) will be coupled to the Large Aperture Telescope located at an elevation of 5,200 m on Cerro Toco in Chile. The resulting instrument will produce arcminute-resolution millimeter-wave maps of half the sky with unprecedented precision. The LATR is the largest cryogenic millimeter-wave camera built to date with a diameter of 2.4 m and a length of 2.6 m. It cools 1200 kg of material to 4 K and 200 kg to 100 mk, the operating temperature of the bolometric detectors with bands centered around 27, 39, 93, 145, 225, and 280 GHz. Ultimately, the LATR will accommodate 13 40 cm diameter optics tubes, each with three detector wafers and a total of 62,000 detectors. The LATR design must simultaneously maintain the optical alignment of the system, control stray light, provide cryogenic isolation, limit thermal gradients, and minimize the time to cool the system from room temperature to 100 mK. The interplay between these competing factors poses unique challenges. We discuss the trade studies involved with the design, the final optimization, the construction, and ultimate performance of the system.
△ Less
Submitted 3 March, 2021;
originally announced March 2021.
-
Darboux families and the classification of real four-dimensional indecomposable coboundary Lie bialgebras
Authors:
J. de Lucas,
D. Wysocki
Abstract:
This work introduces a new concept, the so-called Darboux family, which is employed to determine, to analyse geometrically, and to classify up to Lie algebra automorphisms, in a relatively easy manner, coboundary Liebialgebras on real four-dimensional indecomposable Lie algebras. The Darboux family notion can be consideredas a generalisation of the Darboux polynomial for a vector field. The classi…
▽ More
This work introduces a new concept, the so-called Darboux family, which is employed to determine, to analyse geometrically, and to classify up to Lie algebra automorphisms, in a relatively easy manner, coboundary Liebialgebras on real four-dimensional indecomposable Lie algebras. The Darboux family notion can be consideredas a generalisation of the Darboux polynomial for a vector field. The classification of $r$-matrices and solutions to classical Yang-Baxter equations for real four-dimensional indecomposable Lie algebras is also given in detail. Our methods can further be applied to general, even higher-dimensional, Lie algebras. As a byproduct, a method to obtain matrix representations of certain Lie algebras with a non-trivial center is developed.
△ Less
Submitted 11 February, 2021;
originally announced February 2021.
-
Poisson-Hopf deformations of Lie-Hamilton systems revisited: deformed superposition rules and applications to the oscillator algebra
Authors:
Angel Ballesteros,
Rutwig Campoamor-Stursberg,
Eduardo Fernandez-Saiz,
Francisco J. Herranz,
Javier de Lucas
Abstract:
The formalism for Poisson-Hopf (PH) deformations of Lie-Hamilton systems is refined in one of its crucial points concerning applications, namely the obtention of effective and computationally feasible PH deformed superposition rules for prolonged PH deformations of Lie-Hamilton systems. The two new notions here proposed are a generalization of the standard superposition rules and the concept of di…
▽ More
The formalism for Poisson-Hopf (PH) deformations of Lie-Hamilton systems is refined in one of its crucial points concerning applications, namely the obtention of effective and computationally feasible PH deformed superposition rules for prolonged PH deformations of Lie-Hamilton systems. The two new notions here proposed are a generalization of the standard superposition rules and the concept of diagonal prolongations for Lie systems, which are consistently recovered under the non-deformed limit. Using a technique from superintegrability theory, we obtain a maximal number of functionally independent constants of the motion for a generic prolonged PH deformation of a Lie-Hamilton system, from which a simplified deformed superposition rule can be derived. As an application, explicit deformed superposition rules for prolonged PH deformations of Lie-Hamilton systems based on the oscillator Lie algebra ${h}_4$ are computed. Moreover, by making use that the main structural properties of the book subalgebra ${b}_2$ of ${h}_4$ are preserved under the PH deformation, we consider prolonged PH deformations based on ${b}_2$ as restrictions of those for ${h}_4$-Lie-Hamilton systems, thus allowing the study of prolonged PH deformations of the complex Bernoulli equations, for which both the constants of the motion and the deformed superposition rules are explicitly presented.
△ Less
Submitted 27 March, 2021; v1 submitted 3 January, 2021;
originally announced January 2021.