Search | arXiv e-print repository

Imagen 3

Authors: Imagen-Team-Google, :, Jason Baldridge, Jakob Bauer, Mukul Bhutani, Nicole Brichtova, Andrew Bunner, Kelvin Chan, Yichang Chen, Sander Dieleman, Yuqing Du, Zach Eaton-Rosen, Hongliang Fei, Nando de Freitas, Yilin Gao, Evgeny Gladchenko, Sergio Gómez Colmenarejo, Mandy Guo, Alex Haig, Will Hawkins, Hexiang Hu, Huilian Huang, Tobenna Peter Igwe, Christos Kaplanis, Siavash Khodadadeh , et al. (227 additional authors not shown)

Abstract: We introduce Imagen 3, a latent diffusion model that generates high quality images from text prompts. We describe our quality and responsibility evaluations. Imagen 3 is preferred over other state-of-the-art (SOTA) models at the time of evaluation. In addition, we discuss issues around safety and representation, as well as methods we used to minimize the potential harm of our models. We introduce Imagen 3, a latent diffusion model that generates high quality images from text prompts. We describe our quality and responsibility evaluations. Imagen 3 is preferred over other state-of-the-art (SOTA) models at the time of evaluation. In addition, we discuss issues around safety and representation, as well as methods we used to minimize the potential harm of our models. △ Less

Submitted 13 August, 2024; originally announced August 2024.

arXiv:2406.11757 [pdf, other]

STAR: SocioTechnical Approach to Red Teaming Language Models

Authors: Laura Weidinger, John Mellor, Bernat Guillen Pegueroles, Nahema Marchal, Ravin Kumar, Kristian Lum, Canfer Akbulut, Mark Diaz, Stevie Bergman, Mikel Rodriguez, Verena Rieser, William Isaac

Abstract: This research introduces STAR, a sociotechnical framework that improves on current best practices for red teaming safety of large language models. STAR makes two key contributions: it enhances steerability by generating parameterised instructions for human red teamers, leading to improved coverage of the risk surface. Parameterised instructions also provide more detailed insights into model failur… ▽ More This research introduces STAR, a sociotechnical framework that improves on current best practices for red teaming safety of large language models. STAR makes two key contributions: it enhances steerability by generating parameterised instructions for human red teamers, leading to improved coverage of the risk surface. Parameterised instructions also provide more detailed insights into model failures at no increased cost. Second, STAR improves signal quality by matching demographics to assess harms for specific groups, resulting in more sensitive annotations. STAR further employs a novel step of arbitration to leverage diverse viewpoints and improve label reliability, treating disagreement not as noise but as a valuable contribution to signal quality. △ Less

Submitted 6 August, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

Comments: 8 pages, 5 figures, 5 pages appendix. * denotes equal contribution

arXiv:2403.16901 [pdf, other]

Hyperpixels: Pixel Filter Arrays of Multivariate Optical Elements for Optimized Spectral Imaging

Authors: Calum Williams, Richard Cousins, Christopher J. Mellor, Sarah E. Bohndiek, George S. D. Gordon

Abstract: We introduce the concept of `hyperpixels' in which each element of a pixel filter array (suitable for CMOS image sensor integration) has a spectral transmission tailored to a target spectral component expected in application-specific scenes. These are analogous to arrays of multivariate optical elements that could be used for sensing specific analytes. Spectral tailoring is achieved by engineering… ▽ More We introduce the concept of `hyperpixels' in which each element of a pixel filter array (suitable for CMOS image sensor integration) has a spectral transmission tailored to a target spectral component expected in application-specific scenes. These are analogous to arrays of multivariate optical elements that could be used for sensing specific analytes. Spectral tailoring is achieved by engineering the heights of multiple sub-pixel Fabry-Perot resonators that cover each pixel area. We first present a design approach for hyperpixels, based on a matched filter concept and, as an exemplar, design a set of 4 hyperpixels tailored to optimally discriminate between 4 spectral reflectance targets. Next, we fabricate repeating 2x2 pixel filter arrays of these designs, alongside repeating 2x2 arrays of an optimal bandpass filters, perform both spectral and imaging characterization. Experimentally measured hyperpixel transmission spectra show a 2.4x reduction in unmixing matrix condition number (p=0.031) compared to the optimal band-pass set. Imaging experiments using the filter arrays with a monochrome sensor achieve a 3.47x reduction in unmixing matrix condition number (p=0.020) compared to the optimal band-pass set. This demonstrates the utility of the hyperpixel approach and shows its superiority even over the optimal bandpass case. We expect that with further improvements in design and fabrication processes increased performance may be obtained. Because the hyperpixels are straightforward to customize, fabricate and can be placed atop monochrome sensors, this approach is highly versatile and could be adapted to a wide range of real-time imaging applications which are limited by low SNR including micro-endoscopy, capsule endoscopy, industrial inspection and machine vision. △ Less

Submitted 25 March, 2024; originally announced March 2024.

arXiv:2401.14551 [pdf, other]

Single- and multi-layer micro-scale diffractive lens fabrication for fiber imaging probes with versatile depth-of-field

Authors: Fei He, Rafael Fuentes-Dominguez, Richard Cousins, Christopher J. Mellor, Jennifer K. Barton, George S. D. Gordon

Abstract: Hair-thin optical fiber endoscopes have opened up new paradigms for advanced imaging applications in vivo. In certain applications, such as optical coherence tomography (OCT), light-shaping structures may be required on fiber facets to generate needle-like Bessel beams with large depth-of-field, while in others shorter depths of field with high lateral resolutions are preferable. In this paper, we… ▽ More Hair-thin optical fiber endoscopes have opened up new paradigms for advanced imaging applications in vivo. In certain applications, such as optical coherence tomography (OCT), light-shaping structures may be required on fiber facets to generate needle-like Bessel beams with large depth-of-field, while in others shorter depths of field with high lateral resolutions are preferable. In this paper, we demonstrate a novel method to fabricate light-shaping structures on optical fibres, achieved via bonding encapsulated planar diffractive lenses onto fiber facets. Diffractive metallic structures have the advantages of being simple to design, fabricate and transfer, and our encapsulation approach is scalable to multi-layer stacks. As a demonstration, we design and transfer a Fresnel zone plate and a diffractive axicon onto fiber facets, and show that the latter device generates a needle-like Bessel beam with 350 mu m focal depth. We also evaluate the imaging performance of both devices and show that the axicon fiber is able to maintain focussed images of a USAF resolution target over a 150 mu m distance. Finally, we fabricate a two-layer stack of Fresnel zone plates on a fiber and characterise the modified beam profile and demonstrate good imaging performance. We anticipate our fabrication approach could enable multi-functional complex optical structures (e.g. using plasmonics, polarization control) to be integrated onto fibers for ultra-thin advanced imaging and sensing. △ Less

Submitted 25 January, 2024; originally announced January 2024.

arXiv:2312.11805 [pdf, other]

Gemini: A Family of Highly Capable Multimodal Models

Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultra model advances the state of the art in 30 of 32 of these benchmarks - notably being the first model to achieve human-expert performance on the well-studied exam benchmark MMLU, and improving the state of the art in every one of the 20 multimodal benchmarks we examined. We believe that the new capabilities of the Gemini family in cross-modal reasoning and language understanding will enable a wide variety of use cases. We discuss our approach toward post-training and deploying Gemini models responsibly to users through services including Gemini, Gemini Advanced, Google AI Studio, and Cloud Vertex AI. △ Less

Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

arXiv:2310.16057 [pdf]

Influenza Hospitalisations in England during the 2022/23 Season: do different data sources drive divergence in modelled waves? A comparison of surveillance and administrative data

Authors: Jonathon Mellor, Rachel Christie, James Guilder, Robert S Paton, Suzanne Elgohari, Conall Watson, Sarah Deeny, Thomas Ward

Abstract: Accurate and representative data is vital for precisely reporting the impact of influenza in healthcare systems. Northern hemisphere winter 2022/23 experienced the most substantial influenza wave since the COVID-19 pandemic began in 2020. Simultaneously, new data streams become available within health services because of the pandemic. Comparing these data, surveillance and administrative, supports… ▽ More Accurate and representative data is vital for precisely reporting the impact of influenza in healthcare systems. Northern hemisphere winter 2022/23 experienced the most substantial influenza wave since the COVID-19 pandemic began in 2020. Simultaneously, new data streams become available within health services because of the pandemic. Comparing these data, surveillance and administrative, supports the accurate monitoring of population level disease trends. We analysed admissions rates per capita from four different collection mechanisms covering National Health Service hospital Trusts in England over the winter 2022/23 wave. We adjust for difference in reporting and extracted key epidemic characteristics including the maximum admission rate, peak timing, cumulative season admissions and growth rates by fitting generalised additive models at national and regional levels. By modelling the admission rates per capita across surveillance and administrative data systems we show that different data measuring the epidemic produce different estimates of key quantities. Nationally and in most regions the data correspond well for the maximum admission rate, date of peak and growth rate, however, in subnational analysis discrepancies in estimates arose, particularly for the cumulative admission rate. This research shows that the choice of data used to measure seasonal influenza epidemics can influence analysis substantially at sub-national levels. For the admission rate per capita there is comparability in the sentinel surveillance approach (which has other important functions), rapid situational reports, operational databases and time lagged administrative data giving assurance in their combined value. Utilising multiple sources of data aids understanding of the impact of seasonal influenza epidemics in the population. △ Less

Submitted 16 October, 2023; originally announced October 2023.

arXiv:2306.05762 [pdf]

Real-time COVID-19 hospital admissions forecasting with leading indicators and ensemble methods in England

Authors: Jonathon Mellor, Rachel Christie, Robert S Paton, Rhianna Leslie, Maria Tang, Martyn Fyles, Sarah Deeny, Thomas Ward, Christopher E Overton

Abstract: Hospitalisations from COVID-19 with Omicron sub-lineages have put a sustained pressure on the English healthcare system. Understanding the expected healthcare demand enables more effective and timely planning from public health. We collect syndromic surveillance sources, which include online search data, NHS 111 telephonic and online triages. Incorporating this data we explore generalised additive… ▽ More Hospitalisations from COVID-19 with Omicron sub-lineages have put a sustained pressure on the English healthcare system. Understanding the expected healthcare demand enables more effective and timely planning from public health. We collect syndromic surveillance sources, which include online search data, NHS 111 telephonic and online triages. Incorporating this data we explore generalised additive models, generalised linear mixed-models, penalised generalised linear models and model ensemble methods to forecast over a two-week forecast horizon at an NHS Trust level. Furthermore, we showcase how model combinations improve forecast scoring through a mean ensemble, weighted ensemble, and ensemble by regression. Validated over multiple Omicron waves, at different spatial scales, we show that leading indicators can improve performance of forecasting models, particularly at epidemic changepoints. Using a variety of scoring rules, we show that ensemble approaches outperformed all individual models, providing higher performance at a 21-day window than the corresponding individual models at 14-days. We introduce a modelling structure used by public health officials in England in 2022 to inform NHS healthcare strategy and policy decision making. This paper explores the significance of ensemble methods to improve forecasting performance and how novel syndromic surveillance can be practically applied in epidemic forecasting. △ Less

Submitted 16 August, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

Comments: arXiv admin note: substantial text overlap with arXiv:2303.12037

arXiv:2305.09952 [pdf]

doi 10.1038/s41598-023-50502-9

Cathodoluminescence spectroscopy of monolayer hexagonal boron nitride

Authors: K. Shima, T. S. Cheng, C. J. Mellor, P. H. Beton, C. Elias, P. Valvin, B. Gil, G. Cassabois, S. V. Novikov, S. F. Chichibu

Abstract: Cathodoluminescence (CL) spectroscopy is a powerful technique for studying emission properties of optoelectronic materials because CL is free from excitable bandgap limits and from ambiguous signals due to simple light scattering and resonant Raman scattering potentially involved in the photoluminescence (PL) spectra. However, direct CL measurements of atomically thin two-dimensional materials, su… ▽ More Cathodoluminescence (CL) spectroscopy is a powerful technique for studying emission properties of optoelectronic materials because CL is free from excitable bandgap limits and from ambiguous signals due to simple light scattering and resonant Raman scattering potentially involved in the photoluminescence (PL) spectra. However, direct CL measurements of atomically thin two-dimensional materials, such as transition metal dichalcogenides and hexagonal boron nitride (hBN), have been difficult due to the small excitation volume that interacts with high-energy electron beams (e-beams). Herein, distinct CL signals from a monolayer hBN, namely mBN, epitaxial film grown on a highly oriented pyrolytic graphite substrate are shown by using a home-made CL system capable of large-area and surface-sensitive excitation by an e-beam. The spatially resolved CL spectra at 13 K exhibited a predominant 5.5-eV emission band, which has been ascribed to originate from multilayered aggregates of hBN, markedly at thicker areas formed on the step edges of the substrate. Conversely, a faint peak at 6.04 eV was routinely observed from atomically flat areas. Since the energy agreed with the PL peak of 6.05 eV at 10 K that has been assigned as being due to the recombination of phonon-assisted direct excitons of mBN by Elias et al. [Nat. Commun. 10, 2639 (2019)], the CL peak at 6.04 eV is attributed to originate from the mBN epilayer. The CL results support the transition from indirect bandgap in bulk hBN to direct bandgap in mBN, in analogy with molybdenum disulfide. The results also encourage to elucidate emission properties of other low-dimensional materials with reduced excitation volumes by using the present CL configuration. △ Less

Submitted 17 May, 2023; originally announced May 2023.

Comments: 7 pages, 3 figures

arXiv:2303.12037 [pdf]

Understanding the leading indicators of hospital admissions from COVID-19 across successive waves in the UK

Authors: Jonathon Mellor, Christopher E Overton, Martyn Fyles, Liam Chawner, James Baxter, Tarrion Baird, Thomas Ward

Abstract: Following the UK Government's Living with COVID-19 Strategy and the end of universal testing, hospital admissions are an increasingly important measure of COVID-19 pandemic pressure. Understanding leading indicators of admissions at National Health Service (NHS) Trust, regional and national geographies help health services plan capacity needs and prepare for ongoing pressures. We explored the spat… ▽ More Following the UK Government's Living with COVID-19 Strategy and the end of universal testing, hospital admissions are an increasingly important measure of COVID-19 pandemic pressure. Understanding leading indicators of admissions at National Health Service (NHS) Trust, regional and national geographies help health services plan capacity needs and prepare for ongoing pressures. We explored the spatio-temporal relationships of leading indicators of hospital pressure across successive waves of SARS-CoV-2 incidence in England. This includes an analysis of internet search volume values from Google Trends, NHS triage calls and online queries, the NHS COVID-19 App, lateral flow devices and the ZOE App. Data sources were analysed for their feasibility as leading indicators using linear and non-linear methods; granger causality, cross correlations and dynamic time warping at fine spatial scales. Consistent temporal and spatial relationships were found for some of the leading indicators assessed across resurgent waves of COVID-19. Google Trends and NHS queries consistently led admissions in over 70% of Trusts, with lead times ranging from 5-20 days, whereas an inconsistent relationship was found for the ZOE app, NHS COVID-19 App, and rapid testing, that diminished with granularity, showing limited autocorrelation of leads between -7 to 7 days. This work shows that novel syndromic surveillance data has utility for understanding the expected hospital burden at fine spatial scales. The analysis shows at low level geographies that some surveillance sources can predict hospital admissions, though care must be taken in relying on the lead times and consistency between waves. △ Less

Submitted 16 August, 2023; v1 submitted 21 March, 2023; originally announced March 2023.

arXiv:2302.11904 [pdf]

Forecasting influenza hospital admissions within English sub-regions using hierarchical generalised additive models

Authors: Jonathon Mellor, Rachel Christie, Christopher E Overton, Robert S Paton, Rhianna Leslie, Maria Tang, Sarah Deeny, Thomas Ward

Abstract: Background: Seasonal influenza causes a substantial burden on healthcare services over the winter period when these systems are already under pressure. Policies during the COVID-19 pandemic supressed the transmission of season influenza, making the timing and magnitude of a potential resurgence difficult to predict. Methods: We developed a hierarchical generalised additive model (GAM) for the sh… ▽ More Background: Seasonal influenza causes a substantial burden on healthcare services over the winter period when these systems are already under pressure. Policies during the COVID-19 pandemic supressed the transmission of season influenza, making the timing and magnitude of a potential resurgence difficult to predict. Methods: We developed a hierarchical generalised additive model (GAM) for the short-term forecasting of hospital admissions with a positive test for the influenza virus sub-regionally across England. The model incorporates a multi-level structure of spatio-temporal splines, weekly seasonality, and spatial correlation. Using multiple performance metrics including interval score, coverage, bias, and median absolute error, the predictive performance is evaluated for the 2022/23 seasonal wave. Performance is measured against an autoregressive integrated moving average (ARIMA) time series model. Results: The GAM method outperformed the ARIMA model across scoring rules at both high and low-level geographies, and across the different phases of the epidemic wave including the turning point. The performance of the GAM with a 14-day forecast horizon was comparable in error to the ARIMA at 7 days. The performance of the GAM is found to be most sensitive to the flexibility of the smoothing function that measures the national epidemic trend. Interpretation: This study introduces a novel approach to short-term forecasting of hospital admissions with influenza using hierarchical, spatial, and temporal components. The model is data-driven and practical to deploy using information realistically available at time of prediction, addressing key limitations of epidemic forecasting approaches. This model was used across the winter for healthcare operational planning by the UK Health Security Agency and the National Health Service in England. △ Less

Submitted 23 February, 2023; originally announced February 2023.

arXiv:2212.08571 [pdf, other]

Statistical Design and Analysis for Robust Machine Learning: A Case Study from COVID-19

Authors: Davide Pigoli, Kieran Baker, Jobie Budd, Lorraine Butler, Harry Coppock, Sabrina Egglestone, Steven G. Gilmour, Chris Holmes, David Hurley, Radka Jersakova, Ivan Kiskin, Vasiliki Koutra, Jonathon Mellor, George Nicholson, Joe Packham, Selina Patel, Richard Payne, Stephen J. Roberts, Björn W. Schuller, Ana Tendero-Cañadas, Tracey Thornley, Alexander Titcomb

Abstract: Since early in the coronavirus disease 2019 (COVID-19) pandemic, there has been interest in using artificial intelligence methods to predict COVID-19 infection status based on vocal audio signals, for example cough recordings. However, existing studies have limitations in terms of data collection and of the assessment of the performances of the proposed predictive models. This paper rigorously ass… ▽ More Since early in the coronavirus disease 2019 (COVID-19) pandemic, there has been interest in using artificial intelligence methods to predict COVID-19 infection status based on vocal audio signals, for example cough recordings. However, existing studies have limitations in terms of data collection and of the assessment of the performances of the proposed predictive models. This paper rigorously assesses state-of-the-art machine learning techniques used to predict COVID-19 infection status based on vocal audio signals, using a dataset collected by the UK Health Security Agency. This dataset includes acoustic recordings and extensive study participant meta-data. We provide guidelines on testing the performance of methods to classify COVID-19 infection status based on acoustic features and we discuss how these can be extended more generally to the development and assessment of predictive methods based on public health datasets. △ Less

Submitted 27 February, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

arXiv:2212.08570 [pdf, other]

Audio-based AI classifiers show no evidence of improved COVID-19 screening over simple symptoms checkers

Authors: Harry Coppock, George Nicholson, Ivan Kiskin, Vasiliki Koutra, Kieran Baker, Jobie Budd, Richard Payne, Emma Karoune, David Hurley, Alexander Titcomb, Sabrina Egglestone, Ana Tendero Cañadas, Lorraine Butler, Radka Jersakova, Jonathon Mellor, Selina Patel, Tracey Thornley, Peter Diggle, Sylvia Richardson, Josef Packham, Björn W. Schuller, Davide Pigoli, Steven Gilmour, Stephen Roberts, Chris Holmes

Abstract: Recent work has reported that AI classifiers trained on audio recordings can accurately predict severe acute respiratory syndrome coronavirus 2 (SARSCoV2) infection status. Here, we undertake a large scale study of audio-based deep learning classifiers, as part of the UK governments pandemic response. We collect and analyse a dataset of audio recordings from 67,842 individuals with linked metadata… ▽ More Recent work has reported that AI classifiers trained on audio recordings can accurately predict severe acute respiratory syndrome coronavirus 2 (SARSCoV2) infection status. Here, we undertake a large scale study of audio-based deep learning classifiers, as part of the UK governments pandemic response. We collect and analyse a dataset of audio recordings from 67,842 individuals with linked metadata, including reverse transcription polymerase chain reaction (PCR) test outcomes, of whom 23,514 tested positive for SARS CoV 2. Subjects were recruited via the UK governments National Health Service Test-and-Trace programme and the REal-time Assessment of Community Transmission (REACT) randomised surveillance survey. In an unadjusted analysis of our dataset AI classifiers predict SARS-CoV-2 infection status with high accuracy (Receiver Operating Characteristic Area Under the Curve (ROCAUC) 0.846 [0.838, 0.854]) consistent with the findings of previous studies. However, after matching on measured confounders, such as age, gender, and self reported symptoms, our classifiers performance is much weaker (ROC-AUC 0.619 [0.594, 0.644]). Upon quantifying the utility of audio based classifiers in practical settings, we find them to be outperformed by simple predictive scores based on user reported symptoms. △ Less

Submitted 2 March, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

arXiv:2212.07738 [pdf]

A large-scale and PCR-referenced vocal audio dataset for COVID-19

Authors: Jobie Budd, Kieran Baker, Emma Karoune, Harry Coppock, Selina Patel, Ana Tendero Cañadas, Alexander Titcomb, Richard Payne, David Hurley, Sabrina Egglestone, Lorraine Butler, Jonathon Mellor, George Nicholson, Ivan Kiskin, Vasiliki Koutra, Radka Jersakova, Rachel A. McKendry, Peter Diggle, Sylvia Richardson, Björn W. Schuller, Steven Gilmour, Davide Pigoli, Stephen Roberts, Josef Packham, Tracey Thornley , et al. (1 additional authors not shown)

Abstract: The UK COVID-19 Vocal Audio Dataset is designed for the training and evaluation of machine learning models that classify SARS-CoV-2 infection status or associated respiratory symptoms using vocal audio. The UK Health Security Agency recruited voluntary participants through the national Test and Trace programme and the REACT-1 survey in England from March 2021 to March 2022, during dominant transmi… ▽ More The UK COVID-19 Vocal Audio Dataset is designed for the training and evaluation of machine learning models that classify SARS-CoV-2 infection status or associated respiratory symptoms using vocal audio. The UK Health Security Agency recruited voluntary participants through the national Test and Trace programme and the REACT-1 survey in England from March 2021 to March 2022, during dominant transmission of the Alpha and Delta SARS-CoV-2 variants and some Omicron variant sublineages. Audio recordings of volitional coughs, exhalations, and speech were collected in the 'Speak up to help beat coronavirus' digital survey alongside demographic, self-reported symptom and respiratory condition data, and linked to SARS-CoV-2 test results. The UK COVID-19 Vocal Audio Dataset represents the largest collection of SARS-CoV-2 PCR-referenced audio recordings to date. PCR results were linked to 70,794 of 72,999 participants and 24,155 of 25,776 positive cases. Respiratory symptoms were reported by 45.62% of participants. This dataset has additional potential uses for bioacoustics research, with 11.30% participants reporting asthma, and 27.20% with linked influenza PCR test results. △ Less

Submitted 3 November, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

Comments: 39 pages, 4 figures

arXiv:2209.14375 [pdf, other]

Improving alignment of dialogue agents via targeted human judgements

Authors: Amelia Glaese, Nat McAleese, Maja Trębacz, John Aslanides, Vlad Firoiu, Timo Ewalds, Maribeth Rauh, Laura Weidinger, Martin Chadwick, Phoebe Thacker, Lucy Campbell-Gillingham, Jonathan Uesato, Po-Sen Huang, Ramona Comanescu, Fan Yang, Abigail See, Sumanth Dathathri, Rory Greig, Charlie Chen, Doug Fritz, Jaume Sanchez Elias, Richard Green, Soňa Mokrá, Nicholas Fernando, Boxi Wu , et al. (9 additional authors not shown)

Abstract: We present Sparrow, an information-seeking dialogue agent trained to be more helpful, correct, and harmless compared to prompted language model baselines. We use reinforcement learning from human feedback to train our models with two new additions to help human raters judge agent behaviour. First, to make our agent more helpful and harmless, we break down the requirements for good dialogue into na… ▽ More We present Sparrow, an information-seeking dialogue agent trained to be more helpful, correct, and harmless compared to prompted language model baselines. We use reinforcement learning from human feedback to train our models with two new additions to help human raters judge agent behaviour. First, to make our agent more helpful and harmless, we break down the requirements for good dialogue into natural language rules the agent should follow, and ask raters about each rule separately. We demonstrate that this breakdown enables us to collect more targeted human judgements of agent behaviour and allows for more efficient rule-conditional reward models. Second, our agent provides evidence from sources supporting factual claims when collecting preference judgements over model statements. For factual questions, evidence provided by Sparrow supports the sampled response 78% of the time. Sparrow is preferred more often than baselines while being more resilient to adversarial probing by humans, violating our rules only 8% of the time when probed. Finally, we conduct extensive analyses showing that though our model learns to follow our rules it can exhibit distributional biases. △ Less

Submitted 28 September, 2022; originally announced September 2022.

arXiv:2206.11769 [pdf, other]

Single-phase deep learning in cortico-cortical networks

Authors: Will Greedy, Heng Wei Zhu, Joseph Pemberton, Jack Mellor, Rui Ponte Costa

Abstract: The error-backpropagation (backprop) algorithm remains the most common solution to the credit assignment problem in artificial neural networks. In neuroscience, it is unclear whether the brain could adopt a similar strategy to correctly modify its synapses. Recent models have attempted to bridge this gap while being consistent with a range of experimental observations. However, these models are ei… ▽ More The error-backpropagation (backprop) algorithm remains the most common solution to the credit assignment problem in artificial neural networks. In neuroscience, it is unclear whether the brain could adopt a similar strategy to correctly modify its synapses. Recent models have attempted to bridge this gap while being consistent with a range of experimental observations. However, these models are either unable to effectively backpropagate error signals across multiple layers or require a multi-phase learning process, neither of which are reminiscent of learning in the brain. Here, we introduce a new model, Bursting Cortico-Cortical Networks (BurstCCN), which solves these issues by integrating known properties of cortical networks namely bursting activity, short-term plasticity (STP) and dendrite-targeting interneurons. BurstCCN relies on burst multiplexing via connection-type-specific STP to propagate backprop-like error signals within deep cortical networks. These error signals are encoded at distal dendrites and induce burst-dependent plasticity as a result of excitatory-inhibitory top-down inputs. First, we demonstrate that our model can effectively backpropagate errors through multiple layers using a single-phase learning process. Next, we show both empirically and analytically that learning in our model approximates backprop-derived gradients. Finally, we demonstrate that our model is capable of learning complex image classification tasks (MNIST and CIFAR-10). Overall, our results suggest that cortical features across sub-cellular, cellular, microcircuit and systems levels jointly underlie single-phase efficient deep learning in the brain. △ Less

Submitted 24 October, 2022; v1 submitted 23 June, 2022; originally announced June 2022.

Comments: Accepted to 36th Conference on Neural Information Processing Systems (NeurIPS 2022). 22 pages, 9 figures, 5 tables

arXiv:2206.08325 [pdf, ps, other]

Characteristics of Harmful Text: Towards Rigorous Benchmarking of Language Models

Authors: Maribeth Rauh, John Mellor, Jonathan Uesato, Po-Sen Huang, Johannes Welbl, Laura Weidinger, Sumanth Dathathri, Amelia Glaese, Geoffrey Irving, Iason Gabriel, William Isaac, Lisa Anne Hendricks

Abstract: Large language models produce human-like text that drive a growing number of applications. However, recent literature and, increasingly, real world observations, have demonstrated that these models can generate language that is toxic, biased, untruthful or otherwise harmful. Though work to evaluate language model harms is under way, translating foresight about which harms may arise into rigorous b… ▽ More Large language models produce human-like text that drive a growing number of applications. However, recent literature and, increasingly, real world observations, have demonstrated that these models can generate language that is toxic, biased, untruthful or otherwise harmful. Though work to evaluate language model harms is under way, translating foresight about which harms may arise into rigorous benchmarks is not straightforward. To facilitate this translation, we outline six ways of characterizing harmful text which merit explicit consideration when designing new benchmarks. We then use these characteristics as a lens to identify trends and gaps in existing benchmarks. Finally, we apply them in a case study of the Perspective API, a toxicity classifier that is widely used in harm benchmarks. Our characteristics provide one piece of the bridge that translates between foresight and effective evaluation. △ Less

Submitted 28 October, 2022; v1 submitted 16 June, 2022; originally announced June 2022.

Comments: Accepted to NeurIPS 2022 Datasets and Benchmarks Track; 10 pages plus appendix

arXiv:2112.11446 [pdf, other]

Scaling Language Models: Methods, Analysis & Insights from Training Gopher

Authors: Jack W. Rae, Sebastian Borgeaud, Trevor Cai, Katie Millican, Jordan Hoffmann, Francis Song, John Aslanides, Sarah Henderson, Roman Ring, Susannah Young, Eliza Rutherford, Tom Hennigan, Jacob Menick, Albin Cassirer, Richard Powell, George van den Driessche, Lisa Anne Hendricks, Maribeth Rauh, Po-Sen Huang, Amelia Glaese, Johannes Welbl, Sumanth Dathathri, Saffron Huang, Jonathan Uesato, John Mellor , et al. (55 additional authors not shown)

Abstract: Language modelling provides a step towards intelligent communication systems by harnessing large repositories of written human knowledge to better predict and understand the world. In this paper, we present an analysis of Transformer-based language model performance across a wide range of model scales -- from models with tens of millions of parameters up to a 280 billion parameter model called Gop… ▽ More Language modelling provides a step towards intelligent communication systems by harnessing large repositories of written human knowledge to better predict and understand the world. In this paper, we present an analysis of Transformer-based language model performance across a wide range of model scales -- from models with tens of millions of parameters up to a 280 billion parameter model called Gopher. These models are evaluated on 152 diverse tasks, achieving state-of-the-art performance across the majority. Gains from scale are largest in areas such as reading comprehension, fact-checking, and the identification of toxic language, but logical and mathematical reasoning see less benefit. We provide a holistic analysis of the training dataset and model's behaviour, covering the intersection of model scale with bias and toxicity. Finally we discuss the application of language models to AI safety and the mitigation of downstream harms. △ Less

Submitted 21 January, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

Comments: 120 pages

arXiv:2112.04359 [pdf, other]

Ethical and social risks of harm from Language Models

Authors: Laura Weidinger, John Mellor, Maribeth Rauh, Conor Griffin, Jonathan Uesato, Po-Sen Huang, Myra Cheng, Mia Glaese, Borja Balle, Atoosa Kasirzadeh, Zac Kenton, Sasha Brown, Will Hawkins, Tom Stepleton, Courtney Biles, Abeba Birhane, Julia Haas, Laura Rimell, Lisa Anne Hendricks, William Isaac, Sean Legassick, Geoffrey Irving, Iason Gabriel

Abstract: This paper aims to help structure the risk landscape associated with large-scale Language Models (LMs). In order to foster advances in responsible innovation, an in-depth understanding of the potential risks posed by these models is needed. A wide range of established and anticipated risks are analysed in detail, drawing on multidisciplinary expertise and literature from computer science, linguist… ▽ More This paper aims to help structure the risk landscape associated with large-scale Language Models (LMs). In order to foster advances in responsible innovation, an in-depth understanding of the potential risks posed by these models is needed. A wide range of established and anticipated risks are analysed in detail, drawing on multidisciplinary expertise and literature from computer science, linguistics, and social sciences. We outline six specific risk areas: I. Discrimination, Exclusion and Toxicity, II. Information Hazards, III. Misinformation Harms, V. Malicious Uses, V. Human-Computer Interaction Harms, VI. Automation, Access, and Environmental Harms. The first area concerns the perpetuation of stereotypes, unfair discrimination, exclusionary norms, toxic language, and lower performance by social group for LMs. The second focuses on risks from private data leaks or LMs correctly inferring sensitive information. The third addresses risks arising from poor, false or misleading information including in sensitive domains, and knock-on risks such as the erosion of trust in shared information. The fourth considers risks from actors who try to use LMs to cause harm. The fifth focuses on risks specific to LLMs used to underpin conversational agents that interact with human users, including unsafe use, manipulation or deception. The sixth discusses the risk of environmental harm, job automation, and other challenges that may have a disparate effect on different social groups or communities. In total, we review 21 risks in-depth. We discuss the points of origin of different risks and point to potential mitigation approaches. Lastly, we discuss organisational responsibilities in implementing mitigations, and the role of collaboration and participation. We highlight directions for further research, particularly on expanding the toolkit for assessing and evaluating the outlined risks in LMs. △ Less

Submitted 8 December, 2021; originally announced December 2021.

arXiv:2109.07445 [pdf, other]

Challenges in Detoxifying Language Models

Authors: Johannes Welbl, Amelia Glaese, Jonathan Uesato, Sumanth Dathathri, John Mellor, Lisa Anne Hendricks, Kirsty Anderson, Pushmeet Kohli, Ben Coppin, Po-Sen Huang

Abstract: Large language models (LM) generate remarkably fluent text and can be efficiently adapted across NLP tasks. Measuring and guaranteeing the quality of generated text in terms of safety is imperative for deploying LMs in the real world; to this end, prior work often relies on automatic evaluation of LM toxicity. We critically discuss this approach, evaluate several toxicity mitigation strategies wit… ▽ More Large language models (LM) generate remarkably fluent text and can be efficiently adapted across NLP tasks. Measuring and guaranteeing the quality of generated text in terms of safety is imperative for deploying LMs in the real world; to this end, prior work often relies on automatic evaluation of LM toxicity. We critically discuss this approach, evaluate several toxicity mitigation strategies with respect to both automatic and human evaluation, and analyze consequences of toxicity mitigation in terms of model bias and LM quality. We demonstrate that while basic intervention strategies can effectively optimize previously established automatic metrics on the RealToxicityPrompts dataset, this comes at the cost of reduced LM coverage for both texts about, and dialects of, marginalized groups. Additionally, we find that human raters often disagree with high automatic toxicity scores after strong toxicity reduction interventions -- highlighting further the nuances involved in careful evaluation of LM toxicity. △ Less

Submitted 15 September, 2021; originally announced September 2021.

Comments: 23 pages, 6 figures, published in Findings of EMNLP 2021

ACM Class: I.2.6; I.2.7

arXiv:2107.07950 [pdf, other]

doi 10.1088/2053-1583/ac0d9c

Band gap measurements of monolayer h-BN and insights into carbon-related point defects

Authors: Ricardo Javier Peña Román, Fábio J R Costa Costa, Alberto Zobelli, Christine Elias, Pierre Valvin, Guillaume Cassabois, Bernard Gil, Alex Summerfield, Tin S Cheng, Christopher J Mellor, Peter H Beton, Sergei V Novikov, Luiz F Zagonel

Abstract: Being a flexible wide band gap semiconductor, hexagonal boron nitride (h-BN) has great potential for technological applications like efficient deep ultraviolet light sources, building block for two-dimensional heterostructures and room temperature single photon emitters in the ultraviolet and visible spectral range. To enable such applications, it is mandatory to reach a better understanding of th… ▽ More Being a flexible wide band gap semiconductor, hexagonal boron nitride (h-BN) has great potential for technological applications like efficient deep ultraviolet light sources, building block for two-dimensional heterostructures and room temperature single photon emitters in the ultraviolet and visible spectral range. To enable such applications, it is mandatory to reach a better understanding of the electronic and optical properties of h-BN and the impact of various structural defects. Despite the large efforts in the last years, aspects such as the electronic band gap value, the exciton binding energy and the effect of point defects remained elusive, particularly when considering a single monolayer. Here, we directly measured the density of states of a single monolayer of h-BN epitaxially grown on highly oriented pyrolytic graphite, by performing low temperature scanning tunneling microscopy (STM) and spectroscopy (STS). The observed h-BN electronic band gap on defect-free regions is $(6.8\pm0.2)$ eV. Using optical spectroscopy to obtain the h-BN optical band gap, the exciton binding energy is determined as being of $(0.7\pm0.2)$ eV. In addition, the locally excited cathodoluminescence and photoluminescence show complex spectra that are typically associated to intragap states related to carbon defects. Moreover, in some regions of the monolayer h-BN we identify, using STM, point defects which have intragap electronic levels around 2.0 eV below the Fermi level. △ Less

Submitted 16 July, 2021; originally announced July 2021.

Comments: 50 Pages, 8 Figures, 100+ references

Journal ref: 2D Material 8 044001 (2021)

arXiv:2102.00529 [pdf, other]

Decoupling the Role of Data, Attention, and Losses in Multimodal Transformers

Authors: Lisa Anne Hendricks, John Mellor, Rosalia Schneider, Jean-Baptiste Alayrac, Aida Nematzadeh

Abstract: Recently multimodal transformer models have gained popularity because their performance on language and vision tasks suggest they learn rich visual-linguistic representations. Focusing on zero-shot image retrieval tasks, we study three important factors which can impact the quality of learned representations: pretraining data, the attention mechanism, and loss functions. By pretraining models on s… ▽ More Recently multimodal transformer models have gained popularity because their performance on language and vision tasks suggest they learn rich visual-linguistic representations. Focusing on zero-shot image retrieval tasks, we study three important factors which can impact the quality of learned representations: pretraining data, the attention mechanism, and loss functions. By pretraining models on six datasets, we observe that dataset noise and language similarity to our downstream task are important indicators of model performance. Through architectural analysis, we learn that models with a multimodal attention mechanism can outperform deeper models with modality specific attention mechanisms. Finally, we show that successful contrastive losses used in the self-supervised learning literature do not yield similar performance gains when used in multimodal transformers △ Less

Submitted 31 January, 2021; originally announced February 2021.

Comments: pre-print of MIT Press Publication version

arXiv:2006.04647 [pdf, other]

Neural Architecture Search without Training

Authors: Joseph Mellor, Jack Turner, Amos Storkey, Elliot J. Crowley

Abstract: The time and effort involved in hand-designing deep neural networks is immense. This has prompted the development of Neural Architecture Search (NAS) techniques to automate this design. However, NAS algorithms tend to be slow and expensive; they need to train vast numbers of candidate networks to inform the search process. This could be alleviated if we could partially predict a network's trained… ▽ More The time and effort involved in hand-designing deep neural networks is immense. This has prompted the development of Neural Architecture Search (NAS) techniques to automate this design. However, NAS algorithms tend to be slow and expensive; they need to train vast numbers of candidate networks to inform the search process. This could be alleviated if we could partially predict a network's trained accuracy from its initial state. In this work, we examine the overlap of activations between datapoints in untrained networks and motivate how this can give a measure which is usefully indicative of a network's trained performance. We incorporate this measure into a simple algorithm that allows us to search for powerful networks without any training in a matter of seconds on a single GPU, and verify its effectiveness on NAS-Bench-101, NAS-Bench-201, NATS-Bench, and Network Design Spaces. Our approach can be readily combined with more expensive search methods; we examine a simple adaptation of regularised evolutionary search. Code for reproducing our experiments is available at https://github.com/BayesWatch/nas-without-training. △ Less

Submitted 11 June, 2021; v1 submitted 8 June, 2020; originally announced June 2020.

Comments: Accepted at ICML 2021 for a long presentation

arXiv:2003.00949 [pdf]

doi 10.1038/s41563-020-00850-y

Identifying Carbon as the Source of Visible Single Photon Emission from Hexagonal Boron Nitride

Authors: Noah Mendelson, Dipankar Chugh, Jeffrey R. Reimers, Tin S. Cheng, Andreas Gottscholl, Hu Long, Christopher J. Mellor, Alex Zettl, Vladimir Dyakonov, Peter H. Beton, Sergei V. Novikov, Chennupati Jagadish, Hark Hoe Tan, Michael J. Ford, Milos Toth, Carlo Bradac, Igor Aharonovich

Abstract: Single photon emitters (SPEs) in hexagonal boron nitride (hBN) have garnered significant attention over the last few years due to their superior optical properties. However, despite the vast range of experimental results and theoretical calculations, the defect structure responsible for the observed emission has remained elusive. Here, by controlling the incorporation of impurities into hBN and by… ▽ More Single photon emitters (SPEs) in hexagonal boron nitride (hBN) have garnered significant attention over the last few years due to their superior optical properties. However, despite the vast range of experimental results and theoretical calculations, the defect structure responsible for the observed emission has remained elusive. Here, by controlling the incorporation of impurities into hBN and by comparing various synthesis methods, we provide direct evidence that the visible SPEs are carbon related. Room temperature optically detected magnetic resonance (ODMR) is demonstrated on ensembles of these defects. We also perform ion implantation experiments and confirm that only carbon implantation creates SPEs in the visible spectral range. Computational analysis of hundreds of potential carbon-based defect transitions suggest that the emission results from the negatively charged VBCN- defect, which experiences long-range out-of-plane deformations and is environmentally sensitive. Our results resolve a long-standing debate about the origin of single emitters at the visible range in hBN and will be key to deterministic engineering of these defects for quantum photonic devices. △ Less

Submitted 20 April, 2020; v1 submitted 2 March, 2020; originally announced March 2020.

arXiv:2001.06105 [pdf, other]

Better Boosting with Bandits for Online Learning

Authors: Nikolaos Nikolaou, Joseph Mellor, Nikunj C. Oza, Gavin Brown

Abstract: Probability estimates generated by boosting ensembles are poorly calibrated because of the margin maximization nature of the algorithm. The outputs of the ensemble need to be properly calibrated before they can be used as probability estimates. In this work, we demonstrate that online boosting is also prone to producing distorted probability estimates. In batch learning, calibration is achieved by… ▽ More Probability estimates generated by boosting ensembles are poorly calibrated because of the margin maximization nature of the algorithm. The outputs of the ensemble need to be properly calibrated before they can be used as probability estimates. In this work, we demonstrate that online boosting is also prone to producing distorted probability estimates. In batch learning, calibration is achieved by reserving part of the training data for training the calibrator function. In the online setting, a decision needs to be made on each round: shall the new example(s) be used to update the parameters of the ensemble or those of the calibrator. We proceed to resolve this decision with the aid of bandit optimization algorithms. We demonstrate superior performance to uncalibrated and naively-calibrated on-line boosting ensembles in terms of probability estimation. Our proposed mechanism can be easily adapted to other tasks(e.g. cost-sensitive classification) and is robust to the choice of hyperparameters of both the calibrator and the ensemble. △ Less

Submitted 16 January, 2020; originally announced January 2020.

Comments: 44 pages, 6 figures

arXiv:1910.01007 [pdf, other]

Unsupervised Doodling and Painting with Improved SPIRAL

Authors: John F. J. Mellor, Eunbyung Park, Yaroslav Ganin, Igor Babuschkin, Tejas Kulkarni, Dan Rosenbaum, Andy Ballard, Theophane Weber, Oriol Vinyals, S. M. Ali Eslami

Abstract: We investigate using reinforcement learning agents as generative models of images (extending arXiv:1804.01118). A generative agent controls a simulated painting environment, and is trained with rewards provided by a discriminator network simultaneously trained to assess the realism of the agent's samples, either unconditional or reconstructions. Compared to prior work, we make a number of improvem… ▽ More We investigate using reinforcement learning agents as generative models of images (extending arXiv:1804.01118). A generative agent controls a simulated painting environment, and is trained with rewards provided by a discriminator network simultaneously trained to assess the realism of the agent's samples, either unconditional or reconstructions. Compared to prior work, we make a number of improvements to the architectures of the agents and discriminators that lead to intriguing and at times surprising results. We find that when sufficiently constrained, generative agents can learn to produce images with a degree of visual abstraction, despite having only ever seen real photographs (no human brush strokes). And given enough time with the painting environment, they can produce images with considerable realism. These results show that, under the right circumstances, some aspects of human drawing can emerge from simulated embodiment, without the need for external supervision, imitation or social cues. Finally, we note the framework's potential for use in creative applications. △ Less

Submitted 2 October, 2019; originally announced October 2019.

Comments: See https://learning-to-paint.github.io for an interactive version of this paper, with videos

ACM Class: I.2; I.4

arXiv:1803.00316 [pdf, other]

The K-Nearest Neighbour UCB algorithm for multi-armed bandits with covariates

Authors: Henry WJ Reeve, Joe Mellor, Gavin Brown

Abstract: In this paper we propose and explore the k-Nearest Neighbour UCB algorithm for multi-armed bandits with covariates. We focus on a setting where the covariates are supported on a metric space of low intrinsic dimension, such as a manifold embedded within a high dimensional ambient feature space. The algorithm is conceptually simple and straightforward to implement. The k-Nearest Neighbour UCB algor… ▽ More In this paper we propose and explore the k-Nearest Neighbour UCB algorithm for multi-armed bandits with covariates. We focus on a setting where the covariates are supported on a metric space of low intrinsic dimension, such as a manifold embedded within a high dimensional ambient feature space. The algorithm is conceptually simple and straightforward to implement. The k-Nearest Neighbour UCB algorithm does not require prior knowledge of the either the intrinsic dimension of the marginal distribution or the time horizon. We prove a regret bound for the k-Nearest Neighbour UCB algorithm which is minimax optimal up to logarithmic factors. In particular, the algorithm automatically takes advantage of both low intrinsic dimensionality of the marginal distribution over the covariates and low noise in the data, expressed as a margin condition. In addition, focusing on the case of bounded rewards, we give corresponding regret bounds for the k-Nearest Neighbour KL-UCB algorithm, which is an analogue of the KL-UCB algorithm adapted to the setting of multi-armed bandits with covariates. Finally, we present empirical results which demonstrate the ability of both the k-Nearest Neighbour UCB and k-Nearest Neighbour KL-UCB to take advantage of situations where the data is supported on an unknown sub-manifold of a high-dimensional feature space. △ Less

Submitted 1 March, 2018; originally announced March 2018.

Comments: To be presented at ALT 2018

Journal ref: Algorithmic Learning Theory 2018

arXiv:1401.6222 [pdf]

doi 10.1088/0953-2048/27/8/085015

Amplification of electromagnetic waves excited by a chain of propagating magnetic vortices in YBaCuO Josephson-junction arrays at 77K and above

Authors: Boris Chesca, Daniel John, Christopher J. Mellor

Abstract: When a soliton propagates in a discrete lattice it excites small-amplitude linear waves in its wake. In a dc current-biased Josephson-junction (JJ) array these manifest as electromagnetic (EM) waves excited by a (magnetic field induced) chain of propagating magnetic vortices. When the vortex velocity and the phase velocity of one of the excited EM waves match, phase-locking occurs. This produces r… ▽ More When a soliton propagates in a discrete lattice it excites small-amplitude linear waves in its wake. In a dc current-biased Josephson-junction (JJ) array these manifest as electromagnetic (EM) waves excited by a (magnetic field induced) chain of propagating magnetic vortices. When the vortex velocity and the phase velocity of one of the excited EM waves match, phase-locking occurs. This produces resonant steps in the current-voltage characteristics where amplification of EM radiation occurs. We report the first observation of phase-locking-induced amplification of EM radiation at 77K and above in JJ arrays made of high temperature superconductors. △ Less

Submitted 23 January, 2014; originally announced January 2014.

Comments: 19 pages, 5 Figures

Journal ref: Supercond. Sci. Technol. Vol.27, Page 085015, 2014

arXiv:1302.3721 [pdf, other]

Thompson Sampling in Switching Environments with Bayesian Online Change Point Detection

Authors: Joseph Mellor, Jonathan Shapiro

Abstract: Thompson Sampling has recently been shown to be optimal in the Bernoulli Multi-Armed Bandit setting[Kaufmann et al., 2012]. This bandit problem assumes stationary distributions for the rewards. It is often unrealistic to model the real world as a stationary distribution. In this paper we derive and evaluate algorithms using Thompson Sampling for a Switching Multi-Armed Bandit Problem. We propose a… ▽ More Thompson Sampling has recently been shown to be optimal in the Bernoulli Multi-Armed Bandit setting[Kaufmann et al., 2012]. This bandit problem assumes stationary distributions for the rewards. It is often unrealistic to model the real world as a stationary distribution. In this paper we derive and evaluate algorithms using Thompson Sampling for a Switching Multi-Armed Bandit Problem. We propose a Thompson Sampling strategy equipped with a Bayesian change point mechanism to tackle this problem. We develop algorithms for a variety of cases with constant switching rate: when switching occurs all arms change (Global Switching), switching occurs independently for each arm (Per-Arm Switching), when the switching rate is known and when it must be inferred from data. This leads to a family of algorithms we collectively term Change-Point Thompson Sampling (CTS). We show empirical results of the algorithm in 4 artificial environments, and 2 derived from real world data; news click-through[Yahoo!, 2011] and foreign exchange data[Dukascopy, 2012], comparing them to some other bandit algorithms. In real world data CTS is the most effective. △ Less

Submitted 15 February, 2013; originally announced February 2013.

Comments: A version will appear in the Sixteenth international conference on Artificial Intelligence and Statistics (AIStats 2013)

arXiv:1208.5672 [pdf, other]

doi 10.1063/1.4824012

Increased surface flashover voltage in microfabricated devices

Authors: R. C. Sterling, M. D. Hughes, C. J. Mellor, W. K. Hensinger

Abstract: With the demand for improved performance in microfabricated devices, the necessity to apply greater electric fields and voltages becomes evident. When operating in vacuum, the voltage is typically limited by surface flashover forming along the surface of a dielectric. By modifying the fabrication process we have discovered it is possible to more than double the flashover voltage. Our finding has s… ▽ More With the demand for improved performance in microfabricated devices, the necessity to apply greater electric fields and voltages becomes evident. When operating in vacuum, the voltage is typically limited by surface flashover forming along the surface of a dielectric. By modifying the fabrication process we have discovered it is possible to more than double the flashover voltage. Our finding has significant impact on the realization of next-generation micro- and nano-fabricated devices and for the fabrication of on-chip ion trap arrays for the realization of scalable ion quantum technology. △ Less

Submitted 25 April, 2014; v1 submitted 28 August, 2012; originally announced August 2012.

Journal ref: Appl. Phys. Lett. 103, 143504 (2013)

arXiv:1204.4487 [pdf, ps, other]

doi 10.1088/1367-2630/14/11/113040

Nonlinear modal coupling in a high-stress doubly-clamped nanomechanical resonator

Authors: K. J. Lulla, R. B. Cousins, A. Venkatesan, M. J. Patton, A. D. Armour, C. J. Mellor, J. R. Owers-Bradley

Abstract: We present results from a study of the nonlinear intermodal coupling between different flexural vibrational modes of a single high-stress, doubly-clamped silicon nitride nanomechanical beam. The measurements were carried out at 100 mK and the beam was actuated using the magnetomotive technique. We observed the nonlinear behavior of the modes individually and also measured the coupling between them… ▽ More We present results from a study of the nonlinear intermodal coupling between different flexural vibrational modes of a single high-stress, doubly-clamped silicon nitride nanomechanical beam. The measurements were carried out at 100 mK and the beam was actuated using the magnetomotive technique. We observed the nonlinear behavior of the modes individually and also measured the coupling between them by driving the beam at multiple frequencies. We demonstrate that the different modes of the resonator are coupled to each other by the displacement induced tension in the beam, which also leads to the well known Duffing nonlinearity in doubly-clamped beams. △ Less

Submitted 19 April, 2012; originally announced April 2012.

Comments: 15 pages, 7 figures

Journal ref: New Journal of Physics 14 113040 (2012)

arXiv:1202.3315 [pdf, ps, other]

doi 10.1088/1367-2630/14/8/083015

Real-space imaging of quantum Hall effect edge strips

Authors: M. E. Suddards, A. Baumgartner, M. Henini, C. J. Mellor

Abstract: We use dynamic scanning capacitance microscopy (DSCM) to image compressible and incompressible strips at the edge of a Hall bar in a two-dimensional electron gas (2DEG) in the quantum Hall effect (QHE) regime. This method gives access to the complex local conductance, Gts, between a sharp metallic tip scanned across the sample surface and ground, comprising the complex sample conductance. Near int… ▽ More We use dynamic scanning capacitance microscopy (DSCM) to image compressible and incompressible strips at the edge of a Hall bar in a two-dimensional electron gas (2DEG) in the quantum Hall effect (QHE) regime. This method gives access to the complex local conductance, Gts, between a sharp metallic tip scanned across the sample surface and ground, comprising the complex sample conductance. Near integer filling factors we observe a bright stripe along the sample edge in the imaginary part of Gts. The simultaneously recorded real part exhibits a sharp peak at the boundary between the sample interior and the stripe observed in the imaginary part. The features are periodic in the inverse magnetic field and consistent with compressible and incompressible strips forming at the sample edge. For currents larger than the critical current of the QHE break-down the stripes vanish sharply and a homogeneous signal is recovered, similar to zero magnetic field. Our experiments directly illustrate the formation and a variety of properties of the conceptually important QHE edge states at the physical edge of a 2DEG. △ Less

Submitted 15 February, 2012; originally announced February 2012.

Comments: 7 pages

Journal ref: New J. Phys. 14, 083015 (2012)

arXiv:1008.1788 [pdf, ps, other]

doi 10.1103/PhysRevB.83.081305

Spin polarization of (Ga,Mn)As measured by Andreev Spectroscopy: The role of spin-active scattering

Authors: S. Piano, R. Grein, C. J. Mellor, K. Vyborny, R. Campion, M. Wang, M. Eschrig, B. L. Gallagher

Abstract: We investigate the spin-polarization of the ferromagnetic semiconductor (Ga,Mn)As by point contact Andreev reflection spectroscopy. The conductance spectra are analyzed using a recent theoretical model that accounts for momentum- and spin-dependent scattering at the interface. This allows us to fit the data without resorting, as in the case of the standard spin-dependent Blonder-Tinkham-Klapwijk (… ▽ More We investigate the spin-polarization of the ferromagnetic semiconductor (Ga,Mn)As by point contact Andreev reflection spectroscopy. The conductance spectra are analyzed using a recent theoretical model that accounts for momentum- and spin-dependent scattering at the interface. This allows us to fit the data without resorting, as in the case of the standard spin-dependent Blonder-Tinkham-Klapwijk (BTK) model, to an effective temperature or a statistical distribution of superconducting gaps. We find a transport polarization PC{\approx}57%, in considerably better agreement with the k{\cdot}p kinetic-exchange model of (Ga,Mn)As, than the significantly larger estimates inferred from the BTK model. The temperature dependence of the conductance spectra is fully analyzed. △ Less

Submitted 18 February, 2011; v1 submitted 10 August, 2010; originally announced August 2010.

Comments: 4 pages, 3 figures

Journal ref: Phys. Rev. B 83, 081305(R) (2011)

arXiv:0912.1281 [pdf, ps, other]

doi 10.1103/PhysRevB.81.073410

Dissipation due to tunneling two-level systems in gold nanomechanical resonators

Authors: A. Venkatesan, K. J. Lulla, M. J. Patton, A. D. Armour, C. J. Mellor, J. R. Owers-Bradley

Abstract: We present measurements of the dissipation and frequency shift in nanomechanical gold resonators at temperatures down to 10 mK. The resonators were fabricated as doubly-clamped beams above a GaAs substrate and actuated magnetomotively. Measurements on beams with frequencies 7.95 MHz and 3.87 MHz revealed that from 30 mK to 500 mK the dissipation increases with temperature as $T^{0.5}$, with satu… ▽ More We present measurements of the dissipation and frequency shift in nanomechanical gold resonators at temperatures down to 10 mK. The resonators were fabricated as doubly-clamped beams above a GaAs substrate and actuated magnetomotively. Measurements on beams with frequencies 7.95 MHz and 3.87 MHz revealed that from 30 mK to 500 mK the dissipation increases with temperature as $T^{0.5}$, with saturation occurring at higher temperatures. The relative frequency shift of the resonators increases logarithmically with temperature up to at least 400 mK. Similarities with the behavior of bulk amorphous solids suggest that the dissipation in our resonators is dominated by two-level systems. △ Less

Submitted 7 December, 2009; originally announced December 2009.

arXiv:0812.4146 [pdf, other]

doi 10.1063/1.3069289

Low-temperature and high magnetic field dynamic scanning capacitance microscope

Authors: A. Baumgartner, M. E. Suddards, C. J. Mellor

Abstract: We demonstrate a dynamic scanning capacitance microscope (DSCM) that operates at large bandwidths, cryogenic temperatures and high magnetic fields. The setup is based on a non-contact atomic force microscope (AFM) with a quartz tuning fork sensor with non-optical excitation and read-out for topography, force and dissipation measurements. The metallic AFM tip forms part of an rf resonator with a… ▽ More We demonstrate a dynamic scanning capacitance microscope (DSCM) that operates at large bandwidths, cryogenic temperatures and high magnetic fields. The setup is based on a non-contact atomic force microscope (AFM) with a quartz tuning fork sensor with non-optical excitation and read-out for topography, force and dissipation measurements. The metallic AFM tip forms part of an rf resonator with a transmission characteristics modulated by the sample properties and the tip-sample capacitance. The tip motion gives rise to a modulation of the capacitance at the frequency of the AFM sensor and its harmonics, which can be recorded simultaneously with the AFM data. We use an intuitive model to describe and analyze the resonator transmission and show that for most experimental conditions it is proportional to the complex tip-sample conductance, which depends on both the tip-sample capacitance and the sample resistivity. We demonstrate the performance of the DSCM on metal disks buried under a polymer layer and we discuss images recorded on a two-dimensional electron gas in the quantum Hall effect regime, i.e. at cryogenic temperatures and high magnetic fields, where we directly image the formation of compressible stripes at the physical edge of the sample. △ Less

Submitted 22 December, 2008; originally announced December 2008.

Journal ref: Rev.Sci.Instrum.80:013704,2009

arXiv:0710.4636 [pdf]

Why Systems-on-Chip Needs More UML like a Hole in the Head

Authors: Stephen J. Mellor, John R. Wolfe, Campbell Mccausland

Abstract: Let's be clear from the outset: SoC can most certainly make use of UML; SoC just doesn't need more UML, or even all of it. The advent of model mappings, coupled with marks that indicate which mapping rule to apply, enable a major simplification of the use of UML in SoC. Let's be clear from the outset: SoC can most certainly make use of UML; SoC just doesn't need more UML, or even all of it. The advent of model mappings, coupled with marks that indicate which mapping rule to apply, enable a major simplification of the use of UML in SoC. △ Less

Submitted 25 October, 2007; originally announced October 2007.

Comments: Submitted on behalf of EDAA (http://www.edaa.com/)

Journal ref: Dans Design, Automation and Test in Europe - DATE'05, Munich : Allemagne (2005)

arXiv:cond-mat/0703614 [pdf, ps, other]

Anisotropic magnetic field dependence of many-body enhanced electron tunnelling through a quantum dot

Authors: E. E. Vdovin, Yu. N. Khanin, O. Makarovsky, A. Patane, L. Eaves, M. Henini, C. J. Mellor, K. A. Benedict, R. Airey

Abstract: We investigate the effect of an applied magnetic field on resonant tunneling of electrons through the bound states of self-assembled InAs quantum dots (QDs) embedded within an (AlGa)As tunnel barrier. At low temperatures (no more than 2 K), a magnetic field B applied either parallel or perpendicular to the direction of current flow causes a significant enhancement of the tunnel current. For the… ▽ More We investigate the effect of an applied magnetic field on resonant tunneling of electrons through the bound states of self-assembled InAs quantum dots (QDs) embedded within an (AlGa)As tunnel barrier. At low temperatures (no more than 2 K), a magnetic field B applied either parallel or perpendicular to the direction of current flow causes a significant enhancement of the tunnel current. For the latter field configuration, we observe a strong angular anisotropy of the enhanced current when B is rotated in the plane of the quantum dot layer. We attribute this behavior to the effect of the lowered symmetry of the QD eigenfunctions on the electron-electron interaction. △ Less

Submitted 23 March, 2007; originally announced March 2007.

Comments: Revtex4, 6 pages, 6 figures

arXiv:cond-mat/0511375 [pdf, ps, other]

doi 10.1016/j.physe.2006.03.142

1/f noise in a dilute GaAs two-dimensional hole system in the insulating phase

Authors: G. Deville, R. Leturcq, D. L'Hote, R. Tourbot, C. J. Mellor, M. Henini

Abstract: We have measured the resistance and the 1/f resistance noise of a two-dimensional low density hole system in a high mobility GaAs quantum well at low temperature. At densities lower than the metal-insulator transition one, the temperature dependence of the resistance is either power-like or simply activated. The noise decreases when the temperature or the density increase. These results contradi… ▽ More We have measured the resistance and the 1/f resistance noise of a two-dimensional low density hole system in a high mobility GaAs quantum well at low temperature. At densities lower than the metal-insulator transition one, the temperature dependence of the resistance is either power-like or simply activated. The noise decreases when the temperature or the density increase. These results contradict the standard description of independent particles in the strong localization regime. On the contrary, they agree with the percolation picture suggested by higher density results. The physical nature of the system could be a mixture of a conducting and an insulating phase. We compare our results with those of composite thin films. △ Less

Submitted 15 November, 2005; originally announced November 2005.

Comments: 4 pages, 3 figures; to appear in Physica E (EP2DS-16 proceedings)

arXiv:cond-mat/0510142 [pdf]

doi 10.1063/1.2036717

1/f Noise In Low Density Two-Dimensional Hole Systems In GaAs

Authors: G. Deville, R. Leturcq, D. L'Hote, R. Tourbot, C. J. Mellor, M. Henini

Abstract: Two-dimensional electron or hole systems in semiconductors offer the unique opportunity to investigate the physics of strongly interacting fermions. We have measured the 1/f resistance noise of two-dimensional hole systems in high mobility GaAs quantum wells, at densities below that of the metal-insulator transition (MIT) at zero magnetic field. Two techniques voltage and current fluctuations we… ▽ More Two-dimensional electron or hole systems in semiconductors offer the unique opportunity to investigate the physics of strongly interacting fermions. We have measured the 1/f resistance noise of two-dimensional hole systems in high mobility GaAs quantum wells, at densities below that of the metal-insulator transition (MIT) at zero magnetic field. Two techniques voltage and current fluctuations were used. The normalized noise power SR/R2 increases strongly when the hole density or the temperature are decreased. The temperature dependence is steeper at the lowest densities. This contradicts the predictions of the modulation approach in the strong localization hopping transport regime. The hypothesis of a second order phase transition or percolation transition at a density below that of the MIT is thus reinforced. △ Less

Submitted 6 October, 2005; originally announced October 2005.

Comments: PDF, 4 pages, 2 figures, in: Proceedings of the 18th International Conference on Noise and Fluctuations (ICNF2005), September 19-23, 2005, Salamanca, Spain

Report number: SPEC-S05/043 [http://www-drecam.cea.fr/spec/articles/S05/043]

Journal ref: AIP Conference Proceedings 780, 139-142 (2005)

arXiv:cond-mat/0412084 [pdf]

doi 10.1117/12.546505

Resistance noise scaling in a 2D system in GaAs

Authors: R. Leturcq, G. Deville, D. L'Hote, R. Tourbot, C. J. Mellor, M. Henini

Abstract: The 1/f resistance noise of a two-dimensional (2D) hole system in a high mobility GaAs quantum well has been measured on both sides of the 2D metal-insulator transition (MIT) at zero magnetic field (B=0), and deep in the insulating regime. The two measurement methods used are described: I or V fixed, and measurement of resp. V or I fluctuations. The normalized noise magnitude SR/R^2 increases st… ▽ More The 1/f resistance noise of a two-dimensional (2D) hole system in a high mobility GaAs quantum well has been measured on both sides of the 2D metal-insulator transition (MIT) at zero magnetic field (B=0), and deep in the insulating regime. The two measurement methods used are described: I or V fixed, and measurement of resp. V or I fluctuations. The normalized noise magnitude SR/R^2 increases strongly when the hole density is decreased, and its temperature (T) dependence goes from a slight increase with T at the largest densities, to a strong decrease at low density. We find that the noise magnitude scales with the resistance, SR /R^2 ~ R^2.4. Such a scaling is expected for a second order phase transition or a percolation transition. The possible presence of such a transition is investigated by studying the dependence of the conductivity as a function of the density. This dependence is consistent with a critical behavior close to a critical density p* lower than the usual MIT critical density pc. △ Less

Submitted 3 December, 2004; originally announced December 2004.

Comments: 13 pages, 8 figures, Proceedings of SPIE: Fluctuations and noise in materials, D. Popovic, M.B. Weissman, Z.A. Racz Eds., Vol. 5469, pp. 101-113, Mspalomas, Spain, 2004

arXiv:cond-mat/0301222 [pdf, ps, other]

doi 10.1103/PhysRevLett.90.076402

Resistance Noise Scaling in a Dilute Two-Dimensional Hole System in GaAs

Authors: R. Leturcq, D. L'Hote, R. Tourbot, C. J. Mellor, M. Henini

Abstract: We have measured the resistance noise of a two-dimensional (2D)hole system in a high mobility GaAs quantum well, around the 2D metal-insulator transition (MIT) at zero magnetic field. The normalized noise power $S_R/R^2$ increases strongly when the hole density p_s is decreased, increases slightly with temperature (T) at the largest densities, and decreases strongly with T at low p_s. The noise… ▽ More We have measured the resistance noise of a two-dimensional (2D)hole system in a high mobility GaAs quantum well, around the 2D metal-insulator transition (MIT) at zero magnetic field. The normalized noise power $S_R/R^2$ increases strongly when the hole density p_s is decreased, increases slightly with temperature (T) at the largest densities, and decreases strongly with T at low p_s. The noise scales with the resistance, $S_R/R^2 \sim R^{2.4}$, as for a second order phase transition such as a percolation transition. The p_s dependence of the conductivity is consistent with a critical behavior for such a transition, near a density p* which is lower than the observed MIT critical density p_c. △ Less

Submitted 13 February, 2003; v1 submitted 14 January, 2003; originally announced January 2003.

Comments: 4 pages, 4 figures, to be published in Phys. Rev. Lett

arXiv:cond-mat/9907485 [pdf, ps, other]

doi 10.1103/PhysRevB.60.10984

A theory of phonon spectroscopy in the fractional quantum Hall regime

Authors: Keith A. Benedict, R. K. Hills, C. J. Mellor

Abstract: We describe a theoretical framework for the interpretation of time-resolved phonon absorption experiments carried out in the fractional quantum Hall regime of a magnetically quantized two-dimensional electron system (2des). The only phonons which can be absorbed at low temperature are those whose energies exceed the magnetoroton gap predicted by Girvin, MacDonald and Platzman. The rate of energy… ▽ More We describe a theoretical framework for the interpretation of time-resolved phonon absorption experiments carried out in the fractional quantum Hall regime of a magnetically quantized two-dimensional electron system (2des). The only phonons which can be absorbed at low temperature are those whose energies exceed the magnetoroton gap predicted by Girvin, MacDonald and Platzman. The rate of energy transfer from the phonons to the electron liquid is entirely controlled by the creation of these collective excitations. Using simple isotropic approximations for the phonon propagation and electron-phonon coupling we obtain analytic results for the regime in which the electron temperature and the characteristic temperature of the phonons are much less than the gap and identify the way in which the dispersion curve of the magnetorotons could be extracted from time and angle resolved experiments. △ Less

Submitted 30 July, 1999; originally announced July 1999.

Comments: To appear in Physical Review B, 4 figures

Showing 1–41 of 41 results for author: Mellor, J