-
Prompt Selection Matters: Enhancing Text Annotations for Social Sciences with Large Language Models
Authors:
Louis Abraham,
Charles Arnal,
Antoine Marie
Abstract:
Large Language Models have recently been applied to text annotation tasks from social sciences, equalling or surpassing the performance of human workers at a fraction of the cost. However, no inquiry has yet been made on the impact of prompt selection on labelling accuracy. In this study, we show that performance greatly varies between prompts, and we apply the method of automatic prompt optimizat…
▽ More
Large Language Models have recently been applied to text annotation tasks from social sciences, equalling or surpassing the performance of human workers at a fraction of the cost. However, no inquiry has yet been made on the impact of prompt selection on labelling accuracy. In this study, we show that performance greatly varies between prompts, and we apply the method of automatic prompt optimization to systematically craft high quality prompts. We also provide the community with a simple, browser-based implementation of the method at https://prompt-ultra.github.io/ .
△ Less
Submitted 15 July, 2024;
originally announced July 2024.
-
Building Trust in AI-Driven Decision Making for Cyber-Physical Systems (CPS): A Comprehensive Review
Authors:
Rahul Umesh Mhapsekar,
Muhammad Iftikhar Umrani,
Malik Faizan,
Omer Ali,
Lizy Abraham
Abstract:
Recent advancements in technology have led to the emergence of Cyber-Physical Systems (CPS), which seamlessly integrate the cyber and physical domains in various sectors such as agriculture, autonomous systems, and healthcare. This integration presents opportunities for enhanced efficiency and automation through the utilization of artificial intelligence (AI) and machine learning (ML). However, th…
▽ More
Recent advancements in technology have led to the emergence of Cyber-Physical Systems (CPS), which seamlessly integrate the cyber and physical domains in various sectors such as agriculture, autonomous systems, and healthcare. This integration presents opportunities for enhanced efficiency and automation through the utilization of artificial intelligence (AI) and machine learning (ML). However, the complexity of CPS brings forth challenges related to transparency, bias, and trust in AI-enabled decision-making processes. This research explores the significance of AI and ML in enabling CPS in these domains and addresses the challenges associated with interpreting and trusting AI systems within CPS. Specifically, the role of explainable AI (XAI) in enhancing trustworthiness and reliability in AI-enabled decision-making processes is discussed. Key challenges such as transparency, security, and privacy are identified, along with the necessity of building trust through transparency, accountability, and ethical considerations.
△ Less
Submitted 10 May, 2024;
originally announced May 2024.
-
Energy-Efficient Uncertainty-Aware Biomass Composition Prediction at the Edge
Authors:
Muhammad Zawish,
Paul Albert,
Flavio Esposito,
Steven Davy,
Lizy Abraham
Abstract:
Clover fixates nitrogen from the atmosphere to the ground, making grass-clover mixtures highly desirable to reduce external nitrogen fertilization. Herbage containing clover additionally promotes higher food intake, resulting in higher milk production. Herbage probing however remains largely unused as it requires a time-intensive manual laboratory analysis. Without this information, farmers are un…
▽ More
Clover fixates nitrogen from the atmosphere to the ground, making grass-clover mixtures highly desirable to reduce external nitrogen fertilization. Herbage containing clover additionally promotes higher food intake, resulting in higher milk production. Herbage probing however remains largely unused as it requires a time-intensive manual laboratory analysis. Without this information, farmers are unable to perform localized clover sowing or take targeted fertilization decisions. Deep learning algorithms have been proposed with the goal to estimate the dry biomass composition from images of the grass directly in the fields. The energy-intensive nature of deep learning however limits deployment to practical edge devices such as smartphones. This paper proposes to fill this gap by applying filter pruning to reduce the energy requirement of existing deep learning solutions. We report that although pruned networks are accurate on controlled, high-quality images of the grass, they struggle to generalize to real-world smartphone images that are blurry or taken from challenging angles. We address this challenge by training filter-pruned models using a variance attenuation loss so they can predict the uncertainty of their predictions. When the uncertainty exceeds a threshold, we re-infer using a more accurate unpruned model. This hybrid approach allows us to reduce energy consumption while retaining a high accuracy. We evaluate our algorithm on two datasets: the GrassClover and the Irish clover using an NVIDIA Jetson Nano edge device. We find that we reduce energy reduction with respect to state-of-the-art solutions by 50% on average with only 4% accuracy loss.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
Automated Detection of Galactic Rings from SDSS Images
Authors:
Linn Abraham,
Sheelu Abraham,
Ajit K. Kembhavi,
N. S. Philip,
A. K. Aniyan,
Sudhanshu Barway,
Harish Kumar
Abstract:
Morphological features in galaxies, like spiral arms, bars, rings, tidal tails etc. carry information about their structure, origin and evolution. It is therefore important to catalogue and study such features and to correlate them with other basic galaxy properties the environment in which the galaxies are located and their interactions with other galaxies. Surveys such as SDSS, Pan-STARRS, HSC-S…
▽ More
Morphological features in galaxies, like spiral arms, bars, rings, tidal tails etc. carry information about their structure, origin and evolution. It is therefore important to catalogue and study such features and to correlate them with other basic galaxy properties the environment in which the galaxies are located and their interactions with other galaxies. Surveys such as SDSS, Pan-STARRS, HSC-SSP have made available very large samples of galaxies for gainful morphological studies. The availability of galaxy images and catalogues will increase manifold with future surveys like LSST. The volume of present and future data is so large that traditional methods, which involve expert astronomers identifying morphological features through visual inspection, are no longer sufficient. It is therefore necessary to use AI based techniques like machine learning and deep learning for finding morphological structures quickly and efficiently. We report in this study the application of deep learning for finding ring like structures in galaxy images from the Sloan Digital Sky Survey (SDSS) data release DR18. We use a catalogue by Buta (2017) of ringed galaxies from the SDSS to train the network reaching good accuracy and recall, and generate a catalogue of 29420 galaxies of which 9805 have ring like structures with prediction confidence exceeding 90 percent. Using a catalogue of barred galaxy images identified by Abraham et. al. (2018) using deep learning techniques, we identify a set of 2087 galaxies with bars as well as rings. The catalogues should be very useful in understanding the origin of these important morphological structures.
△ Less
Submitted 5 April, 2024;
originally announced April 2024.
-
A Game of Competition for Risk
Authors:
Louis Abraham
Abstract:
In this study, we present models where participants strategically select their risk levels and earn corresponding rewards, mirroring real-world competition across various sectors. Our analysis starts with a normal form game involving two players in a continuous action space, confirming the existence and uniqueness of a Nash equilibrium and providing an analytical solution. We then extend this anal…
▽ More
In this study, we present models where participants strategically select their risk levels and earn corresponding rewards, mirroring real-world competition across various sectors. Our analysis starts with a normal form game involving two players in a continuous action space, confirming the existence and uniqueness of a Nash equilibrium and providing an analytical solution. We then extend this analysis to multi-player scenarios, introducing a new numerical algorithm for its calculation. A key novelty of our work lies in using regret minimization algorithms to solve continuous games through discretization. This groundbreaking approach enables us to incorporate additional real-world factors like market frictions and risk correlations among firms. We also experimentally validate that the Nash equilibrium in our model also serves as a correlated equilibrium. Our findings illuminate how market frictions and risk correlations affect strategic risk-taking. We also explore how policy measures can impact risk-taking and its associated rewards, with our model providing broader applicability than the Diamond-Dybvig framework. We make our methodology and open-source code available at https://github.com/louisabraham/cfrgame
Finally, we contribute methodologically by advocating the use of algorithms in economics, shifting focus from finite games to games with continuous action sets. Our study provides a solid framework for analyzing strategic interactions in continuous action games, emphasizing the importance of market frictions, risk correlations, and policy measures in strategic risk-taking dynamics.
△ Less
Submitted 30 May, 2023;
originally announced May 2023.
-
Lightning-induced chemistry on tidally-locked Earth-like exoplanets
Authors:
Marrick Braam,
Paul I. Palmer,
Leen Decin,
Robert J. Ridgway,
Maria Zamyatina,
Nathan J. Mayne,
Denis E. Sergeev,
N. Luke Abraham
Abstract:
Determining the habitability and interpreting atmospheric spectra of exoplanets requires understanding their atmospheric physics and chemistry. We use a 3-D Coupled Climate-Chemistry Model, the Met Office Unified Model with the UK Chemistry and Aerosols framework, to study the emergence of lightning and its chemical impact on tidally-locked Earth-like exoplanets. We simulate the atmosphere of Prox…
▽ More
Determining the habitability and interpreting atmospheric spectra of exoplanets requires understanding their atmospheric physics and chemistry. We use a 3-D Coupled Climate-Chemistry Model, the Met Office Unified Model with the UK Chemistry and Aerosols framework, to study the emergence of lightning and its chemical impact on tidally-locked Earth-like exoplanets. We simulate the atmosphere of Proxima Centauri b orbiting in the Habitable Zone of its M-dwarf star, but the results apply to similar M-dwarf orbiting planets. Our chemical network includes the Chapman ozone reactions and hydrogen oxide (HO$_{\mathrm{x}}$=H+OH+HO$_2$) and nitrogen oxide (NO$_{\mathrm{x}}$=NO+NO$_2$) catalytic cycles. We find that photochemistry driven by stellar radiation (177-850 nm) supports a global ozone layer between 20-50 km. We parameterise lightning flashes as a function of cloud-top height and the resulting production of nitric oxide (NO) from the thermal decomposition of N$_2$ and O$_2$. Rapid dayside convection over and around the substellar point results in lightning flash rates of up to 0.16 flashes km$^{-2}$yr$^{-1}$, enriching the dayside atmosphere below altitudes of 20 km in NO$_{\mathrm{x}}$. Changes in dayside ozone are determined mainly by UV irradiance and the HO$_{\mathrm{x}}$ catalytic cycle. ~45% of the planetary dayside surface remains at habitable temperatures (T$_{\mathrm{surf}}$>273.15 K) and the ozone layer reduces surface UV radiation levels to 15%. Dayside-nightside thermal gradients result in strong winds that subsequently advect NO$_{\mathrm{x}}$ towards the nightside, where the absence of photochemistry allows NO$_{\mathrm{x}}$ chemistry to involve reservoir species. Our study also emphasizes the need for accurate UV stellar spectra to understand the atmospheric chemistry of exoplanets.
△ Less
Submitted 26 September, 2022;
originally announced September 2022.
-
Complexity-Driven CNN Compression for Resource-constrained Edge AI
Authors:
Muhammad Zawish,
Steven Davy,
Lizy Abraham
Abstract:
Recent advances in Artificial Intelligence (AI) on the Internet of Things (IoT)-enabled network edge has realized edge intelligence in several applications such as smart agriculture, smart hospitals, and smart factories by enabling low-latency and computational efficiency. However, deploying state-of-the-art Convolutional Neural Networks (CNNs) such as VGG-16 and ResNets on resource-constrained ed…
▽ More
Recent advances in Artificial Intelligence (AI) on the Internet of Things (IoT)-enabled network edge has realized edge intelligence in several applications such as smart agriculture, smart hospitals, and smart factories by enabling low-latency and computational efficiency. However, deploying state-of-the-art Convolutional Neural Networks (CNNs) such as VGG-16 and ResNets on resource-constrained edge devices is practically infeasible due to their large number of parameters and floating-point operations (FLOPs). Thus, the concept of network pruning as a type of model compression is gaining attention for accelerating CNNs on low-power devices. State-of-the-art pruning approaches, either structured or unstructured do not consider the different underlying nature of complexities being exhibited by convolutional layers and follow a training-pruning-retraining pipeline, which results in additional computational overhead. In this work, we propose a novel and computationally efficient pruning pipeline by exploiting the inherent layer-level complexities of CNNs. Unlike typical methods, our proposed complexity-driven algorithm selects a particular layer for filter-pruning based on its contribution to overall network complexity. We follow a procedure that directly trains the pruned model and avoids the computationally complex ranking and fine-tuning steps. Moreover, we define three modes of pruning, namely parameter-aware (PA), FLOPs-aware (FA), and memory-aware (MA), to introduce versatile compression of CNNs. Our results show the competitive performance of our approach in terms of accuracy and acceleration. Lastly, we present a trade-off between different resources and accuracy which can be helpful for developers in making the right decisions in resource-constrained IoT environments.
△ Less
Submitted 26 August, 2022;
originally announced August 2022.
-
FastCPH: Efficient Survival Analysis for Neural Networks
Authors:
Xuelin Yang,
Louis Abraham,
Sejin Kim,
Petr Smirnov,
Feng Ruan,
Benjamin Haibe-Kains,
Robert Tibshirani
Abstract:
The Cox proportional hazards model is a canonical method in survival analysis for prediction of the life expectancy of a patient given clinical or genetic covariates -- it is a linear model in its original form. In recent years, several methods have been proposed to generalize the Cox model to neural networks, but none of these are both numerically correct and computationally efficient. We propose…
▽ More
The Cox proportional hazards model is a canonical method in survival analysis for prediction of the life expectancy of a patient given clinical or genetic covariates -- it is a linear model in its original form. In recent years, several methods have been proposed to generalize the Cox model to neural networks, but none of these are both numerically correct and computationally efficient. We propose FastCPH, a new method that runs in linear time and supports both the standard Breslow and Efron methods for tied events. We also demonstrate the performance of FastCPH combined with LassoNet, a neural network that provides interpretability through feature sparsity, on survival datasets. The final procedure is efficient, selects useful covariates and outperforms existing CoxPH approaches.
△ Less
Submitted 20 August, 2022;
originally announced August 2022.
-
Neural interval-censored survival regression with feature selection
Authors:
Carlos García Meixide,
Marcos Matabuena,
Louis Abraham,
Michael R. Kosorok
Abstract:
Survival analysis is a fundamental area of focus in biomedical research, particularly in the context of personalized medicine. This prominence is due to the increasing prevalence of large and high-dimensional datasets, such as omics and medical image data. However, the literature on non-linear regression algorithms and variable selection techniques for interval-censoring is either limited or non-e…
▽ More
Survival analysis is a fundamental area of focus in biomedical research, particularly in the context of personalized medicine. This prominence is due to the increasing prevalence of large and high-dimensional datasets, such as omics and medical image data. However, the literature on non-linear regression algorithms and variable selection techniques for interval-censoring is either limited or non-existent, particularly in the context of neural networks. Our objective is to introduce a novel predictive framework tailored for interval-censored regression tasks, rooted in Accelerated Failure Time (AFT) models. Our strategy comprises two key components: i) a variable selection phase leveraging recent advances on sparse neural network architectures, ii) a regression model targeting prediction of the interval-censored response. To assess the performance of our novel algorithm, we conducted a comprehensive evaluation through both numerical experiments and real-world applications that encompass scenarios related to diabetes and physical activity. Our results outperform traditional AFT algorithms, particularly in scenarios featuring non-linear relationships.
△ Less
Submitted 22 August, 2024; v1 submitted 14 June, 2022;
originally announced June 2022.
-
Small Number of Communities in Twitter Keyword Networks
Authors:
Linda Abraham,
Anthony Bonato,
Alexander Nazareth
Abstract:
We investigate networks formed by keywords in tweets and study their community structure. Based on datasets of tweets mined from over seven hundred political figures in the U.S. and Canada, we hypothesize that such Twitter keyword networks exhibit a small number of communities. Our results are further reinforced by considering via so-called pseudo-tweets generated randomly and using AI-based langu…
▽ More
We investigate networks formed by keywords in tweets and study their community structure. Based on datasets of tweets mined from over seven hundred political figures in the U.S. and Canada, we hypothesize that such Twitter keyword networks exhibit a small number of communities. Our results are further reinforced by considering via so-called pseudo-tweets generated randomly and using AI-based language generation software. We speculate as to the possible origins of the small community hypothesis and further attempts at validating it.
△ Less
Submitted 30 August, 2021;
originally announced August 2021.
-
Competition analysis on the over-the-counter credit default swap market
Authors:
Louis Abraham
Abstract:
We study two questions related to competition on the OTC CDS market using data collected as part of the EMIR regulation.
First, we study the competition between central counterparties through collateral requirements. We present models that successfully estimate the initial margin requirements. However, our estimations are not precise enough to use them as input to a predictive model for CCP choi…
▽ More
We study two questions related to competition on the OTC CDS market using data collected as part of the EMIR regulation.
First, we study the competition between central counterparties through collateral requirements. We present models that successfully estimate the initial margin requirements. However, our estimations are not precise enough to use them as input to a predictive model for CCP choice by counterparties in the OTC market.
Second, we model counterpart choice on the interdealer market using a novel semi-supervised predictive task. We present our methodology as part of the literature on model interpretability before arguing for the use of conditional entropy as the metric of interest to derive knowledge from data through a model-agnostic approach. In particular, we justify the use of deep neural networks to measure conditional entropy on real-world datasets. We create the $\textit{Razor entropy}$ using the framework of algorithmic information theory and derive an explicit formula that is identical to our semi-supervised training objective. Finally, we borrow concepts from game theory to define $\textit{top-k Shapley values}$. This novel method of payoff distribution satisfies most of the properties of Shapley values, and is of particular interest when the value function is monotone submodular. Unlike classical Shapley values, top-k Shapley values can be computed in quadratic time of the number of features instead of exponential. We implement our methodology and report the results on our particular task of counterpart choice.
Finally, we present an improvement to the $\textit{node2vec}$ algorithm that could for example be used to further study intermediation. We show that the neighbor sampling used in the generation of biased walks can be performed in logarithmic time with a quasilinear time pre-computation, unlike the current implementations that do not scale well.
△ Less
Submitted 3 December, 2020;
originally announced December 2020.
-
Bloom Origami Assays: Practical Group Testing
Authors:
Louis Abraham,
Gary Becigneul,
Benjamin Coleman,
Bernhard Scholkopf,
Anshumali Shrivastava,
Alexander Smola
Abstract:
We study the problem usually referred to as group testing in the context of COVID-19. Given n samples collected from patients, how should we select and test mixtures of samples to maximize information and minimize the number of tests? Group testing is a well-studied problem with several appealing solutions, but recent biological studies impose practical constraints for COVID-19 that are incompatib…
▽ More
We study the problem usually referred to as group testing in the context of COVID-19. Given n samples collected from patients, how should we select and test mixtures of samples to maximize information and minimize the number of tests? Group testing is a well-studied problem with several appealing solutions, but recent biological studies impose practical constraints for COVID-19 that are incompatible with traditional methods. Furthermore, existing methods use unnecessarily restrictive solutions, which were devised for settings with more memory and compute constraints than the problem at hand. This results in poor utility. In the new setting, we obtain strong solutions for small values of n using evolutionary strategies. We then develop a new method combining Bloom filters with belief propagation to scale to larger values of n (more than 100) with good empirical results. We also present a more accurate decoding algorithm that is tailored for specific COVID-19 settings. This work demonstrates the practical gap between dedicated algorithms and well-known generic solutions. Our efforts results in a new and practical multiplex method yielding strong empirical performance without mixing more than a chosen number of patients into the same probe. Finally, we briefly discuss adaptive methods, casting them into the framework of adaptive sub-modularity.
△ Less
Submitted 21 July, 2020;
originally announced August 2020.
-
Crackovid: Optimizing Group Testing
Authors:
Louis Abraham,
Gary Bécigneul,
Bernhard Schölkopf
Abstract:
We study the problem usually referred to as group testing in the context of COVID-19. Given $n$ samples taken from patients, how should we select mixtures of samples to be tested, so as to maximize information and minimize the number of tests? We consider both adaptive and non-adaptive strategies, and take a Bayesian approach with a prior both for infection of patients and test errors. We start by…
▽ More
We study the problem usually referred to as group testing in the context of COVID-19. Given $n$ samples taken from patients, how should we select mixtures of samples to be tested, so as to maximize information and minimize the number of tests? We consider both adaptive and non-adaptive strategies, and take a Bayesian approach with a prior both for infection of patients and test errors. We start by proposing a mathematically principled objective, grounded in information theory. We then optimize non-adaptive optimization strategies using genetic algorithms, and leverage the mathematical framework of adaptive sub-modularity to obtain theoretical guarantees for the greedy-adaptive method.
△ Less
Submitted 13 May, 2020;
originally announced May 2020.
-
Implications of three-dimensional chemical transport in hot Jupiter atmospheres: results from a consistently coupled chemistry-radiation-hydrodynamics model
Authors:
Benjamin Drummond,
Eric Hebrard,
Nathan J. Mayne,
Olivia Venot,
Robert J. Ridgway,
Quentin Changeat,
Shang-min Tsai,
James Manners,
Pascal Tremblin,
Nathan Luke Abraham,
David Sing,
Krisztian Kohary
Abstract:
We present results from a set of simulations using a fully coupled three-dimensional (3D) chemistry-radiation-hydrodynamics model and investigate the effect of transport of chemical species by the large-scale atmospheric flow in hot Jupiter atmospheres. We couple a flexible chemical kinetics scheme to the Met Office Unified Model which enables the study of the interaction of chemistry, radiative t…
▽ More
We present results from a set of simulations using a fully coupled three-dimensional (3D) chemistry-radiation-hydrodynamics model and investigate the effect of transport of chemical species by the large-scale atmospheric flow in hot Jupiter atmospheres. We couple a flexible chemical kinetics scheme to the Met Office Unified Model which enables the study of the interaction of chemistry, radiative transfer and fluid dynamics. We use a newly-released "reduced" chemical network comprising 30 chemical species that has been specifically developed for application in 3D atmosphere models. We simulate the atmospheres of the well-studied hot Jupiters HD~209458b and HD~189733b which both have dayside--nightside temperature contrasts of several hundred Kelvin and superrotating equatorial jets. We find qualitatively quite different chemical structures between the two planets, particularly for methane (CH$_4$), when advection of chemical species is included. Our results show that consideration of 3D chemical transport is vital in understanding the chemical composition of hot Jupiter atmospheres. 3D mixing leads to significant changes in the abundances of absorbing gas-phase species compared with what would be expected by assuming local chemical equilibrium, or from models including 1D - and even 2D - chemical mixing. We find that CH$_4$, carbon dioxide (CO$_2$) and ammonia (NH$_3$) are particularly interesting as 3D mixing of these species leads to prominent signatures of out-of-equilibrium chemistry in the transmission and emission spectra, detectable with near-future instruments.
△ Less
Submitted 14 April, 2020; v1 submitted 30 January, 2020;
originally announced January 2020.
-
Ozone chemistry on tidally locked M dwarf planets
Authors:
Jack S. Yates,
Paul I. Palmer,
James Manners,
Ian Boutle,
Krisztian Kohary,
Nathan Mayne,
Luke Abraham
Abstract:
We use the Met Office Unified Model to explore the potential of a tidally locked M dwarf planet, nominally Proxima Centauri b irradiated by a quiescent version of its host star, to sustain an atmospheric ozone layer. We assume a slab ocean surface layer, and an Earth-like atmosphere of nitrogen and oxygen with trace amounts of ozone and water vapour. We describe ozone chemistry using the Chapman m…
▽ More
We use the Met Office Unified Model to explore the potential of a tidally locked M dwarf planet, nominally Proxima Centauri b irradiated by a quiescent version of its host star, to sustain an atmospheric ozone layer. We assume a slab ocean surface layer, and an Earth-like atmosphere of nitrogen and oxygen with trace amounts of ozone and water vapour. We describe ozone chemistry using the Chapman mechanism and the hydrogen oxide (HO$_x$, describing the sum of OH and HO$_2$) catalytic cycle. We find that Proxima Centauri radiates with sufficient UV energy to initialize the Chapman mechanism. The result is a thin but stable ozone layer that peaks at 0.75 parts per million at 25 km. The quasi-stationary distribution of atmospheric ozone is determined by photolysis driven by incoming stellar radiation and by atmospheric transport. Ozone mole fractions are smallest in the lowest 15 km of the atmosphere at the sub-stellar point and largest in the nightside gyres. Above 15 km the ozone distribution is dominated by an equatorial jet stream that circumnavigates the planet. The nightside ozone distribution is dominated by two cyclonic Rossby gyres that result in localized ozone hotspots. On the dayside the atmospheric lifetime is determined by the HO$_x$ catalytic cycle and deposition to the surface, with nightside lifetimes due to chemistry much longer than timescales associated with atmospheric transport. Surface UV values peak at the substellar point with values of 0.01 W/m$^2$, shielded by the overlying atmospheric ozone layer but more importantly by water vapour clouds.
△ Less
Submitted 18 December, 2019;
originally announced December 2019.
-
The other side of the Coin: Risks of the Libra Blockchain
Authors:
Louis Abraham,
Dominique Guégan
Abstract:
Libra was presented as a cryptocurrency on June 18, 2019 by Facebook. On the same day, Facebook announced plans for Calibra, a subsidiary in charge of the development of an electronic wallet and financial services. In view of the primary risk of sovereignty posed by the creation of Libra, regulators and Central Banks quickly took very clear positions against the project and expressed a lot of ques…
▽ More
Libra was presented as a cryptocurrency on June 18, 2019 by Facebook. On the same day, Facebook announced plans for Calibra, a subsidiary in charge of the development of an electronic wallet and financial services. In view of the primary risk of sovereignty posed by the creation of Libra, regulators and Central Banks quickly took very clear positions against the project and expressed a lot of questions focusing on regulation aspects and national sovereignty.
The purpose of this paper is to provide a holistic analysis of the project encompassing several aspects of its implementation and the issues it raises. We address a set of questions that are part of the cryptocurrency environment and blockchain technology that support the Libra project. We describe the governance of the project based on two levels, one for the Association and the other for the Libra Blockchain. We identify the main risks considering at the same time political, financial, economic, technological and ethical risks. We emphasize the difficulty to regulate such a project as it will depend on several countries whose legislations are very different. Finally, the future of this kind of projects is discussed through the emergence of Central Bank Digital Currencies.
△ Less
Submitted 24 January, 2020; v1 submitted 17 October, 2019;
originally announced October 2019.
-
LassoNet: A Neural Network with Feature Sparsity
Authors:
Ismael Lemhadri,
Feng Ruan,
Louis Abraham,
Robert Tibshirani
Abstract:
Much work has been done recently to make neural networks more interpretable, and one obvious approach is to arrange for the network to use only a subset of the available features. In linear models, Lasso (or $\ell_1$-regularized) regression assigns zero weights to the most irrelevant or redundant features, and is widely used in data science. However the Lasso only applies to linear models. Here we…
▽ More
Much work has been done recently to make neural networks more interpretable, and one obvious approach is to arrange for the network to use only a subset of the available features. In linear models, Lasso (or $\ell_1$-regularized) regression assigns zero weights to the most irrelevant or redundant features, and is widely used in data science. However the Lasso only applies to linear models. Here we introduce LassoNet, a neural network framework with global feature selection. Our approach enforces a hierarchy: specifically a feature can participate in a hidden unit only if its linear representative is active. Unlike other approaches to feature selection for neural nets, our method uses a modified objective function with constraints, and so integrates feature selection with the parameter learning directly. As a result, it delivers an entire regularization path of solutions with a range of feature sparsity. On systematic experiments, LassoNet significantly outperforms state-of-the-art methods for feature selection and regression. The LassoNet method uses projected proximal gradient descent, and generalizes directly to deep networks. It can be implemented by adding just a few lines of code to a standard neural network.
△ Less
Submitted 16 June, 2021; v1 submitted 29 July, 2019;
originally announced July 2019.
-
SAT solving techniques: a bibliography
Authors:
Louis Abraham
Abstract:
We present a selective bibliography about efficient SAT solving, focused on optimizations for the CDCL-based algorithms.
We present a selective bibliography about efficient SAT solving, focused on optimizations for the CDCL-based algorithms.
△ Less
Submitted 23 April, 2018; v1 submitted 10 February, 2018;
originally announced February 2018.
-
Estimacion de carga muscular mediante imagenes
Authors:
Leandro Abraham,
Facundo Bromberg,
Raymundo Forradellas
Abstract:
Un problema de gran interes en disciplinas como la ocupacional, ergonomica y deportiva, es la medicion de variables biomecanicas involucradas en el movimiento humano (como las fuerzas musculares internas y torque de articulaciones). Actualmente este problema se resuelve en un proceso de dos pasos. Primero capturando datos con dispositivos poco prácticos, intrusivos y costosos. Luego estos datos so…
▽ More
Un problema de gran interes en disciplinas como la ocupacional, ergonomica y deportiva, es la medicion de variables biomecanicas involucradas en el movimiento humano (como las fuerzas musculares internas y torque de articulaciones). Actualmente este problema se resuelve en un proceso de dos pasos. Primero capturando datos con dispositivos poco prácticos, intrusivos y costosos. Luego estos datos son usados como entrada en modelos complejos para obtener las variables biomecanicas como salida. El presente trabajo representa una alternativa automatizada, no intrusiva y economica al primer paso, proponiendo la captura de estos datos a traves de imagenes. En trabajos futuros la idea es automatizar todo el proceso de calculo de esas variables. En este trabajo elegimos un caso particular de medicion de variables biomecanicas: el problema de estimar el nivel discreto de carga muscular que estan ejerciendo los musculos de un brazo. Para estimar a partir de imagenes estaticas del brazo ejerciendo la fuerza de sostener la carga, el nivel de la misma, realizamos un proceso de clasificacion. Nuestro enfoque utiliza Support Vector Machines para clasificacion, combinada con una etapa de pre-procesamiento que extrae caracterısticas visuales utilizando variadas tecnicas (Bag of Keypoints, Local Binary Patterns, Histogramas de Color, Momentos de Contornos) En los mejores casos (Local Binary Patterns y Momentos de Contornos) obtenemos medidas de performance en la clasificacion (Precision, Recall, F-Measure y Accuracy) superiores al 90 %.
△ Less
Submitted 2 June, 2016; v1 submitted 9 May, 2016;
originally announced May 2016.
-
On the Design and Implementation of Structured P2P VPNs
Authors:
David Isaac Wolinsky,
Linton Abraham,
Kyungyong Lee,
Yonggang Liu,
Jiangyan Xu,
P. Oscar Boykin,
Renato Figueiredo
Abstract:
Centralized Virtual Private Networks (VPNs) when used in distributed systems have performance constraints as all traffic must traverse through a central server. In recent years, there has been a paradigm shift towards the use of P2P in VPNs to alleviate pressure placed upon the central server by allowing participants to communicate directly with each other, relegating the server to handling sess…
▽ More
Centralized Virtual Private Networks (VPNs) when used in distributed systems have performance constraints as all traffic must traverse through a central server. In recent years, there has been a paradigm shift towards the use of P2P in VPNs to alleviate pressure placed upon the central server by allowing participants to communicate directly with each other, relegating the server to handling session management and supporting NAT traversal using relays when necessary. Another, less common, approach uses unstructured P2P systems to remove all centralization from the VPN. These approaches currently lack the depth in security options provided by other VPN solutions, and their scalability constraints have not been well studied.
In this paper, we propose and implement a novel VPN architecture, which uses a structured P2P system for peer discovery, session management, NAT traversal, and autonomic relay selection and a central server as a partially-automated public key infrastructure (PKI) via a user-friendly web interface. Our model also provides the first design and implementation of a P2P VPN with full tunneling support, whereby all non-P2P based Internet traffic routes through a trusted third party and does so in a way that is more secure than existing full tunnel techniques. To verify our model, we evaluate our reference implementation by comparing it quantitatively to other VPN technologies focusing on latency, bandwidth, and memory usage. We also discuss some of our experiences with developing, maintaining, and deploying a P2P VPN.
△ Less
Submitted 14 January, 2010;
originally announced January 2010.
-
An Improved Real--Space Genetic Algorithm for Crystal Structure and Polymorph Prediction
Authors:
N. L. Abraham,
M. I. J. Probert
Abstract:
Existing Genetic Algorithms for crystal structure and polymorph prediction can suffer from stagnation during evolution, with a consequent loss of efficiency and accuracy. An improved Genetic Algorithm (GA) is introduced herein which penalizes similar structures and so enhances structural diversity in the population at each generation. This is shown to improve the quality of results found for the…
▽ More
Existing Genetic Algorithms for crystal structure and polymorph prediction can suffer from stagnation during evolution, with a consequent loss of efficiency and accuracy. An improved Genetic Algorithm (GA) is introduced herein which penalizes similar structures and so enhances structural diversity in the population at each generation. This is shown to improve the quality of results found for the theoretical prediction of simple model crystal structures. In particular, this method is demonstrated to find three new zero--temperature phases of the Dzugutov potential that have not been previously reported.
△ Less
Submitted 9 May, 2008;
originally announced May 2008.
-
A Periodic Genetic Algorithm with Real-Space Representation for Crystal Structure and Polymorph Prediction
Authors:
N. L. Abraham,
M. I. J. Probert
Abstract:
A novel Genetic Algorithm is described that is suitable for determining the global minimum energy configurations of crystal structures and which can also be used as a polymorph search technique. This algorithm requires no prior assumptions about unit cell size, shape or symmetry, nor about the ionic configuration within the unit cell. This therefore enables true ab initio crystal structure and p…
▽ More
A novel Genetic Algorithm is described that is suitable for determining the global minimum energy configurations of crystal structures and which can also be used as a polymorph search technique. This algorithm requires no prior assumptions about unit cell size, shape or symmetry, nor about the ionic configuration within the unit cell. This therefore enables true ab initio crystal structure and polymorph prediction. Our new algorithm uses a real-space representation of the population members, and makes use of a novel periodic cut for the crossover operation. Results on large Lennard-Jones systems with FCC- and HCP-commensurate cells show robust convergence to the bulk structure from a random initial assignment and an ability to successfully discriminate between competing low enthalpy configurations. Results from an ab initio carbon polymorph search show the spontaneous emergence of both Lonsdaleite and graphite like structures.
△ Less
Submitted 2 May, 2006;
originally announced May 2006.