-
Red Team Redemption: A Structured Comparison of Open-Source Tools for Adversary Emulation
Authors:
Max Landauer,
Klaus Mayer,
Florian Skopik,
Markus Wurzenberger,
Manuel Kern
Abstract:
Red teams simulate adversaries and conduct sophisticated attacks against defenders without informing them about used tactics in advance. These interactive cyber exercises are highly beneficial to assess and improve the security posture of organizations, detect vulnerabilities, and train employees. Unfortunately, they are also time-consuming and expensive, which often limits their scale or prevents…
▽ More
Red teams simulate adversaries and conduct sophisticated attacks against defenders without informing them about used tactics in advance. These interactive cyber exercises are highly beneficial to assess and improve the security posture of organizations, detect vulnerabilities, and train employees. Unfortunately, they are also time-consuming and expensive, which often limits their scale or prevents them entirely. To address this situation, adversary emulation tools partially automate attacker behavior and enable fast, continuous, and repeatable security testing even when involved personnel lacks red teaming experience. Currently, a wide range of tools designed for specific use-cases and requirements exist. To obtain an overview of these solutions, we conduct a review and structured comparison of nine open-source adversary emulation tools. To this end, we assemble a questionnaire with 80 questions addressing relevant aspects, including setup, support, documentation, usability, and technical features. In addition, we conduct a user study with domain experts to investigate the importance of these aspects for distinct user roles. Based on the evaluation and user feedback, we rank the tools and find MITRE Caldera, Metasploit, and Atomic Red Team on top.
△ Less
Submitted 28 August, 2024;
originally announced August 2024.
-
Wireless Information and Energy Transfer in the Era of 6G Communications
Authors:
Constantinos Psomas,
Konstantinos Ntougias,
Nikita Shanin,
Dongfang Xu,
Kenneth MacSporran Mayer,
Nguyen Minh Tran,
Laura Cottatellucci,
Kae Won Choi,
Dong In Kim,
Robert Schober,
Ioannis Krikidis
Abstract:
Wireless information and energy transfer (WIET) represents an emerging paradigm which employs controllable transmission of radio-frequency signals for the dual purpose of data communication and wireless charging. As such, WIET is widely regarded as an enabler of envisioned 6G use cases that rely on energy-sustainable Internet-of-Things (IoT) networks, such as smart cities and smart grids. Meeting…
▽ More
Wireless information and energy transfer (WIET) represents an emerging paradigm which employs controllable transmission of radio-frequency signals for the dual purpose of data communication and wireless charging. As such, WIET is widely regarded as an enabler of envisioned 6G use cases that rely on energy-sustainable Internet-of-Things (IoT) networks, such as smart cities and smart grids. Meeting the quality-of-service demands of WIET, in terms of both data transfer and power delivery, requires effective co-design of the information and energy signals. In this article, we present the main principles and design aspects of WIET, focusing on its integration in 6G networks. First, we discuss how conventional communication notions such as resource allocation and waveform design need to be revisited in the context of WIET. Next, we consider various candidate 6G technologies that can boost WIET efficiency, namely, holographic multiple-input multiple-output, near-field beamforming, terahertz communication, intelligent reflecting surfaces (IRSs), and reconfigurable (fluid) antenna arrays. We introduce respective WIET design methods, analyze the promising performance gains of these WIET systems, and discuss challenges, open issues, and future research directions. Finally, a near-field energy beamforming scheme and a power-based IRS beamforming algorithm are experimentally validated using a wireless energy transfer testbed. The vision of WIET in communication systems has been gaining momentum in recent years, with constant progress with respect to theoretical but also practical aspects. The comprehensive overview of the state of the art of WIET presented in this paper highlights the potentials of WIET systems as well as their overall benefits in 6G networks.
△ Less
Submitted 16 May, 2024; v1 submitted 29 April, 2024;
originally announced April 2024.
-
A Uniformly Random Solution to Algorithmic Redistricting
Authors:
Jin-Yi Cai,
Jacob Kruse,
Kenneth Mayer,
Daniel P. Szabo
Abstract:
The process of drawing electoral district boundaries is known as political redistricting. Within this context, gerrymandering is the practice of drawing these boundaries such that they unfairly favor a particular political party, often leading to unequal representation and skewed electoral outcomes. One of the few ways to detect gerrymandering is by algorithmically sampling redistricting plans. Pr…
▽ More
The process of drawing electoral district boundaries is known as political redistricting. Within this context, gerrymandering is the practice of drawing these boundaries such that they unfairly favor a particular political party, often leading to unequal representation and skewed electoral outcomes. One of the few ways to detect gerrymandering is by algorithmically sampling redistricting plans. Previous methods mainly focus on sampling from some neighborhood of ``realistic' districting plans, rather than a uniform sample of the entire space. We present a deterministic subexponential time algorithm to uniformly sample from the space of all possible $ k $-partitions of a bounded degree planar graph, and with this construct a sample of the entire space of redistricting plans. We also give a way to restrict this sample space to plans that match certain compactness and population constraints at the cost of added complexity. The algorithm runs in $ 2^{O(\sqrt{n}\log n)} $ time, although we only give a heuristic implementation. Our method generalizes an algorithm to count self-avoiding walks on a square to count paths that split general planar graphs into $ k $ regions, and uses this to sample from the space of all $ k $-partitions of a planar graph.
△ Less
Submitted 21 February, 2024;
originally announced February 2024.
-
Joint Transmit Signal and Beamforming Design for Integrated Sensing and Power Transfer Systems
Authors:
Kenneth MacSporran Mayer,
Nikita Shanin,
Zhenlong You,
Sebastian Lotter,
Stefan Brückner,
Martin Vossiek,
Laura Cottatellucci,
Robert Schober
Abstract:
Integrating different functionalities, conventionally implemented as dedicated systems, into a single platform allows utilising the available resources more efficiently. We consider an integrated sensing and power transfer (ISAPT) system and propose the joint optimisation of the rectangular pulse-shaped transmit signal and the beamforming vector to combine sensing and wireless power transfer (WPT)…
▽ More
Integrating different functionalities, conventionally implemented as dedicated systems, into a single platform allows utilising the available resources more efficiently. We consider an integrated sensing and power transfer (ISAPT) system and propose the joint optimisation of the rectangular pulse-shaped transmit signal and the beamforming vector to combine sensing and wireless power transfer (WPT) functionalities efficiently. In contrast to prior works, we adopt an accurate non-linear circuit-based energy harvesting (EH) model. We formulate and solve a non-convex optimisation problem for a general number of EH receivers to maximise a weighted sum of the average harvested powers at the EH receivers while ensuring the received echo signal reflected by a sensing target (ST) has sufficient power for estimating the range to the ST with a prescribed accuracy within the considered coverage region. The average harvested power is shown to monotonically increase with the pulse duration when the average transmit power budget is sufficiently large. We discuss the trade-off between sensing performance and power transfer for the considered ISAPT system. The proposed approach significantly outperforms a heuristic baseline scheme based on a linear EH model, which linearly combines energy beamforming with the beamsteering vector in the direction to the ST as its transmit strategy.
△ Less
Submitted 20 January, 2024; v1 submitted 8 November, 2023;
originally announced November 2023.
-
On the Computational Complexities of Complex-valued Neural Networks
Authors:
Kayol Soares Mayer,
Jonathan Aguiar Soares,
Ariadne Arrais Cruz,
Dalton Soares Arantes
Abstract:
Complex-valued neural networks (CVNNs) are nonlinear filters used in the digital signal processing of complex-domain data. Compared with real-valued neural networks~(RVNNs), CVNNs can directly handle complex-valued input and output signals due to their complex domain parameters and activation functions. With the trend toward low-power systems, computational complexity analysis has become essential…
▽ More
Complex-valued neural networks (CVNNs) are nonlinear filters used in the digital signal processing of complex-domain data. Compared with real-valued neural networks~(RVNNs), CVNNs can directly handle complex-valued input and output signals due to their complex domain parameters and activation functions. With the trend toward low-power systems, computational complexity analysis has become essential for measuring an algorithm's power consumption. Therefore, this paper presents both the quantitative and asymptotic computational complexities of CVNNs. This is a crucial tool in deciding which algorithm to implement. The mathematical operations are described in terms of the number of real-valued multiplications, as these are the most demanding operations. To determine which CVNN can be implemented in a low-power system, quantitative computational complexities can be used to accurately estimate the number of floating-point operations. We have also investigated the computational complexities of CVNNs discussed in some studies presented in the literature.
△ Less
Submitted 19 October, 2023;
originally announced October 2023.
-
Bringing Spatial Interaction Measures into Multi-Criteria Assessment of Redistricting Plans Using Interactive Web Mapping
Authors:
Jacob Kruse,
Song Gao,
Yuhan Ji,
Daniel P. Szabo,
Kenneth Mayer
Abstract:
Redistricting is the process by which electoral district boundaries are drawn, and a common normative assumption in this process is that districts should be drawn so as to capture coherent communities of interest (COIs). While states rely on various proxies for community illustration, such as compactness metrics and municipal split counts, to guide redistricting, recent legal challenges and schola…
▽ More
Redistricting is the process by which electoral district boundaries are drawn, and a common normative assumption in this process is that districts should be drawn so as to capture coherent communities of interest (COIs). While states rely on various proxies for community illustration, such as compactness metrics and municipal split counts, to guide redistricting, recent legal challenges and scholarly works have shown the failings of such proxy measures and the difficulty of balancing multiple criteria in district plan creation. To address these issues, we propose the use of spatial interaction communities to directly quantify the degree to which districts capture the underlying COIs. Using large-scale human mobility flow data, we condense spatial interaction community capture for a set of districts into a single number, the interaction ratio (IR), which can be used for redistricting plan evaluation. To compare the IR to traditional redistricting criteria (compactness and fairness), and to explore the range of IR values found in valid districting plans, we employ a Markov chain-based regionalization algorithm (ReCom) to produce ensembles of valid plans, and calculate the degree to which they capture spatial interaction communities. Furthermore, we propose two methods for biasing the ReCom algorithm towards different IR values. We perform a multi-criteria assessment of the space of valid maps, and present the results in an interactive web map. The experiments on Wisconsin congressional districting plans demonstrate the effectiveness of our methods for biasing sampling towards higher or lower IR values. Furthermore, the analysis of the districts produced with these methods suggests that districts with higher IR and compactness values tend to produce district plans that are more proportional with regards to seats allocated to each of the two major parties.
△ Less
Submitted 23 September, 2023;
originally announced September 2023.
-
CVNN-based Channel Estimation and Equalization in OFDM Systems Without Cyclic Prefix
Authors:
Heitor dos Santos Sousa,
Jonathan Aguiar Soares,
Kayol Soares Mayer,
Dalton Soares Arantes
Abstract:
In modern communication systems operating with Orthogonal Frequency-Division Multiplexing (OFDM), channel estimation requires minimal complexity with one-tap equalizers. However, this depends on cyclic prefixes, which must be sufficiently large to cover the channel impulse response. Conversely, the use of cyclic prefix (CP) decreases the useful information that can be conveyed in an OFDM frame, th…
▽ More
In modern communication systems operating with Orthogonal Frequency-Division Multiplexing (OFDM), channel estimation requires minimal complexity with one-tap equalizers. However, this depends on cyclic prefixes, which must be sufficiently large to cover the channel impulse response. Conversely, the use of cyclic prefix (CP) decreases the useful information that can be conveyed in an OFDM frame, thereby degrading the spectral efficiency of the system. In this context, we study the impact of CPs on channel estimation with complex-valued neural networks (CVNNs). We show that the phase-transmittance radial basis function neural network offers superior results, in terms of required energy per bit, compared to classical minimum mean-squared error and least squares algorithms in scenarios without CP.
△ Less
Submitted 25 August, 2023;
originally announced August 2023.
-
Information Rate-Harvested Power Tradeoff in THz SWIPT Systems Employing Resonant Tunnelling Diode-based EH Circuits
Authors:
Nikita Shanin,
Simone Clochiatti,
Kenneth M. Mayer,
Laura Cottatellucci,
Nils Weimann,
Robert Schober
Abstract:
In this paper, we study THz simultaneous wireless information and power transfer (SWIPT) systems. Since coherent information detection is challenging at THz frequencies and Schottky diodes may not be efficient for THz energy harvesting (EH) and information detection, we employ unipolar amplitude shift keying (ASK) modulation at the transmitter (TX) and a resonant tunnelling diode (RTD)-based EH ci…
▽ More
In this paper, we study THz simultaneous wireless information and power transfer (SWIPT) systems. Since coherent information detection is challenging at THz frequencies and Schottky diodes may not be efficient for THz energy harvesting (EH) and information detection, we employ unipolar amplitude shift keying (ASK) modulation at the transmitter (TX) and a resonant tunnelling diode (RTD)-based EH circuit at the receiver (RX) to extract both information and power from the RX signal. We model the dependence of the instantaneous output power at the RX on the instantaneous received power by a non-linear piecewise function, whose parameters are adjusted to fit circuit simulation results. To determine the rate-power tradeoff in THz SWIPT systems, we derive the distribution of the TX signal that maximizes the mutual information between the TX and RX signals subject to constraints on the required average harvested power at the RX and the peak signal amplitude at the TX. Since the computational complexity of maximizing the mutual information may be too high for real-time THz SWIPT systems, for high and low required average harvested powers, we also obtain the suboptimal input signal distribution that maximizes the achievable information rate numerically and in closed form, respectively. Furthermore, based on the obtained results, we propose a suboptimal closed-form TX distribution which also achieves a desired harvested power at the RX. Our simulation results show that a lower reverse current flow and a higher breakdown voltage of the employed RTD are preferable when the input signal power at the RX is low and high, respectively. Finally, we demonstrate that for low and high received signal powers, the rate-power tradeoff of THz SWIPT systems is determined by the peak amplitude of the TX signal and the maximum instantaneous harvested power, respectively.
△ Less
Submitted 25 July, 2024; v1 submitted 12 July, 2023;
originally announced July 2023.
-
Optimal Transmit Antenna Deployment and Power Allocation for Wireless Power Supply in an Indoor Space
Authors:
Kenneth M. Mayer,
Laura Cottatellucci,
Robert Schober
Abstract:
As Internet of Things (IoT) devices proliferate, sustainable methods for powering them are becoming indispensable. The wireless provision of power enables battery-free operation and is crucial for complying with weight and size restrictions. For the energy harvesting (EH) components of these devices to be small, a high operating frequency is necessary. In conjunction with a large transmit antenna,…
▽ More
As Internet of Things (IoT) devices proliferate, sustainable methods for powering them are becoming indispensable. The wireless provision of power enables battery-free operation and is crucial for complying with weight and size restrictions. For the energy harvesting (EH) components of these devices to be small, a high operating frequency is necessary. In conjunction with a large transmit antenna, the receivers may be located in the radiating near-field (Fresnel) region, e.g., in indoor scenarios. In this paper, we propose a wireless power transfer (WPT) system ensuring reliable supply of power to an arbitrary number of mobile, low-power, and single-antenna receivers, whose locations in a three-dimensional cuboid room are unknown. A max-min optimisation problem is formulated to determine the optimal transmit power distribution. We rigorously prove that the optimal transmit power distribution's support has a lower dimensionality than its domain and thus, the employment of a continuous aperture antenna, utilised in Holographic MIMO (HMIMO), is unnecessary in the context of the considered WPT problem. Indeed, deploying a discrete transmit antenna architecture, i.e., a transmit antenna array, is sufficient and our proposed solution provides the optimal transmit antenna deployment and power allocation. Moreover, for a one-dimensional transmit antenna architecture, a finite number of transmit antennas is proven to be optimal. The proposed optimal solution is validated through computer simulations. Our simulation results indicate that the optimal transmit antenna architecture requires a finite number of transmit antennas and depends on the geometry of the environment and the dimensionality of the transmit antenna array.
△ Less
Submitted 1 June, 2024; v1 submitted 5 July, 2023;
originally announced July 2023.
-
Points for Energy Renovation (PointER): A LiDAR-Derived Point Cloud Dataset of One Million English Buildings Linked to Energy Characteristics
Authors:
Sebastian Krapf,
Kevin Mayer,
Martin Fischer
Abstract:
Rapid renovation of Europe's inefficient buildings is required to reduce climate change. However, analyzing and evaluating buildings at scale is challenging because every building is unique. In current practice, the energy performance of buildings is assessed during on-site visits, which are slow, costly, and local. This paper presents a building point cloud dataset that promotes a data-driven, la…
▽ More
Rapid renovation of Europe's inefficient buildings is required to reduce climate change. However, analyzing and evaluating buildings at scale is challenging because every building is unique. In current practice, the energy performance of buildings is assessed during on-site visits, which are slow, costly, and local. This paper presents a building point cloud dataset that promotes a data-driven, large-scale understanding of the 3D representation of buildings and their energy characteristics. We generate building point clouds by intersecting building footprints with geo-referenced LiDAR data and link them with attributes from UK's energy performance database via the Unique Property Reference Number (UPRN). To achieve a representative sample, we select one million buildings from a range of rural and urban regions across England, of which half a million are linked to energy characteristics. Building point clouds in new regions can be generated with the open-source code published alongside the paper. The dataset enables novel research in building energy modeling and can be easily expanded to other research fields by adding building features via the UPRN or geo-location.
△ Less
Submitted 28 June, 2023;
originally announced June 2023.
-
Resonant Tunneling Diode-Based THz SWIPT for Microscopic 6G IoT Devices
Authors:
Nikita Shanin,
Simone Clochiatti,
Kenneth M. Mayer,
Laura Cottatellucci,
Nils Weimann,
Robert Schober
Abstract:
In this paper, we study terahertz (THz) simultaneous wireless information and power transfer (SWIPT) for future micro-scale 6G Internet-of-Things (IoT) networks. Since Schottky diodes are not efficient for THz energy harvesting (EH), we propose resonant tunneling diodes (RTDs) for EH at the IoT receiver (RX). As the electrical properties of RTDs are different from those of Schottky diodes, we deve…
▽ More
In this paper, we study terahertz (THz) simultaneous wireless information and power transfer (SWIPT) for future micro-scale 6G Internet-of-Things (IoT) networks. Since Schottky diodes are not efficient for THz energy harvesting (EH), we propose resonant tunneling diodes (RTDs) for EH at the IoT receiver (RX). As the electrical properties of RTDs are different from those of Schottky diodes, we develop a novel closed-form EH model for RTD-based RXs. In particular, we model the dependency of the instantaneous RX output power on the instantaneous received power by a non-linear piecewise function, whose parameters are adjusted to fit circuit simulation results. Furthermore, since coherent information detection is challenging at THz frequencies, we employ unipolar amplitude shift keying (ASK) modulation at the transmitter (TX) and utilize the RTD-based EH circuit at the RX to extract both information and energy from the received signal. We formulate an optimization problem to maximize the mutual information between the TX and RX signals subject to constraints on the peak amplitude of the transmitted signal and the required average harvested power at the RX. Moreover, we determine a feasibility condition for the formulated problem and, for high and low required average harvested powers, we derive the achievable information rate numerically and in closed form, respectively. Our simulation results highlight a tradeoff between the information rate and the average harvested power. Finally, we show that this tradeoff is determined by the peak amplitude of the transmitted signal and the maximum instantaneous harvested power for low and high received signal powers, respectively.
△ Less
Submitted 15 August, 2023; v1 submitted 5 May, 2023;
originally announced May 2023.
-
GPT-4 Technical Report
Authors:
OpenAI,
Josh Achiam,
Steven Adler,
Sandhini Agarwal,
Lama Ahmad,
Ilge Akkaya,
Florencia Leoni Aleman,
Diogo Almeida,
Janko Altenschmidt,
Sam Altman,
Shyamal Anadkat,
Red Avila,
Igor Babuschkin,
Suchir Balaji,
Valerie Balcom,
Paul Baltescu,
Haiming Bao,
Mohammad Bavarian,
Jeff Belgum,
Irwan Bello,
Jake Berdine,
Gabriel Bernadett-Shapiro,
Christopher Berner,
Lenny Bogdonoff,
Oleg Boiko
, et al. (256 additional authors not shown)
Abstract:
We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformer-based mo…
▽ More
We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformer-based model pre-trained to predict the next token in a document. The post-training alignment process results in improved performance on measures of factuality and adherence to desired behavior. A core component of this project was developing infrastructure and optimization methods that behave predictably across a wide range of scales. This allowed us to accurately predict some aspects of GPT-4's performance based on models trained with no more than 1/1,000th the compute of GPT-4.
△ Less
Submitted 4 March, 2024; v1 submitted 15 March, 2023;
originally announced March 2023.
-
Optimal Antenna Placement for Two-Antenna Near-Field Wireless Power Transfer
Authors:
Kenneth MacSporran Mayer,
Laura Cottatellucci,
Robert Schober
Abstract:
Current trends in communication system design precipitate a change in the operating regime from the traditional far-field to the radiating near-field (Fresnel) region. We investigate the optimal transmit antenna placement for a multiple-input single-output (MISO) wireless power transfer (WPT) system designed for a three-dimensional cuboid room under line-of-sight (LoS) conditions in the Fresnel re…
▽ More
Current trends in communication system design precipitate a change in the operating regime from the traditional far-field to the radiating near-field (Fresnel) region. We investigate the optimal transmit antenna placement for a multiple-input single-output (MISO) wireless power transfer (WPT) system designed for a three-dimensional cuboid room under line-of-sight (LoS) conditions in the Fresnel region. We formulate an optimisation problem for maximising the received power at the worst possible receiver location by considering the spherical nature of the electromagnetic (EM) wavefronts in the Fresnel region while assuming perfect knowledge of the channel at the transmitter. For the case of two transmit antennas, we derive a closed-form expression for the optimal positioning of the antennas which is purely determined by the geometry of the environment. If the room contains locations where the far-field approximation holds, the proposed positioning is shown to reduce to the far-field solution. The analytical solution is validated through simulation. Furthermore, the maximum received power at the locations yielding the worst performance is quantified and the power gain over the optimal far-field solution is presented. For the considered cuboid environment, we show that a distributed antenna system is optimal in the Fresnel region, whereas a co-located antenna architecture is ideal for the far-field.
△ Less
Submitted 31 October, 2022;
originally announced October 2022.
-
PCA-based Channel Estimation for MIMO Communications
Authors:
Jonathan Aguiar Soares,
Kayol Soares Mayer,
Pedro Benevenuto Valadares,
Dalton Soares Arantes
Abstract:
In multiple-input multiple-output communications, channel estimation is paramount to keep base stations and users on track. This paper proposes a novel PCA-based-principal component analysis-channel estimation approach for MIMO orthogonal frequency division multiplexing systems. The channel frequency response is firstly estimated with the least squares method, and then PCA is used to filter only t…
▽ More
In multiple-input multiple-output communications, channel estimation is paramount to keep base stations and users on track. This paper proposes a novel PCA-based-principal component analysis-channel estimation approach for MIMO orthogonal frequency division multiplexing systems. The channel frequency response is firstly estimated with the least squares method, and then PCA is used to filter only the higher singular components of the channel impulse response, which is then converted back to the frequency domain. The proposed approach is compared with the MMSE, the minimum mean square error estimation, in terms of bit error rate versus Eb/N0.
△ Less
Submitted 27 September, 2022;
originally announced September 2022.
-
Estimating building energy efficiency from street view imagery, aerial imagery, and land surface temperature data
Authors:
Kevin Mayer,
Lukas Haas,
Tianyuan Huang,
Juan Bernabé-Moreno,
Ram Rajagopal,
Martin Fischer
Abstract:
Current methods to determine the energy efficiency of buildings require on-site visits of certified energy auditors which makes the process slow, costly, and geographically incomplete. To accelerate the identification of promising retrofit targets on a large scale, we propose to estimate building energy efficiency from widely available and remotely sensed data sources only, namely street view, aer…
▽ More
Current methods to determine the energy efficiency of buildings require on-site visits of certified energy auditors which makes the process slow, costly, and geographically incomplete. To accelerate the identification of promising retrofit targets on a large scale, we propose to estimate building energy efficiency from widely available and remotely sensed data sources only, namely street view, aerial view, footprint, and satellite-borne land surface temperature (LST) data. After collecting data for almost 40,000 buildings in the United Kingdom, we combine these data sources by training multiple end-to-end deep learning models with the objective to classify buildings as energy efficient (EU rating A-D) or inefficient (EU rating E-G). After evaluating the trained models quantitatively as well as qualitatively, we extend our analysis by studying the predictive power of each data source in an ablation study. We find that the end-to-end deep learning model trained on all four data sources achieves a macro-averaged F1 score of 64.64% and outperforms the k-NN and SVM-based baseline models by 14.13 to 12.02 percentage points, respectively. Thus, this work shows the potential and complementary nature of remotely sensed data in predicting energy efficiency and opens up new opportunities for future work to integrate additional data sources.
△ Less
Submitted 24 August, 2022; v1 submitted 5 June, 2022;
originally announced June 2022.
-
Evaluating Large Language Models Trained on Code
Authors:
Mark Chen,
Jerry Tworek,
Heewoo Jun,
Qiming Yuan,
Henrique Ponde de Oliveira Pinto,
Jared Kaplan,
Harri Edwards,
Yuri Burda,
Nicholas Joseph,
Greg Brockman,
Alex Ray,
Raul Puri,
Gretchen Krueger,
Michael Petrov,
Heidy Khlaaf,
Girish Sastry,
Pamela Mishkin,
Brooke Chan,
Scott Gray,
Nick Ryder,
Mikhail Pavlov,
Alethea Power,
Lukasz Kaiser,
Mohammad Bavarian,
Clemens Winter
, et al. (33 additional authors not shown)
Abstract:
We introduce Codex, a GPT language model fine-tuned on publicly available code from GitHub, and study its Python code-writing capabilities. A distinct production version of Codex powers GitHub Copilot. On HumanEval, a new evaluation set we release to measure functional correctness for synthesizing programs from docstrings, our model solves 28.8% of the problems, while GPT-3 solves 0% and GPT-J sol…
▽ More
We introduce Codex, a GPT language model fine-tuned on publicly available code from GitHub, and study its Python code-writing capabilities. A distinct production version of Codex powers GitHub Copilot. On HumanEval, a new evaluation set we release to measure functional correctness for synthesizing programs from docstrings, our model solves 28.8% of the problems, while GPT-3 solves 0% and GPT-J solves 11.4%. Furthermore, we find that repeated sampling from the model is a surprisingly effective strategy for producing working solutions to difficult prompts. Using this method, we solve 70.2% of our problems with 100 samples per problem. Careful investigation of our model reveals its limitations, including difficulty with docstrings describing long chains of operations and with binding operations to variables. Finally, we discuss the potential broader impacts of deploying powerful code generation technologies, covering safety, security, and economics.
△ Less
Submitted 14 July, 2021; v1 submitted 7 July, 2021;
originally announced July 2021.
-
An Enriched Automated PV Registry: Combining Image Recognition and 3D Building Data
Authors:
Benjamin Rausch,
Kevin Mayer,
Marie-Louise Arlt,
Gunther Gust,
Philipp Staudt,
Christof Weinhardt,
Dirk Neumann,
Ram Rajagopal
Abstract:
While photovoltaic (PV) systems are installed at an unprecedented rate, reliable information on an installation level remains scarce. As a result, automatically created PV registries are a timely contribution to optimize grid planning and operations. This paper demonstrates how aerial imagery and three-dimensional building data can be combined to create an address-level PV registry, specifying are…
▽ More
While photovoltaic (PV) systems are installed at an unprecedented rate, reliable information on an installation level remains scarce. As a result, automatically created PV registries are a timely contribution to optimize grid planning and operations. This paper demonstrates how aerial imagery and three-dimensional building data can be combined to create an address-level PV registry, specifying area, tilt, and orientation angles. We demonstrate the benefits of this approach for PV capacity estimation. In addition, this work presents, for the first time, a comparison between automated and officially-created PV registries. Our results indicate that our enriched automated registry proves to be useful to validate, update, and complement official registries.
△ Less
Submitted 7 December, 2020;
originally announced December 2020.