Zum Hauptinhalt springen

Showing 1–15 of 15 results for author: Palmer, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.07745  [pdf, other

    cs.LG

    Deep Reinforcement Learning for Autonomous Cyber Operations: A Survey

    Authors: Gregory Palmer, Chris Parry, Daniel J. B. Harrold, Chris Willis

    Abstract: The rapid increase in the number of cyber-attacks in recent years raises the need for principled methods for defending networks against malicious actors. Deep reinforcement learning (DRL) has emerged as a promising approach for mitigating these attacks. However, while DRL has shown much potential for cyber-defence, numerous challenges must be overcome before DRL can be applied to autonomous cyber-… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: 60 pages, 14 figures, 3 tables

  2. arXiv:2211.01704  [pdf

    eess.SP cs.LG cs.SD eess.AS

    Cutting Through the Noise: An Empirical Comparison of Psychoacoustic and Envelope-based Features for Machinery Fault Detection

    Authors: Peter Wißbrock, Yvonne Richter, David Pelkmann, Zhao Ren, Gregory Palmer

    Abstract: Acoustic-based fault detection has a high potential to monitor the health condition of mechanical parts. However, the background noise of an industrial environment may negatively influence the performance of fault detection. Limited attention has been paid to improving the robustness of fault detection against industrial environmental noise. Therefore, we present the Lenze production background-no… ▽ More

    Submitted 13 March, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

    Comments: the final published version at ICASSP 2023 include small additional content as well as some minor revisions

  3. arXiv:2210.17227  [pdf, other

    cs.NI math.OC

    Modelling M/M/R-JSQ-PS sojourn time distribution for Ultra-Reliable Low Latency Communication services

    Authors: Geraint I. Palmer, Jorge Martín-Pérez

    Abstract: The future Internet promises to support time-sensitive services that require ultra low latencies and reliabilities of 99.99%. Recent advances in cellular and WiFi connections enhance the network to meet high reliability and ultra low latencies. However, the aforementioned services require that the server processing time ensures low latencies with high reliability, otherwise the end-to-end performa… ▽ More

    Submitted 22 December, 2022; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: 14 Pages, 10 figures, submitted to Elsevier European Journal of Operational Research

  4. arXiv:2207.04515  [pdf, other

    cs.AI cs.SE

    Developing an AI-enabled IIoT platform -- Lessons learned from early use case validation

    Authors: Holger Eichelberger, Gregory Palmer, Svenja Reimer, Tat Trong Vu, Hieu Do, Sofiane Laridi, Alexander Weber, Claudia Niederée, Thomas Hildebrandt

    Abstract: For a broader adoption of AI in industrial production, adequate infrastructure capabilities are crucial. This includes easing the integration of AI with industrial devices, support for distributed deployment, monitoring, and consistent system configuration. Existing IIoT platforms still lack required capabilities to flexibly integrate reusable AI services and relevant standards such as Asset Admin… ▽ More

    Submitted 10 July, 2022; originally announced July 2022.

    Comments: 16 pages, 5 figures

  5. arXiv:2207.03352  [pdf, other

    q-fin.TR cs.AI cs.LG

    Market Making with Scaled Beta Policies

    Authors: Joseph Jerome, Gregory Palmer, Rahul Savani

    Abstract: This paper introduces a new representation for the actions of a market maker in an order-driven market. This representation uses scaled beta distributions, and generalises three approaches taken in the artificial intelligence for market making literature: single price-level selection, ladder strategies and "market making at the touch". Ladder strategies place uniform volume across an interval of c… ▽ More

    Submitted 27 September, 2022; v1 submitted 7 July, 2022; originally announced July 2022.

  6. arXiv:2205.09550  [pdf, other

    cs.LG

    Data Valuation for Offline Reinforcement Learning

    Authors: Amir Abolfazli, Gregory Palmer, Daniel Kudenko

    Abstract: The success of deep reinforcement learning (DRL) hinges on the availability of training data, which is typically obtained via a large number of environment interactions. In many real-world scenarios, costs and risks are associated with gathering these data. The field of offline reinforcement learning addresses these issues through outsourcing the collection of data to a domain expert or a carefull… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: 9 pages, 3 figures, 2 tables

  7. arXiv:2205.02827  [pdf, other

    cs.LG

    Identifying Cause-and-Effect Relationships of Manufacturing Errors using Sequence-to-Sequence Learning

    Authors: Jeff Reimer, Yandong Wang, Sofiane Laridi, Juergen Urdich, Sören Wilmsmeier, Gregory Palmer

    Abstract: In car-body production the pre-formed sheet metal parts of the body are assembled on fully-automated production lines. The body passes through multiple stations in succession, and is processed according to the order requirements. The timely completion of orders depends on the individual station-based operations concluding within their scheduled cycle times. If an error occurs in one station, it ca… ▽ More

    Submitted 5 May, 2022; originally announced May 2022.

    Comments: 11 pages, 5 figures, 2 tables

  8. arXiv:2203.04696  [pdf, other

    cs.SD cs.LG eess.AS

    Robust Federated Learning Against Adversarial Attacks for Speech Emotion Recognition

    Authors: Yi Chang, Sofiane Laridi, Zhao Ren, Gregory Palmer, Björn W. Schuller, Marco Fisichella

    Abstract: Due to the development of machine learning and speech processing, speech emotion recognition has been a popular research topic in recent years. However, the speech data cannot be protected when it is uploaded and processed on servers in the internet-of-things applications of speech emotion recognition. Furthermore, deep neural networks have proven to be vulnerable to human-indistinguishable advers… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

    Comments: 11 pages, 6 figures, 3 tables

  9. arXiv:2007.04611  [pdf, other

    cs.CY cs.LG

    A deep learning approach to identify unhealthy advertisements in street view images

    Authors: Gregory Palmer, Mark Green, Emma Boyland, Yales Stefano Rios Vasconcelos, Rahul Savani, Alex Singleton

    Abstract: While outdoor advertisements are common features within towns and cities, they may reinforce social inequalities in health. Vulnerable populations in deprived areas may have greater exposure to fast food, gambling and alcohol advertisements encouraging their consumption. Understanding who is exposed and evaluating potential policy restrictions requires a substantial manual data collection effort.… ▽ More

    Submitted 7 February, 2021; v1 submitted 9 July, 2020; originally announced July 2020.

    Comments: 13 pages, 5 figures, 3 table. To appear in Nature Scientific Reports

  10. arXiv:2002.09406  [pdf, other

    cs.CV

    The Automated Inspection of Opaque Liquid Vaccines

    Authors: Gregory Palmer, Benjamin Schnieders, Rahul Savani, Karl Tuyls, Joscha-David Fossel, Harry Flore

    Abstract: In the pharmaceutical industry the screening of opaque vaccines containing suspensions is currently a manual task carried out by trained human visual inspectors. We show that deep learning can be used to effectively automate this process. A moving contrast is required to distinguish anomalies from other particles, reflections and dust resting on a vial's surface. We train 3D-ConvNets to predict th… ▽ More

    Submitted 21 February, 2020; originally announced February 2020.

    Comments: 8 pages, 5 Figures, 3 Tables, ECAI 2020 Conference Proceedings

  11. arXiv:1903.00683  [pdf, other

    cs.RO

    Fully Convolutional One-Shot Object Segmentation for Industrial Robotics

    Authors: Benjamin Schnieders, Shan Luo, Gregory Palmer, Karl Tuyls

    Abstract: The ability to identify and localize new objects robustly and effectively is vital for robotic grasping and manipulation in warehouses or smart factories. Deep convolutional neural networks (DCNNs) have achieved the state-of-the-art performance on established image datasets for object detection and segmentation. However, applying DCNNs in dynamic industrial scenarios, e.g., warehouses and autonomo… ▽ More

    Submitted 2 March, 2019; originally announced March 2019.

    Comments: International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2019), 9 pages

  12. arXiv:1809.05096  [pdf, other

    cs.MA cs.AI cs.LG

    Negative Update Intervals in Deep Multi-Agent Reinforcement Learning

    Authors: Gregory Palmer, Rahul Savani, Karl Tuyls

    Abstract: In Multi-Agent Reinforcement Learning (MA-RL), independent cooperative learners must overcome a number of pathologies to learn optimal joint policies. Addressing one pathology often leaves approaches vulnerable towards others. For instance, hysteretic Q-learning addresses miscoordination while leaving agents vulnerable towards misleading stochastic rewards. Other methods, such as leniency, have pr… ▽ More

    Submitted 7 May, 2019; v1 submitted 13 September, 2018; originally announced September 2018.

    Comments: 11 Pages, 6 Figures, AAMAS2019 Conference Proceedings

  13. arXiv:1710.03561  [pdf, other

    cs.PF

    Ciw: An open source discrete event simulation library

    Authors: Geraint I. Palmer, Vincent A. Knight, Paul R. Harper, Asyl L. Hawa

    Abstract: This paper introduces Ciw, an open source library for conducting discrete event simulations that has been developed in Python. The strengths of the library are illustrated in terms of best practice and reproducibility for computational research. An analysis of Ciw's performance and comparison to several alternative discrete event simulation frameworks is presented.

    Submitted 27 September, 2017; originally announced October 2017.

  14. arXiv:1707.04402  [pdf, other

    cs.MA cs.AI cs.LG

    Lenient Multi-Agent Deep Reinforcement Learning

    Authors: Gregory Palmer, Karl Tuyls, Daan Bloembergen, Rahul Savani

    Abstract: Much of the success of single agent deep reinforcement learning (DRL) in recent years can be attributed to the use of experience replay memories (ERM), which allow Deep Q-Networks (DQNs) to be trained efficiently through sampling stored state transitions. However, care is required when using ERMs for multi-agent deep reinforcement learning (MA-DRL), as stored transitions can become outdated becaus… ▽ More

    Submitted 27 February, 2018; v1 submitted 14 July, 2017; originally announced July 2017.

    Comments: 9 pages, 6 figures, AAMAS2018 Conference Proceedings

  15. arXiv:1604.00896  [pdf, other

    cs.GT physics.soc-ph

    An open reproducible framework for the study of the iterated prisoner's dilemma

    Authors: Vincent Knight, Owen Campbell, Marc Harper, Karol Langner, James Campbell, Thomas Campbell, Alex Carney, Martin Chorley, Cameron Davidson-Pilon, Kristian Glass, Nikoleta Glynatsi, Tomáš Ehrlich, Martin Jones, Georgios Koutsovoulos, Holly Tibble, Müller Jochen, Geraint Palmer, Piotr Petunov, Paul Slavin, Timothy Standen, Luis Visintini, Karl Molden

    Abstract: The Axelrod library is an open source Python package that allows for reproducible game theoretic research into the Iterated Prisoner's Dilemma. This area of research began in the 1980s but suffers from a lack of documentation and test code. The goal of the library is to provide such a resource, with facilities for the design of new strategies and interactions between them, as well as conducting to… ▽ More

    Submitted 20 December, 2016; v1 submitted 4 April, 2016; originally announced April 2016.

    Comments: 11 pages, Journal of Open Research Software 4.1 (2016)