Zum Hauptinhalt springen

Showing 1–50 of 69 results for author: Herrera, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.14210  [pdf, other

    cs.LG cs.AI

    Fair Overlap Number of Balls (Fair-ONB): A Data-Morphology-based Undersampling Method for Bias Reduction

    Authors: José Daniel Pascual-Triana, Alberto Fernández, Paulo Novais, Francisco Herrera

    Abstract: Given the magnitude of data generation currently, both in quantity and speed, the use of machine learning is increasingly important. When data include protected features that might give rise to discrimination, special care must be taken. Data quality is critical in these cases, as biases in training data can be reflected in classification models. This has devastating consequences and fails to comp… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

    Comments: 16 pages, 9 tables, 10 figures

  2. arXiv:2407.08745  [pdf, other

    cs.NE cs.AI

    Evolutionary Computation for the Design and Enrichment of General-Purpose Artificial Intelligence Systems: Survey and Prospects

    Authors: Javier Poyatos, Javier Del Ser, Salvador Garcia, Hisao Ishibuchi, Daniel Molina, Isaac Triguero, Bing Xue, Xin Yao, Francisco Herrera

    Abstract: In Artificial Intelligence, there is an increasing demand for adaptive models capable of dealing with a diverse spectrum of learning tasks, surpassing the limitations of systems devised to cope with a single task. The recent emergence of General-Purpose Artificial Intelligence Systems (GPAIS) poses model configuration and adaptability challenges at far greater complexity scales than the optimal de… ▽ More

    Submitted 3 June, 2024; originally announced July 2024.

  3. arXiv:2406.11772  [pdf, other

    cs.CV cs.AI

    Deep Learning methodology for the identification of wood species using high-resolution macroscopic images

    Authors: David Herrera-Poyatos, Andrés Herrera-Poyatos, Rosana Montes, Paloma de Palacios, Luis G. Esteban, Alberto García Iruela, Francisco García Fernández, Francisco Herrera

    Abstract: Significant advancements in the field of wood species identification are needed worldwide to support sustainable timber trade. In this work we contribute to automate the identification of wood species via high-resolution macroscopic images of timber. The main challenge of this problem is that fine-grained patterns in timber are crucial in order to accurately identify wood species, and these patter… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 17 pages and 6 figures

    ACM Class: I.2.1; I.2.10

  4. arXiv:2405.12326  [pdf, other

    cs.LG

    Overlap Number of Balls Model-Agnostic CounterFactuals (ONB-MACF): A Data-Morphology-based Counterfactual Generation Method for Trustworthy Artificial Intelligence

    Authors: José Daniel Pascual-Triana, Alberto Fernández, Javier Del Ser, Francisco Herrera

    Abstract: Explainable Artificial Intelligence (XAI) is a pivotal research domain aimed at understanding the operational mechanisms of AI systems, particularly those considered ``black boxes'' due to their complex, opaque nature. XAI seeks to make these AI systems more understandable and trustworthy, providing insight into their decision-making processes. By producing clear and comprehensible explanations, X… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: 21 pages, 6 figures. Submitted to Information Sciences

  5. arXiv:2404.06127  [pdf

    cs.CR cs.AI

    FLEX: FLEXible Federated Learning Framework

    Authors: Francisco Herrera, Daniel Jiménez-López, Alberto Argente-Garrido, Nuria Rodríguez-Barroso, Cristina Zuheros, Ignacio Aguilera-Martos, Beatriz Bello, Mario García-Márquez, M. Victoria Luzón

    Abstract: In the realm of Artificial Intelligence (AI), the need for privacy and security in data processing has become paramount. As AI applications continue to expand, the collection and handling of sensitive data raise concerns about individual privacy protection. Federated Learning (FL) emerges as a promising solution to address these challenges by enabling decentralized model training on local devices,… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: Submitted to Information Fusion

  6. arXiv:2404.02611  [pdf, other

    cs.AI

    SHIELD: A regularization technique for eXplainable Artificial Intelligence

    Authors: Iván Sevillano-García, Julián Luengo, Francisco Herrera

    Abstract: As Artificial Intelligence systems become integral across domains, the demand for explainability grows. While the effort by the scientific community is focused on obtaining a better explanation for the model, it is important not to ignore the potential of this explanation process to improve training as well. While existing efforts primarily focus on generating and evaluating explanations for black… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 18 pages, 8 figures

    ACM Class: I.2.6

  7. arXiv:2404.02510  [pdf, other

    cs.LG cs.AI

    An Interpretable Client Decision Tree Aggregation process for Federated Learning

    Authors: Alberto Argente-Garrido, Cristina Zuheros, M. Victoria Luzón, Francisco Herrera

    Abstract: Trustworthy Artificial Intelligence solutions are essential in today's data-driven applications, prioritizing principles such as robustness, safety, transparency, explainability, and privacy among others. This has led to the emergence of Federated Learning as a solution for privacy and distributed machine learning. While decision trees, as self-explanatory models, are ideal for collaborative model… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: Submitted to Information Science Journal

  8. arXiv:2403.15587  [pdf, other

    cs.AI

    Large language models for crowd decision making based on prompt design strategies using ChatGPT: models, analysis and challenges

    Authors: Cristina Zuheros, David Herrera-Poyatos, Rosana Montes, Francisco Herrera

    Abstract: Social Media and Internet have the potential to be exploited as a source of opinion to enrich Decision Making solutions. Crowd Decision Making (CDM) is a methodology able to infer opinions and decisions from plain texts, such as reviews published in social media platforms, by means of Sentiment Analysis. Currently, the emergence and potential of Large Language Models (LLMs) lead us to explore new… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  9. Teranga Go!: Carpooling Collaborative Consumption Community with multi-criteria hesitant fuzzy linguistic term set opinions to build confidence and trust

    Authors: Rosana Montes, Ana M. Sanchez, Pedro Villar, Francisco Herrera

    Abstract: Classic Delphi and Fuzzy Delphi methods are used to test content validity of a data collection tools such as questionnaires. Fuzzy Delphi takes the opinion issued by judges from a linguistic perspective reducing ambiguity in opinions by using fuzzy numbers. We propose an extension named 2-Tuple Fuzzy Linguistic Delphi method to deal with scenarios in which judges show different expertise degrees b… ▽ More

    Submitted 7 February, 2024; originally announced March 2024.

    Comments: project at https://github.com/rosanamontes/teranga.go. arXiv admin note: substantial text overlap with arXiv:2402.01775

    Journal ref: Applied Soft Computing 67, 2018, Pages 941-952

  10. Design and consensus content validity of the questionnaire for b-learning education: A 2-Tuple Fuzzy Linguistic Delphi based Decision Support Tool

    Authors: Rosana Montes, Cristina Zuheros, Jeovani M. Morales, Noe Zermeño, Jerónimo Duran, Francsico Herrera

    Abstract: Classic Delphi and Fuzzy Delphi methods are used to test content validity of data collection tools such as questionnaires. Fuzzy Delphi takes the opinion issued by judges from a linguistic perspective reducing ambiguity in opinions by using fuzzy numbers. We propose an extension named 2-Tuple Fuzzy Linguistic Delphi method to deal with scenarios in which judges show different expertise degrees by… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 47 pages, 7 figures

    Journal ref: Open Access Volume 147 November 2023 Article number 110755

  11. Explainable Artificial Intelligence (XAI) 2.0: A Manifesto of Open Challenges and Interdisciplinary Research Directions

    Authors: Luca Longo, Mario Brcic, Federico Cabitza, Jaesik Choi, Roberto Confalonieri, Javier Del Ser, Riccardo Guidotti, Yoichi Hayashi, Francisco Herrera, Andreas Holzinger, Richard Jiang, Hassan Khosravi, Freddy Lecue, Gianclaudio Malgieri, Andrés Páez, Wojciech Samek, Johannes Schneider, Timo Speith, Simone Stumpf

    Abstract: As systems based on opaque Artificial Intelligence (AI) continue to flourish in diverse real-world applications, understanding these black box models has become paramount. In response, Explainable AI (XAI) has emerged as a field of research with practical and ethical benefits across various domains. This paper not only highlights the advancements in XAI and its application in real-world scenarios… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    ACM Class: F.2.0; H.1.2; I.2; I.2.6; K.4; K.5

    Journal ref: Information Fusion 2024

  12. General Purpose Artificial Intelligence Systems (GPAIS): Properties, Definition, Taxonomy, Societal Implications and Responsible Governance

    Authors: Isaac Triguero, Daniel Molina, Javier Poyatos, Javier Del Ser, Francisco Herrera

    Abstract: Most applications of Artificial Intelligence (AI) are designed for a confined and specific task. However, there are many scenarios that call for a more general AI, capable of solving a wide array of tasks without being specifically designed for them. The term General-Purpose Artificial Intelligence Systems (GPAIS) has been defined to refer to these AI systems. To date, the possibility of an Artifi… ▽ More

    Submitted 3 November, 2023; v1 submitted 26 July, 2023; originally announced July 2023.

    Journal ref: Information Fusion, Volume 103, March 2024, 102135

  13. arXiv:2305.02231  [pdf, other

    cs.CY cs.AI cs.LG

    Connecting the Dots in Trustworthy Artificial Intelligence: From AI Principles, Ethics, and Key Requirements to Responsible AI Systems and Regulation

    Authors: Natalia Díaz-Rodríguez, Javier Del Ser, Mark Coeckelbergh, Marcos López de Prado, Enrique Herrera-Viedma, Francisco Herrera

    Abstract: Trustworthy Artificial Intelligence (AI) is based on seven technical requirements sustained over three main pillars that should be met throughout the system's entire life cycle: it should be (1) lawful, (2) ethical, and (3) robust, both from a technical and a social perspective. However, attaining truly trustworthy AI concerns a wider vision that comprises the trustworthiness of all processes and… ▽ More

    Submitted 12 June, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

    Comments: 30 pages, 5 figures, under second review

    MSC Class: 68T01 ACM Class: I.2; K.4; K.5

  14. Multiobjective Evolutionary Pruning of Deep Neural Networks with Transfer Learning for improving their Performance and Robustness

    Authors: Javier Poyatos, Daniel Molina, Aitor Martínez, Javier Del Ser, Francisco Herrera

    Abstract: Evolutionary Computation algorithms have been used to solve optimization problems in relation with architectural, hyper-parameter or training configuration, forging the field known today as Neural Architecture Search. These algorithms have been combined with other techniques such as the pruning of Neural Networks, which reduces the complexity of the network, and the Transfer Learning, which lets t… ▽ More

    Submitted 5 February, 2024; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: 28 pages, 11 figures

    ACM Class: I.2; I.4

    Journal ref: Applied Soft Computing, 147 (2023), 110757

  15. arXiv:2211.06154  [pdf, other

    cs.AI

    REVEL Framework to measure Local Linear Explanations for black-box models: Deep Learning Image Classification case of study

    Authors: Iván Sevillano-García, Julián Luengo-Martín, Francisco Herrera

    Abstract: Explainable artificial intelligence is proposed to provide explanations for reasoning performed by an Artificial Intelligence. There is no consensus on how to evaluate the quality of these explanations, since even the definition of explanation itself is not clear in the literature. In particular, for the widely known Local Linear Explanations, there are qualitative proposals for the evaluation of… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

  16. arXiv:2209.02048  [pdf, other

    eess.IV cs.CV cs.LG

    Fuzzy Attention Neural Network to Tackle Discontinuity in Airway Segmentation

    Authors: Yang Nan, Javier Del Ser, Zeyu Tang, Peng Tang, Xiaodan Xing, Yingying Fang, Francisco Herrera, Witold Pedrycz, Simon Walsh, Guang Yang

    Abstract: Airway segmentation is crucial for the examination, diagnosis, and prognosis of lung diseases, while its manual delineation is unduly burdensome. To alleviate this time-consuming and potentially subjective manual procedure, researchers have proposed methods to automatically segment airways from computerized tomography (CT) images. However, some small-sized airway branches (e.g., bronchus and termi… ▽ More

    Submitted 9 September, 2022; v1 submitted 5 September, 2022; originally announced September 2022.

    Comments: 12 pages, 5 figures, Submitted to IEEE TNNLS

  17. arXiv:2206.03179  [pdf, other

    cs.NE cs.AI

    TSFEDL: A Python Library for Time Series Spatio-Temporal Feature Extraction and Prediction using Deep Learning (with Appendices on Detailed Network Architectures and Experimental Cases of Study)

    Authors: Ignacio Aguilera-Martos, Ángel M. García-Vico, Julián Luengo, Sergio Damas, Francisco J. Melero, José Javier Valle-Alonso, Francisco Herrera

    Abstract: The combination of convolutional and recurrent neural networks is a promising framework that allows the extraction of high-quality spatio-temporal features together with its temporal dependencies, which is key for time series prediction problems such as forecasting, classification or anomaly detection, amongst others. In this paper, the TSFEDL library is introduced. It compiles 20 state-of-the-art… ▽ More

    Submitted 8 June, 2022; v1 submitted 7 June, 2022; originally announced June 2022.

    Comments: 26 pages, 33 figures

  18. arXiv:2205.10232  [pdf, other

    cs.LG cs.AI

    Exploring the Trade-off between Plausibility, Change Intensity and Adversarial Power in Counterfactual Explanations using Multi-objective Optimization

    Authors: Javier Del Ser, Alejandro Barredo-Arrieta, Natalia Díaz-Rodríguez, Francisco Herrera, Andreas Holzinger

    Abstract: There is a broad consensus on the importance of deep learning models in tasks involving complex data. Often, an adequate understanding of these models is required when focusing on the transparency of decisions in human-critical applications. Besides other explainability techniques, trustworthiness can be achieved by using counterfactuals, like the way a human becomes familiar with an unknown proce… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

    Comments: 52 pages, 14 figures, under review

  19. arXiv:2204.11832  [pdf

    cs.LG physics.chem-ph physics.optics

    Machine learning identification of organic compounds using visible light

    Authors: Thulasi Bikku, Rubén A. Fritz, Yamil J. Colón, Felipe Herrera

    Abstract: Identifying chemical compounds is essential in several areas of science and engineering. Laser-based techniques are promising for autonomous compound detection because the optical response of materials encodes enough electronic and vibrational information for remote chemical identification. This has been exploited using the fingerprint region of infrared absorption spectra, which involves a dense… ▽ More

    Submitted 13 June, 2022; v1 submitted 6 April, 2022; originally announced April 2022.

    Comments: 18 pages, 7 figures. Open database and python code. Version adds comparison with Raman classifiers (Table 1)

    Journal ref: J. Phys. Chem. A 127, 2407, 2023

  20. Handling Imbalanced Classification Problems With Support Vector Machines via Evolutionary Bilevel Optimization

    Authors: Alejandro Rosales-Pérez, Salvador García, Francisco Herrera

    Abstract: Support vector machines (SVMs) are popular learning algorithms to deal with binary classification problems. They traditionally assume equal misclassification costs for each class; however, real-world problems may have an uneven class distribution. This article introduces EBCS-SVM: evolutionary bilevel cost-sensitive SVMs. EBCS-SVM handles imbalanced classification problems by simultaneously learni… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

    Comments: Copyright 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

    MSC Class: 62H30 ACM Class: I.2

    Journal ref: IEEE Transactions on Cybernetics, Early Access, april 13, 2022

  21. EvoPruneDeepTL: An Evolutionary Pruning Model for Transfer Learning based Deep Neural Networks

    Authors: Javier Poyatos, Daniel Molina, Aritz. D. Martinez, Javier Del Ser, Francisco Herrera

    Abstract: In recent years, Deep Learning models have shown a great performance in complex optimization problems. They generally require large training datasets, which is a limitation in most practical cases. Transfer learning allows importing the first layers of a pre-trained architecture and connecting them to fully-connected layers to adapt them to a new problem. Consequently, the configuration of the the… ▽ More

    Submitted 5 February, 2024; v1 submitted 8 February, 2022; originally announced February 2022.

    MSC Class: 68 ACM Class: I.2; I.4

    Journal ref: Neural Networks, 158, (2023), 59-82

  22. Survey on Federated Learning Threats: concepts, taxonomy on attacks and defences, experimental study and challenges

    Authors: Nuria Rodríguez-Barroso, Daniel Jiménez López, M. Victoria Luzón, Francisco Herrera, Eugenio Martínez-Cámara

    Abstract: Federated learning is a machine learning paradigm that emerges as a solution to the privacy-preservation demands in artificial intelligence. As machine learning, federated learning is threatened by adversarial attacks against the integrity of the learning model and the privacy of data via a distributed approach to tackle local and global learning. This weak point is exacerbated by the inaccessibil… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

    Journal ref: Information Fusion (2022)

  23. arXiv:2201.06505  [pdf

    cs.AI cs.CV

    Data Harmonisation for Information Fusion in Digital Healthcare: A State-of-the-Art Systematic Review, Meta-Analysis and Future Research Directions

    Authors: Yang Nan, Javier Del Ser, Simon Walsh, Carola Schönlieb, Michael Roberts, Ian Selby, Kit Howard, John Owen, Jon Neville, Julien Guiot, Benoit Ernst, Ana Pastor, Angel Alberich-Bayarri, Marion I. Menzel, Sean Walsh, Wim Vos, Nina Flerin, Jean-Paul Charbonnier, Eva van Rikxoort, Avishek Chatterjee, Henry Woodruff, Philippe Lambin, Leonor Cerdá-Alberich, Luis Martí-Bonmatí, Francisco Herrera , et al. (1 additional authors not shown)

    Abstract: Removing the bias and variance of multicentre data has always been a challenge in large scale digital healthcare studies, which requires the ability to integrate clinical features extracted from data acquired by different scanners and protocols to improve stability and robustness. Previous studies have described various computational approaches to fuse single modality multicentre datasets. However… ▽ More

    Submitted 17 January, 2022; originally announced January 2022.

    Comments: 54 pages, 14 figures, accepted by the Information Fusion journal

  24. Reducing Data Complexity using Autoencoders with Class-informed Loss Functions

    Authors: David Charte, Francisco Charte, Francisco Herrera

    Abstract: Available data in machine learning applications is becoming increasingly complex, due to higher dimensionality and difficult classes. There exists a wide variety of approaches to measuring complexity of labeled data, according to class overlap, separability or boundary shapes, as well as group morphology. Many techniques can transform the data in order to find better features, but few focus on spe… ▽ More

    Submitted 11 November, 2021; originally announced November 2021.

    Comments: This paper has been accepted for publication by IEEE Transactions on Pattern Analysis and Machine Intelligence

    MSC Class: 68T07 ACM Class: I.2.6; I.5.1

  25. arXiv:2109.03748  [pdf, other

    cs.LG cs.NE

    A robust approach for deep neural networks in presence of label noise: relabelling and filtering instances during training

    Authors: Anabel Gómez-Ríos, Julián Luengo, Francisco Herrera

    Abstract: Deep learning has outperformed other machine learning algorithms in a variety of tasks, and as a result, it is widely used. However, like other machine learning algorithms, deep learning, and convolutional neural networks (CNNs) in particular, perform worse when the data sets present label noise. Therefore, it is important to develop algorithms that help the training of deep networks and their gen… ▽ More

    Submitted 18 July, 2022; v1 submitted 8 September, 2021; originally announced September 2021.

    Comments: 24 pages, 5 figures

  26. Anomaly Detection in Predictive Maintenance: A New Evaluation Framework for Temporal Unsupervised Anomaly Detection Algorithms

    Authors: Jacinto Carrasco, Irina Markova, David López, Ignacio Aguilera, Diego García, Marta García-Barzana, Manuel Arias-Rodil, Julián Luengo, Francisco Herrera

    Abstract: The research in anomaly detection lacks a unified definition of what represents an anomalous instance. Discrepancies in the nature itself of an anomaly lead to multiple paradigms of algorithms design and experimentation. Predictive maintenance is a special case, where the anomaly represents a failure that must be prevented. Related time-series research as outlier and novelty detection or time-seri… ▽ More

    Submitted 2 September, 2021; v1 submitted 26 May, 2021; originally announced May 2021.

    Comments: 25 pages, 9 figures, 5 tables

    ACM Class: J.2

  27. arXiv:2105.11844  [pdf, other

    cs.CV cs.AI cs.LG

    CI-dataset and DetDSCI methodology for detecting too small and too large critical infrastructures in satellite images: Airports and electrical substations as case study

    Authors: Francisco Pérez-Hernández, José Rodríguez-Ortega, Yassir Benhammou, Francisco Herrera, Siham Tabik

    Abstract: The detection of critical infrastructures in large territories represented by aerial and satellite images is of high importance in several fields such as in security, anomaly detection, land use planning and land use change detection. However, the detection of such infrastructures is complex as they have highly variable shapes and sizes, i.e., some infrastructures, such as electrical substations,… ▽ More

    Submitted 21 September, 2021; v1 submitted 25 May, 2021; originally announced May 2021.

  28. arXiv:2104.11914  [pdf, other

    cs.LG cs.AI cs.CV cs.SC

    EXplainable Neural-Symbolic Learning (X-NeSyL) methodology to fuse deep learning representations with expert knowledge graphs: the MonuMAI cultural heritage use case

    Authors: Natalia Díaz-Rodríguez, Alberto Lamas, Jules Sanchez, Gianni Franchi, Ivan Donadello, Siham Tabik, David Filliat, Policarpo Cruz, Rosana Montes, Francisco Herrera

    Abstract: The latest Deep Learning (DL) models for detection and classification have achieved an unprecedented performance over classical machine learning algorithms. However, DL models are black-box methods hard to debug, interpret, and certify. DL alone cannot provide explanations that can be validated by a non technical audience. In contrast, symbolic AI systems that convert concepts into rules or symbol… ▽ More

    Submitted 13 October, 2021; v1 submitted 24 April, 2021; originally announced April 2021.

  29. arXiv:2104.11653  [pdf, other

    cs.CV

    MULTICAST: MULTI Confirmation-level Alarm SysTem based on CNN and LSTM to mitigate false alarms for handgun detection in video-surveillance

    Authors: Roberto Olmos, Siham Tabik, Francisco Perez-Hernandez, Alberto Lamas, Francisco Herrera

    Abstract: Despite the constant advances in computer vision, integrating modern single-image detectors in real-time handgun alarm systems in video-surveillance is still debatable. Using such detectors still implies a high number of false alarms and false negatives. In this context, most existent studies select one of the latest single-image detectors and train it on a better dataset or use some pre-processin… ▽ More

    Submitted 3 May, 2021; v1 submitted 23 April, 2021; originally announced April 2021.

  30. arXiv:2010.03917  [pdf, other

    cs.NE

    AT-MFCGA: An Adaptive Transfer-guided Multifactorial Cellular Genetic Algorithm for Evolutionary Multitasking

    Authors: Eneko Osaba, Javier Del Ser, Aritz D. Martinez, Jesus L. Lobo, Francisco Herrera

    Abstract: Transfer Optimization is an incipient research area dedicated to solving multiple optimization tasks simultaneously. Among the different approaches that can address this problem effectively, Evolutionary Multitasking resorts to concepts from Evolutionary Computation to solve multiple problems within a single search process. In this paper we introduce a novel adaptive metaheuristic algorithm to dea… ▽ More

    Submitted 3 May, 2021; v1 submitted 8 October, 2020; originally announced October 2020.

    Comments: 31 pages, 4 figures, paper accepted for being published in Information Sciences journal

  31. arXiv:2009.09677  [pdf, other

    cs.LG stat.ML

    CURIE: A Cellular Automaton for Concept Drift Detection

    Authors: Jesus L. Lobo, Javier Del Ser, Eneko Osaba, Albert Bifet, Francisco Herrera

    Abstract: Data stream mining extracts information from large quantities of data flowing fast and continuously (data streams). They are usually affected by changes in the data distribution, giving rise to a phenomenon referred to as concept drift. Thus, learning models must detect and adapt to such changes, so as to exhibit a good predictive performance after a drift has occurred. In this regard, the develop… ▽ More

    Submitted 21 September, 2020; originally announced September 2020.

  32. arXiv:2008.03620  [pdf, other

    cs.NE cs.AI cs.MA

    Lights and Shadows in Evolutionary Deep Learning: Taxonomy, Critical Methodological Analysis, Cases of Study, Learned Lessons, Recommendations and Challenges

    Authors: Aritz D. Martinez, Javier Del Ser, Esther Villar-Rodriguez, Eneko Osaba, Javier Poyatos, Siham Tabik, Daniel Molina, Francisco Herrera

    Abstract: Much has been said about the fusion of bio-inspired optimization algorithms and Deep Learning models for several purposes: from the discovery of network topologies and hyper-parametric configurations with improved performance for a given task, to the optimization of the model's parameters as a replacement for gradient-based solvers. Indeed, the literature is rich in proposals showcasing the applic… ▽ More

    Submitted 8 August, 2020; originally announced August 2020.

    Comments: 64 pages, 18 figures, under review for its consideration in Information Fusion journal

  33. arXiv:2008.01499  [pdf

    cs.AI

    Distributed Linguistic Representations in Decision Making: Taxonomy, Key Elements and Applications, and Challenges in Data Science and Explainable Artificial Intelligence

    Authors: Yuzhu Wu, Zhen Zhang, Gang Kou, Hengjie Zhang, Xiangrui Chao, Cong-Cong Li, Yucheng Dong, Francisco Herrera

    Abstract: Distributed linguistic representations are powerful tools for modelling the uncertainty and complexity of preference information in linguistic decision making. To provide a comprehensive perspective on the development of distributed linguistic representations in decision making, we present the taxonomy of existing distributed linguistic representations. Then, we review the key elements of distribu… ▽ More

    Submitted 7 August, 2020; v1 submitted 4 August, 2020; originally announced August 2020.

    Comments: 37 pages

  34. arXiv:2008.00032  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Sentiment Analysis based Multi-person Multi-criteria Decision Making Methodology using Natural Language Processing and Deep Learning for Smarter Decision Aid. Case study of restaurant choice using TripAdvisor reviews

    Authors: Cristina Zuheros, Eugenio Martínez-Cámara, Enrique Herrera-Viedma, Francisco Herrera

    Abstract: Decision making models are constrained by taking the expert evaluations with pre-defined numerical or linguistic terms. We claim that the use of sentiment analysis will allow decision making models to consider expert evaluations in natural language. Accordingly, we propose the Sentiment Analysis based Multi-person Multi-criteria Decision Making (SA-MpMcDM) methodology for smarter decision aid, whi… ▽ More

    Submitted 14 October, 2020; v1 submitted 31 July, 2020; originally announced August 2020.

  35. arXiv:2007.15030  [pdf, ps, other

    cs.LG cs.AI cs.CR stat.ML

    Dynamic Defense Against Byzantine Poisoning Attacks in Federated Learning

    Authors: Nuria Rodríguez-Barroso, Eugenio Martínez-Cámara, M. Victoria Luzón, Francisco Herrera

    Abstract: Federated learning, as a distributed learning that conducts the training on the local devices without accessing to the training data, is vulnerable to Byzatine poisoning adversarial attacks. We argue that the federated learning model has to avoid those kind of adversarial attacks through filtering out the adversarial clients by means of the federated aggregation operator. We propose a dynamic fede… ▽ More

    Submitted 24 February, 2022; v1 submitted 29 July, 2020; originally announced July 2020.

    Comments: 10 pages

    Journal ref: Future Generation Computer Systems, 133 (2022), 1-9

  36. Revisiting Data Complexity Metrics Based on Morphology for Overlap and Imbalance: Snapshot, New Overlap Number of Balls Metrics and Singular Problems Prospect

    Authors: José Daniel Pascual-Triana, David Charte, Marta Andrés Arroyo, Alberto Fernández, Francisco Herrera

    Abstract: Data Science and Machine Learning have become fundamental assets for companies and research institutions alike. As one of its fields, supervised classification allows for class prediction of new samples, learning from given training data. However, some properties can cause datasets to be problematic to classify. In order to evaluate a dataset a priori, data complexity metrics have been used exte… ▽ More

    Submitted 15 July, 2020; originally announced July 2020.

    Comments: 23 pages, 9 figures, preprint

    Journal ref: Knowledge and Information Systems (Knowl Inf Syst 63, 1961-1989 (2021))

  37. arXiv:2007.00914  [pdf, other

    cs.LG cs.AI cs.CR stat.ML

    Federated Learning and Differential Privacy: Software tools analysis, the Sherpa.ai FL framework and methodological guidelines for preserving data privacy

    Authors: Nuria Rodríguez-Barroso, Goran Stipcich, Daniel Jiménez-López, José Antonio Ruiz-Millán, Eugenio Martínez-Cámara, Gerardo González-Seco, M. Victoria Luzón, Miguel Ángel Veganzones, Francisco Herrera

    Abstract: The high demand of artificial intelligence services at the edges that also preserve data privacy has pushed the research on novel machine learning paradigms that fit those requirements. Federated learning has the ambition to protect data privacy through distributed learning methods that keep the data in their data silos. Likewise, differential privacy attains to improve the protection of data priv… ▽ More

    Submitted 6 October, 2020; v1 submitted 2 July, 2020; originally announced July 2020.

    Comments: 46 pages, 5 figures

    MSC Class: 68T01 ACM Class: I.2.11

    Journal ref: Information Fusion 64 (2020) 270-292

  38. arXiv:2006.01409  [pdf, other

    eess.IV cs.CV

    COVIDGR dataset and COVID-SDNet methodology for predicting COVID-19 based on Chest X-Ray images

    Authors: S. Tabik, A. Gómez-Ríos, J. L. Martín-Rodríguez, I. Sevillano-García, M. Rey-Area, D. Charte, E. Guirado, J. L. Suárez, J. Luengo, M. A. Valero-González, P. García-Villanova, E. Olmedo-Sánchez, F. Herrera

    Abstract: Currently, Coronavirus disease (COVID-19), one of the most infectious diseases in the 21st century, is diagnosed using RT-PCR testing, CT scans and/or Chest X-Ray (CXR) images. CT (Computed Tomography) scanners and RT-PCR testing are not available in most medical centers and hence in many cases CXR images become the most time/cost effective tool for assisting clinicians in making decisions. Deep l… ▽ More

    Submitted 11 November, 2020; v1 submitted 2 June, 2020; originally announced June 2020.

    Comments: Paper accepted in Journal of Biomedical And Health Informatics

  39. An analysis on the use of autoencoders for representation learning: fundamentals, learning task case studies, explainability and challenges

    Authors: David Charte, Francisco Charte, María J. del Jesus, Francisco Herrera

    Abstract: In many machine learning tasks, learning a good representation of the data can be the key to building a well-performant solution. This is because most learning algorithms operate with the features in order to find models for the data. For instance, classification performance can improve if the data is mapped to a space where classes are easily separated, and regression can be facilitated by findin… ▽ More

    Submitted 21 May, 2020; originally announced May 2020.

    MSC Class: 68T05

    Journal ref: Neurocomputing 404 (2020) 93-107

  40. A Showcase of the Use of Autoencoders in Feature Learning Applications

    Authors: David Charte, Francisco Charte, María J. del Jesus, Francisco Herrera

    Abstract: Autoencoders are techniques for data representation learning based on artificial neural networks. Differently to other feature learning methods which may be focused on finding specific transformations of the feature space, they can be adapted to fulfill many purposes, such as data visualization, denoising, anomaly detection and semantic hashing. This work presents these applications and provides d… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

    Comments: This manuscript was accepted as conference paper in IWINAC 2019. The final authenticated publication is available online at https://doi.org/10.1007/978-3-030-19651-6_40

    Journal ref: In: From Bioinspired Systems and Biomedical Applications to Machine Learning/IWINAC 2019. LNCS vol 11487. Springer (2019)

  41. arXiv:2004.09969  [pdf, other

    cs.NE cs.AI

    Fairness in Bio-inspired Optimization Research: A Prescription of Methodological Guidelines for Comparing Meta-heuristics

    Authors: Antonio LaTorre, Daniel Molina, Eneko Osaba, Javier Del Ser, Francisco Herrera

    Abstract: Bio-inspired optimization (including Evolutionary Computation and Swarm Intelligence) is a growing research topic with many competitive bio-inspired algorithms being proposed every year. In such an active area, preparing a successful proposal of a new bio-inspired algorithm is not an easy task. Given the maturity of this research field, proposing a new optimization technique with innovative elemen… ▽ More

    Submitted 19 April, 2020; originally announced April 2020.

    Comments: 43 pages, 4 figures

  42. arXiv:2003.10768  [pdf, other

    cs.NE cs.AI

    Multifactorial Cellular Genetic Algorithm (MFCGA): Algorithmic Design, Performance Comparison and Genetic Transferability Analysis

    Authors: Eneko Osaba, Aritz D. Martinez, Jesus L. Lobo, Javier Del Ser, Francisco Herrera

    Abstract: Multitasking optimization is an incipient research area which is lately gaining a notable research momentum. Unlike traditional optimization paradigm that focuses on solving a single task at a time, multitasking addresses how multiple optimization problems can be tackled simultaneously by performing a single search process. The main objective to achieve this goal efficiently is to exploit synergie… ▽ More

    Submitted 24 March, 2020; originally announced March 2020.

    Comments: Accepted for its presentation at WCCI 2020

  43. arXiv:2003.02601  [pdf, other

    cs.LG stat.ML

    Fuzzy k-Nearest Neighbors with monotonicity constraints: Moving towards the robustness of monotonic noise

    Authors: Sergio González, Salvador García, Sheng-Tun Li, Robert John, Francisco Herrera

    Abstract: This paper proposes a new model based on Fuzzy k-Nearest Neighbors for classification with monotonic constraints, Monotonic Fuzzy k-NN (MonFkNN). Real-life data-sets often do not comply with monotonic constraints due to class noise. MonFkNN incorporates a new calculation of fuzzy memberships, which increases robustness against monotonic noise without the need for relabeling. Our proposal has been… ▽ More

    Submitted 5 March, 2020; originally announced March 2020.

    Comments: Accepted in Neurocomputing

  44. arXiv:2002.12133  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Simultaneously Evolving Deep Reinforcement Learning Models using Multifactorial Optimization

    Authors: Aritz D. Martinez, Eneko Osaba, Javier Del Ser, Francisco Herrera

    Abstract: In recent years, Multifactorial Optimization (MFO) has gained a notable momentum in the research community. MFO is known for its inherent capability to efficiently address multiple optimization tasks at the same time, while transferring information among such tasks to improve their convergence speed. On the other hand, the quantum leap made by Deep Q Learning (DQL) in the Machine Learning field ha… ▽ More

    Submitted 23 March, 2020; v1 submitted 25 February, 2020; originally announced February 2020.

    Comments: 8 pages, 5 figures, submitted to IEEE Conference on Evolutionary Computation 2020 (IEEE CEC)

  45. Recent Trends in the Use of Statistical Tests for Comparing Swarm and Evolutionary Computing Algorithms: Practical Guidelines and a Critical Review

    Authors: J. Carrasco, S. García, M. M. Rueda, S. Das, F. Herrera

    Abstract: A key aspect of the design of evolutionary and swarm intelligence algorithms is studying their performance. Statistical comparisons are also a crucial part which allows for reliable conclusions to be drawn. In the present paper we gather and examine the approaches taken from different perspectives to summarise the assumptions made by these statistical tests, the conclusions reached and the steps f… ▽ More

    Submitted 21 February, 2020; originally announced February 2020.

    Comments: 52 pages, 10 figures, 19 tables

    Journal ref: SWEVO, Volume 54, May 2020, 100665

  46. Comprehensive Taxonomies of Nature- and Bio-inspired Optimization: Inspiration versus Algorithmic Behavior, Critical Analysis and Recommendations (from 2020 to 2024)

    Authors: Daniel Molina, Javier Poyatos, Javier Del Ser, Salvador García, Amir Hussain, Francisco Herrera

    Abstract: In recent years, bio-inspired optimization methods, which mimic biological processes to solve complex problems, have gained popularity in recent literature. The proliferation of proposals prove the growing interest in this field. The increase in nature- and bio-inspired algorithms, applications, and guidelines highlights growing interest in this field. However, the exponential rise in the number o… ▽ More

    Submitted 17 April, 2024; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: 89 pages, 9 figures

    ACM Class: I.2.8

    Journal ref: Cognitive Computation 12:5 (2020) 897-939

  47. arXiv:2002.02164  [pdf, other

    cs.LG cs.AI nlin.CG stat.ML

    LUNAR: Cellular Automata for Drifting Data Streams

    Authors: Jesus L. Lobo, Javier Del Ser, Francisco Herrera

    Abstract: With the advent of huges volumes of data produced in the form of fast streams, real-time machine learning has become a challenge of relevance emerging in a plethora of real-world applications. Processing such fast streams often demands high memory and processing resources. In addition, they can be affected by non-stationary phenomena (concept drift), by which learning methods have to detect change… ▽ More

    Submitted 6 February, 2020; originally announced February 2020.

    Comments: 36 pages, 6 figures, 4 tables

  48. arXiv:2001.11486  [pdf, other

    cs.LG stat.ML

    MNIST-NET10: A heterogeneous deep networks fusion based on the degree of certainty to reach 0.1 error rate. Ensembles overview and proposal

    Authors: S. Tabik, R. F. Alvear-Sandoval, M. M. Ruiz, J. L. Sancho-Gómez, A. R. Figueiras-Vidal, F. Herrera

    Abstract: Ensemble methods have been widely used for improving the results of the best single classificationmodel. A large body of works have achieved better performance mainly by applying one specific ensemble method. However, very few works have explored complex fusion schemes using het-erogeneous ensembles with new aggregation strategies. This paper is three-fold: 1) It provides an overview of the most p… ▽ More

    Submitted 7 April, 2020; v1 submitted 30 January, 2020; originally announced January 2020.

  49. arXiv:2001.05759  [pdf, other

    cs.LG cs.DC stat.ML

    Smart Data driven Decision Trees Ensemble Methodology for Imbalanced Big Data

    Authors: Diego García-Gil, Salvador García, Ning Xiong, Francisco Herrera

    Abstract: Differences in data size per class, also known as imbalanced data distribution, have become a common problem affecting data quality. Big Data scenarios pose a new challenge to traditional imbalanced classification algorithms, since they are not prepared to work with such amount of data. Split data strategies and lack of data in the minority class due to the use of MapReduce paradigm have posed new… ▽ More

    Submitted 3 September, 2021; v1 submitted 16 January, 2020; originally announced January 2020.

  50. arXiv:1910.10045  [pdf, other

    cs.AI cs.LG cs.NE

    Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI

    Authors: Alejandro Barredo Arrieta, Natalia Díaz-Rodríguez, Javier Del Ser, Adrien Bennetot, Siham Tabik, Alberto Barbado, Salvador García, Sergio Gil-López, Daniel Molina, Richard Benjamins, Raja Chatila, Francisco Herrera

    Abstract: In the last years, Artificial Intelligence (AI) has achieved a notable momentum that may deliver the best of expectations over many application sectors across the field. For this to occur, the entire community stands in front of the barrier of explainability, an inherent problem of AI techniques brought by sub-symbolism (e.g. ensembles or Deep Neural Networks) that were not present in the last hyp… ▽ More

    Submitted 26 December, 2019; v1 submitted 22 October, 2019; originally announced October 2019.

    Comments: 67 pages, 13 figures, accepted for its publication in Information Fusion