Zum Hauptinhalt springen

Showing 1–10 of 10 results for author: Galatolo, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10660  [pdf, other

    cs.CL

    DIEKAE: Difference Injection for Efficient Knowledge Augmentation and Editing of Large Language Models

    Authors: Alessio Galatolo, Meriem Beloucif, Katie Winkle

    Abstract: Pretrained Language Models (PLMs) store extensive knowledge within their weights, enabling them to recall vast amount of information. However, relying on this parametric knowledge brings some limitations such as outdated information or gaps in the training data. This work addresses these problems by distinguish between two separate solutions: knowledge editing and knowledge augmentation. We introd… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: WIP

  2. arXiv:2406.04116  [pdf, ps, other

    cs.AI cs.CL

    Promoting Fairness and Diversity in Speech Datasets for Mental Health and Neurological Disorders Research

    Authors: Eleonora Mancini, Ana Tanevska, Andrea Galassi, Alessio Galatolo, Federico Ruggeri, Paolo Torroni

    Abstract: Current research in machine learning and artificial intelligence is largely centered on modeling and performance evaluation, less so on data collection. However, recent research demonstrated that limitations and biases in data may negatively impact trustworthiness and reliability. These aspects are particularly impactful on sensitive domains such as mental health and neurological disorders, where… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 34 pages

  3. arXiv:2311.15698  [pdf, other

    cs.CL cs.AI

    Cerbero-7B: A Leap Forward in Language-Specific LLMs Through Enhanced Chat Corpus Generation and Evaluation

    Authors: Federico A. Galatolo, Mario G. C. A. Cimino

    Abstract: This study introduces a novel approach for generating high-quality, language-specific chat corpora using a self-chat mechanism. We combine a generator LLM for creating new samples and an embedder LLM to ensure diversity. A new Masked Language Modelling (MLM) model-based quality assessment metric is proposed for evaluating and filtering the corpora. Utilizing the llama2-70b as the generator and a m… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  4. arXiv:2212.07839  [pdf, other

    cs.CV cs.CL cs.LG

    TeTIm-Eval: a novel curated evaluation data set for comparing text-to-image models

    Authors: Federico A. Galatolo, Mario G. C. A. Cimino, Edoardo Cogotti

    Abstract: Evaluating and comparing text-to-image models is a challenging problem. Significant advances in the field have recently been made, piquing interest of various industrial sectors. As a consequence, a gold standard in the field should cover a variety of tasks and application contexts. In this paper a novel evaluation approach is experimented, on the basis of: (i) a curated data set, made by high-qua… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

  5. arXiv:2105.13244  [pdf, other

    cs.CV

    Using Early-Learning Regularization to Classify Real-World Noisy Data

    Authors: Alessio Galatolo, Alfred Nilsson, Roderick Karlemstrand, Yineng Wang

    Abstract: The memorization problem is well-known in the field of computer vision. Liu et al. propose a technique called Early-Learning Regularization, which improves accuracy on the CIFAR datasets when label noise is present. This project replicates their experiments and investigates the performance on a real-world dataset with intrinsic noise. Results show that their experimental results are consistent. We… ▽ More

    Submitted 1 June, 2021; v1 submitted 27 May, 2021; originally announced May 2021.

  6. arXiv:2102.01645  [pdf, other

    cs.NE cs.AI cs.LG

    Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search

    Authors: Federico A. Galatolo, Mario G. C. A. Cimino, Gigliola Vaglini

    Abstract: In this research work we present CLIP-GLaSS, a novel zero-shot framework to generate an image (or a caption) corresponding to a given caption (or image). CLIP-GLaSS is based on the CLIP neural network, which, given an image and a descriptive caption, provides similar embeddings. Differently, CLIP-GLaSS takes a caption (or an image) as an input, and generates the image (or the caption) whose CLIP e… ▽ More

    Submitted 1 October, 2021; v1 submitted 2 February, 2021; originally announced February 2021.

    Journal ref: IMPROVE, ISBN 978-989-758-511-1, pages 166-174 (2021)

  7. Solving the scalarization issues of Advantage-based Reinforcement Learning Algorithms

    Authors: Federico A. Galatolo, Mario G. C. A. Cimino, Gigliola Vaglini

    Abstract: In this research, some of the issues that arise from the scalarization of the multi-objective optimization problem in the Advantage Actor Critic (A2C) reinforcement learning algorithm are investigated. The paper shows how a naive scalarization can lead to gradients overlapping. Furthermore, the possibility that the entropy regularization term can be a source of uncontrolled noise is discussed. Wit… ▽ More

    Submitted 1 October, 2021; v1 submitted 8 April, 2020; originally announced April 2020.

    Journal ref: Computers & Electrical Engineering, 92, 107117 (2021)

  8. arXiv:1905.06684  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Formal derivation of Mesh Neural Networks with their Forward-Only gradient Propagation

    Authors: Federico A. Galatolo, Mario G. C. A. Cimino, Gigliola Vaglini

    Abstract: This paper proposes the Mesh Neural Network (MNN), a novel architecture which allows neurons to be connected in any topology, to efficiently route information. In MNNs, information is propagated between neurons throughout a state transition function. State and error gradients are then directly computed from state updates without backward computation. The MNN architecture and the error propagation… ▽ More

    Submitted 30 September, 2021; v1 submitted 16 May, 2019; originally announced May 2019.

    Journal ref: Galatolo, F. A., Cimino, M. G., & Vaglini, G. (2021). Formal Derivation of Mesh Neural Networks with Their Forward-Only Gradient Propagation. Neural Processing Letters, 1-16

  9. arXiv:1903.01341  [pdf

    cs.NE cs.LG stat.ML

    Using stigmergy as a computational memory in the design of recurrent neural networks

    Authors: Federico A. Galatolo, Mario G. C. A. Cimino, Gigliola Vaglini

    Abstract: In this paper, a novel architecture of Recurrent Neural Network (RNN) is designed and experimented. The proposed RNN adopts a computational memory based on the concept of stigmergy. The basic principle of a Stigmergic Memory (SM) is that the activity of deposit/removal of a quantity in the SM stimulates the next activities of deposit/removal. Accordingly, subsequent SM activities tend to reinforce… ▽ More

    Submitted 9 January, 2019; originally announced March 2019.

  10. arXiv:1811.10574  [pdf

    cs.NE cs.LG stat.ML

    Using stigmergy to incorporate the time into artificial neural networks

    Authors: Federico A. Galatolo, Mario G. C. A. Cimino, Gigliola Vaglini

    Abstract: A current research trend in neurocomputing involves the design of novel artificial neural networks incorporating the concept of time into their operating model. In this paper, a novel architecture that employs stigmergy is proposed. Computational stigmergy is used to dynamically increase (or decrease) the strength of a connection, or the activation level, of an artificial neuron when stimulated (o… ▽ More

    Submitted 25 October, 2018; originally announced November 2018.