Zum Hauptinhalt springen

Showing 1–50 of 62 results for author: Neto, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.06593  [pdf, other

    stat.ML cs.LG stat.AP

    Advancing Causal Inference: A Nonparametric Approach to ATE and CATE Estimation with Continuous Treatments

    Authors: Hugo Gobato Souto, Francisco Louzada Neto

    Abstract: This paper introduces a generalized ps-BART model for the estimation of Average Treatment Effect (ATE) and Conditional Average Treatment Effect (CATE) in continuous treatments, addressing limitations of the Bayesian Causal Forest (BCF) model. The ps-BART model's nonparametric nature allows for flexibility in capturing nonlinear relationships between treatment and outcome variables. Across three di… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

  2. arXiv:2409.05665  [pdf, other

    stat.ML cs.LG

    K-Fold Causal BART for CATE Estimation

    Authors: Hugo Gobato Souto, Francisco Louzada Neto

    Abstract: This research aims to propose and evaluate a novel model named K-Fold Causal Bayesian Additive Regression Trees (K-Fold Causal BART) for improved estimation of Average Treatment Effects (ATE) and Conditional Average Treatment Effects (CATE). The study employs synthetic and semi-synthetic datasets, including the widely recognized Infant Health and Development Program (IHDP) benchmark dataset, to va… ▽ More

    Submitted 9 September, 2024; originally announced September 2024.

  3. arXiv:2407.10322  [pdf, other

    cs.CY cs.SE

    Building Collaborative Learning: Exploring Social Annotation in Introductory Programming

    Authors: Francisco Gomes de Oliveira Neto, Felix Dobslaw

    Abstract: The increasing demand for software engineering education presents learning challenges in courses due to the diverse range of topics that require practical applications, such as programming or software design, all of which are supported by group work and interaction. Social Annotation (SA) is an approach to teaching that can enhance collaborative learning among students. In SA, both students and te… ▽ More

    Submitted 17 June, 2024; originally announced July 2024.

  4. arXiv:2406.14971  [pdf, other

    cs.CL cs.AI cs.LG

    Domain Adaptation of Llama3-70B-Instruct through Continual Pre-Training and Model Merging: A Comprehensive Evaluation

    Authors: Shamane Siriwardhana, Mark McQuade, Thomas Gauthier, Lucas Atkins, Fernando Fernandes Neto, Luke Meyers, Anneketh Vij, Tyler Odenthal, Charles Goddard, Mary MacCarthy, Jacob Solawetz

    Abstract: We conducted extensive experiments on domain adaptation of the Meta-Llama-3-70B-Instruct model on SEC data, exploring its performance on both general and domain-specific benchmarks. Our focus included continual pre-training (CPT) and model merging, aiming to enhance the model's domain-specific capabilities while mitigating catastrophic forgetting. Through this study, we evaluated the impact of int… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 8 pages, 6 figures

  5. Unveiling Assumptions: Exploring the Decisions of AI Chatbots and Human Testers

    Authors: Francisco Gomes de Oliveira Neto

    Abstract: The integration of Large Language Models (LLMs) and chatbots introduces new challenges and opportunities for decision-making in software testing. Decision-making relies on a variety of information, including code, requirements specifications, and other software artifacts that are often unclear or exist solely in the developer's mind. To fill in the gaps left by unclear information, we often rely o… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Published at the 1st ACM International Conference on AI-Powered Software (AIWare 2024)

  6. arXiv:2406.06623  [pdf, other

    cs.LG stat.ML

    Spectrum: Targeted Training on Signal to Noise Ratio

    Authors: Eric Hartford, Lucas Atkins, Fernando Fernandes Neto, David Golchinfar

    Abstract: Efficiently post-training large language models remains a challenging task due to the vast computational resources required. We present Spectrum, a method that accelerates LLM training by selectively targeting layer modules based on their signal-to-noise ratio (SNR), and freezing the remaining modules. Our approach, which utilizes an algorithm to compute module SNRs prior to training, has shown to… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  7. arXiv:2405.12712  [pdf, other

    cs.SE cs.AI cs.CL cs.HC

    From Human-to-Human to Human-to-Bot Conversations in Software Engineering

    Authors: Ranim Khojah, Francisco Gomes de Oliveira Neto, Philipp Leitner

    Abstract: Software developers use natural language to interact not only with other humans, but increasingly also with chatbots. These interactions have different properties and flow differently based on what goal the developer wants to achieve and who they interact with. In this paper, we aim to understand the dynamics of conversations that occur during modern software development after the integration of A… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: Accepted at the 1st ACM International Conference on AI-powered Software (AIware) 2024

  8. arXiv:2405.03824  [pdf, other

    cs.SE

    Breaking Barriers: Investigating the Sense of Belonging Among Women and Non-Binary Students in Software Engineering

    Authors: Lina Boman, Jonatan Andersson, Francisco Gomes de Oliveira Neto

    Abstract: Women in computing were among the first programmers in the early 20th century and were substantial contributors to the industry. Today, men dominate the software engineering industry. Research and data show that women are far less likely to pursue a career in this industry, and those that do are less likely than men to stay in it. Reasons for women and other underrepresented minorities to leave th… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  9. arXiv:2404.16226  [pdf

    cs.CL

    Computational analysis of the language of pain: a systematic review

    Authors: Diogo A. P. Nunes, Joana Ferreira-Gomes, Fani Neto, David Martins de Matos

    Abstract: Objectives: This study aims to systematically review the literature on the computational processing of the language of pain, or pain narratives, whether generated by patients or physicians, identifying current trends and challenges. Methods: Following the PRISMA guidelines, a comprehensive literature search was conducted to select relevant studies on the computational processing of the language of… ▽ More

    Submitted 10 May, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: 36 pages, 16 tables, 2 figures, systematic review

    ACM Class: I.2.7; J.3; A.1

  10. arXiv:2404.14901  [pdf, other

    cs.SE cs.AI cs.CL cs.HC cs.LG

    Beyond Code Generation: An Observational Study of ChatGPT Usage in Software Engineering Practice

    Authors: Ranim Khojah, Mazen Mohamad, Philipp Leitner, Francisco Gomes de Oliveira Neto

    Abstract: Large Language Models (LLMs) are frequently discussed in academia and the general public as support tools for virtually any use case that relies on the production of text, including software engineering. Currently there is much debate, but little empirical evidence, regarding the practical usefulness of LLM-based tools such as ChatGPT for engineers in industry. We conduct an observational study of… ▽ More

    Submitted 21 May, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: Accepted at the ACM International Conference on the Foundations of Software Engineering (FSE) 2024

  11. arXiv:2403.05669  [pdf, other

    stat.ML cs.LG

    Spectral Clustering of Categorical and Mixed-type Data via Extra Graph Nodes

    Authors: Dylan Soemitro, Jeova Farias Sales Rocha Neto

    Abstract: Clustering data objects into homogeneous groups is one of the most important tasks in data mining. Spectral clustering is arguably one of the most important algorithms for clustering, as it is appealing for its theoretical soundness and is adaptable to many real-world data settings. For example, mixed data, where the data is composed of numerical and categorical features, is typically handled via… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  12. arXiv:2403.01638  [pdf, other

    cs.CL

    Multi-level Product Category Prediction through Text Classification

    Authors: Wesley Ferreira Maia, Angelo Carmignani, Gabriel Bortoli, Lucas Maretti, David Luz, Daniel Camilo Fuentes Guzman, Marcos Jardel Henriques, Francisco Louzada Neto

    Abstract: This article investigates applying advanced machine learning models, specifically LSTM and BERT, for text classification to predict multiple categories in the retail sector. The study demonstrates how applying data augmentation techniques and the focal loss function can significantly enhance accuracy in classifying products into multiple categories using a robust Brazilian retail dataset. The LSTM… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

  13. arXiv:2402.09786  [pdf, other

    cs.CV cs.AI cs.CY cs.LG

    Examining Pathological Bias in a Generative Adversarial Network Discriminator: A Case Study on a StyleGAN3 Model

    Authors: Alvin Grissom II, Ryan F. Lei, Matt Gusdorff, Jeova Farias Sales Rocha Neto, Bailey Lin, Ryan Trotter

    Abstract: Generative adversarial networks (GANs) generate photorealistic faces that are often indistinguishable by humans from real faces. While biases in machine learning models are often assumed to be due to biases in training data, we find pathological internal color and luminance biases in the discriminator of a pre-trained StyleGAN3-r model that are not explicable by the training data. We also find tha… ▽ More

    Submitted 28 August, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

  14. arXiv:2311.01475  [pdf, other

    cs.CV cs.LG eess.IV

    Patch-Based Deep Unsupervised Image Segmentation using Graph Cuts

    Authors: Isaac Wasserman, Jeova Farias Sales Rocha Neto

    Abstract: Unsupervised image segmentation aims at grouping different semantic patterns in an image without the use of human annotation. Similarly, image clustering searches for groupings of images based on their semantic content without supervision. Classically, both problems have captivated researchers as they drew from sound mathematical concepts to produce concrete applications. With the emergence of dee… ▽ More

    Submitted 15 January, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

  15. arXiv:2309.03351  [pdf, other

    cs.CV cs.LG eess.IV stat.AP

    Using Neural Networks for Fast SAR Roughness Estimation of High Resolution Images

    Authors: Li Fan, Jeova Farias Sales Rocha Neto

    Abstract: The analysis of Synthetic Aperture Radar (SAR) imagery is an important step in remote sensing applications, and it is a challenging problem due to its inherent speckle noise. One typical solution is to model the data using the $G_I^0$ distribution and extract its roughness information, which in turn can be used in posterior imaging tasks, such as segmentation, classification and interpretation. Th… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

  16. arXiv:2306.13200  [pdf, other

    cs.CV cs.LG

    Improving Log-Cumulant Based Estimation of Roughness Information in SAR imagery

    Authors: Jeova Farias Sales Rocha Neto, Francisco Alixandre Avila Rodrigues

    Abstract: Synthetic Aperture Radar (SAR) image understanding is crucial in remote sensing applications, but it is hindered by its intrinsic noise contamination, called speckle. Sophisticated statistical models, such as the $\mathcal{G}^0$ family of distributions, have been employed to SAR data and many of the current advancements in processing this imagery have been accomplished through extracting informati… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

  17. arXiv:2306.13166  [pdf, other

    cs.CV

    A Sparse Graph Formulation for Efficient Spectral Image Segmentation

    Authors: Rahul Palnitkar, Jeova Farias Sales Rocha Neto

    Abstract: Spectral Clustering is one of the most traditional methods to solve segmentation problems. Based on Normalized Cuts, it aims at partitioning an image using an objective function defined by a graph. Despite their mathematical attractiveness, spectral approaches are traditionally neglected by the scientific community due to their practical issues and underperformance. In this paper, we adopt a spars… ▽ More

    Submitted 7 June, 2024; v1 submitted 22 June, 2023; originally announced June 2023.

  18. arXiv:2305.07511  [pdf, ps, other

    cs.LG cs.AI cs.CY eess.IV

    eXplainable Artificial Intelligence on Medical Images: A Survey

    Authors: Matteus Vargas Simão da Silva, Rodrigo Reis Arrais, Jhessica Victoria Santos da Silva, Felipe Souza Tânios, Mateus Antonio Chinelatto, Natalia Backhaus Pereira, Renata De Paris, Lucas Cesar Ferreira Domingos, Rodrigo Dória Villaça, Vitor Lopes Fabris, Nayara Rossi Brito da Silva, Ana Claudia Akemi Matsuki de Faria, Jose Victor Nogueira Alves da Silva, Fabiana Cristina Queiroz de Oliveira Marucci, Francisco Alves de Souza Neto, Danilo Xavier Silva, Vitor Yukio Kondo, Claudio Filipi Gonçalves dos Santos

    Abstract: Over the last few years, the number of works about deep learning applied to the medical field has increased enormously. The necessity of a rigorous assessment of these models is required to explain these results to all people involved in medical exams. A recent field in the machine learning area is explainable artificial intelligence, also known as XAI, which targets to explain the results of such… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

  19. Chronic pain patient narratives allow for the estimation of current pain intensity

    Authors: Diogo A. P. Nunes, Joana Ferreira-Gomes, Daniela Oliveira, Carlos Vaz, Sofia Pimenta, Fani Neto, David Martins de Matos

    Abstract: Chronic pain is a multi-dimensional experience, and pain intensity plays an important part, impacting the patients emotional balance, psychology, and behaviour. Standard self-reporting tools, such as the Visual Analogue Scale for pain, fail to capture this burden. Moreover, this type of tools is susceptible to a degree of subjectivity, dependent on the patients clear understanding of how to use it… ▽ More

    Submitted 17 November, 2022; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: 29 pages, 6 figures, 7 tables

    ACM Class: I.2.7; I.5.4; J.3; J.4

  20. LODUS: A Multi-Level Framework for Simulating Environment and Population -- A Contagion Experiment on a Pandemic World

    Authors: Gabriel Fonseca Silva, Vinícius Cassol, Amyr Borges Fortes Neto, Andre Antonitsch, Diogo Schaffer, Soraia Raupp Musse, Rodrigo de Marsillac Linn

    Abstract: Nowadays we are experiencing a way of life that never existed before. The pandemic has sharply changed our habits, customs, and behavior. In addition, a lot of work was suddenly requested for city managers challenging them to develop strategies to try stopping the pandemic progression. Urban environments must be dynamic and managers need fast decisions when working on crisis situations. In this pa… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

    Comments: 9 pages, 7 figures, 4 equations

  21. arXiv:2208.07853  [pdf, other

    cs.CV stat.ML

    Estimating Appearance Models for Image Segmentation via Tensor Factorization

    Authors: Jeova Farias Sales Rocha Neto

    Abstract: Image Segmentation is one of the core tasks in Computer Vision and solving it often depends on modeling the image appearance data via the color distributions of each it its constituent regions. Whereas many segmentation algorithms handle the appearance models dependence using alternation or implicit methods, we propose here a new approach to directly estimate them from the image without prior info… ▽ More

    Submitted 15 November, 2023; v1 submitted 16 August, 2022; originally announced August 2022.

  22. arXiv:2207.09065  [pdf, other

    cs.SE cs.IT

    Automated Black-Box Boundary Value Detection

    Authors: Felix Dobslaw, Robert Feldt, Francisco de Oliveira Neto

    Abstract: The input domain of software systems can typically be divided into sub-domains for which the outputs are similar. To ensure high quality it is critical to test the software on the boundaries between these sub-domains. Consequently, boundary value analysis and testing has been part of the toolbox of software testers for long and is typically taught early to students. However, despite its many argue… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

  23. Parametrized constant-depth quantum neuron

    Authors: Jonathan H. A. de Carvalho, Fernando M. de Paula Neto

    Abstract: Quantum computing has been revolutionizing the development of algorithms. However, only noisy intermediate-scale quantum devices are available currently, which imposes several restrictions on the circuit implementation of quantum algorithms. In this paper, we propose a framework that builds quantum neurons based on kernel machines, where the quantum neurons differ from each other by their feature… ▽ More

    Submitted 28 September, 2023; v1 submitted 24 February, 2022; originally announced February 2022.

    Comments: Accepted version. 21 pages, 14 figures

  24. arXiv:2110.13575  [pdf, other

    cs.SE cs.AI cs.NE

    Automated Support for Unit Test Generation: A Tutorial Book Chapter

    Authors: Afonso Fontes, Gregory Gay, Francisco Gomes de Oliveira Neto, Robert Feldt

    Abstract: Unit testing is a stage of testing where the smallest segment of code that can be tested in isolation from the rest of the system - often a class - is tested. Unit tests are typically written as executable code, often in a format provided by a unit testing framework such as pytest for Python. Creating unit tests is a time and effort-intensive process with many repetitive, manual elements. To ill… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

    Comments: This is a preprint of a chapter from the upcoming book, "Optimising the Software Development Process with Artificial Intelligence" (Springer, 2022)

  25. arXiv:2109.00402  [pdf, other

    cs.CL cs.IR q-bio.QM

    Chronic Pain and Language: A Topic Modelling Approach to Personal Pain Descriptions

    Authors: Diogo A. P. Nunes, Joana Ferreira Gomes, Fani Neto, David Martins de Matos

    Abstract: Chronic pain is recognized as a major health problem, with impacts not only at the economic, but also at the social, and individual levels. Being a private and subjective experience, it is impossible to externally and impartially experience, describe, and interpret chronic pain as a purely noxious stimulus that would directly point to a causal agent and facilitate its mitigation, contrary to acute… ▽ More

    Submitted 17 March, 2022; v1 submitted 1 September, 2021; originally announced September 2021.

    Comments: 9 pages, 5 figures, 6 tables

    ACM Class: I.2.7; I.5.3; I.5.4; J.3; J.4

  26. arXiv:2108.10218  [pdf

    cs.CL cs.IR cs.SI q-bio.QM

    Modeling chronic pain experiences from online reports using the Reddit Reports of Chronic Pain dataset

    Authors: Diogo A. P. Nunes, Joana Ferreira-Gomes, Fani Neto, David Martins de Matos

    Abstract: Objective: Reveal and quantify qualities of reported experiences of chronic pain on social media, from multiple pathological backgrounds, by means of the novel Reddit Reports of Chronic Pain (RRCP) dataset, using Natural Language Processing techniques. Materials and Methods: Define and validate the RRCP dataset for a set of subreddits related to chronic pain. Identify the main concerns discussed i… ▽ More

    Submitted 18 November, 2022; v1 submitted 23 August, 2021; originally announced August 2021.

    Comments: 24 pages, 26 figures, 8 tables

    ACM Class: I.2.7; I.5.3; I.5.4; J.3; J.4

    Journal ref: Information 2023, 14(4), 237

  27. On Applying the Lackadaisical Quantum Walk Algorithm to Search for Multiple Solutions on Grids

    Authors: Jonathan H. A. de Carvalho, Luciano S. de Souza, Fernando M. de Paula Neto, Tiago A. E. Ferreira

    Abstract: Quantum computing promises to improve the information processing power to levels unreachable by classical computation. Quantum walks are heading the development of quantum algorithms for searching information on graphs more efficiently than their classical counterparts. A quantum-walk-based algorithm standing out in the literature is the lackadaisical quantum walk. The lackadaisical quantum walk i… ▽ More

    Submitted 9 January, 2023; v1 submitted 11 June, 2021; originally announced June 2021.

    Comments: Accepted manuscript. 23 pages, 7 figures

    Journal ref: Information Sciences 622 (2023) 873-888

  28. arXiv:2102.11121  [pdf, other

    cs.CV

    Direct Estimation of Appearance Models for Segmentation

    Authors: Jeova F. S. Rocha Neto, Pedro Felzenszwalb, Marilyn Vazquez

    Abstract: Image segmentation algorithms often depend on appearance models that characterize the distribution of pixel values in different image regions. We describe a new approach for estimating appearance models directly from an image, without explicit consideration of the pixels that make up each region. Our approach is based on novel algebraic expressions that relate local image statistics to the appeara… ▽ More

    Submitted 15 September, 2021; v1 submitted 22 February, 2021; originally announced February 2021.

    Comments: To appear in the SIAM Journal on Imaging Sciences (SIIMS)

    MSC Class: 68U10; 62M05; 62H30; 65C20

  29. arXiv:2011.12999  [pdf, other

    cs.GR cs.CV cs.SD eess.AS

    Learning to dance: A graph convolutional adversarial network to generate realistic dance motions from audio

    Authors: João P. Ferreira, Thiago M. Coutinho, Thiago L. Gomes, José F. Neto, Rafael Azevedo, Renato Martins, Erickson R. Nascimento

    Abstract: Synthesizing human motion through learning techniques is becoming an increasingly popular approach to alleviating the requirement of new data capture to produce animations. Learning to move naturally from music, i.e., to dance, is one of the more complex motions humans often perform effortlessly. Each dance movement is unique, yet such movements maintain the core characteristics of the dance style… ▽ More

    Submitted 30 November, 2020; v1 submitted 25 November, 2020; originally announced November 2020.

    Comments: Accepted at the Elsevier Computers & Graphics (C&G) 2020

  30. Using mutation testing to measure behavioural test diversity

    Authors: Francisco Gomes de Oliveira Neto, Felix Dobslaw, Robert Feldt

    Abstract: Diversity has been proposed as a key criterion to improve testing effectiveness and efficiency.It can be used to optimise large test repositories but also to visualise test maintenance issues and raise practitioners' awareness about waste in test artefacts and processes. Even though these diversity-based testing techniques aim to exercise diverse behavior in the system under test (SUT), the divers… ▽ More

    Submitted 18 October, 2020; originally announced October 2020.

    Comments: Published at the 15th International Workshop on Mutation Analysis

  31. arXiv:2008.13028  [pdf, other

    cs.DB cs.HC

    STULL: Unbiased Online Sampling for Visual Exploration of Large Spatiotemporal Data

    Authors: Guizhen Wang, Jingjing Guo, Mingjie Tang, José Florencio de Queiroz Neto, Calvin Yau, Anas Daghistani, Morteza Karimzadeh, Walid G. Aref, David S. Ebert

    Abstract: Online sampling-supported visual analytics is increasingly important, as it allows users to explore large datasets with acceptable approximate answers at interactive rates. However, existing online spatiotemporal sampling techniques are often biased, as most researchers have primarily focused on reducing computational latency. Biased sampling approaches select data with unequal probabilities and p… ▽ More

    Submitted 29 August, 2020; originally announced August 2020.

    Comments: IEEE VIS (InfoVis/VAST/SciVis) 2020 ACM 2012 CCS - Human-centered computing, Visualization, Visualization design and evaluation methods

    ACM Class: H.3.3

  32. Critical Point Calculations by Numerical Inversion of Functions

    Authors: C. N. Parajara, G. M. Platt, F. D. Moura Neto, M. Escobar, G. B. Libotte

    Abstract: In this work, we propose a new approach to the problem of critical point calculation, based on the formulation of Heidemann and Khalil (1980). This leads to a $2 \times 2$ system of nonlinear algebraic equations in temperature and molar volume, which makes possible the prediction of critical points of the mixture through an adaptation of the technique of inversion of functions from the plane to th… ▽ More

    Submitted 30 May, 2020; originally announced June 2020.

  33. arXiv:2006.06573  [pdf, other

    cs.CV cs.LG eess.IV

    Spectral Image Segmentation with Global Appearance Modeling

    Authors: Jeova F. S. Rocha Neto, Pedro F. Felzenszwalb

    Abstract: We introduce a new spectral method for image segmentation that incorporates long range relationships for global appearance modeling. The approach combines two different graphs, one is a sparse graph that captures spatial relationships between nearby pixels and another is a dense graph that captures pairwise similarity between all pairs of pixels. We extend the spectral method for Normalized Cuts t… ▽ More

    Submitted 6 October, 2022; v1 submitted 11 June, 2020; originally announced June 2020.

    ACM Class: I.4; I.5

  34. An Empirical Study of Bots in Software Development -- Characteristics and Challenges from a Practitioner's Perspective

    Authors: Linda Erlenhov, Francisco Gomes de Oliveira Neto, Philipp Leitner

    Abstract: Software engineering bots - automated tools that handle tedious tasks - are increasingly used by industrial and open source projects to improve developer productivity. Current research in this area is held back by a lack of consensus of what software engineering bots (DevBots) actually are, what characteristics distinguish them from other tools, and what benefits and challenges are associated with… ▽ More

    Submitted 29 October, 2020; v1 submitted 28 May, 2020; originally announced May 2020.

    Comments: To be published at the ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE)

  35. Challenges and guidelines on designing test cases for test bots

    Authors: Linda Erlenhov, Francisco Gomes de Oliveira Neto, Martin Chukaleski, Samer Daknache

    Abstract: Test bots are automated testing tools that autonomously and periodically run a set of test cases that check whether the system under test meets the requirements set forth by the customer. The automation decreases the amount of time a development team spends on testing. As development projects become larger, it is important to focus on improving the test bots by designing more effective test cases… ▽ More

    Submitted 21 April, 2020; originally announced April 2020.

    Comments: To be published in IEEE/ACM 42nd International Conference on Software Engineering Workshops (ICSEW'20), May 23--29, 2020, Seoul, Republic of Korea

  36. Boundary Value Exploration for Software Analysis

    Authors: Felix Dobslaw, Francisco Gomes de Oliveira Neto, Robert Feldt

    Abstract: For software to be reliable and resilient, it is widely accepted that tests must be created and maintained alongside the software itself. One safeguard from vulnerabilities and failures in code is to ensure correct behavior on the boundaries between the input space sub-domains. So-called boundary value analysis (BVA) and boundary value testing (BVT) techniques aim to exercise those boundaries and… ▽ More

    Submitted 12 October, 2020; v1 submitted 18 January, 2020; originally announced January 2020.

  37. arXiv:1912.04030  [pdf, other

    cs.NI cs.IT cs.LG

    Adaptive Modulation and Coding based on Reinforcement Learning for 5G Networks

    Authors: Mateus P. Mota, Daniel C. Araujo, Francisco Hugo Costa Neto, Andre L. F. de Almeida, F. Rodrigo P. Cavalcanti

    Abstract: We design a self-exploratory reinforcement learning (RL) framework, based on the Q-learning algorithm, that enables the base station (BS) to choose a suitable modulation and coding scheme (MCS) that maximizes the spectral efficiency while maintaining a low block error rate (BLER). In this framework, the BS chooses the MCS based on the channel quality indicator (CQI) reported by the user equipment… ▽ More

    Submitted 25 November, 2019; originally announced December 2019.

    Comments: Accepted for presentation at the IEEE GLOBECOM 2019

  38. arXiv:1907.03475  [pdf, other

    cs.SE

    Estimating Return on Investment for GUI Test Automation Tools

    Authors: Felix Dobslaw, Robert Feldt, David Michaelsson, Patrick Haar, Francisco G. de Oliveira Neto, Richard Torkar

    Abstract: Automated graphical user interface (GUI) tests can reduce manual testing activities and increase test frequency. This motivates the conversion of manual test cases into automated GUI tests. However, it is not clear whether such automation is cost-effective given that GUI automation scripts add to the code base and demand maintenance as a system evolves. In this paper, we introduce a method for est… ▽ More

    Submitted 1 November, 2019; v1 submitted 8 July, 2019; originally announced July 2019.

    Comments: 12 pages

  39. arXiv:1905.12804   

    cs.SD cs.IR cs.LG eess.AS stat.ML

    A Music Classification Model based on Metric Learning and Feature Extraction from MP3 Audio Files

    Authors: Angelo C. Mendes da Silva, Mauricio A. Nunes, Raul Fonseca Neto

    Abstract: The development of models for learning music similarity and feature extraction from audio media files is an increasingly important task for the entertainment industry. This work proposes a novel music classification model based on metric learning and feature extraction from MP3 audio files. The metric learning process considers the learning of a set of parameterized distances employing a structure… ▽ More

    Submitted 17 September, 2019; v1 submitted 29 May, 2019; originally announced May 2019.

    Comments: In a review process, I found some errors and made some changes in methodology that improved my results. Once I finish the experiments, I will upload the new version

  40. arXiv:1811.12081  [pdf, other

    eess.SP cs.LG stat.ML

    Deep Haar Scattering Networks in Pattern Recognition: A promising approach

    Authors: Fernando Fernandes Neto, Alemayehu Admasu Solomon, Rodrigo de Losso, Claudio Garcia, Pedro Delano Cavalcanti

    Abstract: The aim of this paper is to discuss the use of Haar scattering networks, which is a very simple architecture that naturally supports a large number of stacked layers, yet with very few parameters, in a relatively broad set of pattern recognition problems, including regression and classification tasks. This architecture, basically, consists of stacking convolutional filters, that can be thought as… ▽ More

    Submitted 29 November, 2018; originally announced November 2018.

  41. arXiv:1810.01061  [pdf

    stat.ML cs.LG

    Feature Selection Approach with Missing Values Conducted for Statistical Learning: A Case Study of Entrepreneurship Survival Dataset

    Authors: Diego Nascimento, Anderson Ara, Francisco Louzada Neto

    Abstract: In this article, we investigate the features which enhanced discriminate the survival in the micro and small business (MSE) using the approach of data mining with feature selection. According to the complexity of the data set, we proposed a comparison of three data imputation methods such as mean imputation (MI), k-nearest neighbor (KNN) and expectation maximization (EM) using mutually the selecti… ▽ More

    Submitted 2 October, 2018; originally announced October 2018.

  42. arXiv:1809.09849  [pdf, other

    cs.SE

    A Method to Assess and Argue for Practical Significance in Software Engineering

    Authors: Richard Torkar, Carlo A. Furia, Robert Feldt, Francisco Gomes de Oliveira Neto, Lucas Gren, Per Lenberg, Neil A. Ernst

    Abstract: A key goal of empirical research in software engineering is to assess practical significance, which answers whether the observed effects of some compared treatments show a relevant difference in practice in realistic scenarios. Even though plenty of standard techniques exist to assess statistical significance, connecting it to practical significance is not straightforward or routinely done; indeed… ▽ More

    Submitted 25 December, 2020; v1 submitted 26 September, 2018; originally announced September 2018.

    Comments: 13 pages, 9 figures, 3 tables. Minor rev update

  43. arXiv:1808.07089  [pdf, other

    cs.IR cs.LG

    CoBaR: Confidence-Based Recommender

    Authors: Fernando S. Aguiar Neto, Arthur F. da Costa, Marcelo G. Manzato

    Abstract: Neighborhood-based collaborative filtering algorithms usually adopt a fixed neighborhood size for every user or item, although groups of users or items may have different lengths depending on users' preferences. In this paper, we propose an extension to a non-personalized recommender based on confidence intervals and hierarchical clustering to generate groups of users with optimal sizes. The evalu… ▽ More

    Submitted 21 August, 2018; originally announced August 2018.

  44. arXiv:1807.05593  [pdf, other

    cs.SE

    Visualizing test diversity to support test optimisation

    Authors: Francisco Gomes de Oliveira Neto, Robert Feldt, Linda Erlenhov, José Benardi de Souza Nunes

    Abstract: Diversity has been used as an effective criteria to optimise test suites for cost-effective testing. Particularly, diversity-based (alternatively referred to as similarity-based) techniques have the benefit of being generic and applicable across different Systems Under Test (SUT), and have been used to automatically select or prioritise large sets of test cases. However, it is a challenge to feedb… ▽ More

    Submitted 17 July, 2018; v1 submitted 15 July, 2018; originally announced July 2018.

  45. arXiv:1804.03236  [pdf

    stat.ML cs.LG

    Building Function Approximators on top of Haar Scattering Networks

    Authors: Fernando Fernandes Neto

    Abstract: In this article we propose building general-purpose function approximators on top of Haar Scattering Networks. We advocate that this architecture enables a better comprehension of feature extraction, in addition to its implementation simplicity and low computational costs. We show its approximation and feature extraction capabilities in a wide range of different problems, which can be applied on s… ▽ More

    Submitted 9 April, 2018; originally announced April 2018.

    Comments: 7 pages, 5 figures, to appear in International Journal of Machine Learning and Computing, vol. 8 number 3

  46. arXiv:1802.07140  [pdf, other

    cs.SE

    A Testability Analysis Framework for Non-Functional Properties

    Authors: Michael Felderer, Bogdan Marculescu, Francisco Gomes de Oliveira Neto, Robert Feldt, Richard Torkar

    Abstract: This paper presents background, the basic steps and an example for a testability analysis framework for non-functional properties.

    Submitted 20 February, 2018; originally announced February 2018.

  47. arXiv:1802.02033  [pdf, other

    cs.SE

    Ways of Applying Artificial Intelligence in Software Engineering

    Authors: Robert Feldt, Francisco G. de Oliveira Neto, Richard Torkar

    Abstract: As Artificial Intelligence (AI) techniques have become more powerful and easier to use they are increasingly deployed as key components of modern software systems. While this enables new functionality and often allows better adaptation to user needs it also creates additional problems for software engineers and exposes companies to new risks. Some work has been done to better understand the intera… ▽ More

    Submitted 7 February, 2018; v1 submitted 6 February, 2018; originally announced February 2018.

  48. arXiv:1801.03523  [pdf

    stat.ML cs.NE physics.comp-ph q-fin.CP

    Generative Models for Stochastic Processes Using Convolutional Neural Networks

    Authors: Fernando Fernandes Neto

    Abstract: The present paper aims to demonstrate the usage of Convolutional Neural Networks as a generative model for stochastic processes, enabling researchers from a wide range of fields (such as quantitative finance and physics) to develop a general tool for forecasts and simulations without the need to identify/assume a specific system structure or estimate its parameters.

    Submitted 8 January, 2018; originally announced January 2018.

  49. arXiv:1712.01697  [pdf, ps, other

    cs.CV cs.GR cs.NE eess.IV

    Dialectical Multispectral Classification of Diffusion-Weighted Magnetic Resonance Images as an Alternative to Apparent Diffusion Coefficients Maps to Perform Anatomical Analysis

    Authors: Wellington Pinheiro dos Santos, Francisco Marcos de Assis, Ricardo Emmanuel de Souza, Plínio Batista dos Santos Filho, Fernando Buarque de Lima Neto

    Abstract: Multispectral image analysis is a relatively promising field of research with applications in several areas, such as medical imaging and satellite monitoring. A considerable number of current methods of analysis are based on parametric statistics. Alternatively, some methods in Computational Intelligence are inspired by biology and other sciences. Here we claim that Philosophy can be also consider… ▽ More

    Submitted 3 December, 2017; originally announced December 2017.

    Journal ref: Computerized Medical Imaging and Graphics, v. 33, p. 442-460, 2009

  50. arXiv:1711.04188  [pdf, ps, other

    cs.SE

    Assessing Agile Transformation Success Factors

    Authors: Amadeu Silveira Campanelli, Florindo Silote Neto, Fernando Silva Parreiras

    Abstract: Research on success factors involved in the agile transformation process is not conclusive and there is still need for guidelines to help in the transformation process considering the organizational context (culture, values, needs, reality and goals). The usage of success factors as a tool to help agile adoption raises the following research question: What are the success factors for an organizati… ▽ More

    Submitted 11 November, 2017; originally announced November 2017.