Zum Hauptinhalt springen

Showing 1–50 of 692 results for author: De, P

Searching in archive cs. Search in all archives.
.
  1. CryptoAnalytics: Cryptocoins Price Forecasting with Machine Learning Techniques

    Authors: Pasquale De Rosa, Pascal Felber, Valerio Schiavoni

    Abstract: This paper introduces CryptoAnalytics, a software toolkit for cryptocoins price forecasting with machine learning (ML) techniques. Cryptocoins are tradable digital assets exchanged for specific trading prices. While history has shown the extreme volatility of such trading prices, the ability to efficiently model and forecast the time series resulting from the exchange price volatility remains an o… ▽ More

    Submitted 6 September, 2024; originally announced September 2024.

    Journal ref: SoftwareX, Volume 26, May 2024

  2. Practical Forecasting of Cryptocoins Timeseries using Correlation Patterns

    Authors: Pasquale De Rosa, Pascal Felber, Valerio Schiavoni

    Abstract: Cryptocoins (i.e., Bitcoin, Ether, Litecoin) are tradable digital assets. Ownerships of cryptocoins are registered on distributed ledgers (i.e., blockchains). Secure encryption techniques guarantee the security of the transactions (transfers of coins among owners), registered into the ledger. Cryptocoins are exchanged for specific trading prices. The extreme volatility of such trading prices acros… ▽ More

    Submitted 5 September, 2024; originally announced September 2024.

    Journal ref: DEBS 2023: Proceedings of the 17th ACM International Conference on Distributed and Event-based Systems

  3. arXiv:2409.03043  [pdf, other

    cs.CV cs.AI cs.LG

    Can Your Generative Model Detect Out-of-Distribution Covariate Shift?

    Authors: Christiaan Viviers, Amaan Valiuddin, Francisco Caetano, Lemar Abdi, Lena Filatova, Peter de With, Fons van der Sommen

    Abstract: Detecting Out-of-Distribution~(OOD) sensory data and covariate distribution shift aims to identify new test examples with different high-level image statistics to the captured, normal and In-Distribution (ID) set. Existing OOD detection literature largely focuses on semantic shift with little-to-no consensus over covariate shift. Generative models capture the ID data in an unsupervised manner, ena… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

    Comments: ECCV 2024

  4. arXiv:2409.02562  [pdf, other

    cs.CV

    Interacting Multiple Model-based Joint Homography Matrix and Multiple Object State Estimation

    Authors: Paul Johannes Claasen, Johan Pieter de Villiers

    Abstract: A novel MOT algorithm, IMM Joint Homography State Estimation (IMM-JHSE), is proposed. By jointly modelling the camera projection matrix as part of track state vectors, IMM-JHSE removes the explicit influence of camera motion compensation techniques on predicted track position states, which was prevalent in previous approaches. Expanding upon this, static and dynamic camera motion models are combin… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

    Comments: Preprint submitted to Information Fusion

  5. arXiv:2409.01152  [pdf, other

    cs.CL cs.IR

    Real World Conversational Entity Linking Requires More Than Zeroshots

    Authors: Mohanna Hoveyda, Arjen P. de Vries, Maarten de Rijke, Faegheh Hasibi

    Abstract: Entity linking (EL) in conversations faces notable challenges in practical applications, primarily due to the scarcity of entity-annotated conversational datasets and sparse knowledge bases (KB) containing domain-specific, long-tail entities. We designed targeted evaluation scenarios to measure the efficacy of EL models under resource constraints. Our evaluation employs two KBs: Fandom, exemplifyi… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

  6. Extending the C/C++ Memory Model with Inline Assembly

    Authors: Paulo Emílio de Vilhena, Ori Lahav, Viktor Vafeiadis, Azalea Raad

    Abstract: Programs written in C/C++ often include inline assembly: a snippet of architecture-specific assembly code used to access low-level functionalities that are impossible or expensive to simulate in the source language. Although inline assembly is widely used, its semantics has not yet been formally studied. In this paper, we overcome this deficiency by investigating the effect of inline assembly on… ▽ More

    Submitted 2 September, 2024; v1 submitted 30 August, 2024; originally announced August 2024.

    ACM Class: F.3.2

  7. arXiv:2408.13167  [pdf, other

    stat.ML cs.LG

    A density ratio framework for evaluating the utility of synthetic data

    Authors: Thom Benjamin Volker, Peter-Paul de Wolf, Erik-Jan van Kesteren

    Abstract: Synthetic data generation is a promising technique to facilitate the use of sensitive data while mitigating the risk of privacy breaches. However, for synthetic data to be useful in downstream analysis tasks, it needs to be of sufficient quality. Various methods have been proposed to measure the utility of synthetic data, but their results are often incomplete or even misleading. In this paper, we… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

  8. arXiv:2408.12945  [pdf, other

    cs.CV

    Find the Assembly Mistakes: Error Segmentation for Industrial Applications

    Authors: Dan Lehman, Tim J. Schoonbeek, Shao-Hsuan Hung, Jacek Kustra, Peter H. N. de With, Fons van der Sommen

    Abstract: Recognizing errors in assembly and maintenance procedures is valuable for industrial applications, since it can increase worker efficiency and prevent unplanned down-time. Although assembly state recognition is gaining attention, none of the current works investigate assembly error localization. Therefore, we propose StateDiffNet, which localizes assembly errors based on detecting the differences… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

    Comments: 23 pages (14 main paper, 2 references, 7 supplementary), 15 figures (8 main paper, 7 supplementary). Accepted at ECCV Vision-based InduStrial InspectiON (VISION) workshop

  9. arXiv:2408.12175  [pdf, other

    cs.LG stat.ML

    How disentangled are your classification uncertainties?

    Authors: Ivo Pascal de Jong, Andreea Ioana Sburlea, Matias Valdenegro-Toro

    Abstract: Uncertainty Quantification in Machine Learning has progressed to predicting the source of uncertainty in a prediction: Uncertainty from stochasticity in the data (aleatoric), or uncertainty from limitations of the model (epistemic). Generally, each uncertainty is evaluated in isolation, but this obscures the fact that they are often not truly disentangled. This work proposes a set of experiments t… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Comments: 11 pages, 11 figures

  10. arXiv:2408.11700  [pdf, other

    cs.CV

    Supervised Representation Learning towards Generalizable Assembly State Recognition

    Authors: Tim J. Schoonbeek, Goutham Balachandran, Hans Onvlee, Tim Houben, Shao-Hsuan Hung, Jacek Kustra, Peter H. N. de With, Fons van der Sommen

    Abstract: Assembly state recognition facilitates the execution of assembly procedures, offering feedback to enhance efficiency and minimize errors. However, recognizing assembly states poses challenges in scalability, since parts are frequently updated, and the robustness to execution errors remains underexplored. To address these challenges, this paper proposes an approach based on representation learning… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: 8 pages, 8 figures

  11. arXiv:2408.08995  [pdf, other

    cs.AI

    On the Undecidability of Artificial Intelligence Alignment: Machines that Halt

    Authors: Gabriel Adriano de Melo, Marcos Ricardo Omena De Albuquerque Maximo, Nei Yoshihiro Soma, Paulo Andre Lima de Castro

    Abstract: The inner alignment problem, which asserts whether an arbitrary artificial intelligence (AI) model satisfices a non-trivial alignment function of its outputs given its inputs, is undecidable. This is rigorously proved by Rice's theorem, which is also equivalent to a reduction to Turing's Halting Problem, whose proof sketch is presented in this work. Nevertheless, there is an enumerable set of prov… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

    Comments: Submitted for the Scientific Reports AI Alignment Collection

  12. arXiv:2407.17904  [pdf, other

    cs.CV

    Exploring the Effect of Dataset Diversity in Self-Supervised Learning for Surgical Computer Vision

    Authors: Tim J. M. Jaspers, Ronald L. P. D. de Jong, Yasmina Al Khalil, Tijn Zeelenberg, Carolus H. J. Kusters, Yiping Li, Romy C. van Jaarsveld, Franciscus H. A. Bakker, Jelle P. Ruurda, Willem M. Brinkman, Peter H. N. De With, Fons van der Sommen

    Abstract: Over the past decade, computer vision applications in minimally invasive surgery have rapidly increased. Despite this growth, the impact of surgical computer vision remains limited compared to other medical fields like pathology and radiology, primarily due to the scarcity of representative annotated data. Whereas transfer learning from large annotated datasets such as ImageNet has been convention… ▽ More

    Submitted 26 July, 2024; v1 submitted 25 July, 2024; originally announced July 2024.

    Comments: accepted - Data Engineering in Medical Imaging (DEMI) Workshop @ MICCAI2024

  13. arXiv:2407.14772  [pdf, other

    cs.CV

    Subgraph Clustering and Atom Learning for Improved Image Classification

    Authors: Aryan Singh, Pepijn Van de Ven, Ciarán Eising, Patrick Denny

    Abstract: In this study, we present the Graph Sub-Graph Network (GSN), a novel hybrid image classification model merging the strengths of Convolutional Neural Networks (CNNs) for feature extraction and Graph Neural Networks (GNNs) for structural modeling. GSN employs k-means clustering to group graph nodes into clusters, facilitating the creation of subgraphs. These subgraphs are then utilized to learn repr… ▽ More

    Submitted 20 July, 2024; originally announced July 2024.

  14. arXiv:2407.09885  [pdf, other

    cs.DB

    Statistical Validation of Column Matching in the Database Schema Evolution of the Brazilian Public School Census

    Authors: Muriki G. Yamanaka, Diogo H. de Almeida, Paulo R. Lisboa de Almeida, Simone Dominico, Leticia M. Peres, Marcos S. Sunye, Eduardo C. de Almeida

    Abstract: Publicly available datasets are subject to new versions, with each new version potentially reflecting changes to the data. These changes may involve adding or removing attributes, changing data types, and modifying values or their semantics. Integrating these datasets into a database poses a significant challenge: how to keep track of the evolving database schema while incorporating different vers… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

    Comments: Accepted for presentation at the Simposio Brasileiro de Bancos de Dados (SBBD) 2024

  15. arXiv:2407.08855  [pdf, other

    eess.IV cs.CV

    BraTS-PEDs: Results of the Multi-Consortium International Pediatric Brain Tumor Segmentation Challenge 2023

    Authors: Anahita Fathi Kazerooni, Nastaran Khalili, Xinyang Liu, Debanjan Haldar, Zhifan Jiang, Anna Zapaishchykova, Julija Pavaine, Lubdha M. Shah, Blaise V. Jones, Nakul Sheth, Sanjay P. Prabhu, Aaron S. McAllister, Wenxin Tu, Khanak K. Nandolia, Andres F. Rodriguez, Ibraheem Salman Shaikh, Mariana Sanchez Montano, Hollie Anne Lai, Maruf Adewole, Jake Albrecht, Udunna Anazodo, Hannah Anderson, Syed Muhammed Anwar, Alejandro Aristizabal, Sina Bagheri , et al. (55 additional authors not shown)

    Abstract: Pediatric central nervous system tumors are the leading cause of cancer-related deaths in children. The five-year survival rate for high-grade glioma in children is less than 20%. The development of new treatments is dependent upon multi-institutional collaborative clinical trials requiring reproducible and accurate centralized response assessment. We present the results of the BraTS-PEDs 2023 cha… ▽ More

    Submitted 16 July, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

  16. arXiv:2407.02099  [pdf, other

    cs.CL

    Helpful assistant or fruitful facilitator? Investigating how personas affect language model behavior

    Authors: Pedro Henrique Luz de Araujo, Benjamin Roth

    Abstract: One way to personalize and steer generations from large language models (LLM) is to assign a persona: a role that describes how the user expects the LLM to behave (e.g., a helpful assistant, a teacher, a woman). This paper investigates how personas affect diverse aspects of model behavior. We assign to seven LLMs 162 personas from 12 categories spanning variables like gender, sexual orientation, a… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 20 pages, 12 figures

  17. arXiv:2407.02075  [pdf, other

    cs.CV

    Label Anything: Multi-Class Few-Shot Semantic Segmentation with Visual Prompts

    Authors: Pasquale De Marinis, Nicola Fanelli, Raffaele Scaringi, Emanuele Colonna, Giuseppe Fiameni, Gennaro Vessio, Giovanna Castellano

    Abstract: We present Label Anything, an innovative neural network architecture designed for few-shot semantic segmentation (FSS) that demonstrates remarkable generalizability across multiple classes with minimal examples required per class. Diverging from traditional FSS methods that predominantly rely on masks for annotating support images, Label Anything introduces varied visual prompts -- points, boundin… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  18. arXiv:2406.19509  [pdf

    cs.DB

    Semantic orchestration and exploitation of material data: A dataspace solution demonstrated on steel and copper applications

    Authors: Yoav Nahshon, Lukas Morand, Matthias Büschelberger, Dirk Helm, Kiran Kumaraswamy, Paul Zierep, Matthias Weber, Pablo de Andrés

    Abstract: In the field of materials science and manufacturing, a vast amount of heterogeneous data exists, encompassing measurement and simulation data, machine data, publications, and more. This data serves as the bedrock of valuable knowledge that can be leveraged for various engineering applications. However, efficiently storing and handling such diverse data remain significantly challenging, often due t… ▽ More

    Submitted 3 July, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

  19. arXiv:2406.18787  [pdf, other

    cs.LG stat.ML

    Unified Uncertainties: Combining Input, Data and Model Uncertainty into a Single Formulation

    Authors: Matias Valdenegro-Toro, Ivo Pascal de Jong, Marco Zullich

    Abstract: Modelling uncertainty in Machine Learning models is essential for achieving safe and reliable predictions. Most research on uncertainty focuses on output uncertainty (predictions), but minimal attention is paid to uncertainty at inputs. We propose a method for propagating uncertainty in the inputs through a Neural Network that is simultaneously able to estimate input, data, and model uncertainty.… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 4 pages, 3 figures, with appendix. LatinX in AI Research Workshop @ ICML 2024 Camera Ready

  20. arXiv:2406.18589  [pdf, other

    cs.CV cs.LG

    Text-Guided Alternative Image Clustering

    Authors: Andreas Stephan, Lukas Miklautz, Collin Leiber, Pedro Henrique Luz de Araujo, Dominik Répás, Claudia Plant, Benjamin Roth

    Abstract: Traditional image clustering techniques only find a single grouping within visual data. In particular, they do not provide a possibility to explicitly define multiple types of clustering. This work explores the potential of large vision-language models to facilitate alternative image clustering. We propose Text-Guided Alternative Image Consensus Clustering (TGAICC), a novel approach that leverages… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  21. arXiv:2406.18247  [pdf, other

    eess.IV cs.CV cs.LG

    Generative artificial intelligence in ophthalmology: multimodal retinal images for the diagnosis of Alzheimer's disease with convolutional neural networks

    Authors: I. R. Slootweg, M. Thach, K. R. Curro-Tafili, F. D. Verbraak, F. H. Bouwman, Y. A. L. Pijnenburg, J. F. Boer, J. H. P. de Kwisthout, L. Bagheriye, P. J. González

    Abstract: Background/Aim. This study aims to predict Amyloid Positron Emission Tomography (AmyloidPET) status with multimodal retinal imaging and convolutional neural networks (CNNs) and to improve the performance through pretraining with synthetic data. Methods. Fundus autofluorescence, optical coherence tomography (OCT), and OCT angiography images from 328 eyes of 59 AmyloidPET positive subjects and 108 A… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  22. arXiv:2406.17032  [pdf, other

    cs.CV

    DWARF: Disease-weighted network for attention map refinement

    Authors: Haozhe Luo, Aurélie Pahud de Mortanges, Oana Inel, Abraham Bernstein, Mauricio Reyes

    Abstract: The interpretability of deep learning is crucial for evaluating the reliability of medical imaging models and reducing the risks of inaccurate patient recommendations. This study addresses the "human out of the loop" and "trustworthiness" issues in medical image analysis by integrating medical professionals into the interpretability process. We propose a disease-weighted attention map refinement n… ▽ More

    Submitted 28 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

  23. arXiv:2406.14150  [pdf, other

    cs.LG

    Multi-modal Transfer Learning between Biological Foundation Models

    Authors: Juan Jose Garau-Luis, Patrick Bordes, Liam Gonzalez, Masa Roller, Bernardo P. de Almeida, Lorenz Hexemer, Christopher Blum, Stefan Laurent, Jan Grzegorzewski, Maren Lang, Thomas Pierrot, Guillaume Richard

    Abstract: Biological sequences encode fundamental instructions for the building blocks of life, in the form of DNA, RNA, and proteins. Modeling these sequences is key to understand disease mechanisms and is an active research area in computational biology. Recently, Large Language Models have shown great promise in solving certain biological tasks but current approaches are limited to a single sequence moda… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    MSC Class: 68T07 (Primary)

  24. arXiv:2406.11772  [pdf, other

    cs.CV cs.AI

    Deep Learning methodology for the identification of wood species using high-resolution macroscopic images

    Authors: David Herrera-Poyatos, Andrés Herrera-Poyatos, Rosana Montes, Paloma de Palacios, Luis G. Esteban, Alberto García Iruela, Francisco García Fernández, Francisco Herrera

    Abstract: Significant advancements in the field of wood species identification are needed worldwide to support sustainable timber trade. In this work we contribute to automate the identification of wood species via high-resolution macroscopic images of timber. The main challenge of this problem is that fine-grained patterns in timber are crucial in order to accurately identify wood species, and these patter… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 17 pages and 6 figures

    ACM Class: I.2.1; I.2.10

  25. arXiv:2406.06494  [pdf, other

    cs.LG cs.AI

    Scaling Continuous Latent Variable Models as Probabilistic Integral Circuits

    Authors: Gennaro Gala, Cassio de Campos, Antonio Vergari, Erik Quaeghebeur

    Abstract: Probabilistic integral circuits (PICs) have been recently introduced as probabilistic models enjoying the key ingredient behind expressive generative models: continuous latent variables (LVs). PICs are symbolic computational graphs defining continuous LV models as hierarchies of functions that are summed and multiplied together, or integrated over some LVs. They are tractable if LVs can be analyti… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  26. arXiv:2405.20904  [pdf, ps, other

    math.CO cs.DM

    Solving systems of equations on antichains for the computation of the ninth Dedekind Number

    Authors: Patrick De Causmaecker, Lennart Van Hirtum

    Abstract: We study systems of equations on antichains, together with a way to count the number of solutions. We start with two simple examples, generalise and show more applications. One of the results was used in the recent computation of D(9), the others have potential to speed up existing techniques in the future.

    Submitted 18 June, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

  27. arXiv:2405.14806  [pdf, other

    physics.data-an cs.LG hep-ph stat.ML

    Lorentz-Equivariant Geometric Algebra Transformers for High-Energy Physics

    Authors: Jonas Spinner, Victor Bresó, Pim de Haan, Tilman Plehn, Jesse Thaler, Johann Brehmer

    Abstract: Extracting scientific understanding from particle-physics experiments requires solving diverse learning problems with high precision and good data efficiency. We propose the Lorentz Geometric Algebra Transformer (L-GATr), a new multi-purpose architecture for high-energy physics. L-GATr represents high-energy data in a geometric algebra over four-dimensional space-time and is equivariant under Lore… ▽ More

    Submitted 9 July, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: 10+12 pages, 5+2 figures, 2 tables, v2: Extend acknowledgements, add link to github repo

    Report number: MIT-CTP/5723

  28. arXiv:2405.13469  [pdf, other

    astro-ph.EP astro-ph.IM cs.LG stat.AP

    Machine Learning for Exoplanet Detection in High-Contrast Spectroscopy: Revealing Exoplanets by Leveraging Hidden Molecular Signatures in Cross-Correlated Spectra with Convolutional Neural Networks

    Authors: Emily O. Garvin, Markus J. Bonse, Jean Hayoz, Gabriele Cugno, Jonas Spiller, Polychronis A. Patapis, Dominique Petit Dit de la Roche, Rakesh Nath-Ranga, Olivier Absil, Nicolai F. Meinshausen, Sascha P. Quanz

    Abstract: The new generation of observatories and instruments (VLT/ERIS, JWST, ELT) motivate the development of robust methods to detect and characterise faint and close-in exoplanets. Molecular mapping and cross-correlation for spectroscopy use molecular templates to isolate a planet's spectrum from its host star. However, reliance on signal-to-noise ratio (S/N) metrics can lead to missed discoveries, due… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 27 pages, 24 figures. Submitted for publication in A&A January 2, 2024. After first iteration with the referee, resubmitted May 17, 2024

    Journal ref: A&A 689, A143 (2024)

  29. Advancing 6-DoF Instrument Pose Estimation in Variable X-Ray Imaging Geometries

    Authors: Christiaan G. A. Viviers, Lena Filatova, Maurice Termeer, Peter H. N. de With, Fons van der Sommen

    Abstract: Accurate 6-DoF pose estimation of surgical instruments during minimally invasive surgeries can substantially improve treatment strategies and eventual surgical outcome. Existing deep learning methods have achieved accurate results, but they require custom approaches for each object and laborious setup and training environments often stretching to extensive simulations, whilst lacking real-time com… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: Early author version of paper. Refer to the full paper at https://ieeexplore.ieee.org/document/10478293

    Journal ref: IEEE Transactions on Image Processing (2024) (Volume: 33) Page(s): 2462 - 2476

  30. arXiv:2405.09787  [pdf, other

    eess.IV cs.CV cs.LG

    Analysis of the BraTS 2023 Intracranial Meningioma Segmentation Challenge

    Authors: Dominic LaBella, Ujjwal Baid, Omaditya Khanna, Shan McBurney-Lin, Ryan McLean, Pierre Nedelec, Arif Rashid, Nourel Hoda Tahon, Talissa Altes, Radhika Bhalerao, Yaseen Dhemesh, Devon Godfrey, Fathi Hilal, Scott Floyd, Anastasia Janas, Anahita Fathi Kazerooni, John Kirkpatrick, Collin Kent, Florian Kofler, Kevin Leu, Nazanin Maleki, Bjoern Menze, Maxence Pajot, Zachary J. Reitman, Jeffrey D. Rudie , et al. (96 additional authors not shown)

    Abstract: We describe the design and results from the BraTS 2023 Intracranial Meningioma Segmentation Challenge. The BraTS Meningioma Challenge differed from prior BraTS Glioma challenges in that it focused on meningiomas, which are typically benign extra-axial tumors with diverse radiologic and anatomical presentation and a propensity for multiplicity. Nine participating teams each developed deep-learning… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: 16 pages, 11 tables, 10 figures, MICCAI

  31. arXiv:2405.06478  [pdf

    cs.CY cs.AI cs.SI

    Attention is all they need: Cognitive science and the (techno)political economy of attention in humans and machines

    Authors: Pablo González de la Torre, Marta Pérez-Verdugo, Xabier E. Barandiaran

    Abstract: This paper critically analyses the "attention economy" within the framework of cognitive science and techno-political economics, as applied to both human and machine interactions. We explore how current business models, particularly in digital platform capitalism, harness user engagement by strategically shaping attentional patterns. These platforms utilize advanced AI and massive data analytics t… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  32. Doing Personal LAPS: LLM-Augmented Dialogue Construction for Personalized Multi-Session Conversational Search

    Authors: Hideaki Joko, Shubham Chatterjee, Andrew Ramsay, Arjen P. de Vries, Jeff Dalton, Faegheh Hasibi

    Abstract: The future of conversational agents will provide users with personalized information responses. However, a significant challenge in developing models is the lack of large-scale dialogue datasets that span multiple sessions and reflect real-world user preferences. Previous approaches rely on experts in a wizard-of-oz setup that is difficult to scale, particularly for personalized tasks. Our method,… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: Accepted at SIGIR 2024 (Full Paper)

  33. arXiv:2405.03004  [pdf, other

    cs.CL cs.LG

    Exploring prompts to elicit memorization in masked language model-based named entity recognition

    Authors: Yuxi Xia, Anastasiia Sedova, Pedro Henrique Luz de Araujo, Vasiliki Kougia, Lisa Nußbaumer, Benjamin Roth

    Abstract: Training data memorization in language models impacts model capability (generalization) and safety (privacy risk). This paper focuses on analyzing prompts' impact on detecting the memorization of 6 masked language model-based named entity recognition models. Specifically, we employ a diverse set of 400 automatically generated prompts, and a pairwise dataset where each pair consists of one person's… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  34. arXiv:2405.01190  [pdf, other

    cs.IT stat.AP

    On the Impact of Dynamic Beamforming on EMF Exposure and Network Coverage: A Stochastic Geometry Perspective

    Authors: Quentin Gontier, Charles Wiame, Joe Wiart, François Horlin, Christo Tsigros, Claude Oestges, Philippe De Doncker

    Abstract: This paper introduces a new mathematical framework for dynamic beamforming-based cellular networks, grounded in stochastic geometry. The framework is used to study the electromagnetic field exposure (EMFE) of active and idle users as a function of the distance between them. A novel multi-cosine antenna pattern is introduced, offering more accurate modeling by incorporating both main and side lobes… ▽ More

    Submitted 23 August, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: This work is being submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  35. arXiv:2404.19045  [pdf, other

    cs.RO

    Maritime Vessel Tank Inspection using Aerial Robots: Experience from the field and dataset release

    Authors: Mihir Dharmadhikari, Nikhil Khedekar, Paolo De Petris, Mihir Kulkarni, Morten Nissov, Kostas Alexis

    Abstract: This paper presents field results and lessons learned from the deployment of aerial robots inside ship ballast tanks. Vessel tanks including ballast tanks and cargo holds present dark, dusty environments having simultaneously very narrow openings and wide open spaces that create several challenges for autonomous navigation and inspection operations. We present a system for vessel tank inspection u… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: Accepted to the IEEE ICRA Workshop on Field Robotics 2024

  36. arXiv:2404.18905  [pdf, other

    stat.ME cs.LG stat.ML

    Detecting critical treatment effect bias in small subgroups

    Authors: Piersilvio De Bartolomeis, Javier Abad, Konstantin Donhauser, Fanny Yang

    Abstract: Randomized trials are considered the gold standard for making informed decisions in medicine, yet they often lack generalizability to the patient populations in clinical practice. Observational studies, on the other hand, cover a broader patient population but are prone to various biases. Thus, before using an observational study for decision-making, it is crucial to benchmark its treatment effect… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: Accepted for presentation at the Conference on Uncertainty in Artificial Intelligence (UAI) 2024

  37. arXiv:2404.18562  [pdf, ps, other

    cs.AR

    Time Reversal for Near-Field Communications on Multi-chip Wireless Networks

    Authors: Fátima Rodríguez-Galán, Ama Bandara, Elana Pereira de Santana, Peter Haring Bolívar, Eduard Alarcón, Sergi Abadal

    Abstract: Wireless Network-on-Chip (WNoC) has been proposed as a low-latency, versatile, and broadcast-capable complement to current interconnects in the quest for satisfying the ever-increasing communications needs of modern computing systems. However, to realize the promise of WNoC, multiple wireless links operating at several tens of Gb/s need to be created within a computing package. Unfortunately, the… ▽ More

    Submitted 30 April, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  38. arXiv:2404.17325  [pdf, ps, other

    cs.ET eess.SP

    Towards Scalable Multi-Chip Wireless Networks with Near-Field Time Reversal

    Authors: Ama Bandara, Fátima Rodríguez-Galán, Pau Talarn, Elana Pereira de Santana, Peter Haring Bolívar, Eduard Alarcón, Sergi Abadal

    Abstract: The concept of Wireless Network-on-Chip (WNoC) has emerged as a potential solution to address the escalating communication demands of modern computing systems due to their low-latency, versatility, and reconfigurability. However, for WNoC to fulfill its potential, it is essential to establish multiple high-speed wireless links across chips. Unfortunately, the compact and enclosed nature of computi… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  39. arXiv:2404.17080  [pdf, other

    cs.DM cs.DS math.CO

    Solving the Graph Burning Problem for Large Graphs

    Authors: Felipe de Carvalho Pereira, Pedro Jussieu de Rezende, Tallys Yunes, Luiz Fernando Batista Morato

    Abstract: We propose an exact algorithm for the Graph Burning Problem ($\texttt{GBP}$), an NP-hard optimization problem that models the spread of influence on social networks. Given a graph $G$ with vertex set $V$, the objective is to find a sequence of $k$ vertices in $V$, namely, $v_1, v_2, \dots, v_k$, such that $k$ is minimum and $\bigcup_{i = 1}^{k} \{u\! \in\! V\! : d(u, v_i) \leq k - i\} = V$, where… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 10 pages, 1 figure and 2 tables

    MSC Class: 68R05 (Primary) 05C85; 90C10 (Secondary) ACM Class: G.2.1

  40. arXiv:2404.15009  [pdf, other

    cs.CV eess.IV

    The Brain Tumor Segmentation in Pediatrics (BraTS-PEDs) Challenge: Focus on Pediatrics (CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs)

    Authors: Anahita Fathi Kazerooni, Nastaran Khalili, Xinyang Liu, Deep Gandhi, Zhifan Jiang, Syed Muhammed Anwar, Jake Albrecht, Maruf Adewole, Udunna Anazodo, Hannah Anderson, Ujjwal Baid, Timothy Bergquist, Austin J. Borja, Evan Calabrese, Verena Chung, Gian-Marco Conte, Farouk Dako, James Eddy, Ivan Ezhov, Ariana Familiar, Keyvan Farahani, Andrea Franson, Anurag Gottipati, Shuvanjan Haldar, Juan Eugenio Iglesias , et al. (46 additional authors not shown)

    Abstract: Pediatric tumors of the central nervous system are the most common cause of cancer-related death in children. The five-year survival rate for high-grade gliomas in children is less than 20%. Due to their rarity, the diagnosis of these entities is often delayed, their treatment is mainly based on historic treatment concepts, and clinical trials require multi-institutional collaborations. Here we pr… ▽ More

    Submitted 11 July, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2305.17033

  41. arXiv:2404.13691  [pdf, other

    cs.CV cs.RO

    A Complete System for Automated 3D Semantic-Geometric Mapping of Corrosion in Industrial Environments

    Authors: Rui Pimentel de Figueiredo, Stefan Nordborg Eriksen, Ignacio Rodriguez, Simon Bøgh

    Abstract: Corrosion, a naturally occurring process leading to the deterioration of metallic materials, demands diligent detection for quality control and the preservation of metal-based objects, especially within industrial contexts. Traditional techniques for corrosion identification, including ultrasonic testing, radio-graphic testing, and magnetic flux leakage, necessitate the deployment of expensive and… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  42. arXiv:2404.12757  [pdf, other

    cs.IT cs.NI

    Meta Distribution of Passive Electromagnetic Field Exposure in Cellular Networks

    Authors: Quentin Gontier, Charles Wiame, François Horlin, Christo Tsigros, Claude Oestges, Philippe De Doncker

    Abstract: This paper focuses on the meta distribution of electromagnetic field exposure (EMFE) experienced by a passive user in a cellular network implementing dynamic beamforming. The meta distribution serves as a valuable tool for extracting fine-grained insights into statistics of individual passive user EMFE across the network. A comprehensive stochastic geometry framework is established for this analys… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  43. arXiv:2404.12712  [pdf, other

    cs.CV cs.AI cs.LG

    uTRAND: Unsupervised Anomaly Detection in Traffic Trajectories

    Authors: Giacomo D'Amicantonio, Egor Bondarau, Peter H. N. de With

    Abstract: Deep learning-based approaches have achieved significant improvements on public video anomaly datasets, but often do not perform well in real-world applications. This paper addresses two issues: the lack of labeled data and the difficulty of explaining the predictions of a neural network. To this end, we present a framework called uTRAND, that shifts the problem of anomalous trajectory prediction… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  44. arXiv:2404.10157  [pdf, other

    cs.CV cs.LG

    Salient Object-Aware Background Generation using Text-Guided Diffusion Models

    Authors: Amir Erfan Eshratifar, Joao V. B. Soares, Kapil Thadani, Shaunak Mishra, Mikhail Kuznetsov, Yueh-Ning Ku, Paloma de Juan

    Abstract: Generating background scenes for salient objects plays a crucial role across various domains including creative design and e-commerce, as it enhances the presentation and context of subjects by integrating them into tailored environments. Background generation can be framed as a task of text-conditioned outpainting, where the goal is to extend image content beyond a salient object's boundaries on… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: Accepted for publication at CVPR 2024's Generative Models for Computer Vision workshop

  45. arXiv:2404.09236  [pdf, ps, other

    cs.CC

    The complexity of convexity number and percolation time in the cycle convexity

    Authors: Carlos V. G. C. Lima, Thiago Marcilon, Pedro Paulo de Medeiros

    Abstract: The subject of graph convexity is well explored in the literature, the so-called interval convexities above all. In this work, we explore the cycle convexity, an interval convexity whose interval function is $I(S) = S \cup \{u \mid G[S \cup \{u\}]$ has a cycle containing $u\}$. In this convexity, we prove that determine whether the convexity number of a graph $G$ is at least $k$ is \NP-complete an… ▽ More

    Submitted 6 May, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

  46. arXiv:2404.06131  [pdf, other

    cs.LO

    Weak Simplicial Bisimilarity for Polyhedral Models and SLCS_eta -- Extended Version

    Authors: Nick Bezhanishvili, Vincenzo Ciancia, David Gabelaia, Mamuka Jibladze, Diego Latella, Mieke Massink, Erik P. de Vink

    Abstract: In the context of spatial logics and spatial model checking for polyhedral models -- mathematical basis for visualisations in continuous space -- we propose a weakening of simplicial bisimilarity. We additionally propose a corresponding weak notion of $\pm$-bisimilarity on cell-poset models, a discrete representation of polyhedral models. We show that two points are weakly simplicial bisimilar iff… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  47. arXiv:2404.04736  [pdf, other

    cs.CV cs.AI cs.LG

    ProtoAL: Interpretable Deep Active Learning with prototypes for medical imaging

    Authors: Iury B. de A. Santos, André C. P. L. F. de Carvalho

    Abstract: The adoption of Deep Learning algorithms in the medical imaging field is a prominent area of research, with high potential for advancing AI-based Computer-aided diagnosis (AI-CAD) solutions. However, current solutions face challenges due to a lack of interpretability features and high data demands, prompting recent efforts to address these issues. In this study, we propose the ProtoAL method, wher… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

  48. arXiv:2403.19260  [pdf, other

    cs.CL

    NaijaHate: Evaluating Hate Speech Detection on Nigerian Twitter Using Representative Data

    Authors: Manuel Tonneau, Pedro Vitor Quinta de Castro, Karim Lasri, Ibrahim Farouq, Lakshminarayanan Subramanian, Victor Orozco-Olvera, Samuel P. Fraiberger

    Abstract: To address the global issue of online hate, hate speech detection (HSD) systems are typically developed on datasets from the United States, thereby failing to generalize to English dialects from the Majority World. Furthermore, HSD models are often evaluated on non-representative samples, raising concerns about overestimating model performance in real-world settings. In this work, we introduce Nai… ▽ More

    Submitted 24 June, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: ACL 2024 main conference. Data and models available at https://github.com/worldbank/NaijaHate

  49. arXiv:2403.15431  [pdf, other

    eess.SP cs.HC cs.LG

    Transferring BCI models from calibration to control: Observing shifts in EEG features

    Authors: Ivo Pascal de Jong, Lüke Luna van den Wittenboer, Matias Valdenegro-Toro, Andreea Ioana Sburlea

    Abstract: Public Motor Imagery-based brain-computer interface (BCI) datasets are being used to develop increasingly good classifiers. However, they usually follow discrete paradigms where participants perform Motor Imagery at regularly timed intervals. It is often unclear what changes may happen in the EEG patterns when users attempt to perform a control task with such a BCI. This may lead to generalisation… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  50. arXiv:2403.15185  [pdf, other

    cs.CL

    Investigating the Performance of Language Models for Completing Code in Functional Programming Languages: a Haskell Case Study

    Authors: Tim van Dam, Frank van der Heijden, Philippe de Bekker, Berend Nieuwschepen, Marc Otten, Maliheh Izadi

    Abstract: Language model-based code completion models have quickly grown in use, helping thousands of developers write code in many different programming languages. However, research on code completion models typically focuses on imperative languages such as Python and JavaScript, which results in a lack of representation for functional programming languages. Consequently, these models often perform poorly… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: To appear in the First Special Event on AI Foundation Models and Software Engineering (FORGE 2024)