Zum Hauptinhalt springen

Showing 1–50 of 196 results for author: Pérez, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.03542  [pdf, other

    cs.LG

    Risk-based Calibration for Probabilistic Classifiers

    Authors: Aritz Pérez, Carlos Echegoyen, Guzmán Santafé

    Abstract: We introduce a general iterative procedure called risk-based calibration (RC) designed to minimize the empirical risk under the 0-1 loss (empirical error) for probabilistic classifiers. These classifiers are based on modeling probability distributions, including those constructed from the joint distribution (generative) and those based on the class conditional distribution (conditional). RC can be… ▽ More

    Submitted 5 September, 2024; originally announced September 2024.

  2. arXiv:2409.01184  [pdf, other

    cs.CV

    PitVis-2023 Challenge: Workflow Recognition in videos of Endoscopic Pituitary Surgery

    Authors: Adrito Das, Danyal Z. Khan, Dimitrios Psychogyios, Yitong Zhang, John G. Hanrahan, Francisco Vasconcelos, You Pang, Zhen Chen, Jinlin Wu, Xiaoyang Zou, Guoyan Zheng, Abdul Qayyum, Moona Mazher, Imran Razzak, Tianbin Li, Jin Ye, Junjun He, Szymon Płotka, Joanna Kaleta, Amine Yamlahi, Antoine Jund, Patrick Godau, Satoshi Kondo, Satoshi Kasai, Kousuke Hirasawa , et al. (7 additional authors not shown)

    Abstract: The field of computer vision applied to videos of minimally invasive surgery is ever-growing. Workflow recognition pertains to the automated recognition of various aspects of a surgery: including which surgical steps are performed; and which surgical instruments are used. This information can later be used to assist clinicians when learning the surgery; during live surgery; and when writing operat… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

  3. Integrating Quantum Computing Resources into Scientific HPC Ecosystems

    Authors: Thomas Beck, Alessandro Baroni, Ryan Bennink, Gilles Buchs, Eduardo Antonio Coello Perez, Markus Eisenbach, Rafael Ferreira da Silva, Muralikrishnan Gopalakrishnan Meena, Kalyan Gottiparthi, Peter Groszkowski, Travis S. Humble, Ryan Landfield, Ketan Maheshwari, Sarp Oral, Michael A. Sandoval, Amir Shehata, In-Saeng Suh, Christopher Zimmer

    Abstract: Quantum Computing (QC) offers significant potential to enhance scientific discovery in fields such as quantum chemistry, optimization, and artificial intelligence. Yet QC faces challenges due to the noisy intermediate-scale quantum era's inherent external noise issues. This paper discusses the integration of QC as a computational accelerator within classical scientific high-performance computing (… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

  4. arXiv:2408.04902  [pdf, other

    cs.LO

    Algorithms for Markov Binomial Chains

    Authors: Alejandro Alarcón Gonzalez, Niel Hens, Tim Leys, Guillermo A. Pérez

    Abstract: We study algorithms to analyze a particular class of Markov population processes that is often used in epidemiology. More specifically, Markov binomial chains are the model that arises from stochastic time-discretizations of classical compartmental models. In this work we formalize this class of Markov population processes and focus on the problem of computing the expected time to termination in a… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

  5. In-Situ Techniques on GPU-Accelerated Data-Intensive Applications

    Authors: Yi Ju, Mingshuai Li, Adalberto Perez, Laura Bellentani, Niclas Jansson, Stefano Markidis, Philipp Schlatter, Erwin Laure

    Abstract: The computational power of High-Performance Computing (HPC) systems is constantly increasing, however, their input/output (IO) performance grows relatively slowly, and their storage capacity is also limited. This unbalance presents significant challenges for applications such as Molecular Dynamics (MD) and Computational Fluid Dynamics (CFD), which generate massive amounts of data for further visua… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

  6. Understanding the Impact of Synchronous, Asynchronous, and Hybrid In-Situ Techniques in Computational Fluid Dynamics Applications

    Authors: Yi Ju, Adalberto Perez, Stefano Markidis, Philipp Schlatter, Erwin Laure

    Abstract: High-Performance Computing (HPC) systems provide input/output (IO) performance growing relatively slowly compared to peak computational performance and have limited storage capacity. Computational Fluid Dynamics (CFD) applications aiming to leverage the full power of Exascale HPC systems, such as the solver Nek5000, will generate massive data for further processing. These data need to be efficient… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

  7. arXiv:2407.17361  [pdf, other

    cs.CV cs.AI

    MuST: Multi-Scale Transformers for Surgical Phase Recognition

    Authors: Alejandra Pérez, Santiago Rodríguez, Nicolás Ayobi, Nicolás Aparicio, Eugénie Dessevres, Pablo Arbeláez

    Abstract: Phase recognition in surgical videos is crucial for enhancing computer-aided surgical systems as it enables automated understanding of sequential procedural stages. Existing methods often rely on fixed temporal windows for video analysis to identify dynamic surgical phases. Thus, they struggle to simultaneously capture short-, mid-, and long-term information necessary to fully understand complex s… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

  8. arXiv:2407.08474  [pdf, other

    cs.HC cs.SE

    DIDUP: Dynamic Iterative Development for UI Prototyping

    Authors: Jenny Ma, Karthik Sreedhar, Vivian Liu, Sitong Wang, Pedro Alejandro Perez, Lydia B. Chilton

    Abstract: Large language models (LLMs) are remarkably good at writing code. A particularly valuable case of human-LLM collaboration is code-based UI prototyping, a method for creating interactive prototypes that allows users to view and fully engage with a user interface. We conduct a formative study of GPT Pilot, a leading LLM-generated code-prototyping system, and find that its inflexibility towards chang… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 5 pages, 3 figures

  9. arXiv:2407.06391  [pdf, other

    cs.LO

    Around Classical and Intuitionistic Linear Processes

    Authors: Juan C. Jaramillo, Dan Frumin, Jorge A. Pérez

    Abstract: Curry-Howard correspondences between Linear Logic (LL) and session types provide a firm foundation for concurrent processes. As the correspondences hold for intuitionistic and classic versions of LL (ILL and CLL), we obtain two different families of type systems for concurrency. An open question remains: how do these two families exactly relate to each other? Based upon a translation from CLL to I… ▽ More

    Submitted 22 July, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

    Comments: Full version, 19 pages + appendices

  10. arXiv:2405.13348  [pdf, other

    cs.LG

    On the Challenges of Creating Datasets for Analyzing Commercial Sex Advertisements to Assess Human Trafficking Risk and Organized Activity

    Authors: Pablo Rivas, Tomas Cerny, Alejandro Rodriguez Perez, Javier Turek, Laurie Giddens, Gisela Bichler, Stacie Petter

    Abstract: Our study addresses the challenges of building datasets to understand the risks associated with organized activities and human trafficking through commercial sex advertisements. These challenges include data scarcity, rapid obsolescence, and privacy concerns. Traditional approaches, which are not automated and are difficult to reproduce, fall short in addressing these issues. We have developed a r… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: LXAI Workshop at the 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024)

    ACM Class: I.2.7

  11. arXiv:2403.18111  [pdf, other

    cs.HC

    Scrolly2Reel: Retargeting Graphics for Social Media Using Narrative Beats

    Authors: Duy K. Nguyen, Jenny Ma, Pedro Alejandro Perez, Lydia B. Chilton

    Abstract: Content retargeting is crucial for social media creators. Once great content is created, it is important to reach as broad an audience as possible. This is particularly important in journalism where younger audiences are shifting away from print and towards short-video platforms. Many newspapers already create rich graphics for the web that they want to be able to reuse for social media. One examp… ▽ More

    Submitted 19 June, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Comments: 9 pages, 3 figures

  12. arXiv:2403.02019  [pdf, other

    cs.FL cs.LG

    Active Learning of Mealy Machines with Timers

    Authors: Véronique Bruyère, Bharat Garhewal, Guillermo A. Pérez, Gaëtan Staquet, Frits W. Vaandrager

    Abstract: We present the first algorithm for query learning of a general class of Mealy machines with timers (MMTs) in a black-box context. Our algorithm is an extension of the L# algorithm of Vaandrager et al. to a timed setting. Like the algorithm for learning timed automata proposed by Waga, our algorithm is inspired by ideas of Maler & Pnueli. Based on the elementary languages of, both Waga's and our al… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 77 pages, 19 figures

    MSC Class: 68Q45 ACM Class: F.4.3

  13. arXiv:2402.13785  [pdf, other

    cs.AI

    Synthesis of Hierarchical Controllers Based on Deep Reinforcement Learning Policies

    Authors: Florent Delgrange, Guy Avni, Anna Lukina, Christian Schilling, Ann Nowé, Guillermo A. Pérez

    Abstract: We propose a novel approach to the problem of controller design for environments modeled as Markov decision processes (MDPs). Specifically, we consider a hierarchical MDP a graph with each vertex populated by an MDP called a "room". We first apply deep reinforcement learning (DRL) to obtain low-level policies for each room, scaling to large rooms of unknown structure. We then apply reactive synthe… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 19 pages main text, 17 pages Appendix (excluding references)

  14. arXiv:2402.13237  [pdf, other

    cs.LO cs.FL

    Continuous Pushdown VASS in One Dimension are Easy

    Authors: Guillermo A. Perez, Shrisha Rao

    Abstract: A pushdown vector addition system with states (PVASS) extends the model of vector addition systems with a pushdown stack. The algorithmic analysis of PVASS has applications such as static analysis of recursive programs manipulating integer variables. Unfortunately, reachability analysis, even for one-dimensional PVASS is not known to be decidable. We relax the model of one-dimensional PVASS to mak… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: 2 tables, 6 figures, 12 pages

  15. arXiv:2402.13219  [pdf, other

    cs.AI cs.HC cs.LG cs.MA eess.SY

    Analyzing Operator States and the Impact of AI-Enhanced Decision Support in Control Rooms: A Human-in-the-Loop Specialized Reinforcement Learning Framework for Intervention Strategies

    Authors: Ammar N. Abbas, Chidera W. Amazu, Joseph Mietkiewicz, Houda Briwa, Andres Alonzo Perez, Gabriele Baldissone, Micaela Demichela, Georgios G. Chasparis, John D. Kelleher, Maria Chiara Leva

    Abstract: In complex industrial and chemical process control rooms, effective decision-making is crucial for safety and efficiency. The experiments in this paper evaluate the impact and applications of an AI-based decision support system integrated into an improved human-machine interface, using dynamic influence diagrams, a hidden Markov model, and deep reinforcement learning. The enhanced support system a… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Journal ref: International Journal of Human-Computer Interaction, 2024

  16. arXiv:2402.11901  [pdf, other

    cs.AI

    Real-World Planning with PDDL+ and Beyond

    Authors: Wiktor Piotrowski, Alexandre Perez

    Abstract: Real-world applications of AI Planning often require a highly expressive modeling language to accurately capture important intricacies of target systems. Hybrid systems are ubiquitous in the real-world, and PDDL+ is the standardized modeling language for capturing such systems as planning domains. PDDL+ enables accurate encoding of mixed discrete-continuous system dynamics, exogenous activity, and… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  17. arXiv:2402.09121  [pdf, ps, other

    cs.FL

    Inform: From Compartmental Models to Stochastic Bounded Counter Machines

    Authors: Tim Leys, Guillermo A. Perez

    Abstract: Compartmental models are used in epidemiology to capture the evolution of infectious diseases such as COVID-19 in a population by assigning members of it to compartments with labels such as susceptible, infected, and recovered. In a stochastic compartmental model the flow of individuals between compartments is determined probabilistically. We establish that certain stochastic compartment models ca… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  18. arXiv:2402.08332  [pdf, other

    math.CO cs.DM cs.DS

    Detecting $K_{2,3}$ as an induced minor

    Authors: Clément Dallard, Maël Dumas, Claire Hilaire, Martin Milanič, Anthony Perez, Nicolas Trotignon

    Abstract: We consider a natural generalization of chordal graphs, in which every minimal separator induces a subgraph with independence number at most $2$. Such graphs can be equivalently defined as graphs that do not contain the complete bipartite graph $K_{2,3}$ as an induced minor, that is, graphs from which $K_{2,3}$ cannot be obtained by a sequence of edge contractions and vertex deletions. We develo… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 18 pages

    MSC Class: 05C75 (Primary); 05C85; 05C83; 05C40; 05C69 (Secondary)

  19. arXiv:2402.05137  [pdf, other

    astro-ph.IM astro-ph.CO astro-ph.GA cs.LG

    LtU-ILI: An All-in-One Framework for Implicit Inference in Astrophysics and Cosmology

    Authors: Matthew Ho, Deaglan J. Bartlett, Nicolas Chartier, Carolina Cuesta-Lazaro, Simon Ding, Axel Lapel, Pablo Lemos, Christopher C. Lovell, T. Lucas Makinen, Chirag Modi, Viraj Pandya, Shivam Pandey, Lucia A. Perez, Benjamin Wandelt, Greg L. Bryan

    Abstract: This paper presents the Learning the Universe Implicit Likelihood Inference (LtU-ILI) pipeline, a codebase for rapid, user-friendly, and cutting-edge machine learning (ML) inference in astrophysics and cosmology. The pipeline includes software for implementing various neural architectures, training schemata, priors, and density estimators in a manner easily adaptable to any research workflow. It i… ▽ More

    Submitted 2 July, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: 22 pages, 10 figures, accepted in the Open Journal of Astrophysics. Code available at https://github.com/maho3/ltu-ili

    Journal ref: 2024 OJA, Vol. 7

  20. arXiv:2401.14763  [pdf, ps, other

    cs.LO

    Comparing Session Type Systems derived from Linear Logic

    Authors: Bas van den Heuvel, Jorge A. Pérez

    Abstract: Session types are a typed approach to message-passing concurrency, where types describe sequences of intended exchanges over channels. Session type systems have been given strong logical foundations via Curry-Howard correspondences with linear logic, a resource-aware logic that naturally captures structured interactions. These logical foundations provide an elegant framework to specify and (static… ▽ More

    Submitted 22 August, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: Preprint to appear in JLAMP; revised/extended version of https://doi.org/10.4204/EPTCS.314.1

  21. arXiv:2401.12485  [pdf, other

    cs.LG cs.AI quant-ph stat.ML

    Adiabatic Quantum Support Vector Machines

    Authors: Prasanna Date, Dong Jun Woun, Kathleen Hamilton, Eduardo A. Coello Perez, Mayanka Chandra Shekhar, Francisco Rios, John Gounley, In-Saeng Suh, Travis Humble, Georgia Tourassi

    Abstract: Adiabatic quantum computers can solve difficult optimization problems (e.g., the quadratic unconstrained binary optimization problem), and they seem well suited to train machine learning models. In this paper, we describe an adiabatic quantum approach for training support vector machines. We show that the time complexity of our quantum approach is an order of magnitude better than the classical ap… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  22. arXiv:2401.11174  [pdf, other

    cs.CV cs.AI cs.LG

    Pixel-Wise Recognition for Holistic Surgical Scene Understanding

    Authors: Nicolás Ayobi, Santiago Rodríguez, Alejandra Pérez, Isabela Hernández, Nicolás Aparicio, Eugénie Dessevres, Sebastián Peña, Jessica Santander, Juan Ignacio Caicedo, Nicolás Fernández, Pablo Arbeláez

    Abstract: This paper presents the Holistic and Multi-Granular Surgical Scene Understanding of Prostatectomies (GraSP) dataset, a curated benchmark that models surgical scene understanding as a hierarchy of complementary tasks with varying levels of granularity. Our approach enables a multi-level comprehension of surgical activities, encompassing long-term tasks such as surgical phases and steps recognition… ▽ More

    Submitted 25 January, 2024; v1 submitted 20 January, 2024; originally announced January 2024.

    Comments: Preprint submitted to Medical Image Analysis. Official extension of previous MICCAI 2022 (https://link.springer.com/chapter/10.1007/978-3-031-16449-1_42) and ISBI 2023 (https://ieeexplore.ieee.org/document/10230819) orals. Data and codes are available at https://github.com/BCV-Uniandes/GraSP

  23. arXiv:2401.01148  [pdf, ps, other

    stat.ML cs.LG

    PAC-Bayes-Chernoff bounds for unbounded losses

    Authors: Ioar Casado, Luis A. Ortega, Andrés R. Masegosa, Aritz Pérez

    Abstract: We introduce a new PAC-Bayes oracle bound for unbounded losses. This result can be understood as a PAC-Bayesian version of the Cramér-Chernoff bound. The proof technique relies on controlling the tails of certain random variables involving the Cramér transform of the loss. We highlight several applications of the main theorem. First, we show that our result naturally allows exact optimization of t… ▽ More

    Submitted 6 February, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

    Comments: Updated Section 5

  24. arXiv:2401.00496  [pdf, other

    cs.CV cs.AI cs.LG

    SAR-RARP50: Segmentation of surgical instrumentation and Action Recognition on Robot-Assisted Radical Prostatectomy Challenge

    Authors: Dimitrios Psychogyios, Emanuele Colleoni, Beatrice Van Amsterdam, Chih-Yang Li, Shu-Yu Huang, Yuchong Li, Fucang Jia, Baosheng Zou, Guotai Wang, Yang Liu, Maxence Boels, Jiayu Huo, Rachel Sparks, Prokar Dasgupta, Alejandro Granados, Sebastien Ourselin, Mengya Xu, An Wang, Yanan Wu, Long Bai, Hongliang Ren, Atsushi Yamada, Yuriko Harai, Yuto Ishikawa, Kazuyuki Hayashi , et al. (25 additional authors not shown)

    Abstract: Surgical tool segmentation and action recognition are fundamental building blocks in many computer-assisted intervention applications, ranging from surgical skills assessment to decision support systems. Nowadays, learning-based action recognition and segmentation approaches outperform classical methods, relying, however, on large, annotated datasets. Furthermore, action recognition and tool segme… ▽ More

    Submitted 23 January, 2024; v1 submitted 31 December, 2023; originally announced January 2024.

  25. arXiv:2311.15451  [pdf, other

    cs.CL cs.LG

    Uncertainty-aware Language Modeling for Selective Question Answering

    Authors: Qi Yang, Shreya Ravikumar, Fynn Schmitt-Ulms, Satvik Lolla, Ege Demir, Iaroslav Elistratov, Alex Lavaee, Sadhana Lolla, Elaheh Ahmadi, Daniela Rus, Alexander Amini, Alejandro Perez

    Abstract: We present an automatic large language model (LLM) conversion approach that produces uncertainty-aware LLMs capable of estimating uncertainty with every prediction. Our approach is model- and data-agnostic, is computationally-efficient, and does not rely on external models or systems. We evaluate converted models on the selective question answering setting -- to answer as many questions as possibl… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

  26. arXiv:2311.13118  [pdf, other

    cs.LG cs.AI cs.CL cs.CY cs.SI

    Combatting Human Trafficking in the Cyberspace: A Natural Language Processing-Based Methodology to Analyze the Language in Online Advertisements

    Authors: Alejandro Rodriguez Perez, Pablo Rivas

    Abstract: This project tackles the pressing issue of human trafficking in online C2C marketplaces through advanced Natural Language Processing (NLP) techniques. We introduce a novel methodology for generating pseudo-labeled datasets with minimal supervision, serving as a rich resource for training state-of-the-art NLP models. Focusing on tasks like Human Trafficking Risk Prediction (HTRP) and Organized Acti… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

    MSC Class: 68T50; 62H30; 91C99; 68T068T50; 62H30; 91C99; 68T01 ACM Class: I.2.7; I.5.4; K.4.1; K.4.2

  27. arXiv:2311.09369  [pdf, other

    stat.ML cs.CY cs.LG

    Time-dependent Probabilistic Generative Models for Disease Progression

    Authors: Onintze Zaballa, Aritz Pérez, Elisa Gómez-Inhiesto, Teresa Acaiturri-Ayesta, Jose A. Lozano

    Abstract: Electronic health records contain valuable information for monitoring patients' health trajectories over time. Disease progression models have been developed to understand the underlying patterns and dynamics of diseases using these data as sequences. However, analyzing temporal data from EHRs is challenging due to the variability and irregularities present in medical records. We propose a Markovi… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2023, December 10th, 2023, New Orleans, United States, 17 pages

  28. arXiv:2311.05309  [pdf, other

    cond-mat.soft cond-mat.mtrl-sci cs.CE physics.chem-ph

    Liquid phase fast electron tomography unravels the true 3D structure of colloidal assemblies

    Authors: Daniel Arenas Esteban, Da Wang, Ajinkya Kadu, Noa Olluyn, Ana Sánchez Iglesias, Alejandro Gomez Perez, Jesus Gonzalez Casablanca, Stavros Nicolopoulos, Luis M. Liz-Marzán, Sara Bals

    Abstract: Electron tomography has become a commonly used tool to investigate the three-dimensional (3D) structure of nanomaterials, including colloidal nanoparticle assemblies. However, electron microscopy is typically carried out under high vacuum conditions. Therefore, pre-treatment sample preparation is needed for assemblies obtained by (wet) colloid chemistry methods, including solvent evaporation and d… ▽ More

    Submitted 23 November, 2023; v1 submitted 9 November, 2023; originally announced November 2023.

    Comments: 32 pages, 12 figures, 2 tables, submitted

  29. arXiv:2311.03812  [pdf, ps, other

    cs.CL

    Conversations in Galician: a Large Language Model for an Underrepresented Language

    Authors: Eliseo Bao, Anxo Pérez, Javier Parapar

    Abstract: The recent proliferation of Large Conversation Language Models has highlighted the economic significance of widespread access to this type of AI technologies in the current information age. Nevertheless, prevailing models have primarily been trained on corpora consisting of documents written in popular languages. The dearth of such cutting-edge tools for low-resource languages further exacerbates… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: 5 pages

  30. arXiv:2310.17410  [pdf, ps, other

    cs.AI cs.LO

    Synthesizing Efficiently Monitorable Formulas in Metric Temporal Logic

    Authors: Ritam Raha, Rajarshi Roy, Nathanael Fijalkow, Daniel Neider, Guillermo A. Perez

    Abstract: In runtime verification, manually formalizing a specification for monitoring system executions is a tedious and error-prone process. To address this issue, we consider the problem of automatically synthesizing formal specifications from system executions. To demonstrate our approach, we consider the popular specification language Metric Temporal Logic (MTL), which is particularly tailored towards… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

  31. arXiv:2310.15234  [pdf, other

    astro-ph.CO astro-ph.GA cs.LG

    Field-level simulation-based inference with galaxy catalogs: the impact of systematic effects

    Authors: Natalí S. M. de Santi, Francisco Villaescusa-Navarro, L. Raul Abramo, Helen Shao, Lucia A. Perez, Tiago Castro, Yueying Ni, Christopher C. Lovell, Elena Hernandez-Martinez, Federico Marinacci, David N. Spergel, Klaus Dolag, Lars Hernquist, Mark Vogelsberger

    Abstract: It has been recently shown that a powerful way to constrain cosmological parameters from galaxy redshift surveys is to train graph neural networks to perform field-level likelihood-free inference without imposing cuts on scale. In particular, de Santi et al. (2023) developed models that could accurately infer the value of $Ω_{\rm m}$ from catalogs that only contain the positions and radial velocit… ▽ More

    Submitted 9 May, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: 39 pages, 25 figures. For the reference in the abstract (de Santi et al. 2023) see arXiv:2302.14101

  32. arXiv:2310.13664  [pdf, other

    cs.CL

    Explainable Depression Symptom Detection in Social Media

    Authors: Eliseo Bao, Anxo Pérez, Javier Parapar

    Abstract: Users of social platforms often perceive these sites as supportive spaces to post about their mental health issues. Those conversations contain important traces about individuals' health risks. Recently, researchers have exploited this online information to construct mental health detection models, which aim to identify users at risk on platforms like Twitter, Reddit or Facebook. Most of these mod… ▽ More

    Submitted 20 August, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

    Comments: Accepted for publication in Health Information Science and Systems

  33. arXiv:2309.13349  [pdf, other

    cs.NE stat.ML

    Speeding-up Evolutionary Algorithms to solve Black-Box Optimization Problems

    Authors: Judith Echevarrieta, Etor Arza, Aritz Pérez

    Abstract: Population-based evolutionary algorithms are often considered when approaching computationally expensive black-box optimization problems. They employ a selection mechanism to choose the best solutions from a given population after comparing their objective values, which are then used to generate the next population. This iterative process explores the solution space efficiently, leading to improve… ▽ More

    Submitted 29 January, 2024; v1 submitted 23 September, 2023; originally announced September 2023.

  34. arXiv:2308.14101  [pdf, other

    cs.CV

    Superpixels algorithms through network community detection

    Authors: Anthony Perez

    Abstract: Community detection is a powerful tool from complex networks analysis that finds applications in various research areas. Several image segmentation methods rely for instance on community detection algorithms as a black box in order to compute undersegmentations, i.e. a small number of regions that represent areas of interest of the image. However, to the best of our knowledge, the efficiency of su… ▽ More

    Submitted 27 August, 2023; originally announced August 2023.

  35. arXiv:2308.13609  [pdf, ps, other

    cs.LO math.NT

    Integer Programming with GCD Constraints

    Authors: Rémy Defossez, Christoph Haase, Alessio Mansutti, Guillermo A. Perez

    Abstract: We study the non-linear extension of integer programming with greatest common divisor constraints of the form $\gcd(f,g) \sim d$, where $f$ and $g$ are linear polynomials, $d$ is a positive integer, and $\sim$ is a relation among $\leq, =, \neq$ and $\geq$. We show that the feasibility problem for these systems is in NP, and that an optimal solution minimizing a linear objective function, if it ex… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

  36. arXiv:2308.10758  [pdf, ps, other

    cs.CL cs.IR

    DepreSym: A Depression Symptom Annotated Corpus and the Role of LLMs as Assessors of Psychological Markers

    Authors: Anxo Pérez, Marcos Fernández-Pichel, Javier Parapar, David E. Losada

    Abstract: Computational methods for depression detection aim to mine traces of depression from online publications posted by Internet users. However, solutions trained on existing collections exhibit limited generalisation and interpretability. To tackle these issues, recent studies have shown that identifying depressive symptoms can lead to more robust models. The eRisk initiative fosters research on this… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

  37. arXiv:2308.07738  [pdf, other

    cs.AI

    Formally-Sharp DAgger for MCTS: Lower-Latency Monte Carlo Tree Search using Data Aggregation with Formal Methods

    Authors: Debraj Chakraborty, Damien Busatto-Gaston, Jean-François Raskin, Guillermo A. Pérez

    Abstract: We study how to efficiently combine formal methods, Monte Carlo Tree Search (MCTS), and deep learning in order to produce high-quality receding horizon policies in large Markov Decision processes (MDPs). In particular, we use model-checking techniques to guide the MCTS algorithm in order to generate offline samples of high-quality decisions on a representative set of states of the MDP. Those sampl… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

  38. arXiv:2308.01165  [pdf, ps, other

    cs.LO

    Termination in Concurrency, Revisited

    Authors: Joseph W. N. Paulus, Jorge A. Pérez, Daniele Nantes-Sobrinho

    Abstract: Termination is a central property in sequential programming models: a term is terminating if all its reduction sequences are finite. Termination is also important in concurrency in general, and for message-passing programs in particular. A variety of type systems that enforce termination by typing have been developed. In this paper, we rigorously compare several type systems for $π$-calculus proce… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

  39. arXiv:2308.00250  [pdf, other

    cs.SE

    CONSTRUCT: A Program Synthesis Approach for Reconstructing Control Algorithms from Embedded System Binaries in Cyber-Physical Systems

    Authors: Ali Shokri, Alexandre Perez, Souma Chowdhury, Chen Zeng, Gerald Kaloor, Ion Matei, Peter-Patel Schneider, Akshith Gunasekaran, Shantanu Rane

    Abstract: We introduce a novel approach to automatically synthesize a mathematical representation of the control algorithms implemented in industrial cyber-physical systems (CPS), given the embedded system binary. The output model can be used by subject matter experts to assess the system's compliance with the expected behavior and for a variety of forensic applications. Our approach first performs static a… ▽ More

    Submitted 31 July, 2023; originally announced August 2023.

  40. arXiv:2308.00231  [pdf, other

    cs.LG cs.AI

    Capsa: A Unified Framework for Quantifying Risk in Deep Neural Networks

    Authors: Sadhana Lolla, Iaroslav Elistratov, Alejandro Perez, Elaheh Ahmadi, Daniela Rus, Alexander Amini

    Abstract: The modern pervasiveness of large-scale deep neural networks (NNs) is driven by their extraordinary performance on complex problems but is also plagued by their sudden, unexpected, and often catastrophic failures, particularly on challenging scenarios. Existing algorithms that provide risk-awareness to NNs are complex and ad-hoc. Specifically, these methods require significant engineering changes,… ▽ More

    Submitted 31 July, 2023; originally announced August 2023.

    Comments: Neural Information Processing Systems (NeurIPS) 2022. Workshop on Machine Learning for Autonomous Driving (ML4AD)

    Journal ref: Neural Information Processing Systems (NeurIPS) 2022. Workshop on Machine Learning for Autonomous Driving (ML4AD)

  41. arXiv:2306.16899  [pdf, other

    cs.DS cs.CC

    An improved kernelization algorithm for Trivially Perfect Editing

    Authors: Maël Dumas, Anthony Perez

    Abstract: In the Trivially Perfect Editing problem one is given an undirected graph $G = (V,E)$ and an integer $k$ and seeks to add or delete at most $k$ edges in $G$ to obtain a trivially perfect graph. In a recent work, Dumas, Perez and Todinca [Algorithmica 2023] proved that this problem admits a kernel with $O(k^3)$ vertices. This result heavily relies on the fact that the size of trivially perfect modu… ▽ More

    Submitted 26 October, 2023; v1 submitted 29 June, 2023; originally announced June 2023.

  42. arXiv:2306.15562  [pdf, other

    cs.DC

    Challenges and Opportunities for RISC-V Architectures towards Genomics-based Workloads

    Authors: Gonzalo Gomez-Sanchez, Aaron Call, Xavier Teruel, Lorena Alonso, Ignasi Moran, Miguel Angel Perez, David Torrents, Josep Ll. Berral

    Abstract: The use of large-scale supercomputing architectures is a hard requirement for scientific computing Big-Data applications. An example is genomics analytics, where millions of data transformations and tests per patient need to be done to find relevant clinical indicators. Therefore, to ensure open and broad access to high-performance technologies, governments, and academia are pushing toward the int… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Journal ref: Presented at the ISC High-Performance Computing 2023

  43. arXiv:2306.09628  [pdf, other

    cs.CV stat.ML

    Structural Restricted Boltzmann Machine for image denoising and classification

    Authors: Arkaitz Bidaurrazaga, Aritz Pérez, Roberto Santana

    Abstract: Restricted Boltzmann Machines are generative models that consist of a layer of hidden variables connected to another layer of visible units, and they are used to model the distribution over visible variables. In order to gain a higher representability power, many hidden units are commonly used, which, in combination with a large number of visible units, leads to a high number of trainable paramete… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

  44. arXiv:2306.06649  [pdf, ps, other

    stat.ML cs.LG

    Efficient Learning of Minimax Risk Classifiers in High Dimensions

    Authors: Kartheek Bondugula, Santiago Mazuelas, Aritz Pérez

    Abstract: High-dimensional data is common in multiple areas, such as health care and genomics, where the number of features can be tens of thousands. In such scenarios, the large number of features often leads to inefficient learning. Constraint generation methods have recently enabled efficient learning of L1-regularized support vector machines (SVMs). In this paper, we leverage such methods to obtain an e… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

    Comments: Accepted for the 39th Conference on Uncertainty in Artificial Intelligence (UAI 2023)

  45. arXiv:2306.04204  [pdf, ps, other

    cs.PL

    Monitoring Blackbox Implementations of Multiparty Session Protocols

    Authors: Bas van den Heuvel, Jorge A. Pérez, Rares A. Dobre

    Abstract: We present a framework for the distributed monitoring of networks of components that coordinate by message-passing, following multiparty session protocols specified as global types. We improve over prior works by (i) supporting components whose exact specification is unknown ("blackboxes") and (ii) covering protocols that cannot be analyzed by existing techniques. We first give a procedure for syn… ▽ More

    Submitted 3 October, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: Full version with appendices of our RV'23 paper

  46. arXiv:2305.09634  [pdf, other

    cs.GT

    Bi-Objective Lexicographic Optimization in Markov Decision Processes with Related Objectives

    Authors: Damien Busatto-Gaston, Debraj Chakraborty, Anirban Majumdar, Sayan Mukherjee, Guillermo A. Pérez, Jean-François Raskin

    Abstract: We consider lexicographic bi-objective problems on Markov Decision Processes (MDPs), where we optimize one objective while guaranteeing optimality of another. We propose a two-stage technique for solving such problems when the objectives are related (in a way that we formalize). We instantiate our technique for two natural pairs of objectives: minimizing the (conditional) expected number of steps… ▽ More

    Submitted 15 August, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

  47. Automata with Timers

    Authors: Véronique Bruyère, Guillermo A. Pérez, Gaëtan Staquet, Frits W. Vaandrager

    Abstract: In this work, we study properties of deterministic finite-state automata with timers, a subclass of timed automata proposed by Vaandrager et al. as a candidate for an efficiently learnable timed model. We first study the complexity of the configuration reachability problem for such automata and establish that it is PSPACE-complete. Then, as simultaneous timeouts (we call these, races) can occur in… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

    Comments: 35 pages, 9 figures

    ACM Class: F.4.3

    Journal ref: Formal Modeling and Analysis of Timed Systems (FORMATS) 2023 pp. 33-49

  48. arXiv:2305.07345  [pdf, other

    cs.PF cs.DS math.OC stat.AP

    On the Fair Comparison of Optimization Algorithms in Different Machines

    Authors: Etor Arza, Josu Ceberio, Ekhiñe Irurozki, Aritz Pérez

    Abstract: An experimental comparison of two or more optimization algorithms requires the same computational resources to be assigned to each algorithm. When a maximum runtime is set as the stopping criterion, all algorithms need to be executed in the same machine if they are to use the same resources. Unfortunately, the implementation code of the algorithms is not always available, which means that running… ▽ More

    Submitted 7 August, 2023; v1 submitted 12 May, 2023; originally announced May 2023.

    Journal ref: Ann. Appl. Stat. 18(1): 42-62 (March 2024)

  49. arXiv:2305.05739  [pdf, ps, other

    cs.LO cs.AI

    Graph-Based Reductions for Parametric and Weighted MDPs

    Authors: Kasper Engelen, Guillermo A. Pérez, Shrisha Rao

    Abstract: We study the complexity of reductions for weighted reachability in parametric Markov decision processes. That is, we say a state p is never worse than q if for all valuations of the polynomial indeterminates it is the case that the maximal expected weight that can be reached from p is greater than the same value from q. In terms of computational complexity, we establish that determining whether p… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

  50. arXiv:2303.12558  [pdf, other

    cs.LG cs.AI

    Wasserstein Auto-encoded MDPs: Formal Verification of Efficiently Distilled RL Policies with Many-sided Guarantees

    Authors: Florent Delgrange, Ann Nowé, Guillermo A. Pérez

    Abstract: Although deep reinforcement learning (DRL) has many success stories, the large-scale deployment of policies learned through these advanced techniques in safety-critical scenarios is hindered by their lack of formal guarantees. Variational Markov Decision Processes (VAE-MDPs) are discrete latent space models that provide a reliable framework for distilling formally verifiable controllers from any R… ▽ More

    Submitted 21 April, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

    Comments: ICLR 2023, 10 pages main text, 14 pages appendix (excluding references)