Zum Hauptinhalt springen

Showing 1–16 of 16 results for author: Hernandez, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13264  [pdf, other

    cs.AI cs.LG cs.SE

    Do Multimodal Foundation Models Understand Enterprise Workflows? A Benchmark for Business Process Management Tasks

    Authors: Michael Wornow, Avanika Narayan, Ben Viggiano, Ishan S. Khare, Tathagat Verma, Tibor Thompson, Miguel Angel Fuentes Hernandez, Sudharsan Sundar, Chloe Trujillo, Krrish Chawla, Rongfei Lu, Justin Shen, Divya Nagaraj, Joshua Martinez, Vardhan Agrawal, Althea Hudson, Nigam H. Shah, Christopher Re

    Abstract: Existing ML benchmarks lack the depth and diversity of annotations needed for evaluating models on business process management (BPM) tasks. BPM is the practice of documenting, measuring, improving, and automating enterprise workflows. However, research has focused almost exclusively on one task - full end-to-end automation using agents based on multimodal foundation models (FMs) like GPT-4. This f… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  2. arXiv:2312.00286  [pdf, other

    quant-ph cs.CC

    Complexity-theoretic foundations of BosonSampling with a linear number of modes

    Authors: Adam Bouland, Daniel Brod, Ishaun Datta, Bill Fefferman, Daniel Grier, Felipe Hernandez, Michal Oszmaniec

    Abstract: BosonSampling is the leading candidate for demonstrating quantum computational advantage in photonic systems. While we have recently seen many impressive experimental demonstrations, there is still a formidable distance between the complexity-theoretic hardness arguments and current experiments. One of the largest gaps involves the ratio of photons to modes: all current hardness evidence assumes a… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

    Comments: 26 pages, 3 figures, to appear at QIP 2024

  3. arXiv:2302.14078  [pdf, other

    cs.LG math.DS

    Analyzing Populations of Neural Networks via Dynamical Model Embedding

    Authors: Jordan Cotler, Kai Sheng Tai, Felipe Hernández, Blake Elias, David Sussillo

    Abstract: A core challenge in the interpretation of deep neural networks is identifying commonalities between the underlying algorithms implemented by distinct networks trained for the same task. Motivated by this problem, we introduce DYNAMO, an algorithm that constructs low-dimensional manifolds where each point corresponds to a neural network model, and two points are nearby if the corresponding neural n… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

    Comments: 12+8 pages, 11 figures

  4. A polynomial-time approximation to a minimum dominating set in a graph

    Authors: Frank Hernandez, Ernesto Parra, Jose Maria Sigarreta, Nodari Vakhania

    Abstract: A {\em dominating set} of a graph $G=(V,E)$ is a subset of vertices $S\subseteq V$ such that every vertex $v\in V\setminus S$ has at least one neighbor in $S$. Finding a dominating set with the minimum cardinality in a connected graph $G=(V,E)$ is known to be NP-hard. A polynomial-time approximation algorithm for this problem, described here, works in two stages. At the first stage a dominant set… ▽ More

    Submitted 10 July, 2022; originally announced July 2022.

    Journal ref: Theoretical Computer Science 930 (2022) 142-156

  5. arXiv:2111.06470  [pdf, ps, other

    cs.IT

    The complete weight enumerator of a subclass of optimal three-weight cyclic codes

    Authors: Gerardo Vega, Félix Hernández

    Abstract: A class of optimal three-weight cyclic codes of dimension 3 over any finite field was presented by Vega [Finite Fields Appl., 42 (2016) 23-38]. Shortly thereafter, Heng and Yue [IEEE Trans. Inf. Theory, 62(8) (2016) 4501-4513] generalized this result by presenting several classes of cyclic codes with either optimal three weights or a few weights. On the other hand, a class of optimal five-weight c… ▽ More

    Submitted 11 November, 2021; originally announced November 2021.

    Comments: arXiv admin note: text overlap with arXiv:1508.05077

  6. arXiv:2107.04579  [pdf, ps, other

    cs.IT

    Optimal three-weight cyclic codes whose duals are also optimal

    Authors: Gerardo Vega, Félix Hernández

    Abstract: A class of optimal three-weight cyclic codes of dimension 3 over any finite field was presented by Vega [Finite Fields Appl., 42 (2016) 23-38]. Shortly thereafter, Heng and Yue [IEEE Trans. Inf. Theory, 62(8) (2016) 4501-4513] generalized this result by presenting several classes of cyclic codes with either optimal three weights or a few weights. Here we present a new class of optimal three-weight… ▽ More

    Submitted 9 July, 2021; originally announced July 2021.

  7. arXiv:2102.12736   

    stat.ML cs.LG

    Time-Series Imputation with Wasserstein Interpolation for Optimal Look-Ahead-Bias and Variance Tradeoff

    Authors: Jose Blanchet, Fernando Hernandez, Viet Anh Nguyen, Markus Pelger, Xuhui Zhang

    Abstract: Missing time-series data is a prevalent practical problem. Imputation methods in time-series data often are applied to the full panel data with the purpose of training a model for a downstream out-of-sample task. For example, in finance, imputation of missing returns may be applied prior to training a portfolio optimization model. Unfortunately, this practice may result in a look-ahead-bias in the… ▽ More

    Submitted 11 April, 2023; v1 submitted 25 February, 2021; originally announced February 2021.

    Comments: This paper has been superseded by arXiv:2202.00871

  8. arXiv:2011.09249  [pdf, ps, other

    cs.CL

    The Ubiqus English-Inuktitut System for WMT20

    Authors: François Hernandez, Vincent Nguyen

    Abstract: This paper describes Ubiqus' submission to the WMT20 English-Inuktitut shared news translation task. Our main system, and only submission, is based on a multilingual approach, jointly training a Transformer model on several agglutinative languages. The English-Inuktitut translation task is challenging at every step, from data selection, preparation and tokenization to quality evaluation down the l… ▽ More

    Submitted 18 November, 2020; originally announced November 2020.

    Comments: System Description paper for WMT 2020 English-Inuktitut News Translation Task

  9. arXiv:2007.15296  [pdf, ps, other

    cs.CL

    Leverage Unlabeled Data for Abstractive Speech Summarization with Self-Supervised Learning and Back-Summarization

    Authors: Paul Tardy, Louis de Seynes, François Hernandez, Vincent Nguyen, David Janiszek, Yannick Estève

    Abstract: Supervised approaches for Neural Abstractive Summarization require large annotated corpora that are costly to build. We present a French meeting summarization task where reports are predicted based on the automatic transcription of the meeting audio recordings. In order to build a corpus for this task, it is necessary to obtain the (automatic or manual) transcription of each meeting, and then to s… ▽ More

    Submitted 17 September, 2020; v1 submitted 30 July, 2020; originally announced July 2020.

    Comments: To be published in Proceedings of SPECOM 2020

  10. WLCG Networks: Update on Monitoring and Analytics

    Authors: Marian Babik, Shawn McKee, Pedro Andrade, Brian Paul Bockelman, Robert Gardner, Edgar Mauricio Fajardo Hernandez, Edoardo Martelli, Ilija Vukotic, Derek Weitzel, Marian Zvada

    Abstract: WLCG relies on the network as a critical part of its infrastructure and therefore needs to guarantee effective network usage and prompt detection and resolution of any network issues including connection failures, congestion and traffic routing. The OSG Networking Area, in partnership with WLCG, is focused on being the primary source of networking information for its partners and constituents. It… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

    Comments: Accepted for publication in CHEP 2019 proceedings

  11. StashCache: A Distributed Caching Federation for the Open Science Grid

    Authors: Derek Weitzel, Marian Zvada, Ilija Vukotic, Rob Gardner, Brian Bockelman, Mats Rynge, Edgar Fajardo Hernandez, Brian Lin, Matyas Selmeci

    Abstract: Data distribution for opportunistic users is challenging as they neither own the computing resources they are using or any nearby storage. Users are motivated to use opportunistic computing to expand their data processing capacity, but they require storage and fast networking to distribute data to that processing. Since it requires significant management overhead, it is rare for resource providers… ▽ More

    Submitted 16 May, 2019; originally announced May 2019.

    Comments: In Practice and Experience in Advanced Research Computing (PEARC 19), July 28-August 1, 2019, Chicago, IL, USA. ACM, New York, NY, USA, 7 pages

  12. arXiv:1810.02090  [pdf

    cs.CR

    Shakedown: compiler-based moving target protection for Return Oriented Programing attacks on an industrial IoT device

    Authors: Fady Copty, Francisco Hernandez, Dov Murik, Olmo Rayón

    Abstract: Cybercriminals use Return Oriented Programming techniques to attack systems and IoT devices. While defenses have been developed, not all of them are applicable to constrained devices. We present Shakedown, which is a compile-time randomizing build tool which creates several versions of the binary, each with a distinct memory layout. An attack developed against one device will not work on another d… ▽ More

    Submitted 11 October, 2018; v1 submitted 4 October, 2018; originally announced October 2018.

    Comments: 1st SMESEC Workshop - Heraklion, Greece (2018)

  13. TED-LIUM 3: twice as much data and corpus repartition for experiments on speaker adaptation

    Authors: François Hernandez, Vincent Nguyen, Sahar Ghannay, Natalia Tomashenko, Yannick Estève

    Abstract: In this paper, we present TED-LIUM release 3 corpus dedicated to speech recognition in English, that multiplies by more than two the available data to train acoustic models in comparison with TED-LIUM 2. We present the recent development on Automatic Speech Recognition (ASR) systems in comparison with the two previous releases of the TED-LIUM Corpus from 2012 and 2014. We demonstrate that, passing… ▽ More

    Submitted 13 June, 2019; v1 submitted 12 May, 2018; originally announced May 2018.

    Comments: Submitted to SPECOM 2018, 20th International Conference on Speech and Computer; TED-LIUM 3 corpus available on https://lium.univ-lemans.fr/en/ted-lium3/

    ACM Class: I.2.7

    Journal ref: SPECOM 2018. Lecture Notes in Computer Science, vol 11096, pp 198-208

  14. arXiv:1705.06202  [pdf, other

    cs.DC astro-ph.IM

    Data Access for LIGO on the OSG

    Authors: Derek Weitzel, Brian Bockelman, Duncan A. Brown, Peter Couvares, Frank Würthwein, Edgar Fajardo Hernandez

    Abstract: During 2015 and 2016, the Laser Interferometer Gravitational-Wave Observatory (LIGO) conducted a three-month observing campaign. These observations delivered the first direct detection of gravitational waves from binary black hole mergers. To search for these signals, the LIGO Scientific Collaboration uses the PyCBC search pipeline. To deliver science results in a timely manner, LIGO collaborated… ▽ More

    Submitted 17 May, 2017; originally announced May 2017.

    Comments: 6 pages, 3 figures, submitted to PEARC17

  15. Cryptanalysis of a Classical chaos-based cryptosystem with some quantum cryptography features

    Authors: David Arroyo, Fernando Hernandez, Amalia B. Orúe

    Abstract: The application of synchronization theory to build up new cryptosystems has been a hot topic during the last two decades. In this paper we analyze a recent proposal in this field. We pinpoint the main limitations of the software implementation of chaos-based systems designed on the grounds of synchronization theory. In addition, we show that the cryptosystem under evaluation possesses serious secu… ▽ More

    Submitted 26 October, 2016; originally announced October 2016.

    Comments: Accepted in International Journal of Bifurcation and Chaos, In Press

  16. arXiv:1205.4667  [pdf

    hep-ex cs.DL

    Status Report of the DPHEP Study Group: Towards a Global Effort for Sustainable Data Preservation in High Energy Physics

    Authors: Z. Akopov, Silvia Amerio, David Asner, Eduard Avetisyan, Olof Barring, James Beacham, Matthew Bellis, Gregorio Bernardi, Siegfried Bethke, Amber Boehnlein, Travis Brooks, Thomas Browder, Rene Brun, Concetta Cartaro, Marco Cattaneo, Gang Chen, David Corney, Kyle Cranmer, Ray Culbertson, Sunje Dallmeier-Tiessen, Dmitri Denisov, Cristinel Diaconu, Vitaliy Dodonov, Tony Doyle, Gregory Dubois-Felsmann , et al. (65 additional authors not shown)

    Abstract: Data from high-energy physics (HEP) experiments are collected with significant financial and human effort and are mostly unique. An inter-experimental study group on HEP data preservation and long-term analysis was convened as a panel of the International Committee for Future Accelerators (ICFA). The group was formed by large collider-based experiments and investigated the technical and organisati… ▽ More

    Submitted 21 May, 2012; originally announced May 2012.

    Report number: DPHEP-2012-001