Zum Hauptinhalt springen

Showing 1–10 of 10 results for author: Pfeiffer, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.09276  [pdf, other

    cs.CL cs.LG

    H2O-Danube3 Technical Report

    Authors: Pascal Pfeiffer, Philipp Singer, Yauhen Babakhin, Gabor Fodor, Nischay Dhankhar, Sri Satish Ambati

    Abstract: We present H2O-Danube3, a series of small language models consisting of H2O-Danube3-4B, trained on 6T tokens and H2O-Danube3-500M, trained on 4T tokens. Our models are pre-trained on high quality Web data consisting of primarily English tokens in three stages with different data mixes before final supervised tuning for chat version. The models exhibit highly competitive metrics across a multitude… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  2. arXiv:2401.16818  [pdf, other

    cs.CL cs.LG

    H2O-Danube-1.8B Technical Report

    Authors: Philipp Singer, Pascal Pfeiffer, Yauhen Babakhin, Maximilian Jeblick, Nischay Dhankhar, Gabor Fodor, Sri Satish Ambati

    Abstract: We present H2O-Danube, a series of small 1.8B language models consisting of H2O-Danube-1.8B, trained on 1T tokens, and the incremental improved H2O-Danube2-1.8B trained on an additional 2T tokens. Our models exhibit highly competitive metrics across a multitude of benchmarks and, as of the time of this writing, H2O-Danube2-1.8B achieves the top ranking on Open LLM Leaderboard for all models below… ▽ More

    Submitted 15 April, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

  3. arXiv:2310.13012  [pdf, other

    cs.CL cs.AI

    H2O Open Ecosystem for State-of-the-art Large Language Models

    Authors: Arno Candel, Jon McKinney, Philipp Singer, Pascal Pfeiffer, Maximilian Jeblick, Chun Ming Lee, Marcos V. Conde

    Abstract: Large Language Models (LLMs) represent a revolution in AI. However, they also pose many significant risks, such as the presence of biased, private, copyrighted or harmful text. For this reason we need open, transparent and safe solutions. We introduce a complete open-source ecosystem for developing and testing LLMs. The goal of this project is to boost open alternatives to closed-source approaches… ▽ More

    Submitted 23 October, 2023; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 Demo - ACL Empirical Methods in Natural Language Processing

  4. arXiv:2309.09618  [pdf, other

    cs.LG cs.OS

    A Discussion on Generalization in Next-Activity Prediction

    Authors: Luka Abb, Peter Pfeiffer, Peter Fettke, Jana-Rebecca Rehse

    Abstract: Next activity prediction aims to forecast the future behavior of running process instances. Recent publications in this field predominantly employ deep learning techniques and evaluate their prediction performance using publicly available event logs. This paper presents empirical evidence that calls into question the effectiveness of these current evaluation approaches. We show that there is an en… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: Pre-print, published at the AI4BPM workshop at BPM 2023

  5. arXiv:2306.08161  [pdf, other

    cs.CL cs.AI cs.HC cs.IR cs.LG

    h2oGPT: Democratizing Large Language Models

    Authors: Arno Candel, Jon McKinney, Philipp Singer, Pascal Pfeiffer, Maximilian Jeblick, Prithvi Prabhu, Jeff Gambera, Mark Landry, Shivam Bansal, Ryan Chesler, Chun Ming Lee, Marcos V. Conde, Pasha Stetsenko, Olivier Grellier, SriSatish Ambati

    Abstract: Applications built on top of Large Language Models (LLMs) such as GPT-4 represent a revolution in AI due to their human-level capabilities in natural language processing. However, they also pose many significant risks such as the presence of biased, private, or harmful text, and the unauthorized inclusion of copyrighted material. We introduce h2oGPT, a suite of open-source code repositories for… ▽ More

    Submitted 16 June, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

    Comments: Work in progress by H2O.ai, Inc

  6. arXiv:2107.14048  [pdf

    cs.CY cs.CV

    Corridor for new mobility Aachen-Düsseldorf: Methods and concepts of the research project ACCorD

    Authors: Laurent Kloeker, Amarin Kloeker, Fabian Thomsen, Armin Erraji, Lutz Eckstein, Serge Lamberty, Adrian Fazekas, Eszter Kalló, Markus Oeser, Charlotte Fléchon, Jochen Lohmiller, Pascal Pfeiffer, Martin Sommer, Helen Winter

    Abstract: With the Corridor for New Mobility Aachen - Düsseldorf, an integrated development environment is created, incorporating existing test capabilities, to systematically test and validate automated vehicles in interaction with connected Intelligent Transport Systems Stations (ITS-Ss). This is achieved through a time- and cost-efficient toolchain and methodology, in which simulation, closed test sites… ▽ More

    Submitted 13 July, 2021; originally announced July 2021.

  7. arXiv:2107.07728  [pdf, other

    cs.SD cs.LG eess.AS

    Recognizing bird species in diverse soundscapes under weak supervision

    Authors: Christof Henkel, Pascal Pfeiffer, Philipp Singer

    Abstract: We present a robust classification approach for avian vocalization in complex and diverse soundscapes, achieving second place in the BirdCLEF2021 challenge. We illustrate how to make full use of pre-trained convolutional neural networks, by using an efficient modeling and training routine supplemented by novel augmentation methods. Thereby, we improve the generalization of weakly labeled crowd-sou… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

    Comments: All authors contributed equally, 8 pages, 4 figures, submitted to CEUR-WS

  8. arXiv:2106.08027  [pdf, other

    cs.LG cs.AI stat.ML

    Multivariate Business Process Representation Learning utilizing Gramian Angular Fields and Convolutional Neural Networks

    Authors: Peter Pfeiffer, Johannes Lahann, Peter Fettke

    Abstract: Learning meaningful representations of data is an important aspect of machine learning and has recently been successfully applied to many domains like language understanding or computer vision. Instead of training a model for one specific task, representation learning is about training a model to capture all useful information in the underlying data and make it accessible for a predictor. For pred… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

    Comments: Accepted at the Business Process Management Conference 2021

  9. arXiv:1804.10120  [pdf, other

    cs.MS gr-qc

    Automatic generation of CUDA code performing tensor manipulations using C++ expression templates

    Authors: Adam G. M. Lewis, Harald P. Pfeiffer

    Abstract: We present a C++ library, TLoops, which uses a hierarchy of expression templates to represent operations upon tensorial quantities in single lines of C++ code that resemble analytic equations. These expressions may be run as-is, but may also be used to emit equivalent low-level C or CUDA code, which either performs the operations more quickly on the CPU, or allows them to be rapidly ported to run… ▽ More

    Submitted 24 April, 2018; originally announced April 2018.

    Comments: 46 pages, 5 figures

  10. arXiv:1711.06276  [pdf, other

    gr-qc astro-ph.CO astro-ph.HE cs.CE

    Eccentric, nonspinning, inspiral, Gaussian-process merger approximant for the detection and characterization of eccentric binary black hole mergers

    Authors: E. A. Huerta, C. J. Moore, Prayush Kumar, Daniel George, Alvin J. K. Chua, Roland Haas, Erik Wessel, Daniel Johnson, Derek Glennon, Adam Rebei, A. Miguel Holgado, Jonathan R. Gair, Harald P. Pfeiffer

    Abstract: We present $\texttt{ENIGMA}$, a time domain, inspiral-merger-ringdown waveform model that describes non-spinning binary black holes systems that evolve on moderately eccentric orbits. The inspiral evolution is described using a consistent combination of post-Newtonian theory, self-force and black hole perturbation theory. Assuming eccentric binaries that circularize prior to coalescence, we smooth… ▽ More

    Submitted 24 January, 2018; v1 submitted 16 November, 2017; originally announced November 2017.

    Comments: 19 pages, 10 figures, 1 Appendix. v2: we use numerical relativity simulations to quantify the importance of including higher-order waveform multipoles for the detection of eccentric binary black hole mergers, references added. Accepted to Phys. Rev. D

    ACM Class: J.2

    Journal ref: Phys. Rev. D 97, 024031 (2018)