Zum Hauptinhalt springen

Showing 1–8 of 8 results for author: Raiciu, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.13657  [pdf, ps, other

    cs.CL

    FuLG: 150B Romanian Corpus for Language Model Pretraining

    Authors: Vlad-Andrei Bădoiu, Mihai-Valentin Dumitru, Alexandru M. Gherghescu, Alexandru Agache, Costin Raiciu

    Abstract: Research in the field of language models is rapidly evolving, with many open models being released to the public. Openly available pretraining corpora usually focus on only a handful of languages, with many others either missing completely or extremely underrepresented. In this report, we introduce FuLG, a hundred-fifty-billion-token Romanian corpus extracted from CommonCrawl. We present our metho… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  2. arXiv:2407.12819  [pdf, other

    cs.DC cs.CL cs.LG cs.NI

    A Look Into Training Large Language Models on Next Generation Datacenters

    Authors: Alexandru M. Gherghescu, Vlad-Andrei Bădoiu, Alexandru Agache, Mihai-Valentin Dumitru, Iuliu Vasilescu, Radu Mantu, Costin Raiciu

    Abstract: Is it still worth doing computer networking research? What are relevant problems in this space given the supremacy of hyperscalers in deployed large networks? We take an unconventional approach to finding relevant research directions, by starting from Microsoft's plans to build a $100 billion datacenter for ML. Our goal is to understand what models could be trained in such a datacenter, as well as… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  3. arXiv:2406.13679  [pdf, other

    cs.NI cs.LG

    Prose-to-P4: Leveraging High Level Languages

    Authors: Mihai-Valentin Dumitru, Vlad-Andrei Bădoiu, Costin Raiciu

    Abstract: Languages such as P4 and NPL have enabled a wide and diverse range of networking applications that take advantage of programmable dataplanes. However, software development in these languages is difficult. To address this issue, high-level languages have been designed to offer programmers powerful abstractions that reduce the time, effort and domain-knowledge required for developing networking appl… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  4. arXiv:2404.01630  [pdf, other

    cs.NI

    SMaRTT-REPS: Sender-based Marked Rapidly-adapting Trimmed & Timed Transport with Recycled Entropies

    Authors: Tommaso Bonato, Abdul Kabbani, Daniele De Sensi, Rong Pan, Yanfang Le, Costin Raiciu, Mark Handley, Timo Schneider, Nils Blach, Ahmad Ghalayini, Daniel Alves, Michael Papamichael, Adrian Caulfield, Torsten Hoefler

    Abstract: With the rapid growth of machine learning (ML) workloads in datacenters, existing congestion control (CC) algorithms fail to deliver the required performance at scale. ML traffic is bursty and bulk-synchronous and thus requires quick reaction and strong fairness. We show that existing CC algorithms that use delay as a main signal react too slowly and are not always fair. We design SMaRTT, a simple… ▽ More

    Submitted 27 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: Fixed typo and wrong y axis of one plot

  5. arXiv:2309.15996  [pdf, other

    cs.OS

    Loupe: Driving the Development of OS Compatibility Layers

    Authors: Hugo Lefeuvre, Gaulthier Gain, Vlad-Andrei Bădoiu, Daniel Dinca, Vlad-Radu Schiller, Costin Raiciu, Felipe Huici, Pierre Olivier

    Abstract: Supporting mainstream applications is fundamental for a new OS to have impact. It is generally achieved by developing a layer of compatibility allowing applications developed for a mainstream OS like Linux to run unmodified on the new OS. Building such a layer, as we show, results in large engineering inefficiencies due to the lack of efficient methods to precisely measure the OS features required… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: Accepted to appear at ASPLOS'24 (https://www.asplos-conference.org/asplos2024/)

  6. arXiv:2112.06566  [pdf, other

    cs.OS

    FlexOS: Towards Flexible OS Isolation

    Authors: Hugo Lefeuvre, Vlad-Andrei Bădoiu, Alexander Jung, Stefan Teodorescu, Sebastian Rauch, Felipe Huici, Costin Raiciu, Pierre Olivier

    Abstract: At design time, modern operating systems are locked in a specific safety and isolation strategy that mixes one or more hardware/software protection mechanisms (e.g. user/kernel separation); revisiting these choices after deployment requires a major refactoring effort. This rigid approach shows its limits given the wide variety of modern applications' safety/performance requirements, when new hardw… ▽ More

    Submitted 14 January, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

    Comments: Artifact Evaluation Repository: https://github.com/project-flexos/asplos22-ae

  7. Unikraft: Fast, Specialized Unikernels the Easy Way

    Authors: Simon Kuenzer, Vlad-Andrei Bădoiu, Hugo Lefeuvre, Sharan Santhanam, Alexander Jung, Gaulthier Gain, Cyril Soldani, Costin Lupu, Ştefan Teodorescu, Costi Răducanu, Cristian Banu, Laurent Mathy, Răzvan Deaconescu, Costin Raiciu, Felipe Huici

    Abstract: Unikernels are famous for providing excellent performance in terms of boot times, throughput and memory consumption, to name a few metrics. However, they are infamous for making it hard and extremely time consuming to extract such performance, and for needing significant engineering effort in order to port applications to them. We introduce Unikraft, a novel micro-library OS that (1) fully modular… ▽ More

    Submitted 26 April, 2021; originally announced April 2021.

    Comments: 19 pages, 22 figures, 7 tables, conference proceedings

  8. arXiv:1604.02847  [pdf, other

    cs.NI

    SymNet: scalable symbolic execution for modern networks

    Authors: Radu Stoenescu, Matei Popovici, Lorina Negreanu, Costin Raiciu

    Abstract: We present SymNet, a network static analysis tool based on symbolic execution. SymNet quickly analyzes networks by injecting symbolic packets and tracing their path through the network. Our key novelty is SEFL, a language we designed for network processing that is symbolic-execution friendly. SymNet is easy to use: we have developed parsers that automatically generate SEFL models from router and… ▽ More

    Submitted 11 April, 2016; originally announced April 2016.

    Comments: 13 pages