Zum Hauptinhalt springen

Showing 1–4 of 4 results for author: Fernandes, J B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.16728  [pdf, other

    cs.DC

    Auto Tuning for OpenMP Dynamic Scheduling applied to FWI

    Authors: Felipe H. S. da Silva, João B. Fernandes, Idalmis M. Sardina, Tiago Barros, Samuel Xavier-de-Souza, Italo A. S. Assis

    Abstract: Because Full Waveform Inversion (FWI) works with a massive amount of data, its execution requires much time and computational resources, being restricted to large-scale computer systems such as supercomputers. Techniques such as FWI adapt well to parallel computing and can be parallelized in shared memory systems using the application programming interface (API) OpenMP. The management of parallel… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  2. PATSMA: Parameter Auto-tuning for Shared Memory Algorithms

    Authors: Joao B. Fernandes, Felipe H. S. da Silva, Samuel Xavier-de-Souza, Italo A. S. Assis

    Abstract: Programs with high levels of complexity often face challenges in adjusting execution parameters, particularly when these parameters vary based on the execution context. These dynamic parameters significantly impact the program's performance, such as loop granularity, which can vary depending on factors like the execution environment, program input, or the choice of compiler. Given the expensive na… ▽ More

    Submitted 14 June, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

    Journal ref: SoftwareX, Volume 27, 2024, 101789

  3. arXiv:2401.04494  [pdf, other

    cs.DC

    Adaptive Asynchronous Work-Stealing for distributed load-balancing in heterogeneous systems

    Authors: João B. Fernandes, Ítalo A. S. de Assis, Idalmis M. S. Martins, Tiago Barros, Samuel Xavier-de-Souza

    Abstract: Supercomputers have revolutionized how industries and scientific fields process large amounts of data. These machines group hundreds or thousands of computing nodes working together to execute time-consuming programs that require a large amount of computational resources. Over the years, supercomputers have expanded to include new and different technologies characterizing them as heterogeneous. Ho… ▽ More

    Submitted 23 January, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

    Comments: 32 pages, 5 figures

  4. Auto-tuning of dynamic scheduling applied to 3D reverse time migration on multicore systems

    Authors: Ítalo A. S. Assis, João B. Fernandes, Tiago Barros, Samuel Xavier-de-Souza

    Abstract: Reverse time migration (RTM) is an algorithm widely used in the oil and gas industry to process seismic data. It is a computationally intensive task that suits well in parallel computers. Methods such as RTM can be parallelized in shared memory systems through scheduling iterations of parallel loops to threads. However, several aspects, such as memory size and hierarchy, number of cores, and input… ▽ More

    Submitted 5 July, 2020; v1 submitted 16 May, 2019; originally announced May 2019.