Zum Hauptinhalt springen

Showing 1–7 of 7 results for author: Malas, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:1809.06532  [pdf, other

    cs.DL

    Nanopublications: A Growing Resource of Provenance-Centric Scientific Linked Data

    Authors: Tobias Kuhn, Albert Meroño-Peñuela, Alexander Malic, Jorrit H. Poelen, Allen H. Hurlbert, Emilio Centeno Ortiz, Laura I. Furlong, Núria Queralt-Rosinach, Christine Chichester, Juan M. Banda, Egon Willighagen, Friederike Ehrhart, Chris Evelo, Tareq B. Malas, Michel Dumontier

    Abstract: Nanopublications are a Linked Data format for scholarly data publishing that has received considerable uptake in the last few years. In contrast to the common Linked Data publishing practice, nanopublications work at the granular level of atomic information snippets and provide a consistent container format to attach provenance and metadata at this atomic level. While the nanopublications format i… ▽ More

    Submitted 18 September, 2018; originally announced September 2018.

    Journal ref: In Proceedings of IEEE eScience 2018

  2. arXiv:1708.05256  [pdf, other

    cs.PF cs.CV cs.LG

    Deep Learning at 15PF: Supervised and Semi-Supervised Classification for Scientific Data

    Authors: Thorsten Kurth, Jian Zhang, Nadathur Satish, Ioannis Mitliagkas, Evan Racah, Mostofa Ali Patwary, Tareq Malas, Narayanan Sundaram, Wahid Bhimji, Mikhail Smorkalov, Jack Deslippe, Mikhail Shiryaev, Srinivas Sridharan, Prabhat, Pradeep Dubey

    Abstract: This paper presents the first, 15-PetaFLOP Deep Learning system for solving scientific pattern classification problems on contemporary HPC architectures. We develop supervised convolutional architectures for discriminating signals in high-energy physics data as well as semi-supervised architectures for localizing and classifying extreme weather in climate data. Our Intelcaffe-based implementation… ▽ More

    Submitted 17 August, 2017; originally announced August 2017.

    Comments: 12 pages, 9 figures

  3. arXiv:1510.05218  [pdf, other

    cs.CE cs.DC cs.PF

    Optimization of an electromagnetics code with multicore wavefront diamond blocking and multi-dimensional intra-tile parallelization

    Authors: Tareq M. Malas, Julian Hornich, Georg Hager, Hatem Ltaief, Christoph Pflaum, David E. Keyes

    Abstract: Understanding and optimizing the properties of solar cells is becoming a key issue in the search for alternatives to nuclear and fossil energy sources. A theoretical analysis via numerical simulations involves solving Maxwell's Equations in discretized form and typically requires substantial computing effort. We start from a hybrid-parallel (MPI+OpenMP) production code that implements the Time Har… ▽ More

    Submitted 18 October, 2015; originally announced October 2015.

  4. arXiv:1510.04995  [pdf, other

    cs.DC cs.PF

    Multi-dimensional intra-tile parallelization for memory-starved stencil computations

    Authors: Tareq Malas, Georg Hager, Hatem Ltaief, David Keyes

    Abstract: Optimizing the performance of stencil algorithms has been the subject of intense research over the last two decades. Since many stencil schemes have low arithmetic intensity, most optimizations focus on increasing the temporal data access locality, thus reducing the data traffic through the main memory interface with the ultimate goal of decoupling from this bottleneck. There are, however, only fe… ▽ More

    Submitted 16 October, 2015; originally announced October 2015.

  5. arXiv:1410.5561  [pdf, other

    cs.PF

    Towards energy efficiency and maximum computational intensity for stencil algorithms using wavefront diamond temporal blocking

    Authors: Tareq Malas, Georg Hager, Hatem Ltaief, David Keyes

    Abstract: We study the impact of tunable parameters on computational intensity (i.e., inverse code balance) and energy consumption of multicore-optimized wavefront diamond temporal blocking (MWD) applied to different stencil-based update schemes. MWD combines the concepts of diamond tiling and multicore-aware wavefront blocking in order to achieve lower cache size requirements than standard single-core wave… ▽ More

    Submitted 21 October, 2014; originally announced October 2014.

  6. Multicore-optimized wavefront diamond blocking for optimizing stencil updates

    Authors: Tareq Malas, Georg Hager, Hatem Ltaief, Holger Stengel, Gerhard Wellein, David Keyes

    Abstract: The importance of stencil-based algorithms in computational science has focused attention on optimized parallel implementations for multilevel cache-based processors. Temporal blocking schemes leverage the large bandwidth and low latency of caches to accelerate stencil updates and approach theoretical peak performance. A key ingredient is the reduction of data traffic across slow data paths, espec… ▽ More

    Submitted 12 October, 2014; originally announced October 2014.

  7. Optimizing the Performance of Streaming Numerical Kernels on the IBM Blue Gene/P PowerPC 450 Processor

    Authors: Tareq M. Malas, Aron J. Ahmadia, Jed Brown, John A. Gunnels, David E. Keyes

    Abstract: Several emerging petascale architectures use energy-efficient processors with vectorized computational units and in-order thread processing. On these architectures the sustained performance of streaming numerical kernels, ubiquitous in the solution of partial differential equations, represents a challenge despite the regularity of memory access. Sophisticated optimization techniques are required t… ▽ More

    Submitted 17 January, 2012; originally announced January 2012.