Matthew O'Keefe, Ph.D.

Matthew O'Keefe, Ph.D.

Greater Minneapolis-St. Paul Area
7K followers 500+ connections

Über uns

Results-driven senior technologist with over 25 years of success in building and…

Articles by Matthew

See all articles

Activity

Join now to see all activity

Erleben Sie

  • Confluent Graphic

    Confluent

    St Michael, Minnesota, United States

  • -

    Minneapolis, Minnesota, United States

  • -

  • -

    Greater Minneapolis-St. Paul Area

  • -

    Santa Clara, California

  • -

    Santa Clara, California

  • -

  • -

  • -

  • -

  • -

  • -

Bildung

Licenses & Certifications

  • Commercial Drone Pilot

    FAA

    Issued
  • PADI Open Water Diver Graphic

    PADI Open Water Diver

    PADI

    Issued
  • 101, 103, 104, 105, 106, 115, Coastal Navigation, Marine Weather, Docking Graphic

    101, 103, 104, 105, 106, 115, Coastal Navigation, Marine Weather, Docking

    American Sailing Association

    Issued
  • United States Coast Guard Captain Graphic

    United States Coast Guard Captain

    U.S. Coast Guard

    Issued Expires

Volunteer Experience

  • Scoutmaster

    Sea Scout Ship Mendota

    - Present 8 years 8 months

    Children

    Focusing on sailing and water sports and adventures with the Sea Scouts of Ship Mendota.

Publications

  • High Performance Solid State Storage Under Linux

    IEEE

    We discuss the impact on Linux system design of a prototype PCI Express SSD that operates at least an order-of-magnitude faster than most drives available today. We develop benchmarking strategies and focus on several areas where current Linux systems need improvement, and suggest methods of taking full advantage of such high-performance solid state storage.
    We demonstrate that an SSD can perform with high throughput, high operation rates, and low latency under the most difficult conditions.…

    We discuss the impact on Linux system design of a prototype PCI Express SSD that operates at least an order-of-magnitude faster than most drives available today. We develop benchmarking strategies and focus on several areas where current Linux systems need improvement, and suggest methods of taking full advantage of such high-performance solid state storage.
    We demonstrate that an SSD can perform with high throughput, high operation rates, and low latency under the most difficult conditions. This suggests that high performance SSDs can dramatically improve parallel I/O performance for future high performance computing (HPC) systems.

    Other authors
    See publication
  • Bringsel: A Tool for Measuring Storage System Reliability, Uniformity, Performance and Scalability

    Cray Users Group

    Bringsel is a primary I/O testing program that enables the use of either POSIX or MPI-IO calls to perform benchmarking and evaluation to measure the reliability, uniformity, performance and scalability of file systems and storage technologies. It enables the creation of a large number of directories and files using both a threading model (POSIX) and the MPI library for multiple nodes to coordinate testing activity. Bringsel has run on a variety of large scale computing platforms, including Cray…

    Bringsel is a primary I/O testing program that enables the use of either POSIX or MPI-IO calls to perform benchmarking and evaluation to measure the reliability, uniformity, performance and scalability of file systems and storage technologies. It enables the creation of a large number of directories and files using both a threading model (POSIX) and the MPI library for multiple nodes to coordinate testing activity. Bringsel has run on a variety of large scale computing platforms, including Cray XTs, SGI Origin systems, Sun enterprise-scale SMP systems and Linux clusters.

    Other authors
    • John Kaitschuck
    See publication
  • Disk-Based Technology for Multi-Petabyte Archives

    Cray Users Group

    Cray supercomputers create and process very large data sets, many of which are archived. The archives are commonly tape-based hierarchical storage management (HSM) systems. In this paper, we describe and discuss the usage requirements of current tape-based HSM systems and other data migration technologies. We then propose a potential alternative, a complementary strategy that uses disk-based MAID storage for deep archiving to speed access and improve data management scalability.

    Other authors
    See publication
  • Performance, Reliability, and Operational Issues for High Performance NAS Storage on Cray Platforms

    Cray Users Group

    This report summarizes performance and failure analyses results for the BlueArc Titan NFS server in the Cray CASA test bed in Chippewa Falls, Wisconsin. The goals of this project are: to observe and describe the file transfer performance and fault recovery behavior of the BlueArc under light and heavy loads with varying file sizes and system access patterns; to measure the BlueArc Titan NAS server for reliability, uniformity, performance, and scalability in operational scenarios; and to develop…

    This report summarizes performance and failure analyses results for the BlueArc Titan NFS server in the Cray CASA test bed in Chippewa Falls, Wisconsin. The goals of this project are: to observe and describe the file transfer performance and fault recovery behavior of the BlueArc under light and heavy loads with varying file sizes and system access patterns; to measure the BlueArc Titan NAS server for reliability, uniformity, performance, and scalability in operational scenarios; and to develop configuration guidelines and best practices to achieve the highest performance, most efficient utilization and effective data management using the BlueArc platform capabilities.

    Other authors
    See publication

Patents

  • Remote Backup and Restore

    Issued US9405776B2

    The present invention relates generally to backup of computer systems and, more particularly, but not by way of limitation, to non-redundant backup of computer systems on a cluster of backup servers. In one embodiment of the present invention, a method includes partitioning a fingerprint namespace among a cluster of backup servers, the fingerprint namespace comprising a universe of fingerprints for representing units of data, each backup server of the cluster of backup servers managing units of…

    The present invention relates generally to backup of computer systems and, more particularly, but not by way of limitation, to non-redundant backup of computer systems on a cluster of backup servers. In one embodiment of the present invention, a method includes partitioning a fingerprint namespace among a cluster of backup servers, the fingerprint namespace comprising a universe of fingerprints for representing units of data, each backup server of the cluster of backup servers managing units of data having fingerprints corresponding to an assigned partition of the fingerprint namespace. The method further includes receiving backup information from a client computing device for a block of data comprising units of data, the backup information including at least a fingerprint for each of the units of data and client-specific backup information. In addition, the method includes, utilizing the fingerprint for each of the units of data, deduplicating the units of data in parallel at the cluster of backup servers in accordance with the partitioning step, the deduplicating step comprising identifying ones of the units data already stored by the cluster of backup servers.

    Other inventors
    See patent
  • Metadata structures and related locking techniques to improve performance and scalability in a cluster file system

    Issued US 7,487,228

    A cluster file system is disclosed. A plurality of disk servers, operating as a single distributed disk server layer, are connected to the one or more physical disks. Each disk server stores metadata for each of the files. A plurality of lock servers, having one or more locks for each file and associated file system metadata operates as a single centralized distributed lock server to coordinate access to the files stored by the disk server layer. A plurality of asynchronous file servers…

    A cluster file system is disclosed. A plurality of disk servers, operating as a single distributed disk server layer, are connected to the one or more physical disks. Each disk server stores metadata for each of the files. A plurality of lock servers, having one or more locks for each file and associated file system metadata operates as a single centralized distributed lock server to coordinate access to the files stored by the disk server layer. A plurality of asynchronous file servers, operating as a single distributed file server layer in response to a request for a file from a user program: (i) retrieves the requested file from the disk server layer and at least one lock, associated with the retrieved file, from the lock server, and (ii) retrieves metadata for at least one file that is related to the requested files, and at least one lock, for each file that has had its metadata retrieved.

    Other inventors
  • Distributed file system using disk servers, lock servers and file servers

    Issued US 7,406,473

    Abstract:
    A distributed file system is disclosed. A plurality of disk servers, operating as a single distributed disk server layer, are connected to the one or more physical disks. Each disk server stores meta-data for each of the files. A plurality of lock servers, having one or more locks for each file operates as a single distributed lock server to coordinate access to the files stored by the disk server layer. A plurality of asynchronous file servers, operating as a single distributed…

    Abstract:
    A distributed file system is disclosed. A plurality of disk servers, operating as a single distributed disk server layer, are connected to the one or more physical disks. Each disk server stores meta-data for each of the files. A plurality of lock servers, having one or more locks for each file operates as a single distributed lock server to coordinate access to the files stored by the disk server layer. A plurality of asynchronous file servers, operating as a single distributed file server layer in response to a request for a file from a user program: (i) retrieves the requested file from the disk server layer and at least one lock, associated with the retrieved file, from the lock server, and (ii) retrieves meta-data for at least one file that is related to the requested files, and at least one lock, for each file that has had its meta-data retrieved.

Languages

  • Englisch

    -

Organizations

  • Wayzata Yacht Club

    Member

    - Present

    WYC is one of the top sailboat racing yacht clubs in North America.

  • American Sailing Association

    Member

    - Present

    ASA promotes sailing and provides training and certifications for cruising and day sailors.

Recommendations received

More activity by Matthew

View Matthew’s full profile

  • See who you know in common
  • Get introduced
  • Contact Matthew directly
Join to view full profile

Other similar profiles

Gemeinsame Artikel erkunden

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More

Add new skills with these courses