Zum Hauptinhalt springen

Showing 1–6 of 6 results for author: Ayad, L A K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.11819  [pdf, other

    cs.DS cs.DB

    Text Indexing for Long Patterns using Locally Consistent Anchors

    Authors: Lorraine A. K. Ayad, Grigorios Loukides, Solon P. Pissis

    Abstract: In many real-world database systems, a large fraction of the data is represented by strings: sequences of letters over some alphabet. This is because strings can easily encode data arising from different sources. It is often crucial to represent such string datasets in a compact form but also to simultaneously enable fast pattern matching queries. This is the classic text indexing problem. The fou… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: Extended version of a PVLDB 2023 paper. Abstract abridged to satisfy arXiv requirements

  2. arXiv:2405.04052  [pdf, other

    cs.DS

    Minimizing the Minimizers via Alphabet Reordering

    Authors: Hilde Verbeek, Lorraine A. K. Ayad, Grigorios Loukides, Solon P. Pissis

    Abstract: Minimizers sampling is one of the most widely-used mechanisms for sampling strings [Roberts et al., Bioinformatics 2004]. Let $S=S[1]\ldots S[n]$ be a string over a totally ordered alphabet $Σ$. Further let $w\geq 2$ and $k\geq 1$ be two integers. The minimizer of $S[i\mathinner{.\,.} i+w+k-2]$ is the smallest position in $[i,i+w-1]$ where the lexicographically smallest length-$k$ substring of… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Extended version of a paper accepted at CPM 2024

  3. arXiv:2310.09023  [pdf, other

    cs.DS

    Sparse Suffix and LCP Array: Simple, Direct, Small, and Fast

    Authors: Lorraine A. K. Ayad, Grigorios Loukides, Solon P. Pissis, Hilde Verbeek

    Abstract: Sparse suffix sorting is the problem of sorting $b=o(n)$ suffixes of a string of length $n$. Efficient sparse suffix sorting algorithms have existed for more than a decade. Despite the multitude of works and their justified claims for applications in text indexing, the existing algorithms have not been employed by practitioners. Arguably this is because there are no simple, direct, and efficient a… ▽ More

    Submitted 4 July, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: LATIN 2024 + experiments

  4. arXiv:1902.04785  [pdf, other

    cs.DS

    Constructing Antidictionaries in Output-Sensitive Space

    Authors: Lorraine A. K. Ayad, Golnaz Badkobeh, Gabriele Fici, Alice Héliou, Solon P. Pissis

    Abstract: A word $x$ that is absent from a word $y$ is called minimal if all its proper factors occur in $y$. Given a collection of $k$ words $y_1,y_2,\ldots,y_k$ over an alphabet $Σ$, we are asked to compute the set $\mathrm{M}^{\ell}_{y_{1}\#\ldots\#y_{k}}$ of minimal absent words of length at most $\ell$ of word $y=y_1\#y_2\#\ldots\#y_k$, $\#\notinΣ$. In data compression, this corresponds to computing th… ▽ More

    Submitted 13 February, 2019; originally announced February 2019.

    Comments: Version accepted to DCC 2019

  5. arXiv:1810.02099  [pdf, other

    cs.DS

    Longest Property-Preserved Common Factor

    Authors: Lorraine A. K Ayad, Giulia Bernardini, Roberto Grossi, Costas S. Iliopoulos, Nadia Pisanti, Solon P. Pissis, Giovanna Rosone

    Abstract: In this paper we introduce a new family of string processing problems. We are given two or more strings and we are asked to compute a factor common to all strings that preserves a specific property and has maximal length. Here we consider three fundamental string properties: square-free factors, periodic factors, and palindromic factors under three different settings, one per property. In the firs… ▽ More

    Submitted 4 October, 2018; originally announced October 2018.

    Comments: Extended version of SPIRE 2018 paper

  6. arXiv:1801.04425  [pdf, ps, other

    cs.DS

    Longest Common Prefixes with $k$-Errors and Applications

    Authors: Lorraine A. K. Ayad, Panagiotis Charalampopoulos, Costas S. Iliopoulos, Solon P. Pissis

    Abstract: Although real-world text datasets, such as DNA sequences, are far from being uniformly random, average-case string searching algorithms perform significantly better than worst-case ones in most applications of interest. In this paper, we study the problem of computing the longest prefix of each suffix of a given string of length $n$ over a constant-sized alphabet that occurs elsewhere in the strin… ▽ More

    Submitted 13 January, 2018; originally announced January 2018.