Zum Hauptinhalt springen

Showing 1–12 of 12 results for author: Badkobeh, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16475  [pdf, other

    cs.DS

    Bijective BWT based compression schemes

    Authors: Golnaz Badkobeh, Hideo Bannai, Dominik Köppl

    Abstract: We investigate properties of the bijective Burrows-Wheeler transform (BBWT). We show that for any string $w$, a bidirectional macro scheme of size $O(r_B)$ can be induced from the BBWT of $w$, where $r_B$ is the number of maximal character runs in the BBWT. We also show that $r_B = O(z\log^2 n)$, where $n$ is the length of $w$ and $z$ is the number of Lempel-Ziv 77 factors of $w$. Then, we show a… ▽ More

    Submitted 19 August, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    Comments: Slightly extended version of paper accepted to SPIRE 2024

  2. arXiv:2209.00271  [pdf, ps, other

    cs.DS cs.FL

    Maximal Closed Substrings

    Authors: Golnaz Badkobeh, Alessandro De Luca, Gabriele Fici, Simon Puglisi

    Abstract: A string is closed if it has length 1 or has a nonempty border without internal occurrences. In this paper we introduce the definition of a \emph{maximal closed substring} (MCS), which is an occurrence of a closed substring that cannot be extended to the left nor to the right into a longer closed substring. MCSs with exponent at least $2$ are commonly called \emph{runs}; those with exponent smalle… ▽ More

    Submitted 14 May, 2024; v1 submitted 1 September, 2022; originally announced September 2022.

  3. arXiv:2106.01763  [pdf, other

    cs.DS

    Internal Shortest Absent Word Queries in Constant Time and Linear Space

    Authors: Golnaz Badkobeh, Panagiotis Charalampopoulos, Dmitry Kosolobov, Solon P. Pissis

    Abstract: Given a string $T$ of length $n$ over an alphabet $Σ\subset \{1,2,\ldots,n^{O(1)}\}$ of size $σ$, we are to preprocess $T$ so that given a range $[i,j]$, we can return a representation of a shortest string over $Σ$ that is absent in the fragment $T[i]\cdots T[j]$ of $T$. We present an $O(n)$-space data structure that answers such queries in constant time and can be constructed in $O(n\log_σn)$ tim… ▽ More

    Submitted 3 June, 2021; originally announced June 2021.

    Comments: 13 pages, 1 figure, 4 tables

  4. arXiv:2011.12742  [pdf, other

    cs.DS cs.FL

    Left Lyndon tree construction

    Authors: Golnaz Badkobeh, Maxime Crochemore

    Abstract: We extend the left-to-right Lyndon factorisation of a word to the left Lyndon tree construction of a Lyndon word. It yields an algorithm to sort the prefixes of a Lyndon word according to the infinite ordering defined by Dolce et al. (2019). A straightforward variant computes the left Lyndon forest of a word. All algorithms run in linear time on a general alphabet, that is, in the letter-compariso… ▽ More

    Submitted 25 November, 2020; originally announced November 2020.

    Comments: 18 pages

    MSC Class: 68W32; 68W27 ACM Class: F.2.2; E.1

  5. arXiv:1902.04785  [pdf, other

    cs.DS

    Constructing Antidictionaries in Output-Sensitive Space

    Authors: Lorraine A. K. Ayad, Golnaz Badkobeh, Gabriele Fici, Alice Héliou, Solon P. Pissis

    Abstract: A word $x$ that is absent from a word $y$ is called minimal if all its proper factors occur in $y$. Given a collection of $k$ words $y_1,y_2,\ldots,y_k$ over an alphabet $Σ$, we are asked to compute the set $\mathrm{M}^{\ell}_{y_{1}\#\ldots\#y_{k}}$ of minimal absent words of length at most $\ell$ of word $y=y_1\#y_2\#\ldots\#y_k$, $\#\notinΣ$. In data compression, this corresponds to computing th… ▽ More

    Submitted 13 February, 2019; originally announced February 2019.

    Comments: Version accepted to DCC 2019

  6. arXiv:1811.08231  [pdf, ps, other

    math.CO cs.DM

    Avoiding conjugacy classes on the 5-letter alphabet

    Authors: Golnaz Badkobeh, Pascal Ochem

    Abstract: We construct an infinite word $w$ over the $5$-letter alphabet such that for every factor $f$ of $w$ of length at least two, there exists a cyclic permutation of $f$ that is not a factor of $w$. In other words, $w$ does not contain a non-trivial conjugacy class. This proves the conjecture in Gamard et al. [TCS 2018]

    Submitted 20 November, 2018; originally announced November 2018.

  7. arXiv:1805.10042  [pdf, ps, other

    cs.DS cs.DM cs.FL

    Algorithms for Anti-Powers in Strings

    Authors: Golnaz Badkobeh, Gabriele Fici, Simon J. Puglisi

    Abstract: A string $S[1,n]$ is a power (or tandem repeat) of order $k$ and period $n/k$ if it can decomposed into $k$ consecutive equal-length blocks of letters. Powers and periods are fundamental to string processing, and algorithms for their efficient computation have wide application and are heavily studied. Recently, Fici et al. (Proc. ICALP 2016) defined an {\em anti-power} of order $k$ to be a string… ▽ More

    Submitted 25 May, 2018; originally announced May 2018.

    Journal ref: Published in Informnation Processing Letters Volume 137, September 2018, Pages 57-60

  8. arXiv:1705.09538  [pdf, ps, other

    cs.DS

    On Two LZ78-style Grammars: Compression Bounds and Compressed-Space Computation

    Authors: Golnaz Badkobeh, Travis Gagie, Shunsuke Inenaga, Tomasz Kociumaka, Dmitry Kosolobov, Simon J. Puglisi

    Abstract: We investigate two closely related LZ78-based compression schemes: LZMW (an old scheme by Miller and Wegman) and LZD (a recent variant by Goto et al.). Both LZD and LZMW naturally produce a grammar for a string of length $n$; we show that the size of this grammar can be larger than the size of the smallest grammar by a factor $Ω(n^{\frac{1}3})$ but is always within a factor… ▽ More

    Submitted 25 July, 2017; v1 submitted 26 May, 2017; originally announced May 2017.

    Comments: 12 pages, accepted to SPIRE 2017

  9. arXiv:1305.6395  [pdf, ps, other

    cs.FL math.CO

    On the Number of Closed Factors in a Word

    Authors: Golnaz Badkobeh, Gabriele Fici, Zsuzsanna Lipták

    Abstract: A closed word (a.k.a. periodic-like word or complete first return) is a word whose longest border does not have internal occurrences, or, equivalently, whose longest repeated prefix is not right special. We investigate the structure of closed factors of words. We show that a word of length $n$ contains at least $n+1$ distinct closed factors, and characterize those words having exactly $n+1$ closed… ▽ More

    Submitted 1 December, 2014; v1 submitted 28 May, 2013; originally announced May 2013.

    Comments: Accepted to LATA 2015

    MSC Class: 68R15

  10. arXiv:1207.5723  [pdf, ps, other

    cs.DM

    Fewest repetitions in infinite binary words

    Authors: Golnaz Badkobeh, Maxime Crochemore

    Abstract: A square is the concatenation of a nonempty word with itself. A word has period p if its letters at distance p match. The exponent of a nonempty word is the quotient of its length over its smallest period. In this article we give a proof of the fact that there exists an infinite binary word which contains finitely many squares and simultaneously avoids words of exponent larger than 7/3. Our infi… ▽ More

    Submitted 24 July, 2012; originally announced July 2012.

  11. Binary Jumbled String Matching for Highly Run-Length Compressible Texts

    Authors: Golnaz Badkobeh, Gabriele Fici, Steve Kroon, Zsuzsanna Lipták

    Abstract: The Binary Jumbled String Matching problem is defined as: Given a string $s$ over $\{a,b\}$ of length $n$ and a query $(x,y)$, with $x,y$ non-negative integers, decide whether $s$ has a substring $t$ with exactly $x$ $a$'s and $y$ $b$'s. Previous solutions created an index of size O(n) in a pre-processing step, which was then used to answer queries in constant time. The fastest algorithms for cons… ▽ More

    Submitted 31 May, 2013; v1 submitted 12 June, 2012; originally announced June 2012.

    Comments: v2: only small cosmetic changes; v3: new title, weakened conjectures on size of Corner Index (we no longer conjecture it to be always linear in size of RLE); removed experimental part on random strings (these are valid but limited in their predictive power w.r.t. general strings); v3 published in IPL

    MSC Class: 68W32; 68P05; 68P20 ACM Class: G.2.1

    Journal ref: Information Processing Letters, 113: 604-608 (2013)

  12. Finite-Repetition threshold for infinite ternary words

    Authors: Golnaz Badkobeh, Maxime Crochemore

    Abstract: The exponent of a word is the ratio of its length over its smallest period. The repetitive threshold r(a) of an a-letter alphabet is the smallest rational number for which there exists an infinite word whose finite factors have exponent at most r(a). This notion was introduced in 1972 by Dejean who gave the exact values of r(a) for every alphabet size a as it has been eventually proved in 2009.… ▽ More

    Submitted 17 August, 2011; originally announced August 2011.

    Comments: In Proceedings WORDS 2011, arXiv:1108.3412

    ACM Class: 68515

    Journal ref: EPTCS 63, 2011, pp. 37-43