Comment on 

Volume 11, article R8 (January 5, 2004)

"Strings with maximally many distinct subsequences and substrings"

Abraham Flaxman, Aram W. Harrow and Gregory B. Sorkin


Comment by the authors, July 15, 2004: (For earlier comments, click here)

Professor Zoltán Kása of Babes-Bolyai University has pointed out that all of our results for substrings were discovered by Professor Antal Iványi of Eötvös Loránd University in 1987 [Ivá87]. As in our Theorem 8, [Ivá87] used de Bruijn sequences to construct words of any alphabet size with the maximal number of distinct substrings, and showed that for alphabet size > 2 there is an infinite word whose n-prefixes suffice, and that for binary alphabets there is not. We are grateful to Kása and Iványi for their correspondence.

The connection between de Bruijn sequences and strings with maximally many distinct substrings is also used by Anisiu, Blázsik and Kása [ABK02] to calculate, for a given string length, the number of strings with maximally many distinct substrings.

Iványi's work [Ivá87] actually addresses "d-substrings", a notion introduced by Hunyadvári and Iványi [HI84] which interpolates between substrings and subsequences as d is varied. A d-substring of a string is a subsequence which can be obtained with gaps of at most d; for d=1 it is a substring and for d=infinity it is a subsequence. Kása [Kás98] also considers d-complexity (the number of d-substrings), focussing on the existence of a string with a given 1-complexity.

We note that if d is at least as large as the alphabet size, the string we construct having a maximum number of distinct subsequences has an equal number of distinct d-substrings, so our result generalizes immediately to d-complexity in that case. For d smaller than the alphabet size, a simple argument (obtained in a discussion with Don Coppersmith, and paralleling the recursion for Fibonacci numbers) shows that the maximum d-complexity of a string of length n is of asymptotic order Ln, where both L and the hidden constants depend on d.

References

[ABK02]
Mira-Cristiana Anisiu, Zoltán Blázsik, and Zoltán Kása, Maximal complexity of finite words, Pure Math. Appl. 13 (2002), no. 1-2, 39-48, Algebraic systems (Felix-Oradea, 2001). MR 2004e:68083

[HI84]
László Hunyadvári and Antal Iványi, On some complexity measures of words, Conference on Automata, Languages, and Mathematical Systems (Salgótarján, Hungary, May 21-23, 1984), no. DM 84-2, Karl Marx Univ. of Economics, Dept. of Mathematics, 1984, pp. 67-82.

[Ivá87]
Antal Iványi, On the d-complexity of words, Ann. Univ. Sci. Budapest. Sect. Comput. 8 (1987), 69-90 (1988). MR 90d:68063

[Kás98]
Zoltán Kása, On the d-complexity of strings, Pure Math. Appl. 9 (1998), no. 1-2, 119-128, Modern applied analysis (Ilieni, 1997). MR 2000c:68112




Back to Volume 11