Faster Compressed Dictionary Matching
In: String Processing and Information Retrieval ISBN: 9783642163203 SPIRE; (2010)
Online
unknown
Zugriff:
Given a set D of d patterns, the dictionary matching problem is to index D such that for any query text T, we can locate the occurrences of any pattern within T efficiently. When D contains a total of n characters drawn from an alphabet of size σ, Hon et al. (2008) gave an nHk(D) + o(n log σ)-bit index which supports a query in O(|T|(loge n + log d)+occ) time, where e > 0 and Hk(D) denotes the kth order entropy of D. Very recently, Belazzougui (2010) proposed an elegant scheme, which takes n log σ + O(n) bits of index space and supports a query in optimal O(|T|+occ) time. In this paper, we provide connections between Belazzougui's index and the XBW compression of Ferragina et al. (2005), and show that Belazzougui's index can be slightly modified to be stored in nHk(D)+O(n) bits, while query time remains optimal; this improves the compressed index by Hon et al. (2008) in both space and time.
Titel: |
Faster Compressed Dictionary Matching
|
---|---|
Autor/in / Beteiligte Person: | Ku, Tsung-Han ; Hon, Wing-Kai ; Thankachan, Sharma V. ; Jeffrey Scott Vitter ; Shah, Rahul |
Link: | |
Quelle: | String Processing and Information Retrieval ISBN: 9783642163203 SPIRE; (2010) |
Veröffentlichung: | Springer Berlin Heidelberg, 2010 |
Medientyp: | unknown |
ISBN: | 978-3-642-16320-3 (print) |
DOI: | 10.1007/978-3-642-16321-0_19 |
Schlagwort: |
|
Sonstiges: |
|