Comparative Study for String Matching Algorithms
DOI:
https://doi.org/10.25098/2.2.33Keywords:
string matching, DNA, Protein, Boyer-Moore, Knuth-Morris-Pratt, Rabin-KarpAbstract
String matching became important application nowadays, the increasing of database such as websites, document, DNA, etc., leads to the urgent needs for string matching; string matching has many applications such as DNA, protein matching, internet search engine, all these types of application of string matching, beside the huge amount of database lead to increase the need to fast and efficient string matching algorithms. This study is about comparing among most well-known string matching algorithms; it focuses on four types of string matching algorithms, each one of them working in a different way. The four are tested with four types of data; ASCII (256 character), English alphabet (26 characters), DNA (4 character), and protein (20 character), with different pattern length (100, 50, 20, 10, 5) results shown based on number of comparisons and time.
References
D. Gussfield, "Algorithms on strings, trees, and sequences," Computer Science and Computional Biology (Cambrigde, 1999), 1997.
G. Navarro, "A guided tour to approximate string matching," ACM computing surveys (CSUR), vol. 33, pp. 31-88, 2001.
C. C.-T. Lecroq. (1997, 22/5). EXACT STRING MATCHING ALGORITHMS. Available: http://www-igm.univ-mlv.fr/~lecroq/string/
Oracle. (2016). Comparison: Exact String Match. Available: http://www.oracle.com/webfolder/technetwork/data-quality/edqhelp/Content/processor_library/matching/comparisons/exact_string_match.htm
D. E. Knuth, et al., "Fast pattern matching in strings," SIAM journal on computing, vol. 6, pp. 323-350, 1977.
K. A. Berman and J. L. Paul, Algorithms: sequential, parallel, and distributed: Course Technology Ptr, 2005.
R. S. Boyer and J. S. Moore, "A fast string searching algorithm," Communications of the ACM, vol. 20, pp. 762-772, 1977.
C. Charras and T. Lecroq, Handbook of exact string matching algorithms: King's College, 2004.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
SJCUS's open access articles are published under a Creative Commons Attribution CC-BY-NC-ND 4.0 license.
