rec06 - 6.006 Intro to Algorithms Recitation 06 February...

Info iconThis preview shows pages 1–2. Sign up to view the full content.

View Full Document Right Arrow Icon
6.006 Intro to Algorithms Recitation 06 February 18, 2011 Rolling Hash (Rabin-Karp Algorithm) Objective If we have text string S and pattern string P , we want to determine whether or not P is found in S , i.e. P is a substring of S . Notes on Strings Strings are arrays of characters. Characters however can be interpreted as integers, with their exact values depending on what type of encoding is being used (e.g. ASCII, Unicode). This means we can treat strings as arrays of integers. Finding a way to convert an array of integers into a single integer allows us to hash strings with hash functions that expect numbers as input. Since strings are arrays and not single elements, comparing two strings for equality is not as straightforward as comparing two integers for equality. To check to see if string A and string B are equal, we would have to iterate through all of A ’s elements and all of B ’s elements, making sure that A [ i ] = B [ i ] for all i . This means that string comparison depends on the length of the strings. Comparing two n -length strings takes O ( n ) time. Also, since hashing a string usually involves iterating through the string’s elements, hashing a string of length n also takes O ( n ) time. Method Say P has length L and S has length n . One way to search for P in S : 1. Hash P to get h ( P ) O(L) 2. Iterate through all length L substrings of S , hashing those substrings and comparing to h ( P ) O(nL) 3. If a substring hash value does match h ( P ) , do a string comparison on that substring and P , stopping if they do match and continuing if they do not. O(L) This method takes O ( nL ) time. We can improve on this runtime by using a rolling hash . In step 2. we looked at O ( n ) substrings independently and took O ( L ) to hash them all. These sub- strings however have a lot of overlap. For example, looking at length 5 substrings of “algorithms”, the first two substrings are “algor” and “lgori”. Wouldn’t it be nice if we could take advantage of
Background image of page 1

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Image of page 2
This is the end of the preview. Sign up to access the rest of the document.

This note was uploaded on 11/11/2011 for the course MATH 180 taught by Professor Byrns during the Spring '11 term at Montgomery College.

Page1 / 4

rec06 - 6.006 Intro to Algorithms Recitation 06 February...

This preview shows document pages 1 - 2. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online