notes08 - Our plan is to compare each word of the text...

Info iconThis preview shows page 1. Sign up to view the full content.

View Full Document Right Arrow Icon
This is the end of the preview. Sign up to access the rest of the document.

Unformatted text preview: Our plan is to compare each word of the text against the words in the dictionary. But we dont have to read every word of the dictionary for each word of the file; since the dictionary is sorted alphabetically, if we sort the words in our text file, we can just make one pass through both files in parallel. sort < lowcase > sorted The sort program can take arguments to do sorting in many different ways; for example, you can ask it to sort the lines of a file based on the third word of each line. But in this case, we want the simplest possible sort: The sort key is the entire line, and were sorting in character-code order (which is the same as alphabetical order since we eliminated capital letters). Common words like the will occur many times in our text. Theres no need to spell-check the same word repeatedly. Since weve sorted the file, all instances of the same word are next to each other in the file, so we can ask Unix to eliminate consecutive equal lines: uniq < sorted > nodup...
View Full Document

Ask a homework question - tutors are online