project07 - CSE 231, Fall 2007 Programming Project 07...

Info iconThis preview shows pages 1–2. Sign up to view the full content.

View Full Document Right Arrow Icon
CSE 231, Fall 2007 Programming Project 07 Assignment Overview This assignment is worth 50 points (5.0% of the course grade) and must be completed and turned in before 11:59pm on Monday, October 22, 2007 . The purpose of this project is to work on dictionaries, sets, and functions. Task Build a simple search engine. Background Search engines played a significant role in developing the Web as we now know it. The details of Google’s search algorithm are trade secrets, but we can build a search engine to find the files that contain words we specify. For example, what files in your computer have both the words “mail” and “tool” in them? Program Specifications We will search text files (*.txt) in the current folder/directory for words and word combinations specified by the user. Algorithm 1. Read in a list of “stop” words from stopWords.txt 2. Read text files (*.txt) in current folder/directory. 3. Build a mapping from each word to the files that contain the word. Exclude “stop” words. Remove non-letters. Make words lower case.
Background image of page 1

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
Image of page 2
This is the end of the preview. Sign up to access the rest of the document.

Page1 / 2

project07 - CSE 231, Fall 2007 Programming Project 07...

This preview shows document pages 1 - 2. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online