IR-part2

29 introducon to informaon retrieval s ec 621

Info iconThis preview shows page 1. Sign up to view the full content.

View Full Document Right Arrow Icon
This is the end of the preview. Sign up to access the rest of the document.

Unformatted text preview: requency weigh*ng Introduc)on to Informa)on Retrieval Introduc*on to Informa(on Retrieval (Inverse) Document frequency weigh*ng Introduc)on to Informa)on Retrieval Sec. 6.2.1 Document frequency §༊  Rare terms are more informa*ve than frequent terms §༊  Recall stop words §༊  Consider a term in the query that is rare in the collec*on (e.g., arachnocentric) §༊  A document containing this term is very likely to be relevant to the query arachnocentric §༊  → We want a high weight for rare terms like arachnocentric. Introduc)on to Informa)on Retrieval Sec. 6.2.1 Document frequency, con*nued §༊  Frequent terms are less informa*ve than rare terms §༊  Consider a query term that is frequent in the collec*on (e.g., high, increase, line) §༊  A document containing such a term is more likely to be relevant than a document that doesn...
View Full Document

This document was uploaded on 02/14/2014.

Ask a homework question - tutors are online