Say there are m 500k disnct terms among these 14

Info iconThis preview shows page 1. Sign up to view the full content.

View Full Document Right Arrow Icon
This is the end of the preview. Sign up to access the rest of the document.

Unformatted text preview: opatra 1 0 0 0 0 0 mercy 1 0 1 1 1 1 worser 1 0 1 1 1 0 12 Introduc)on to Informa)on Retrieval Sec. 1.1 Answers to query §༊  Antony and Cleopatra, Act III, Scene ii Agrippa [Aside to DOMITIUS ENOBARBUS]: Why, Enobarbus, When Antony found Julius Caesar dead, He cried almost to roaring; and he wept When at Philippi he found Brutus slain. §༊  Hamlet, Act III, Scene ii Lord Polonius: I did enact Julius Caesar I was killed i’ the Capitol; Brutus killed me. 13 Introduc)on to Informa)on Retrieval Sec. 1.1 Bigger collec*ons §༊  Consider N = 1 million documents, each with about 1000 words. §༊  Avg 6 bytes/word including spaces/punctua*on §༊  6GB of data in the documents. §༊  Say there are M = 500K dis)nct terms among these. 14 Introduc)on to Informa)on Retrieval S ec. 1.1 Can’t build the matrix §༊  500K x 1M matrix has...
View Full Document

{[ snackBarMessage ]}

Ask a homework question - tutors are online