{[ promptMessage ]}

Bookmark it

{[ promptMessage ]}

text analysis part 1 wordstlk - Advanced Quantitative...

Info iconThis preview shows pages 1–15. Sign up to view the full content.

View Full Document Right Arrow Icon
Advanced Quantitative Research Methodology, Lecture Notes: Text Analysis I: How to Read 100 Million Blogs (& Classify Deaths Without Physicians) 1 Gary King http://GKing.Harvard.Edu April 26, 2009 1 c Copyright 2009 Gary King, All Rights Reserved. Gary King http://GKing.Harvard.Edu () Advanced Quantitative Research Methodology, Lecture Notes: April 26, 2009 1 / 31
Background image of page 1

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full Document Right Arrow Icon
References Daniel Hopkins and Gary King. “ Extracting Systematic Social Science Meaning from Text Gary King (Harvard, IQSS) Text Analysis 2 / 31
Background image of page 2
References Daniel Hopkins and Gary King. “ Extracting Systematic Social Science Meaning from Text commercialized via: Gary King (Harvard, IQSS) Text Analysis 2 / 31
Background image of page 3

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full Document Right Arrow Icon
References Daniel Hopkins and Gary King. “ Extracting Systematic Social Science Meaning from Text commercialized via: Gary King and Ying Lu. “ Verbal Autopsy Methods with Multiple Causes of Death ,” forthcoming, Statistical Science Gary King (Harvard, IQSS) Text Analysis 2 / 31
Background image of page 4
References Daniel Hopkins and Gary King. “ Extracting Systematic Social Science Meaning from Text commercialized via: Gary King and Ying Lu. “ Verbal Autopsy Methods with Multiple Causes of Death ,” forthcoming, Statistical Science In use by (among others): Gary King (Harvard, IQSS) Text Analysis 2 / 31
Background image of page 5

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full Document Right Arrow Icon
References Daniel Hopkins and Gary King. “ Extracting Systematic Social Science Meaning from Text commercialized via: Gary King and Ying Lu. “ Verbal Autopsy Methods with Multiple Causes of Death ,” forthcoming, Statistical Science In use by (among others): Copies at http://gking.harvard.edu Gary King (Harvard, IQSS) Text Analysis 2 / 31
Background image of page 6
Inputs and Target Quantities of Interest Gary King (Harvard, IQSS) Text Analysis 3 / 31
Background image of page 7

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full Document Right Arrow Icon
Inputs and Target Quantities of Interest Input Data: Gary King (Harvard, IQSS) Text Analysis 3 / 31
Background image of page 8
Inputs and Target Quantities of Interest Input Data: Large set of text documents (blogs, web pages, emails, etc.) Gary King (Harvard, IQSS) Text Analysis 3 / 31
Background image of page 9

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full Document Right Arrow Icon
Inputs and Target Quantities of Interest Input Data: Large set of text documents (blogs, web pages, emails, etc.) A set of (mutually exclusive and exhaustive) categories Gary King (Harvard, IQSS) Text Analysis 3 / 31
Background image of page 10
Inputs and Target Quantities of Interest Input Data: Large set of text documents (blogs, web pages, emails, etc.) A set of (mutually exclusive and exhaustive) categories A small set of documents hand-coded into the categories Gary King (Harvard, IQSS) Text Analysis 3 / 31
Background image of page 11

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full Document Right Arrow Icon
Inputs and Target Quantities of Interest Input Data: Large set of text documents (blogs, web pages, emails, etc.) A set of (mutually exclusive and exhaustive) categories A small set of documents hand-coded into the categories Quantities of interest Gary King (Harvard, IQSS) Text Analysis 3 / 31
Background image of page 12
Inputs and Target Quantities of Interest Input Data: Large set of text documents (blogs, web pages, emails, etc.) A set of (mutually exclusive and exhaustive) categories A small set of documents hand-coded into the categories Quantities of interest individual document classifications (spam filters) Gary King (Harvard, IQSS) Text Analysis 3 / 31
Background image of page 13

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full Document Right Arrow Icon
Inputs and Target Quantities of Interest Input Data: Large set of text documents (blogs, web pages, emails, etc.)
Background image of page 14
Image of page 15
This is the end of the preview. Sign up to access the rest of the document.

{[ snackBarMessage ]}