text analysis part 2 discov - Advanced Quantitative...

Info iconThis preview shows pages 1–7. Sign up to view the full content.

View Full Document Right Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
This is the end of the preview. Sign up to access the rest of the document.

Unformatted text preview: Advanced Quantitative Research Methodology, Lecture Notes: Text Analysis II: Unsupervised Learning via Cluster Analysis 1 Gary King http://GKing.Harvard.Edu April 27, 2009 1 © Copyright 2009 Gary King, All Rights Reserved. Gary King http://GKing.Harvard.Edu () Advanced Quantitative Research Methodology, Lecture Notes: Text Analysis II: Unsupervise April 27, 2009 1 / 1 Reading Justin Grimmer and Gary King. 2009. “Quantitative Discovery of Qualitative Information: A General Purpose Document Clustering Methodology” http://gking.harvard.edu/files/abs/discov-abs.shtml . Gary King (Harvard, IQSS) Quantitative Discovery from Text 2 / 1 The Problem: Discovery from Unstructured Text Examples: scholarly literature, news stories, medical information, blog posts, comments, product reviews, emails, social media updates, audio-to-text summaries, speeches, press releases, legal decisions, etc. Gary King (Harvard, IQSS) Quantitative Discovery from Text 3 / 1 The Problem: Discovery from Unstructured Text Examples: scholarly literature, news stories, medical information, blog posts, comments, product reviews, emails, social media updates, audio-to-text summaries, speeches, press releases, legal decisions, etc. 10 minutes of worldwide email = 1 LOC equivalent Gary King (Harvard, IQSS) Quantitative Discovery from Text 3 / 1 The Problem: Discovery from Unstructured Text Examples: scholarly literature, news stories, medical information, blog posts, comments, product reviews, emails, social media updates, audio-to-text summaries, speeches, press releases, legal decisions, etc. 10 minutes of worldwide email = 1 LOC equivalent An essential part of discovery is classification : “one of the most central and generic of all our conceptual exercises. . . . the foundation not only for conceptualization, language, and speech, but also for mathematics, statistics, and data analysis. . . . Without classification, there could be no advanced conceptualization, reasoning, language, data analysis or, for that matter, social science research.” (Bailey, 1994). Gary King (Harvard, IQSS) Quantitative Discovery from Text 3 / 1 The Problem: Discovery from Unstructured Text Examples: scholarly literature, news stories, medical information, blog posts, comments, product reviews, emails, social media updates, audio-to-text summaries, speeches, press releases, legal decisions, etc. 10 minutes of worldwide email = 1 LOC equivalent An essential part of discovery is classification : “one of the most central and generic of all our conceptual exercises. . . . the foundation not only for conceptualization, language, and speech, but also for mathematics, statistics, and data analysis. . . . Without classification, there could be no advanced conceptualization, reasoning, language, data analysis or, for that matter, social science research.” (Bailey, 1994)....
View Full Document

Page1 / 103

text analysis part 2 discov - Advanced Quantitative...

This preview shows document pages 1 - 7. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online