124.11.lec15

124.11.lec15 - Dan Jurafsky Lecture 15: Relaon Extracon...

Info iconThis preview shows pages 1–7. Sign up to view the full content.

View Full Document Right Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon

Info iconThis preview has intentionally blurred sections. Sign up to view the full version.

View Full DocumentRight Arrow Icon
This is the end of the preview. Sign up to access the rest of the document.

Unformatted text preview: Dan Jurafsky Lecture 15: Relaon Extracon (and Informaon Extracon in general) Lots of slides from many people, including Rion Snow, Jim Marn, Chris Manning, and William Cohen, CS 124/LINGUIST 180: From Languages to Information 2 Background: Information Extraction Extract informa5on from text Some5mes called text analy)cs commercially Extract en##es (the people, organiza5ons, loca5ons, 5mes, dates, genes, diseases, medicines, etc. in a text) Extract the rela#ons between en55es Figure out the larger events that are taking place 3 Information Extraction Crea5ng knowledge bases and ontologies Implica5ons for cogni5ve modeling Digital Libaries Google scholar, Citeseer need to extract the 5tle, author and references Bioinforma5cs Patent analysis SpeciFc market segments for stock analysis SEC Flings Intelligence analysis Outline Reminder: Named En5ty Tagging Rela5on Extrac5on Hand-built paUerns Seed (bootstrap) methods Supervised classifca5on Distant supervision What is Information Extraction Filling slots in a database from sub-segments of text. As a task: October 14, 2002, 4:00 a.m. PT For years, Microsoft Corporation CEO Bill Gates railed against the economic philosophy of open-source software with Orwellian fervor, denouncing its communal licensing as a "cancer" that stifled technological innovation. Today, Microsoft claims to "love" the open- source concept, by which software code is made public to encourage improvement and development by outside programmers. Gates himself says Microsoft will gladly disclose its crown jewels--the coveted code behind the Windows operating system--to select customers. "We can be open source. We love the concept of shared source," said Bill Veghte, a Microsoft VP. "That's a super-important shift for us in terms of code access. Richard Stallman, founder of the Free Software Foundation, countered saying NAME TITLE ORGANIZATION Slide from William Cohen What is Information Extraction Filling slots in a database from sub-segments of text. As a task: October 14, 2002, 4:00 a.m. PT For years, Microsoft Corporation CEO Bill Gates railed against the economic philosophy of open-source software with Orwellian fervor, denouncing its communal licensing as a "cancer" that stifled technological innovation. Today, Microsoft claims to "love" the open- source concept, by which software code is made public to encourage improvement and development by outside programmers. Gates himself says Microsoft will gladly disclose its crown jewels--the coveted code behind the Windows operating system--to select customers. "We can be open source. We love the concept of shared source," said Bill Veghte , a Microsoft VP . "That's a super-important shift for us in terms of code access. Richard Stallman , founder of the Free Software Foundation , countered saying NAME TITLE ORGANIZATION Bill Gates CEO Microsoft Bill Veghte VP Microsoft Richard Stallman founder Free Soft.....
View Full Document

Page1 / 97

124.11.lec15 - Dan Jurafsky Lecture 15: Relaon Extracon...

This preview shows document pages 1 - 7. Sign up to view the full document.

View Full Document Right Arrow Icon
Ask a homework question - tutors are online