22 Pages

570-Day-9-slides

Course: WLEWIS 570, Fall 2009
School: Washington
Rating:
 
 
 
 
 

Word Count: 1057

Document Preview

Parsing Chunk Chunk Parsing Also called chunking, light parsing, or partial parsing. Method: Assign some additional structure to input over tagging Used when full parsing not feasible or not desirable. Because of the expense of full-parsing, often treated as a stop-gap solution. Chunk Parsing No rich hierarchy, as in parsing. Usually one layer above tagging. The process: 1. Tokenize 2. Tag 3. Chunk...

Register Now

Unformatted Document Excerpt

Coursehero >> Washington >> Washington >> WLEWIS 570

Course Hero has millions of student submitted documents similar to the one
below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.

Course Hero has millions of student submitted documents similar to the one below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.
Parsing Chunk Chunk Parsing Also called chunking, light parsing, or partial parsing. Method: Assign some additional structure to input over tagging Used when full parsing not feasible or not desirable. Because of the expense of full-parsing, often treated as a stop-gap solution. Chunk Parsing No rich hierarchy, as in parsing. Usually one layer above tagging. The process: 1. Tokenize 2. Tag 3. Chunk Chunk Parsing Like tokenizing and tagging in a few respects: 1. Can skip over material in the input 1. Often finite-state (or finite-state like) methods are used (applied over tags) 2. Often application specific (i.e., the chunks tagged have uses for particular applications) Chunk Parsing Chief Motivations: to find data or to ignore data Example from Bird and Loper: find the argument structures for the verb give. Can "discover" significant grammatical structures before developing a grammar: gave NP gave up NP in NP gave NP up gave NP help Chunk Parsing Like parsing, except: It is not exhaustive, and doesn't pretend to be. Structures and data can be skipped when not convenient or not desired Nested structures typical in parsing Structures of fixed depth produced [S[NP The cow [PP in [NP the barn]]] ate Not in chunking [NP The cow] in [NP the barn] ate Chunk Parsing Finds contiguous, non-overlapping spans of related text, and groups them into chunks. Because contiguity is given, finite state methods can be adapted to chunking Longest Match Abney 1995 discusses longest match heuristic: One automaton for each phrasal category Start automata at position i (where i=0 initially) Winner is the automaton with the longest match Longest Match He took chunks from the PTB: NP D N NP D Adj N VP V Encoded each rule as an automaton Stored longest matching pattern (the winner) If no match for a given word, skipped it (in other words, didn't chunk it) Results: Precision .92, Recall .88 An Application Data-Driven Linguistics Ontology Development (NSF BCE-0411348) One focus: locate linguistically annotated (read: tagged) text and extract linguistically relevant terms from text Attempt to discover "meaning" of the terms Intended to build out content of the ontology (GOLD) Focus on Interlinear Glossed Text (IGT) An Application Interlinear Glossed Text (IGT), some examples: (1) Afisi a-na-ph-a nsomba (Baker 1988:254) hyenas SP-PST-kill-ASP fish `The hyenas killed the fish.' An Application More examples: (4) a. yerexa-n p'at'uhan-e bats-ets child-NOM window-ACC openAOR.3SG `The child opened the window.' (Megerdoomian ??) (37) t3ile-3u33i p3-s 1pl-TOP buy-NOM 'What we to buy is meat.' need is meat.' pies meat 3u3. COP (LaPolla 2000) An Application (4) a. yerexa-n p'at'uhan-e bats-ets child-NOM window-ACC open-AOR.3SG `The child opened the window.' (Megerdoomian ??) Problem: How do we `discover' the meaning of the linguistically salient terms, such as NOM, ACC, AOR, 3SG? Perhaps we can discover the meanings by examining the contexts in which the occur. POS can be a context. Problem: POS tags rarely used in IGT How do you assign POS tags to a language you know nothing about? IGT gives us aligned text for free!! An Application (4) a. yerexa-n p'at'uhan-e bats-ets window-ACC child-NOM open-AOR.3SG `The child opened the window.' (Megerdoomian ??) DT NN VBP DT NN IGT gives us aligned text for free!! POS tag the English translation Align with the glosses and language data That helps. We now know that NOM and ACC attach to nouns, not verbs (nominal inflections) And AOR and 3SG attach to verbs (verbal inflections) An Application (4) a. yerexa-n p'at'uhan-e bats-ets child-NOM window-ACC open-AOR.3SG `The child opened the window.' (Megerdoomian ??) DT NN VBP DT NN In the LaPolla example, we know that NOM does not attach to nouns, but to verbs. Must be some other kind of NOM. (37) t3ile-3u33i p3-s 1pl-TOP buy-NOM 'What we to buy is meat.' need is meat.' pies meat 3u3. COP (LaPolla 2000) An Application (4) a. yerexa-n p'at'uhan-e bats-ets child-NOM window-ACC open-AOR.3SG `The child opened the window.' (Megerdoomian ??) DT NN VBP DT NN How we tagged: Globally applied most frequent tags (stupid tagger) Repaired tags where context dictated a change (e.g., TO preceding race = VB) Technique similar to Brill 1995 An Application (4) a. yerexa-n p'at'uhan-e bats-ets child-NOM window-ACC open-AOR.3SG `The child opened the window.' (Megerdoomian ??) DT NN VBP DT NN But can we get more information about NOM, ACC, etc.? Can chunking tell us something more about these terms? Yes! An Application (4) a. yerexa-n p'at'uhan-e bats-ets child-NOM window-ACC open-AOR.3SG `The child opened the window.' (Megerdoomian ??) DT NN VBP DT NN Chunk phrases, mainly NPs Since relationship (in simple sentences) between NPs and verbs tells us something about the verbs' arguments (Bird and Loper 2005)... We can tap this information to discover more about the linguistic tags An Application (4) a. yerexa-n p'at'uhan-e bats-ets child-NOM window-ACC open-AOR.3SG `The child opened the window.' (Megerdoomian ??) DT NN VBP DT NN NP VP NP Apply Abney 1995's longest match heuristic to get as many chunks as possible (especially NP) Leverage English canonical SVO (NVN) order to identify simple argument structures Use these to discover more information about the terms Thus... An Application (4) a. yerexa-n p'at'uhan-e bats-ets child-NOM window-ACC open-AOR.3SG `The child opened the window.' (Megerdoomian ??) DT NN VBP DT NN NP VP NP We know that NOM attaches to subject NPs may be a case marker indicating subject ACC attaches to object NPs may be a case marker indicating object An Application What we do next: look at co-occurrence relations (clustering) of Terms with terms Host categories with terms To determine more information about the terms Done by building feature vectors of the various linguistic grammatical terms ("grams") representing their contexts And measuring relative distances between these vectors (in particular, for terms we know) Linguistic "Gram" Space PASS CAUS PRES FUT PAST F 1SG M 3SG DAT GEN NOM ACC PL INSTR POSS LOC ABS ERG 18 19 13 14 12 11 17 10 16 3 4 1 2 15 8 9 5 7 6 P P !H~ + P !H~ + P H ! + P H ! + P P "H Q* P V "H Q* P P !H~ + # Q * P !H~ + # Q * + H ! AGREEMENT STD CASES H+ D P !H~ + # Q * P P ! + H P OBL CASES NOMINAL # Q * V P ! + H # Q * TNS/ASPECT " H Q* PH"Q* +H D VERBAL @ P P P P P P P P
Find millions of documents on Course Hero - Study Guides, Lecture Notes, Reference Materials, Practice Exams and more. Course Hero has millions of course specific materials providing students with the best way to expand their education.

Below is a small sample set of documents:

Washington - WLEWIS - 570
Project 2 Reducing dimensionalityClusteringSpecific Issues related to Project 2 Lowering the number of dimensions makes the problem more manageable Less memory Less time Less noise Doesn't have to be particularly sophisticated Get rid of
Washington - WLEWIS - 580
Understanding Metonymies in Discourse (2002)Katja Markert Language Technology Group Division of Informatics University of Edinburgh Edinburgh, U.K. Udo Hahn Computational Linguistics Lab Freiburg University Freiburg, GermanyPresented by Mike O'Lea
UCLA - CS - 118
C hapte 1 r I ntroductionA note on the use of these ppt slides:We're making these slides freely available to all (faculty, students, readers). They're in PowerPoint form so you can add, modify, and delete slides (including this one) and slide cont
Webster FL - EDTC - 5560
The following recipe for chocolate chip cookies recently appeared in Chemical & Engineering News (C&EN, Jun 19, 1995, p. 100). It was attributed to Jeannene Ackerman of Witco Corp. Ingredients: 1- 532.35 cm3 gluten 2- 4.9 cm3 NaHCO3 3- 4.9 cm3 refine
Webster FL - EDTC - 5560
WS 5.6 Types of Reactions Complete the reactions by writing the products. Remember : when you form an element, don't forget about the 7 diatomic gases (N2, O2, F2, H2, Cl2, Br2, I2) & to balance ionic formulas (drop & swap)! Composition Reactions 1.
Webster FL - EDTC - 5560
Penny Lab(2 days)Name _ partner _The familiar Lincoln penny has been made of 95% copper from 1909 - 1982. However, the price of copper had risen so much that the U.S. Treasury needed to change (no pun intended) (okay maybe it was intended) the
Webster FL - EDTC - 5560
Final ExaminationInstructions: Read each question carefully. Answer all the questions. Time limit 4 hours. Begin immediately. HISTORY: Describe the history of the papacy from its origins to the present day; concentrating specifically (but not exclus
Webster FL - EDTC - 5560
WS 1.8 Standards of MeasurementsAfter watching the short video clip "Timelab 2000: history of the English standards of measurement", imagine that you have just colonized a new land, and you have been put in charge of devising a new standard of measu
Webster FL - EDTC - 5560
Density LabName _ Lab Partner _In this lab, you will be determining the density of an unknown metal object (PART A), and the density of an unknown liquid (PART B). You already know that an object's density is a ratio of its mass and volume (D=m/V
Webster FL - EDTC - 5560
SCIENCE FAIR PROJECT PROPOSAL FORM Name/Group Members: _ _(no more than 4 to a group)_ _ Date: _ Proposal Topic: _ Resources List the web sites, encyclopedia pages, and/or book titles. 1. _ 2. _ 3. _ Additional Info._ __ Why I /We think this woul
Webster FL - EDTC - 5560
Words to live by. I can please only one person per day. Today is not your day. Tomorrow isn't looking good either. I don't have an attitude problem. You have a perception problem. Last night I laid in bed looking up at the stars in the sky and I t
Webster FL - EDTC - 5560
WS 9.1 Electrolytes 1. Describe the differences between a strong electrolyte, a weak electrolyte, and a nonelectrolyte. Be sure to give an example of each of these electrolytes in the process.2. According to the demo in class, discuss the process u
Webster FL - EDTC - 5560
WS 9.5 pH of Solutions 1. What is the pH scale? 2. What exactly do the initials "pH" stand for? 3. Arrange the following groups in order from the most acidic to the most basic. The pH of each item is also listed. a) bleach-12.0, lemons-2.3, milk of m
Webster FL - EDTC - 5560
pH lab Name: _ DAY 1 - Indicator Reference Set Part 1 - Making a serial dilution You will make your own indicator reference set which will be used to determine the pH of several acids & bases located on top of the lab shelf. - pay attention to the de
Webster FL - EDTC - 5560
mega- (M) 106 kilo(k) 10 3 -base unit deci- (d) 10 -1 centi- (c) 10 -2 milli- (m) 10-3 micro- () 10-6
Webster FL - EDTC - 5560
1s 2s 2p 3s 3p 3d 4s 4p 4d 4f 5s 5p 5d 5f 6s 6p 7sUse this chart to figure out the order of the orbital energy levels
Webster FL - EDTC - 5560
solublerule 1: rule 2: rule 3: rule 4: Li+, Na+, K+, NH4 +, NO3 Cl-, Br-, ISO 42Li+, Na+, K+, NH4+ except except exceptinsolubleAg +, Hg22+ , Pb2+ Ba2+ , Ca2+ , Pb2+ OH-, CO32-, PO43-, S2-, O2-
Webster FL - EDTC - 5560
prefixes halogens: (-X) -F fluoro-Cl chloro-Br bromo-I iodoalkyls: (name alkane, drop "e", add "-yl" ending) N-containing: -NH2 amino-NO2 nitro-suffixes C-C -ane C=C -ene C=C -yne -OH -olXaromaticX X X para Xreactions halogenation (alkane) C-
Webster FL - EDTC - 5560
WS 7.2 Kinetic Theory - Temperature & Volume 1. What is kinetic energy? What is the equation for calculating it? 2. What is temperature, as defined in class? 3. If the kinetic energy of a gas is increased, which variable in the equation for K.E. is a
Webster FL - EDTC - 5560
WS 7.6 Graham's Law 1. What exactly is temperature a measurement of? _ 2. Why is it important to include the word "average" in your answer? _ 3. What two factors does an object's kinetic energy depend on? __ and _ 4. What specifically is the equation
Webster FL - EDTC - 5560
WS 7.9 Review Sheet pg 1 1. To what temperature (C) would 12.3 g of He have to be cooled to fit in a 34.0 L tank at 1.17 atm?Ans: _ 2. What would be the density of CH4 at STP? . at 132C and 725 mmHg?Ans: _ Ans: _ 3. A gas sample occupies a volume
Webster FL - EDTC - 5560
WS 7.10 Gas Laws Demo Worksheet Throughout this packet, you will be seeing various gas-related demos. Keep track of each by taking notes and making diagrams on this worksheet. Methane Mamba The Vapor RampHot Air BalloonPouring CO2Screaming Meta
Webster FL - EDTC - 5560
Wet Dry Ice LabName: _ Partner: _When a chunk of ice is taken from a freezer and placed in a warm room, it gains heat and its temperature increased to 0C, then it melts (phase change). If it is heated further, at 100C, the water will boil (anothe
Webster FL - EDTC - 5560
* Home Lab *A Watched Pot Never Boils (?)Name: _Do the following experiment with one (or both) of your parents (or guardians). For full credit, have him/her/them sign it below to confirm that they worked with you from beginning to end.Signatu
Webster FL - EDTC - 5560
WS 6.3Logarithms & Exponential Equations In this equation, n is the logarithm of y.y = xnFor example, 103 = 1000. Therefore, log 1000 = 3. Power Property of Logarithms: log xn = n log x Example: Solve for n: log 5n = 62 n log 5 = 62 n = 62 log
Webster FL - EDTC - 5560
WS 6.5 Carbon-14 Dating In the spaces below, write and illustrate as though it were a comic strip, a description of the entire process of C-14 dating in the 6 squares provided, showing: 1) how and where C-14 is produced, 2) how it decays and establis
Webster FL - EDTC - 5560
WS 6.6 - Applications of Nuclear Chemistry preparation of artificial elements radioactive dating radioactive tracers (labels) cancer treatment medical diagnosis food preservation Biological Effects acute (short term) damage chronic (long term
Webster FL - EDTC - 5560
Name _ Period _ Due Date Tues 4/30 Tues 4/30 Thu 5/2 Thu 5/2 Tues 5/7 Tues 5/7 Wed 5/8 Thu 5/9 Tue 5/14 Tue 5/14 Thu 5/16 Fri 5/17 Mon 5/20 Mon 5/20 Tues 5/21 Wed 5/22 Thu 5/23 Fri 5/24 Assignment _ WS 10.1 _ WS 10.2 (will do in class) _ WS 10.3 (#1-
Webster FL - EDTC - 5560
Fun Things To Do at Wal-Mart 1. Set all the alarm clocks to go off at ten minute intervals throughout the day. 2. Challenge other customers to duels with tubes of gift wrap. 3. Leave cryptic messages on the typewriters. 4. Re-dress the mannequins as
Webster FL - EDTC - 5560
WS 10.3 ISOMERS The building blocks: Define "isomer":CNOHFClBrIHHHHexample(w/o the H's):C4H10HC C C C HH H H HandC4H10C C C CC C C CCaution!: Molecules are constantantly being bent, twisted and flipped as t
Webster FL - EDTC - 5560
WS 10.4 ORGANIC NAMINGName 1) butaneC C C CName (yours): _ Line formula Condensed structural formula CH3 CH2 CH2 CH3 Mol. form C4 H10Complete structural formula2) heptane 3) 2-fluorooctane 4) 3-iododecane 5) 2,2,3- trichlorohexane 6) 1,1,1 -
Webster FL - EDTC - 5560
WS 10.7 Organic Nomenclature, part 2Name 1) 3-fluoro,1-butene 2) 2-heptene 3) 2,3-difluoro1-pentene 4) 6-iodo,4-methyl2-decyne 5)1,2,5,5- tetrachloro3-hexyne 6) 1,1,1 - tribromo 2-butyne 7) 8) 9)C= C C C = F Cl C C C= C C Br Br C= C C C BrComplet
Webster FL - EDTC - 5560
WS 10.11 AROMATIC COMPOUNDS 1. What do the terms ortho, meta, and para mean? 2. Write the IUPAC name for each of the following: (remember- if there are 2 functional groups, use o, m, p)CH3 CH2CH3a) ( common name = toluene )Bre)Brb)Br CH3
Webster FL - EDTC - 5560
A Book Report byA witch is going to be a teacherThe kids go to the gym and find out Miss Brewbacker is a witch.The Main character Miss Brewbaker witch gymnast wears all blackThese are the characters Eddie Melody Lisa HowieMy Favorit
Webster FL - EDTC - 5560
WS 2.3 - Science Math PracticeName _Use a scientific calculator to answer the following. Watch sig figs and units! 1) 3.67 m x 24.31 m = _ 3) (4.5173 dm) 3 = _ 5) 456.4 g (3.27 cm)2(3.22 cm) 7) 3.42 x 1023 x 5.63 = _ = 2) 7900 g = _ 451.2 mL 4) 3
Webster FL - EDTC - 5560
WS 2.4 Specific Heat Show all work neatly.q = m c T1. How much heat is required to raise the temp of 654 g of water from 34.5C to 89.7C? Ans _Specific Heats substance c (J/gC) water ethanol graphite diamond iron copper silver gold ice 4.184 2.4
Webster FL - EDTC - 5560
WS 2.7 Demo LogDuring this unit, you will be watching some demos. Record observations and conclusions here. All demos are fair-game for the quiz! Super-Cold CondensationWhat liquid is condensing off the can? _ How cold is this liquid? _ How cold i
Webster FL - EDTC - 5560
WS 8.1 Solutions 1. Identify the solute and solvent in the following solutions: a) 10.0 g of sugar & 40.0 g of water b) 75 g of KBr & 100 g water c) 50 g of water & 5.0 g of NaCl d) 18.0 L of nitrogen & 12.0 L of oxygen e) 200 g of alcohol & 200 g of
Webster FL - EDTC - 5560
WS 8.3 Solubility Curves Based on the solubility below, decide whether each of the following is A: unsaturated, B: saturated, C: supersaturated, or whether D: not enough information is given. * assume it's dissolved * 1) 50 g KCl in 100 g of water at
Webster FL - EDTC - 5560
WS 8.4 Percentage Problems SHOW WORK! 1. A class is comprised of 13 boys and 19 girls. What is the % boys? % girls? Ans: _ _ 2. A solution is comprised of 14.65 g NaNO3 and 56.23 g water. What is the % NaNO3 ? % water? Ans: _ _ 3. 17.89 mg of iron, 3
Webster FL - EDTC - 5560
HOME LAB: RADIAL CHROMATOGRAPHY Name: _ Do the following activity with one of your parents or guardians. Have him/her sign it below to confirm that they worked on it with you from beginning to end. Signature of Parent/Guardian: __ Date: _ Objective:
Webster FL - EDTC - 5560
WS 4.6Polar Bonds & Electronegativity1. An electronegativity difference between 0 to 0.4 is considered a _ bond. 2. An electronegativity difference between 0.5 to 1.9 is considered a _ bond. 3. An electronegativity difference between 2.0 and 4 is
Webster FL - EDTC - 5560
WS 4.8 moleculeMolecular Geometry Lewis structure e- regions on central atom egeometry molecular geometry P or NP polarityCH2 SH2 O 2NF3CCl4.O3..O-O=O . .SO2H3 O +HF
Webster FL - EDTC - 5560
Mad PersonalsMAD'S PERSONAL ADS: Occult-Vegetarian-Ambidextrous, Truck-driving Republican juggler wishes to meet woman of similar interests. Zack, BOX 90MAD'S PERSONAL ADS: Undercover Espionage Agent - 29, would like to meet nice woman 18-30 for
Webster FL - EDTC - 5560
WS 3.1 - EARLY THOUGHTS For each individual listed below, describe their theories / discoveries on the structure of matter. Where appropriate, sketch a diagram of their atomic models. Empedocles - (440 B.C.) Democritus - (400 B.C.) Aristotle - (340 B
Mich Tech - CS - 3421
I. Last Time: A. Hw#2 posted Introduction to JLS / using it to simulate stuff B. Lab coming soon.(Today or tomorrow) (Building bigger / more interesting devices in JLS) C. The SR-Latch: Cross Coupled "NOR" gates
Mich Tech - CS - 3421
I. Last Time: A. Tests: Should be returned on Thurs. (Maybe earlier) B. Hw #3 posted - minor modification this morning. (1st question is new - turn in graph of signals) C. Lab #1 Report Due Thurs Lab demos will be NE
Mich Tech - CS - 3421
I. Last Time: A. Lab Demo Procedure: 1. Bring your lab! CD, Flash, or a laptop 2. Wait in the CS lounge area (fish bowl) When door opens, you're up. 3. If you haven't signed up, do so. 4. If you ha
University of Texas - EBC - 372
July 2, 2008 Dear Friends, It has been said that a journey of a thousand miles begins with a single step. My journey will begin with the push of a pedal. In the summer of 2009, two teams of twenty-five will be biking from Austin, Texas to Anchorage,
University of Texas - EBC - 372
July 2, 2008 Dear Friends, It has been said that a journey of a thousand miles begins with a single step. My journey will begin with the push of a pedal. In the summer of 2009, two teams of twenty-five will be biking from Austin, Texas to Anchorage,
Mich Tech - CS - 3421
I. Last Time: A. Lab #3 Posted (relatively easy / straight forward) Assembly labs: Will require a "High-Level Language" version and an assembly language version. Do the HLL version first! (Due next Wed!) Why H
Mich Tech - CS - 3421
<?xml version="1.0" encoding="UTF-8"?><Error><Code>NoSuchKey</Code><Message>The specified key does not exist.</Message><Key>cb91c282d379155849d0c436d3bd5e29b35bb0c9.txt</Key><RequestId>3B224107C9C905ED</RequestId><HostId>AeCtAAoZR0XewHBDid/mpvKv0jmE
Butler - MA - 125
MA 125 01 - Linear Math and Calculus Butler University Summer 2009INSTRUCTOR: OFFICE: OFFICE HRS: OFFICE PHONE:Duane Leatherman JH 253E Usually after class, unless announced otherwise By appointment 940-9439 HOME PHONE: 940-6383 (Resco)E-mail:
Butler - MA - 125
MA 125 - BUSINESS CALCULUS BUTLER UNIVERSITY MATERIAL COVEREDTEXTBOOK: Mathematics with Applications by Hungerford and Lial, 9th Edition Chapter 1 - Algebra and Equations 1.1 1.5 p. 2 p. 32 The Real Numbers Exponents and RadicalsChapter 2 - Graphs
Butler - MA - 125
Butler - MA - 125
Math 125 Worksheet 1 Show your work on all parts. 1. If x = -5, y = 20, and z = 0.5, find y2z2 x3 z . xName:_2. Simplify the following.16x - 4{3x - 5[12 + 7(2x -6)] - 15} =3. Change the following to fractional and/or radical form and then si
Butler - MA - 125
Butler - MA - 125
Butler - MA - 125
Math 125 Worksheet 5 Showing your work, solve the following problems from the textbook. 1. Find both the maturity value and simple interest for the following. (a) problem #4, page 254Name:_(b) problem #12, page 2552.(a) problem #16, page 254
Butler - MA - 125
Butler - MA - 125