7 Pages

C00-2139

Course: C 00, Fall 2009
School: UPenn
Rating:
 
 
 
 
 

Word Count: 5037

Document Preview

Alignment-Based ABL: Learning Menno van Zaanen School of C o m p u t e r S t u d i e s University of Leeds LS2 9 J T L(~eds UK menno@scs, l e e d s , a c . uk Abstract This ])al)er introdu(:es a new tyl)e of grammar learning algorithm, iilst)ired l)y sl;ring edit distance (Wagner and Fis(:her, 1974). The algorithm takes a (:ortms of tlat S(~lltell(:es as input and returns a (:ortms of lat)elled, l)ra(:ket(~(1...

Register Now

Unformatted Document Excerpt

Coursehero >> Pennsylvania >> UPenn >> C 00

Course Hero has millions of student submitted documents similar to the one
below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.

Course Hero has millions of student submitted documents similar to the one below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.
Alignment-Based ABL: Learning Menno van Zaanen School of C o m p u t e r S t u d i e s University of Leeds LS2 9 J T L(~eds UK menno@scs, l e e d s , a c . uk Abstract This ])al)er introdu(:es a new tyl)e of grammar learning algorithm, iilst)ired l)y sl;ring edit distance (Wagner and Fis(:her, 1974). The algorithm takes a (:ortms of tlat S(~lltell(:es as input and returns a (:ortms of lat)elled, l)ra(:ket(~(1 sen-~ ten(:(~s. The method works on 1)airs of unstru(:tllr(?(l SelltellC(~,s that have one or more words in (:onunon. W]lc, ll two senten('es are (tivided i n t o parts that are the same in 1)oth s(mten(:es and parl;s tha|; are (litl'erent, this intbrmation is used to lind l)arl;s that are interchangeal)le. These t)arts are taken as t)ossil)le (:onstituents of the same tyl)e. After this alignment learning stel) , the selection learning stc l) sel(~('ts the most l)rot)at)le constituents from all 1)ossit)le (:onstituents. This m(;thod was used to t)ootstrat) structure (m the ATIS (:ortms (Mar(:us et al., 1f)93) and on the OVIS ~ (:ort)us (Bommma et ~d., 1997). While the results are en(:om:aging (we ol)|;ained Ul) to 89.25 % non-crossing l)ra(:kets precision), this 1)at)er will 1)oint out some of the shortcomings of our at)l)roa(:h and will suggest 1)ossible solul;ions. 1 Introduction sl;ru('ture to sentences whi(:h are similar to the ,~;tru(:ture peot)le would give to sentences, lint not ne(:essarily in the same |lille or Sl);~(;e l'estrictions. The algorithm (:onsists of two t)hases. The tirst t)hase is a constituent generator, whi(:]l generates a m()tiw~ted set of possible constituents 1)y aligning sentenc(:s. The se(:ond i)hase restri(:ts tllis set l)y selecting the best constituents from the set. The rest of this t)aper is organized as ibllows. Firstly, we will start t)y describing l)revious work in machine learning of language stru(:ture and then we will give a descrit)tion of the ABL algorithm. Next, some results of al)t)lying the ABL algorithm to different corpora will 1)e given, followed 1)y a discussion of the algorithm alia flltllre resear(;h. 2 Previous Work Unsupervised learning of syntactic structure is one of the hardest 1)rol)lems in NLP. Although people are adept at learning grammatical structure, it is ditficult to model this 1)recess and therefore it is hard to make a eomtmter learn strllCtllre. We do not claim that the algorithm described here models the hmnan l)rocess of language learning. Instead, the algorithm should, given unstructured sentences, find the best structure. This means that the algorithm should assign 1Opcnbam" Vcrvoer hfformatie Systeeln (OVIS) stands for Pul)lic Transt)ort hfformation System. I,e;wning metl,o(ls can t)e grouped into suitorvised and unsut)ervised nmthods. Sul)ervised methods are initial|seal with structured input (i.e. stru(:ture(] sent(m(:es for grannnar learning methods), while mlsut)ervised methods learn l)y using mlstru(:tured data only. In 1)ractice, SUl)ervised methods outpertbrm mlsut)ervised methods, since they can adapt their o u t p u t based on the structured exami)les in the initial|sat|on t)hase whereas unSUl)ervised lnethods emmet. However, it is worthwhile to investigate mlsupcrvised gramlnar learning methods, since "the costs of annotation are prohibitively time and ext)ertise intensive, and the resulting corpora may 1)e too suscet)tible to restri(:tion to a particular domain, apt)lication, or genre". (Kehler and Stolcke, 1.999) There have 1)een several approaches to the unsupervised learning of syntactic structures. We will give a short overview here. 961 Memory based learifing (MBL) keeps track of possible contexts and assigns word types based on that information (Daelemans, 1995). Redington et al. (1998) present a method that bootstraps syntactic categories using distributional information and Magerman and Marcus (1990) describe a m e t h o d that finds constituent boundaries using m u t u a l information values of the part of speech n-grams within a sentence. Algorithms that use the minimmn description length (MDL) principle build grammars that describe the input sentences using the minimal nunfl)er of bits. This idea stems from intbrnmtion theory. Examples of these systems can be found in (Grfinwald, 1994) and (de Marcken, Wh,at Wh,at Wh, at Wh, at is is is is a family fare th,e payload of an African Swallow & family fare)x (the payload of an African Swallow)x Figure 1: Example bootstrapping structure For each sentence sl in the corpus: For every other sentence s2 in the corpus: Align s~ to s2 Find the identical and distinct parts between s~ and s2 Assign non-terminals to the constituents (i.e. distinct parts of s~ and s2) Figure 2: Alignment learning algorithm fences as shown in figure 1. 2 The constituents a .family fare and the payload of an African Swallow both have the same syntactic type (they are both NPs), so they can be replaced by each other. This means that when the constituent in the first sentence is replaced by the constituent in the second sentence, the result is a wflid sentence in the language; it is the second sentence. The main goal of the algorithm is to establish that a family .fare and the payload of art, African Swallow are constituents and have the same type. This is done by reversing Harris's idea: 'i1" (a group o.f) words car-,, be; replaced by each other, they are constituents and h.ave th,e same type. So the algorithm now has to find groups of words that can be replaced by each other and after replacement still generate valid sentences. 1996). The system by Wolff (1982) pertbrms a heuristic search while creating and Inerging symbols directed by an evaluation function. Chen (1.995) presents a Bayesian g r a m m a r induction method, which is tbllowed by a postpass using the inside-outside algorithm (Baker, 1979; Lari and Young, 1990). Most work described here cmmot learn complex structures such as recursion, while other systems only use limited context to find constituents. However, the two phases in ABL are closely related to some previous work. Tim alignment learning phase is etlb.ctively a compression technique comparat)le to MDL or Bayesian g r a m m a r induction methods. ABL remembers all possible constituents, building a search space. The selection h;arning phase searches this space, directed by a probabilistic evaluation function. The algorithm consists of two steps: 3 Algorithm 1. Alignment Leanfing 2. Selection Learning 3.1 Alignment We will describe an algorithm that learns structure using a corpus of plain (mlstructured) sentences. It does not need a structured training set to initialize, all structural information is gathered from the unstructured sentences. The output of the algorithm is a labelled, bracketed version of the inlmt corpus. Although the algorithm does not generate a (context-fl'ee) grammar, it is trivial to deduce one from the structured corpus. The algorithm builds on Harris's idea (1951) that states that constituents of the same type can be replaced by each oth,er. Consider the sen- Learning The model learns by comparing all sentences in the intmt corpus to each other in pairs. An overview of the algorithm can be tbund in figure 2. Aligning sentences results in "linking" identical words in the sentences. Adjacent linked words are then grouped. This process reveals 2All sentences in the examlfles can be fbund in the ATIS corlms. 962 .f,'o,,,. Sa,,. F,'a,.ci.,'co (to Dallas). ./'rout (Dallas to)| San Francisco 02 (Sa,, l.o) Dallas 02 O, DaUas #o Sa,,. J';'a,.cisco)2 [;1"0 ~II, .fF()Ii't (San Francisco), to (Dallas)2 (Dalla.gj to (Sa,,. F i g u r e 3: A m b i g u o u s a l i g n m e n t s 1;t1(; groul)S of identical words, 1)ut it also llIlC()vers the groups of distinct wor(ls in t h e sentences. In figure 1 What is is the identical p a r t of the sentences and a fam, ily J'a~v, and the payload of an A./ricau, Swallow are the d i s t i n c t l)arts. T h e distinct p a r t s are interchangeable, so t h e y are (tetermilmd to 1)e c o n s t i t u e n t s o17 the s a m e I;yl)e. We will now Cxl)lain the stel)s in the alignm e n | learning i)hase in more de, tail. also be tbmM in the o t h e r sentence. (In figure 1, What is is a g r o u p like this.) T h e rest of the sentences can also b e g r o u p e d . T h e words in these grout)s arm words t h a t are distinct in the two sentences. W h e n all of these groups fl:om sentence, one would 1)e relflaced by the respective groups of sentence two, sentence two is g e n e r a t e d . (a family fare a n d th,c payload of an African Swallow art: of this t y p e of g r o u p in figure 1.) E a c h pair of these distinct groups consists of possilfle constil;uents Of the same type. :~ As can be, seen in tigure 3, it is possible t h a t e m p t y g r o u p s can lm learned. a.l.a Existing Constituents 3.1.1 Edit Distance q[b find the identi(:al word grouI)S in |;he sentences, we use the edit; distan(:e a l g o r i t h m by W a g n e r and Fischer (197d:), which finds the m i n i m u m nmnl)er of edit o p e r a t i o n s (insertion, (lelei;ion and sul)stii;ul;ion) l;o change one sente, nce into the other, ld(mti(:al wor(ts in the sent(races can 1)e t'(mnd at ])]a(;es W]l(~,l'e lie edit ope r a t i o n was al)plied. T h e insl;antia,tiol~ of the algoril;hm t h a t fin(is l;}le l o n g e s t COllllllOll Slll)S(}(]ll(}ll(;( ~, ill t w o Sell- tences s o m e t i m e s "links" words t h a t are, too far apart, in figure 3 when: 1)esides the o(:cm'rences of.from,, the ocem:rences of San }4"au,ci.sco or Dallas are linked, this results in u n i n t e n d e d constituents. We woukt r;d;her have the lnodel linking to, r e s u l t i n g in a sl;1"u(;I;llre with the 1101111 phrases groul)ed w i t h the same t y p e corre(:tly. Linking San Francisco or Dallas results i~l c o n s t i t u e n t s t h a t vary widely in size. T h i s stems from the large d i s t a n c e b e t w e e n t h e linked urords in the tirsi; sentence mid in th(; s(:cond sentence. T h i s t y p e of alignlnent can t)e ruled out by biasing t h e cost f i m c t i o n using distances At seine 1)oint it m a y be t)ossible t h a t the m o d e l lem'ns a co11stituent t h a t was a l r e a d y stored. T h i s m a y hal)l)en w h e n a new sentence is comp a r e d to a senlaen(;e in the p a r t i a l l y s t r u c t u r e d corpus. In this case,, no new tyl)e, is intro(hu:ed~ lint the, consti|;ucnl; in l;he new sentence gel;s l;he same t y p e of the c o n s t i t u e n t in the sentence in the p a r t i a l l y s t r u c t m : e d corpus. It m a y even t)e the case t h a t a p a r t i a l l y si;ruct u r e d sentence is c o m p a r e d to a n o t h e r p a r t i a l l y sl;rtlctllre(1 selll;elR,e. T h i s occm:s whel~ a s(:nfence t h a t (;onl;ains some sl;ructure, which was learner1 1)y COlnl)aring to a sentelme in the part;]ally s t r u c t u r e ( l (;Ol;pllS~ is (;Olllt)ar(~,(] 1;o allo t h e r (t)art;ially stru(:ture(t) sente, n(:e. W h e n the ('omparison of these two se,nl;ence, s yields a c o n s t i t u e n t thai: was ah:ea(ly t)resent in b o t h senten(:es, the tyl)es of these constitueld;S are merged. All c o n s t i t u e n t s of these types are ut)d a t e d , so the, y have the same tyl)e. B y merging tyl)es of c o n s t i t u e n t s we make t;he assuml)tion t h a t co]lstil;uents in a (:ertain cont e x t can only have one tyl)e. In section 5.2 we discuss the, imt)li(:atiolls of this assmnpl;ion and p r o p o s e an a l t e r n a t i v e at)t)roach. 3.2 Selection Learning between words. 3.1.2 Grouping An edit d i s t a n c e a l g o r i t h m links identical words in two sentences. W h e n a d j a c e n t wor(ls are linked in l)oth sentences, t h e y can l)e g r o u p e d . A groul) like this is a p a r t of a senten(:e t h a t can The first step in the algorithm may at some point generate COllstituents that overlap with other constituents, hi figure 4 Give me all flights .from Dallas to Boston receives two overlal)ping structures. One constituent is learned 3Since the alger||Inn does not know any (linguist;|c) llalIICS for the types, the alger|finn chooses natural numbers to denote different types. 963 ( Book Delta 128 Give me ( ) f l w n Dallas to Boston help on classes ) ?'Give m(all.fligh, ts)'f,'om Dallas to Boston) l?igure 4: Overlapping constituents by comparing against Book Delta 128 f i r m Dallas to Boston and the other (overlapl)ing) constituent is tbund by aligning with Give me help methods are ~pplied afl;er the aligmnent learning phase, since more specific informatioil (in the form of 1)etter counts) can be found at that time. In section 4 we will ewfluate all three methods on the ATIS and OVIS corpus. 3.2.1 Viterbi on classes. The solution to this problem has to do with selecting the correct constituents (or at least the better constituents) out of the possible constitnents. Selecting constituents can be done in several dittbrent ways. A B L : i n c r Assume that the first constituent learned is the correct one. This means that when a new constituent overlaps with older constituents, it can 1)e ignored (i.e. they are not stored in the cortms). A B L : l e a f The model corot)rites the probability of a constituent counting the nmnber of times the particular words of the constituent have occurred in the learned text as a constituent, normalized by the total number of constituents. Since more than just two constituents can overlap, all possible combinations of overlapping constitueni;s should be considered when comImting the best combination of constituents, which is the product of the probabilities of the separate constituents as in SCFGs (cf. (Booth, 1969)). A Viterbi style algorithm optimization (1967) is used to etficiently select the best combination of constituents. When conll)uting the t)r()t)ability of a combination of constituents, multiplying the separate probabilities the of constituents biases towards a low nnmber of constituents. Theretbre, we comtmte the probability of a set of constituents using a normalized version, the geometric mean 4, rather than its product. (Caraballo and Charniak, 1998) 4 Results Ple,f(c) = ]c' C C : yield(c') = yicld(c)l ICI where C is the entire set: of constituents. A B L : b r a n e h In addition to the words of the sentence delimited by the constituent, the model computes the probability based on the part of the sentence delimited by the words of the constituent and its non-terminal (i.e. a normalised probability of ABL:leaf). Pb,.~na,,(clroot(c ) = r) = e c: y/el(l(,-') -- y i e l d ( c ) A ; "1 Ic" c : ,'oot(c") = The first method is non-probabilistic and may be applied every time a constituent is found that overlaps with a known constituent (i.e. while learning). The two other methods are probabilistic. The model computes the probability of the constituents and then uses that probability to select constituents with the highest probability. These The three different ABL algorithms m~d two 1)aseline systems have been tested on the ATIS and OVIS corpora. The ATIS corlms ti'om the P(;nn Treebank consists of 716 sentences containing 11,777 (:onstituents. The larger OVIS corpus is a Dutch corpus containing sentences on travel intbrmation. It consists of exactly 10,000 sentences. We have removed all sentences containing only one word, resulting in a corpus of 6,797 sentences and 48,562 constituents. The sentences of the corpora are stript)ed of their structures. These plain sentences are used in the learning algorithms and the resulting structure is compared to the structure of the original corpus. All ABL methods are tested ten times. Th(, ABL:incr method is applied to random orders of the input corpus. The probabilistic ABL methods select constituents at random when different combinations of constituents have the same probability. The results in table 1 show the 4The geometric mean of a set of constituents ^... A = VFI =, P( d 964 NCBP LEFT 32.6O I{IGI/T 82.70 ABL:INCll 83.24 (1.17) AI3L:LEAF 81.42 (0.11) ABL:BI/ANCII 8, .31 (0.01) AT1S NCBI{ 76.82 92.91 87.21 (0.67) 86.27 (0.06) 89.31 (0.01) ZCS 1.J2 38.83 18.56 (2.32) 21.63 (0.5O) 29.75 (0.00) NCB] ) 51.23 75.85 88.71 (0.79) 85.32 (0.02) 89.2.5 (0.oo) OVIS NCBR 73.17 86.66 ZCS 25.22 48.08 30.87 (0.09) 42.20 (0.01) 84.36 (1.]0) 79.96 (0.03) 8>o4 (0.0|)) laJ)h, 1: Results of I;he ATIS and OVIS corpora mean ;rod standard deviations (between bra(:kets). The two base, line systcnis, left and right, onty t)uiM left: mid right brnnching trees respectively. Three, metrics hnve been compnl;cd. N C B P stmlds for Non-(]rossing Bra.(:kets Precision, which denotes the percentage, of learned (:onstituents th~,t do not overlai) with any consl;it;uent;s in I;he m'igi'n, al (:orpus. NCIH~ is the Non-Crossing Brackets ll.e(:all mid shows |;he t)(;rt'ent~ge of constituents in the original c o l t)us thai; (1o not overlap with :my constituents in the learned (:oft)us. Finnlly, Z(LS' strums ti)l' Zero-(Jrossing Sentences a,nd r(',l)reseuts the perc(ml;age of sentence, s that (t(1 not have m~y overlnt)l)ing constii;uenl;s. 4.2 ABL Compared to Other Methods 4.1 Evaluation 'l-'tm incr modet 1)erfi)rms (tuii:e well (:onsi(hwing the t'~mt that it; (:;mnot re(:ov(w t'roln incorre(:t (:()nstituents, with a t)re(:ision a,nd re(:~dl of ()V(~l' 8t) %. The order of the senl;en(:es how(we, r is quite iml)orbmt , since |tie sl;ml(tard deviation of the inc'r model is quite high (est)e~(:ialty with the ZCS, reaching 3.22 % on the OV!S (:orpus). We expected the prot)nl)ilistic nmtho(ts to i)erform t)o,l;ter, trot the lc((f modet performs slightly worse. The, ZCS, however, is somewhat better, re,suiting in 21.63 % on the AT1S corpus. Furthermore, d;he standard deviations of the le,:f model (&lid Of the branch, model) are c]ose to 0 %. The st;~tisti(:al methods generate more precise, results. Ttm bra'n,ch, modet d e a r l y outl)erfornl all o~,her models. Using more Sl)e(:itic statistics generate better results. Although the resull;s of the N FIS (:orpus mM OVlS corIms differ, the, conclusions that (:ml })e reached are similm:. It; is difficult to corot)are the results of the ABL model ag~dnst other lnethods, since, often d i f thrent corpora or m(',trics m:e used. The methods describe, d by Pereira and Schabcs (1.9(.)2) comes reasonably close to ours. The unsupervised nmthod le~rns structure on plain sentences from the ATIS corlms resulting in 37.35 % precision, while the "un.supcrvised ABL signili(:mltly outperforms this method, reaching 85.31% l)recision. Only their s'uperviscd version results in n slightly higher pre('ision of 90.36 %. The syste,nl th;d; simt)ly buihts right branchins structures results in 82.70 % precision mid 92.91% teeM1 on the ATIS cortms, where ABL got 8 5 . 3 1 % and 8 9 . 3 1 % . This wa,s expected, sin(:e English is a right |)rmmhing language; a left branching sysl;Clll t)(~rff)l.'ltle(| lllllCh woFsc (32.60 % pre(:ision and 76.82 % rccnll). C(mversely, right branching wouht not do very well on ~ ,l~q)mmse, corpus (~ left 1)r~m(:hing langua.ge). Sin(:e A]31, does not have a 1)ref(~renc(~ fi)r direction built; in, we exi)ect ABL to t)ertbrm similarly on n Ja,t)anese (:orpus. 5 5.1 Discussion Recurs|on and Future Extensions All ABL methods des('ribed here can lem:n recursive structures and have been fomtd when ~t)plying ABI, to the NIl?IS and OVIS (:orlms. As (:ml be sc(m in figure 5, the learned recursive structure, is similm: to the, original structure. Some structure has t)een removed to make it easier to see where the recurs|on occurs. Roughly, recursive structures arc built in two steps. First, the algorithm generates the structure with difl'cro,nt non-terminals. Then, the two nonq;ermimds are merged as described in so,el;ion 3.1.3. The merging of the non-terminals m~y occur anywhere in the cortms , sin(:e all merged non-terndnals are ut)dated. 965 learned original learned original Please ezplain the (field FLT D A Y in the (table)is)is Please explain (the .field FLT D A Y in (the table)NP)Np Explain classes QW and (QX and (Y)a2)~'e Explain classes ((QW)Np and (QX)NI, and (Y)NP)NP Fignre 5: Recursive structures learned in the ATIS corpus Show me the ( morning )x flights Show me the ( nonstop )x fli.qhts Figure 6: Wrong syntactic type 5.2 Wrong Syntactic Type In section 3.1.3 we made the assumt)tion t h a t a constituent in a certain context can only have one type. This assumption introduces some problems. The sentence John likes visiting relatives illustrates such a problem. The constituent visiting relatives can be a noun phrase or n verb phrase. Another prol)lem is ilhlstrated in figure 6. W h e n applying the ABL learning algorithm to these sentences, it will determine that morning and nonstop are of the same type. Untbrtunately, morning is a noun, while nonstop is an adverb) A fixture extension will not only look at the type of the constituents, lint also at the context; of the constituents. Ii5 the example, the constituent morning nlay also take the t)lace of a subject position in other sentences~ 1)ut the constituent nonstop never will. This intbrnmtion can be used to determine when to merge constituent types, efl'ectively loosening the assunlption. for example (Redington et al., 1998).) The idea 1)ehind equivalence classes is that words which are closely related are grouped together. A big advantage of equivalence classes is t h a t they can be learned in an unsupervised way, so the resulting algorithm remains nnsui)ervised. Words t h a t are in the same equivalence class are. said to be sufficiently equivalent, so the aligmnent algoritlnn may assunm they are sin> ilar and may thus link them. Now sentences that do not have words in common, but do have words in the same equivalence class in common, can be used to learn structure. When using equivalence classes, more constituents are learned and more terminals in constitnents may l)e seen as similar (according to the equivalence classes). This results in a much richer structm'ed corlms. 5.4 Alternative Statistics 5.3 Weakening Exact Match W h e n the ABL algorithms try to learn with two conlpletely distinct sentences, nothing can be learned. If we weaken the exact match between words in the alignme...

Find millions of documents on Course Hero - Study Guides, Lecture Notes, Reference Materials, Practice Exams and more. Course Hero has millions of course specific materials providing students with the best way to expand their education.

Below is a small sample set of documents:

UPenn - P - 00
An Information-Theory-Based Feature Type Analysis for the Modelling of Statistical ParsingSUI Zhifang , ZHAO Jun , Dekai WU Hong Kong University of Science & Technology Department of Computer Science Human Language Technology Center Clear Water B
UPenn - P - 03
A spoken dialogue interface for TV operations based on data collected by using WOZ methodJun Yeun-Bae Goto Kim NHK STRL NHK STRL Human Science Human Science Tokyo 157-8510 Tokyo 157-8510 Japan Japangoto.j-fw @nhk.or.jp kimu.y-go @nhk.or.jpMasaru
UPenn - P - 03
Loosely Tree-Based Alignment for Machine TranslationDaniel Gildea University of Pennsylvania dgildea@cis.upenn.eduAbstractWe augment a model of translation based on re-ordering nodes in syntactic trees in order to allow alignments not conforming
UPenn - C - 90
Toward Memory-based TranslationSatoshi S A T O and Ma.koto N A G A O Dept. of Electrical Engineering, K y o t o University Y o s h i d a - h o n m a c h i , Sa.kyo, K.yoto, 606, Ja.pan sa.to@kuee.kyoto-u.ac.jpAbstractAn essential problem of examp
UPenn - J - 93
Machine Translation: A Knowledge-Based Approach Sergei Nirenburg, Jaime Carbonell, Masaru Tomita, and Kenneth Goodman(Carnegie Mellon University) San Mateo, CA: Morgan Kaufmann Publishers, 1992, xiv + 258 pp. Hardbound, ISBN 1-55860-128-7, $39.95T
UPenn - C - 00
Automatic Corpus-Based Thai Word Extraction with the C4.5 Learning AlgorithmVIRACH SORNLERTLAMVANICH, TANAPONG POTIPITI AND THATSANEE CHAROENPORN National Electronics and Computer Technology Centel, National Science and Technology Development Agency
UPenn - C - 90
Reversible Unification Based M a c h m . FranslatlonGertjan van Noord OTS RUU Trans 10 3,512 JK Utrecht Valmoord~hutruu59.BH~netMarch 28, 1990Abstract[n this paper it will be shown how unification g r a m m a r s can be used to build a reversib
UPenn - C - 00
Chart-Based Transfer Rule Application in Machine TranslationAdam MeyersNew York University meyers@cs.nyu.edu M i c h i k o Kosaka Monlnouth University kosaka@monmouth.eduR a l p h GrishInanNew York University grishman@cs.nyu.eduAbstract35"ans
UPenn - P - 99
Corpus-Based Identification of Non-Anaphoric N o u n PhrasesD a v i d L. B e a n and E l l e n R i l o f fD e p a r t m e n t of C o m p u t e r Science University of U t a h Salt Lake City, U t a h 84112 {bean,riloff}@cs.utah.eduAbstract Corefer
UPenn - P - 90
ZERO MORPHEMES IN UNIFICATION-BASED COMBINATORY CATEGORIAL GRAMMAR Chinatsu Aone The University of Texas at Austin & MCC 3500 West Balcones Center Dr. Austin, TX 78759 (aone@mcc.com) ABSTRACT In this paper, we report on our use of zero morphemes in U
UPenn - P - 96
A N e w Statistical Parser Based on B i g r a m Lexical D e p e n d e n c i e sCollins* Dept. of Computer and Information Science University of Pennsylvania P h i l a d e l p h i a , P A , 19104, U . S . A . mcollins@gradient, cis.upenn, eduMichae
UPenn - P - 99
Designing a Task-Based Evaluation M e t h o d o l o g y for a Spoken Machine Translation S y s t e mKavita Thomas L a n g u a g e Technologies I n s t i t u t e Carnegie Mellon University 5000 Forbes Avenue P i t t s b u r g h , PA 15213, USAkavita
UPenn - P - 03
An Ontology-based Semantic Tagger for IE systemNarj` s Boufaden e Department of Computer Science Universit de Montr al e e Quebec, H3C 3J7 Canada boufaden@iro.umontreal.caAbstractIn this paper, we present a method for the semantic tagging of word
UPenn - C - 96
NL Domain Explanations in Knowledge Based MATGalia Angelova, Kalina Bontcheva 1Bulgarian Academy of Sciences, Linguistic Modelling Laboratory A c a d . G, B o n c h e v Str. 2 5 A , 1113 S o f i a , B u l g a r i a , { galja,kalina} @ b g c i c t .
UPenn - P - 03
Deverbal Compound Noun Analysis Based on Lexical Conceptual StructureTeruo Koyama Koichi Takeuchi Kyo Kageura Human and Social Information Research Division National Institute of Informatics 2-1-2 Hitotsubashi, Chiyodaku, Tokyo 101-8430, Japan koich
UPenn - D - 07
Large-Scale Named Entity Disambiguation Based on Wikipedia DataSilviu CucerzanMicrosoft Research One Microsoft Way, Redmond, WA 98052, USA silviu@microsoft.comAbstractThis paper presents a large-scale system for the recognition and semantic disa
UPenn - P - 01
A Syntax-based Statistical Translation ModelKenji Yamada and Kevin Knight Information Sciences Institute University of Southern California 4676 Admiralty Way, Suite 1001 Marina del Rey, CA 90292 kyamada,knight @isi.edu AbstractWe present a syntax-b
UPenn - C - 02
Semantics-based Representation for Multimodal Interpretation in Conversational SystemsJoyce ChaiIBM T. J. Watson Research Center 19 Skyline Drive Hawthorne, NY 10532, USA{jchai@us.ibm.com}Abstract To support context-based multimodal interpretati
UPenn - A - 92
A Simple Rule-Based Part of Speech TaggerEric Brill * D e p a r t m e n t of C o m p u t e r S c i e n c e University of Pennsylvania P h i l a d e l p h i a , P e n n s y l v a n i a 19104U.S.A.brill@unagi.cis.upenn.edu Abstract Automatic part o
UPenn - P - 05
A Hierarchical Phrase-Based Model for Statistical Machine TranslationDavid Chiang Institute for Advanced Computer Studies (UMIACS) University of Maryland, College Park, MD 20742, USA dchiang@umiacs.umd.eduAbstractWe present a statistical phrase-b
UPenn - P - 06
Investigations on Event-Based SummarizationMingli Wu Department of Computing The Hong Kong Polytechnic University Kowloon, Hong Kong csmlwu@comp.polyu.edu.hkAbstractWe investigate independent and relevant event-based extractive mutli-document su
UPenn - N - 06
Thai Grapheme-Based Speech RecognitionPaisarn Charoenpornsawat, Sanjika Hewavitharana, Tanja SchultzInteractive Systems Laboratories, School of Computer Science Carnegie Mellon University Pittsburgh, PA 15213 {paisarn, sanjika, tanja}@cs.cmu.eduA
UPenn - P - 01
An Algebra for Semantic Construction in Constraint-based GrammarsAnn Copestake Computer Laboratory University of Cambridge New Museums Site Pembroke St, Cambridge, UKaac@cl.cam.ac.ukAlex Lascarides Division of Informatics University of Edinburgh
UPenn - C - 02
Machine Translation Based on NLG from XML-DBYohei Seki Aoyama Gakuin / Department of Informatics, University The Graduate University for Advanced Studies (Sokendai) Abstract Ken'ichi Harada Department of Computing Science Keio UniversityThe purpos
UPenn - E - 06
Word Sense Induction: Triplet-Based Clustering and Automatic EvaluationStefan Bordag Natural Language Processing Department University of Leipzig Germany sbordag@informatik.uni-leipzig.deAbstractIn this paper a novel solution to automatic and uns
UPenn - P - 89
Unification-BasedSemantic InterpretationRobert C. Moore Artificial Intelligence Center SRI International Menlo Park, CA 94025 AbstractWe show how unification can be used to specify the semantic interpretation of natural-language expressions, inc
UPenn - N - 04
Feature-based Pronunciation Modeling for Speech RecognitionKaren Livescu and James Glass MIT Computer Science and Articial Intelligence Laboratory Cambridge, MA 02139, USA {klivescu, glass}@csail.mit.eduAbstractWe present an approach to pronuncia
UPenn - J - 92
Class-Based n-gram Models of Natural LanguageP e t e r F. B r o w n " P e t e r V. d e S o u z a * R o b e r t L. Mercer* IBM T. J. Watson Research Center V i n c e n t J. D e l l a Pietra* J e n i f e r C. Lai*We address the problem of predicting
UPenn - J - 95
Transformation-Based Error-Driven Learning and Natural Language Processing: A Case Study in Part-of-Speech TaggingEric Brill*The Johns Hopkins UniversityRecently, there has been a rebirth of empiricism in the field of natural language processing.
UPenn - E - 06
Adaptive Transformation-based Learning for Improving Dictionary TaggingBurcu Karagol-Ayan, David Doermann, and Amy Weinberg Institute for Advanced Computer Studies (UMIACS) University of Maryland College Park, MD 20742 {burcu,doermann,weinberg}@umia
UPenn - E - 06
Phrase-Based Backoff Models for Machine Translation of Highly Inected LanguagesMei Yang Department of Electrical Engineering University of Washington Seattle, WA, USA yangmei@ee.washington.edu Katrin Kirchhoff Department of Electrical Engineering Un
UPenn - P - 04
Towards a Semantic Classication of Spanish Verbs Based on Subcategorisation InformationEva Esteve Ferrer Department of Informatics University of Sussex Brighton, BN1 9QH, UK E.Esteve-Ferrer@sussex.ac.uk AbstractWe present experiments aiming at an a
UPenn - N - 03
A Phrase-Based Unigram Model for Statistical Machine TranslationChristoph Tillmann and Fei Xia IBM T.J. Watson Research Center Yorktown Heights, NY 10598 {ctill,feixia}@us.ibm.comAbstractIn this paper, we describe a phrase-based unigram model fo
UPenn - CIS - 610
TensorTextures: Multilinear Image-Based RenderingM. Alex O. Vasilescu and Demetri Terzopoulos University of Toronto, Department of Computer Science New York University, Courant Institute of Mathematical SciencesFigure 1: Frames from the Treasure C
UPenn - P - 84
Features and ValuesLauri Karttunen University of Texas at Austin Artificial Intelligence Center SRI International and Center for the Study of Language and Information Stanford UniversityAbstractThe paper discusses the linguistic aspects of a new
UPenn - T - 87
Unification a n d the n e w g r a m m a t i s m Steve Pulman University of Cambridge Computer Laboratory Corn Exchange Street Cambridge C B 2 3QG, UK.Whatare w e talking about?The prototypical unification grammar consists of a context-free skel
UPenn - H - 01
Guidelines for Annotating Temporal InformationInderjeet Mani, George WilsonThe MITRE Corporation, W640 11493 Sunset Hills Road Reston, Virginia 20190-5214, USA +1-703-883-6149Lisa FerroThe MITRE Corporation, K329 202 Burlington Road, Rte. 62 Bed
UPenn - C - 86
The computational complexity of sentence derivation in functional unification grammarGraeme Ritchie Department of Artificial Intelligence University of Edinburgh Edinburgh EHI IHNAbstract Functional unification (FU) grammar is a general linguisti
UPenn - E - 87
DECLARATIVE k VIEVNOOEL FOR DEPENDENCY PARSING INTO BLACKBOARD METHOOOLOGY-Vatkonen, K., J i p p i n e n , H., L e h t o t a , A. and Ytltammi, KIELIKOHE-pr~ject, SITRA Foundation P.O.Box 329, S F - 0 0 1 2 1 H e t s i n k i FinLand t e L . i n
UPenn - A - 97
Layout & Language: Preliminary experiments in assigning logical structure to table cellsMatthew Hurst and Shona Douglas Language Technology Group, Human Communication Research Centre, University of Edinburgh, Edinburgh EH8 9LW UK { M a t t h e w . H
UPenn - C - 88
AN I N T E G R A T E D MODEL F O R T H E TREATMENT OF TIME I N MT- SYSTEMSM. Meya Siemens CDS c/Luis Muntadas,5 CORNELLA, 08940-BARCELONA SpainJ. Vidul EUROTRA-E Ctra. Vallvidriera, 25.27 08017-BARCELONAAbstractOne of the ways to achieve a goo
UPenn - C - 90
Towards a Unification-Based PhonologyRichard Wiese Seminar f'dr Allgem. Spraehwissenschaft Heinrich-Heine-Univer sit,it DUsseldorf D-4000 Di.isseldorf 1 wiesedd0rud81.bitnet 1 Introduction. The Problem Phonological theory has undergone a number cf m
UPenn - C - 86
ASAELEMENTARY CONTRACTS PRAGMATIC BASIS OF LANGUAGEINTERACTIONE.L. Pershina A[ Laboratory, Computer Center Siberian Division of the USSR Ae. Sei. Novosibirsk 630090, USSR ABSTRACT Language interaction (LI) as a part of interpersonal communica
UPenn - E - 85
ON THE REPRESENTATION OF QUERY TERM RELATIONS BY SOFT BOOLEAN oPERATORSGerard Salton D e p a r t m e n t o f Computer S c i e n c e Cornell University Ithaca, NY 14853, USAABSTRACT The l a n g u a g e a n a l y s i s component i n m o s t t e x t
UPenn - C - 86
ConceptualLexicon Using an Object-OrientedLanguageShoiehi Y O K O Y A M A Electrotechnical Laboratory Tsukuba, Ibaraki, JapanKenji H A N A K A T A Universitat Stuttgart Stuttgart, F. R. G e r m a n yAbstractThis paper d e s c r i b e s the
UPenn - E - 91
STRUCTURAL NON-CORRESPONDENCE IN TRANSLATION Henry S. Thompson, Human Communication Research Centre, University of Edinburgh, 2 Buccleuch Place, Edinburgh, EH8 9LW, UIC ht@uk.ac.ed.cogsciLouisa Sadler, Dept. of Language and Linguistics, University
UPenn - C - 82
ADAPTIVE DIALOGUE - THE BASIS FOR PERSONAL COMPUTER SYSTEMVictor Briabrin Computing C e n t e r , Academy o f S c i e n c e s , Hosoow, USSR1. P e r s o n a l Computer S y s t e m s (POS) r e p r e s e n t nowadays a s i g n i f t e a u t t r e n
UPenn - C - 90
Complex Features in Description of Chinese LanguageFeng Zhiwei Imtitute of Applied Linguistics Chinese Academy of Social Sciences 51 Chaoyangmen Nanxiaojie 100010 Beijing, ChinaAbstract In this paper, the similarity of" multi-vahw label fimction" a
UPenn - C - 96
CHINESE STRING SEARCHING USING TtIE K M P ALGORITHMRobert W.P. LukDepartment of Computing, Hong Kong Polytechnic University,Kowloon,Hong Kong E-mail: csrluk@comp.polyu.edu.hkAbstract This paper is about the modification of KMP (Knuth, Morris and
UPenn - CIT - 591
ArraysApr 10, 2009A problem with simple variablesOne variable holds one valueThe value may change over time, but at any given time, a variable holds a single value If you want to keep track of many values, you need many variables All
UPenn - CIT - 591
Numbers and ArraysWidening and narrowing Numeric types are arranged in a continuum: Wider double float long int short byte,char Narrower You can easily assign a narrower type to a wider type: doublewide; intnarrow; wide=narrow; But if you want
UPenn - STAT - 112
Stat 112Review Notes for Chapter 4, Lecture Notes 6-91. Best Simple Linear Regression: Among the variables X 1 , K , X K , the variable which best predicts Y based on a simple linear regression is the variable for which the simple linear regressi
Auburn Montgomery - MATH - 190
Decimal Expansion of FractionsBrent MurphyP QProblem: Under What conditions will the decimal expansion of p/q terminate? Under what conditions will it repeat? p/q can be investigated as p*(1/q).Terminating When placing 1 over q as a fraction t
Auburn Montgomery - MATH - 190
Project 1.2 Decimal Expansions of Rational NumbersJacob Brozenick Anthony Mayle Kenny Milnes And Tim SweetserProblem Descriptions1. Determine which values of q in the expression p/q will cause the termination of the resulting decimal expansion.
Auburn Montgomery - MATH - 190
Calculus Project 1.2By Dorothy McCammon, Tammy Boals, George Reeves, Robert StevensPart 1 When you have a fraction x/y, y can be divided into x to obtain that fraction in decimal form. There are two different types of decimal numbers you can obt
Auburn Montgomery - MATH - 190
PROBLEM 1: Under what conditions will the decimal expansion p/q terminate? Repeat? PROBLEM 2: Suppose that we are given the decimal expansion of a rational number. How can we represent the decimal in the rational form p/q? PROBLEM 3: Express each
UPenn - M - 95
STATISTICAL SIGNIFICANCE OF MUC-6 RESULT SNancy Chinchor, Ph.D.Science Applications International Corporatio n 10260 Campus Point Drive, M/S A2- F San Diego, CA 9212 1 chinchor@gso.saic.com (619) 458-261 4 INTRODUCTIONThe results of the MUC-6 eva
UPenn - D - 07
Improving Query Spelling Correction Using Web Search ResultsQing Chen Natural Language Processing Lab Northeastern University Shenyang, Liaoning, China, 110004 chenqing@ics.neu.edu.cn Ming Zhou Microsoft Research Asia 5F Sigma Center Zhichun Road, H
UPenn - M - 92
U,SC : MUC-4 Test Results and AnalysisD . Moldovan, S. Cha, M . Chung, K. Hendrickson, J . Kim, and S. Kowalsk iParallel Knowledge Processing Laborator y University of Southern Californi a Los Angeles, California 90089-256 2 moldovan@gringo .usc .
UPenn - H - 90
Recent Results from the A R M Continuous Speech Recognition ProjectM a r t i n Russell and K e i t h P o n t i n gSpeech Research Unit RSKE, Malvern, Worcs WR14 3PS, UKIntroductionThis paper describes some of the most recent work on continuous s
UPenn - M - 92
BBN PLUM : MUC-4 Test Results and Analysi sRalph Weischedel, Damaris Ayuso, Sean Boisen , Heidi Fox, Herbert Gish, Robert Ingria, BBN Systems and Technologie s 10 Moulton St . Cambridge, MA 0213 8 weischedel@bbn.com GOALSOur mid-term to long-term g