6 Pages

C96-2141

Course: C 96, Fall 2009
School: UPenn
Rating:
 
 
 
 
 

Word Count: 3606

Document Preview

M H M - B a s e d Word Alignment in Statistical Translation Stephan Vogel Hermann Ney Christoph Tillmann L e h r s t u h l ffir I n f o r m a t i k V, R W T H A a c h e n D-52056 Aachen, Germany {vogel, n e y , t illmann}@inf ormat ik. rwth-aachen, de Abstract In this paper, we describe a new model for word alignment in statistical translation and present experimental results. The idea of the model is to make the...

Register Now

Unformatted Document Excerpt

Coursehero >> Pennsylvania >> UPenn >> C 96

Course Hero has millions of student submitted documents similar to the one
below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.

Course Hero has millions of student submitted documents similar to the one below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.
M H M - B a s e d Word Alignment in Statistical Translation Stephan Vogel Hermann Ney Christoph Tillmann L e h r s t u h l ffir I n f o r m a t i k V, R W T H A a c h e n D-52056 Aachen, Germany {vogel, n e y , t illmann}@inf ormat ik. rwth-aachen, de Abstract In this paper, we describe a new model for word alignment in statistical translation and present experimental results. The idea of the model is to make the alignment probabilities dependent on the differences in the alignment positions rather than on the absolute positions. To achieve this goal, the approach uses a first-order Hidden Markov model (HMM) for the word alignment problem as they are used successfully in speech recognition for the time alignment problem. The difference to the time alignment HMM is that there is no monotony constraint for the possible word orderings. We describe the details of the model and test the model on several bilingual corpora. 2 Review: Translation M o d e l The goal is the translation of a text given in some language F into a target language E. For convenience, we choose for the following exposition as language pair French and English, i.e. we are given a French string f~ = fx ...fj...fJ, which is to be translated into an English string e / = el...ei...cl. Among all possible English strings, we will choose the one with the highest probability which is given by Bayes' decision rule: a{ = = argmax{P,.(c{lAa)} q a r g m a x {Pr(ejt) . l ' r ( f l e [ ) } el ~ 1 Introduction Pr(e{) is the language model of the target language, whereas Pr(fJle{) is the string translation model. The argmax operation denotes the search problem. In this paper, we address the problem of introducing structures into the probabilistic dependencies in order to model the string translation probability Pr(f~ le{). 3 Alignment Models A key issne in modeling the string translation probability Pr(J'~le I) is the question of how we define the correspondence between the words of the English sentence and the words of the French sentence. In typical cases, we can assume a sort of pairwise dependence by considering all word pairs (fj, ei) for a given sentence pair I.-/1[~'J', We furelqlj' ther constrain this model by assigning each French word to exactly one English word. Models describing these types of dependencies are referred to as In this paper, we address the problem of word alignments for a bilingual corpus. In the recent years, there have been a number of papers considering this or similar problems: (Brown et al., 1990), (Dagan et al., 1993), (Kay et al., 1993), (Fung et al., 1993). In our approach, we use a first-order Hidden Markov model (HMM) (aelinek, 1976), which is similar, but not identical to those used in speech recognition. The key component of this approach is to make the alignment probabilities dependent not on the absolute position of the word alignment, but on its relative position; i.e. we consider the differences in the index of the word positions rather than the index itself. The organization of the paper is as follows. After reviewing the statistical approach to machine translation, we first describe the conventional model (mixture model). We then present our first-order HMM approach in lull detail. Finally we present some experimental results and compare our model with the conventional model. alignment models. In this section, we describe two models for word alignrnent in detail: ,. a mixture-based alignment model, which was introduced in (Brown et al., 1990); an HMM-based alignment model. In this paper, we address the question of how to define specific models for the alignment probabilities. The notational convention will be as follows. We use the symbol Pr(.) to denote general 836 probability distributions with (nearly) no Sl)eeitic asSUml)tions. In contrast, for modcl-t)ased prol)-ability distributions, we use the generic symbol v(.). Alignment with Mixture Distri|mtion Here, we describe the mixture-based alignment model in a fornmlation which is different fronl the original formulation ill ( B r o w n el, a[., 1990). We will ,is(: this model as reference tbr the IIMMbased alignments to lie 1)resented later. The model is based on a decomposition of the joint probability [br ,l'~ into a product over the probabilities for each word J): 3.1 a j=l wheFe~ fo[' norll-la]iz;i,t i o n 17(~/SOllS~ the 8elltC][ce For unilbrm alignment probabilities, it can be shown (Brown et al., 1990), that there is only one optinnnn and therefore the I,',M algorithm (Baum, 1!)72) always tinds the global optimum. For mixture alignment model with nonunilbrm alignment probabilities (subsequently referred to as IBM2 model), there ~tre to() many alignrnent parameters Pill j, I) to be estimated for smMl c o l pora. Therefore, a specific model tbr tile Mignment in:obabilities is used: p ( i l j , 1) = l r(i-j~-) .I (~) Ei':l "( it --" J J-) length probability p(J] l) has been included. The next step now is to assutne a sort O['l,airwise interact, ion between tim French word fj an(l each, F,nglish word ci, i = 1, ...l. These dep('ndencies are captured in the lbrm of a rnixtnre distritmtion: 1 This model assumes that the position distance relative to the diagonal line of the (j, i) plane is the dominating factor (see Fig. 1). 'lb train this model, we use the ,naximutn likelihood criterion in the so-called ulaximmn al)proximation, i.e. the likelihood criterion covers only tile most lik(-.ly align: inch, rather than the set of all alignm(,nts: d P,'(f(I,:I) ~ II ~"IUHO, ~)v(J} I,:~)] j=l (a) p(J)le{) = ~_.p(i, fjlc I) i=1 I In training, this criterion amounts to a sequence of iterations, each of which consists of two steps: * posilion alignmcnl: (riven the model parameters, deLerlniim the mosL likely position align]lient. = ~_~p(ilj, l).p(fjle~) i=1 Putting everything together, we have the following mixture-based ntodel: J l r,'(fi!l~I) = p(JIO ' H ~_~ [~,(ilJ, l). ~,(j)led] (1) j = l i=t with the following ingredients: sentence length prob~d)ility: P(Jll); mixture alignment probability: p ( i l j , I); translation probM)ility: p(f[e). Assuming a tmifornl ~flignment prol)ability 1 paramc, lcr cstimalion: Given the position alignment, i.e. goiug along the alignment paths for all sentence pairs, perform maxitnulu likelihood estimation of the model parameters; for model-De(' distributions, these estimates result in relative frequencies. .p(ilj, 1) = 7 l)ue to the natnre of tile nfixture tnod(:l, there is no interaction between d j a c e n t word positions. Theretbre, the optimal position i for each position j can be determined in(lependently of the neighbouring positions. Thus l.he resulting training procedure is straightforward. a.2 we arrive at the lh'st model proposed t)y (Brown et al., 1990). This model will be referred to as IB M 1 model. To train the translation probabilities p(J'fc), we use a bilingual (;orpus consisting of sentence pairs Alignment with HMM [:/';4"1 : ', . , s Using the ,,laxin,ul , likelihood criterion, we ol)tain the following iterative L a equation (Brown et al., 1990): /)(fie) = ~ $' will, A(f,e) = ~ 2 ~5(f,J).~) }~ a(e,e~.~) We now propose all HMM-based alignment model. '['he motivation is that typicMly we have a strong localization effect in aligning the words in parallel texts (for language pairs fi:om ]ndoeuropean languages): the words are not distrilmted arbitrarily over the senteuce ])ositions, but tend to form clusters. Fig. 1 illustrates this effect for the language pair G e r m a n - 15'nglish. Each word of the German sentence is assigned to a word of the English sentence. The alignments have a strong tendency to preserve the local neighborhood when going from the one langnage to the other language. In mm,y cases, although not al~ ways, there is an even stronger restriction: the differeuce in the position index is smMler than 3. 837 DAYS BOTH ON EIGHT + + + + + + + + +j~ + + + + + + + + + ~J ~+ + +++++++/++.-. + + + + + + +/+ + + + + + + + + + ~x~ + + + + + + + + + +/+ D + + + + + AT IT MAKE CAN WE IF THINK I where, in addition, we have assmned that tile translation probability del)ends only oil aj and not oil aj-:l. Putting everything together, we have the ibllowing llMM-based model: a ++ + + ~ + + + _~ + + +~ + + +jg + + +~ +++ g + + ++ + + + + + + + + + + + + + + + ++ ++++ + + + + + + + + + + + + + + + ++ + + + + Pr(f:i'le{) = ~ I-I [p(ajlaj - ' , l).p(Y)lea,)] af J=, with the following ingredients: IlMM alignment probability: p ( a j l a j _ l , I); (4) WELL p(i]i', I) or z aa Figure 1: Word alignment for a G e r m a n - English sentence pair. To describe these word-by-word aligmnents, we introduce the mapping j ---+ aj, which assigns a word fj in position j to a word el in position { = aj. The concept of these alignments is similar to the ones introduced by (Brown et al., 1990), but we wilt use another type of dependence in the probability distributions. Looking at such alignments produced by a hmnan expert, it is evident that the mathematical model should try to capture the strong dependence of aj on the previous aligmnent. Therefore the probability of alignment aj for position j should have a dependence on the previous alignment aj _ 1 : translation probabflity: p(f]e). In addition, we assume that the t{MM alignment probabilities p(i[i', [) depend only on the jump width (i - i'). Using a set of non-negative parameters { s ( i - i')}, we can write the IIMM alignment probabilities in the form: 4i- p(ili', i) = E ' 1=1 i') s(1 - i') (5) This form ensures that for each word position i', i' = 1, ..., I, the ItMM alignment probabilities satisfy the normMization constraint. Note the similarity between Equations (2) and (5). The mixtm;e model can be interpreted as a zeroth-order model in contrast to the first-order tlMM model. As with the IBM2 model, we use again the maximum approximation: J Pr(fiSle~) "~ max]--[ a' / .ll. j,,, [p(asl<*j-1, z)p(fjl<~,)] task of finding the (6) j=l p(ajiaj_l,i) , In this case, the optimal where we have inchided the conditioning on the total length [ of the English sentence for normalization reasons. A sinfilar approach has been chosen by (Da.gan et al., 1993). Thus the problem formulation is similar to that of the time alignment problem in speech recognition, where the so-called IIidden Markov models have been successfully used for a long time (Jelinek, 1976). Using the same basic principles, we can rewrite the probability by introducing the 'hidden' alignments af := al...aj...aa for a sentence pair If,a; e{]: alignment is more involved than in the case of the mixture model (lBM2). Thereibre, we have to resort to dynainic programming for which we have the following typical reeursion formula: Q(i, j) = p(fj lel) ,nvax [p(ili', 1). Q(i', j - 1)] i =l,.,,I Here, Q(i, j) is a sort of partial probability as in time alignment for speech recognition (Jelinek, 197@. 4 Experimental Results 4.1 T h e T a s k a n d t h e C o r p u s The models were tested on several tasks: the Avalanche Bulletins published by the Swiss Federal Institute for Snow and Avalanche Research (SHSAR) in Davos, Switzerland and made awtilable by the Eup "q I ropean Corpus Initiative (I,CI/MCI, 1994); the Verbmobil Corpus consisting of spontaneously spoken dialogs in the domain of appointment scheduling (Wahlster, 1993); Pr(f~al es) = ~_,Vr(fal, aT[ eI't, a7 ,1 = ~ 1-IP"(k,"stfT-',"{ -*,e/) a I j=l So far there has been no basic restriction of the approach. We now assume a first-order dependence on the alignments aj only: Vr(fj,aslf{ a -~, J-* , e l ) I 838 ,, the E u T r a n s C,orpus which contains tyl)ical phrases from the tourists and t.ravel docnain. (EuTrans, 1996). 'l'able ] gives the details on the size of tit<; corp o r a a,ud t;]t<'it' vocal>ulary. It shottld I>e noted that in a.ll thes(; three ca.ses the ratio el' vocal)t,]ary size a.ml numl)er of running words is not very faw)rable. Tall)le, I: (,orpol :L (,o~pt s AvalancJte ] A[ [ r a i l s l,angua.ge Frolt ch (~('~ l lall Spanish I,;nglish ( l e 11 a n Voc. Size 1993 2265 --1:77@2008 15888 t 63(} Words 62849 ,]4805 is the' globa.l o p t i m i z a t i o n criterion, the tables also show the perplexities of the translation probabilities and of the alignment probabilities. T h e last line in Table 2 gives the perplexity measures wh(m a.lJplying the rtlaxilnun| a p p r o x i m a t i o n and COml>uting the perph'~xity in t;]lis a p p r o x i m a t i o n . These values are equal to the ones after initializing the IBM2 and HMM models, as they should be. From Ta,ble 3, we can see. that the m i x t u r e alignment gives slightly better perplexity values for the translation l)roba.1)ilities, whereas the I I M M model produces a smaller perplexity for the alignment l>rohal)ilities. In the calculatiot, of the, perplexities, th<' seld;en(;e length probal)ility was not in= eluded. Tahle 2: IBM I: Translation, a,ligmnent and total pert)h'~xil.y as a. fimction of' the iteration. Iteration 0 1 Verlmlobil 150279 English 25,] 27 dO 17 2`]/13 For several years 1)et;weeu 83 and !)2, the Avalanche Bulletins are awdlabte for I>oth Getntan and I!'ren(;]l. T h e following is a tyl)ical sen-t<;nce t>air fS;onl the <;or:IreS: Bei zu('.rst recht holnm, Sl)~i.tev tM'eren 'l'eml)eraJ, uren sind vou Samsta.g his 1)ienstag t n o f gett auf <l<'~t; All>ennor(ls<'.ite un</ am All>en-. ha.uptkanml oberhalb 2000 m 60 his 80 cm Neuschnee gel'aJlen. l)ar des temp&'atures d' abord dlevdes, puis plus basses, 60 h 8(1 cm de neige sent tombs de samedi h. m a r d i matin sur le versant herd el; la eft're des Alpes au-dessus de 2000 l[1. An exa,nq)le fi'om the Vet%mobil corpus is given in Figure 1. 4.2 Training and ILesults 2 9 Tra,nslatiotl. 99.36 3.72 2.67 t.87 1.86 Alignrnent 20.07 20.07 20.07 20.07 20.07 Total 1994.00 7/1.57 53.62 37.55 37.36 77.!)5 10 Max. 3.88 20.07 'l'able 3:'1 rans] ~+tion, aligmn en t and totaJ perplexity as a function of the itcra.tion for the IBM2 (A) and the IIMM model (13) Iter. A 0 l 2 Tratmlat;i(m 3.883.17 3.25 3.22 A ligniN.elJ t 20.07 10.82 10.15 10.10 'l'otal l,;ach of the three COrlJora. were ttsed to train 1)oth alignnmnt models, the mixture-I>ased alignment model in Eq.(1) and the llMM-base<l a.lignntent mod('l in Eq.(d). ltere, we will consider the exp<'.rimenta.l tesl;s on tit<'. Avalanche corpus in more detail. T h e traii, ing procedure consiste(l of the following steps: , Initialization training: IBMI model trahted for t0 iterations of the i';M algorithm. ,, l{,efinement traiuiug: T h e translation pcoba1)ilities Dotn the initialization training wet'(; use+d to initialize both the IBM2 model and the I I M M-based nligntnent mo<t<'+l IBM2 ilnum IIMM imum Model: 5 iteratious using Lit(" m a x a.I)proximatiolt (Eq+(3)) Model: 5 iterations usiug l l e max-. al)l)roximation (Fq.(6)) 3 ,] 1~ 5 0 3.20 3.18 3.88 ] 0.06 10.05 20.07 1 3 4 5 3.37 3.46 ;{./17 "Ld6 3.`]5 7.99 6.17 5.90 5.85 5.8,] 77.95 34.27 33.03 32.48 32.18 32.00 77.95 26.98 2t.36 20.48 20.2/1 20.18 'l'h(, resulting perl>h:'~xity (inverse g<~olu(;l.ric avera,ge of the likelihoods) for the dilferent lno(lels ave given iu tim Tal>[es 2 and 3 for the Awdanehe <:<)rims. In adclitiou t;o the total i>erl>lexity, whi<'.h Anoth<2r inl;crc:sting question is whether the IIMM alignntent model helps in finding good and sharply fo('usscd word+to-word (-orres]Jondences. As an (;xamf,1o, Table 4 gives a COmlm+rison of the translatioJ~ probabilities p(fl e) bctweett the mixture and the IIMM alignnw+nt model For the (,e, u +l word Alpensiidhang. T h e counts of the words a.re given in brackets. The, re is virLually no ,:lilfc~rc~nce between the translation l.al>les for the two nn)dels (1BM2 and IIMM). But+ itt general, the tl M M model seems to giw'. slightly better resuits in the cases of (;, ttna t COml+olmd words like Alpcus'iidha'n,(I vcrsant sud des Alpes which require ['u,tction words in the trattslation. 839 Table 4: Alpens/idhang. IBM1 Alpes (684) des (1968) le (1419) sud (416) sur (769) versant (431) Alpes (684) sud (41.6) versant (431) Alpes (684) des (1968) sud (416) versant (431) 0.171 0.035 0.039 0.427 0.040 0.284 0.276 0.371 0.356 0.284 0.028 0.354 0.333 IBM2 The Verbmobil Corpus consists of spontaneously spoken dialogs in the domain of appointment scheduling. The assumption that every word in the source language is aligned to a word in the target language breaks down for many sentence pairs, resulting in poor alignment. This in turn affects the quality of the translation probabilities. Several extensions to the current IIMM based model could be used to tackle these problems: * The results presented here did not use the concept of the empty word. For the HMMbased model this, however, requires a secondorder rather than a first-order model. . We could allow for multi-word phrases in both languages. In addition to the absolute or relative alignment positions, the alignment probabilities can be assumed to depend on part of speech tags or on the words themselves. (confer model 4 in (Brown et al., 1990)). HMM This is a result of the smoother position alignments produced by the HMM model. A pronounced example is given in Figure 2. 'She problem of the absolute position alignment can he demonstrated at the positions (a) and (c): both Schneebretlgefahr und Schneeverfrachtungen have a high probability on neige. The IBM2 models chooses the position near the diagonal, as this is the one with the higher probability. Again, Schneebrettgefahr generates de which explains the wrong alignment near the diagonal in (c). However, this strength of the HMM model can also be a weakness as in the case of est developpe ist ... entstanden (see (b) in Figure 2. The required two large jumps are correctly found by the mixture model, but not by the HMM model. These cases suggest an extention to the HMM model. In general, there are only a small number of big jumps in the position alignments in a given sentence pair. Therefore a model could be useful that distinguishes between local and big jumps. The models have also been tested on the Verbmobil Translation Corpus as well as on a small Corpus used in the EuTrans project. The sentences in the EuTrans corpus are in general short phrases with simple grammatical structures. However, the training corpus is very small and the produced alignments are generally of poor quality. There is no marked difference...

Find millions of documents on Course Hero - Study Guides, Lecture Notes, Reference Materials, Practice Exams and more. Course Hero has millions of course specific materials providing students with the best way to expand their education.

Below is a small sample set of documents:

UPenn - P - 03
Chunk-based Statistical TranslationTaro Watanabe, Eiichiro Sumita and Hiroshi G. Okuno {taro.watanabe, eiichiro.sumita}@atr.co.jp ATR Spoken Language Translation Department of Intelligence Science Research Laboratories and Technology 2-2-2 Hikarida
UPenn - C - 00
ABL: Alignment-Based LearningMenno van Zaanen School of C o m p u t e r S t u d i e s University of Leeds LS2 9 J T L(~eds UK menno@scs, l e e d s , a c . ukAbstract This ])al)er introdu(:es a new tyl)e of grammar learning algorithm, iilst)ired l)
UPenn - P - 00
An Information-Theory-Based Feature Type Analysis for the Modelling of Statistical ParsingSUI Zhifang , ZHAO Jun , Dekai WU Hong Kong University of Science &amp; Technology Department of Computer Science Human Language Technology Center Clear Water B
UPenn - P - 03
A spoken dialogue interface for TV operations based on data collected by using WOZ methodJun Yeun-Bae Goto Kim NHK STRL NHK STRL Human Science Human Science Tokyo 157-8510 Tokyo 157-8510 Japan Japangoto.j-fw @nhk.or.jp kimu.y-go @nhk.or.jpMasaru
UPenn - P - 03
Loosely Tree-Based Alignment for Machine TranslationDaniel Gildea University of Pennsylvania dgildea@cis.upenn.eduAbstractWe augment a model of translation based on re-ordering nodes in syntactic trees in order to allow alignments not conforming
UPenn - C - 90
Toward Memory-based TranslationSatoshi S A T O and Ma.koto N A G A O Dept. of Electrical Engineering, K y o t o University Y o s h i d a - h o n m a c h i , Sa.kyo, K.yoto, 606, Ja.pan sa.to@kuee.kyoto-u.ac.jpAbstractAn essential problem of examp
UPenn - J - 93
Machine Translation: A Knowledge-Based Approach Sergei Nirenburg, Jaime Carbonell, Masaru Tomita, and Kenneth Goodman(Carnegie Mellon University) San Mateo, CA: Morgan Kaufmann Publishers, 1992, xiv + 258 pp. Hardbound, ISBN 1-55860-128-7, $39.95T
UPenn - C - 00
Automatic Corpus-Based Thai Word Extraction with the C4.5 Learning AlgorithmVIRACH SORNLERTLAMVANICH, TANAPONG POTIPITI AND THATSANEE CHAROENPORN National Electronics and Computer Technology Centel, National Science and Technology Development Agency
UPenn - C - 90
Reversible Unification Based M a c h m . FranslatlonGertjan van Noord OTS RUU Trans 10 3,512 JK Utrecht Valmoord~hutruu59.BH~netMarch 28, 1990Abstract[n this paper it will be shown how unification g r a m m a r s can be used to build a reversib
UPenn - C - 00
Chart-Based Transfer Rule Application in Machine TranslationAdam MeyersNew York University meyers@cs.nyu.edu M i c h i k o Kosaka Monlnouth University kosaka@monmouth.eduR a l p h GrishInanNew York University grishman@cs.nyu.eduAbstract35&quot;ans
UPenn - P - 99
Corpus-Based Identification of Non-Anaphoric N o u n PhrasesD a v i d L. B e a n and E l l e n R i l o f fD e p a r t m e n t of C o m p u t e r Science University of U t a h Salt Lake City, U t a h 84112 {bean,riloff}@cs.utah.eduAbstract Corefer
UPenn - P - 90
ZERO MORPHEMES IN UNIFICATION-BASED COMBINATORY CATEGORIAL GRAMMAR Chinatsu Aone The University of Texas at Austin &amp; MCC 3500 West Balcones Center Dr. Austin, TX 78759 (aone@mcc.com) ABSTRACT In this paper, we report on our use of zero morphemes in U
UPenn - P - 96
A N e w Statistical Parser Based on B i g r a m Lexical D e p e n d e n c i e sCollins* Dept. of Computer and Information Science University of Pennsylvania P h i l a d e l p h i a , P A , 19104, U . S . A . mcollins@gradient, cis.upenn, eduMichae
UPenn - P - 99
Designing a Task-Based Evaluation M e t h o d o l o g y for a Spoken Machine Translation S y s t e mKavita Thomas L a n g u a g e Technologies I n s t i t u t e Carnegie Mellon University 5000 Forbes Avenue P i t t s b u r g h , PA 15213, USAkavita
UPenn - P - 03
An Ontology-based Semantic Tagger for IE systemNarj` s Boufaden e Department of Computer Science Universit de Montr al e e Quebec, H3C 3J7 Canada boufaden@iro.umontreal.caAbstractIn this paper, we present a method for the semantic tagging of word
UPenn - C - 96
NL Domain Explanations in Knowledge Based MATGalia Angelova, Kalina Bontcheva 1Bulgarian Academy of Sciences, Linguistic Modelling Laboratory A c a d . G, B o n c h e v Str. 2 5 A , 1113 S o f i a , B u l g a r i a , { galja,kalina} @ b g c i c t .
UPenn - P - 03
Deverbal Compound Noun Analysis Based on Lexical Conceptual StructureTeruo Koyama Koichi Takeuchi Kyo Kageura Human and Social Information Research Division National Institute of Informatics 2-1-2 Hitotsubashi, Chiyodaku, Tokyo 101-8430, Japan koich
UPenn - D - 07
Large-Scale Named Entity Disambiguation Based on Wikipedia DataSilviu CucerzanMicrosoft Research One Microsoft Way, Redmond, WA 98052, USA silviu@microsoft.comAbstractThis paper presents a large-scale system for the recognition and semantic disa
UPenn - P - 01
A Syntax-based Statistical Translation ModelKenji Yamada and Kevin Knight Information Sciences Institute University of Southern California 4676 Admiralty Way, Suite 1001 Marina del Rey, CA 90292 kyamada,knight @isi.edu AbstractWe present a syntax-b
UPenn - C - 02
Semantics-based Representation for Multimodal Interpretation in Conversational SystemsJoyce ChaiIBM T. J. Watson Research Center 19 Skyline Drive Hawthorne, NY 10532, USA{jchai@us.ibm.com}Abstract To support context-based multimodal interpretati
UPenn - A - 92
A Simple Rule-Based Part of Speech TaggerEric Brill * D e p a r t m e n t of C o m p u t e r S c i e n c e University of Pennsylvania P h i l a d e l p h i a , P e n n s y l v a n i a 19104U.S.A.brill@unagi.cis.upenn.edu Abstract Automatic part o
UPenn - P - 05
A Hierarchical Phrase-Based Model for Statistical Machine TranslationDavid Chiang Institute for Advanced Computer Studies (UMIACS) University of Maryland, College Park, MD 20742, USA dchiang@umiacs.umd.eduAbstractWe present a statistical phrase-b
UPenn - P - 06
Investigations on Event-Based SummarizationMingli Wu Department of Computing The Hong Kong Polytechnic University Kowloon, Hong Kong csmlwu@comp.polyu.edu.hkAbstractWe investigate independent and relevant event-based extractive mutli-document su
UPenn - N - 06
Thai Grapheme-Based Speech RecognitionPaisarn Charoenpornsawat, Sanjika Hewavitharana, Tanja SchultzInteractive Systems Laboratories, School of Computer Science Carnegie Mellon University Pittsburgh, PA 15213 {paisarn, sanjika, tanja}@cs.cmu.eduA
UPenn - P - 01
An Algebra for Semantic Construction in Constraint-based GrammarsAnn Copestake Computer Laboratory University of Cambridge New Museums Site Pembroke St, Cambridge, UKaac@cl.cam.ac.ukAlex Lascarides Division of Informatics University of Edinburgh
UPenn - C - 02
Machine Translation Based on NLG from XML-DBYohei Seki Aoyama Gakuin / Department of Informatics, University The Graduate University for Advanced Studies (Sokendai) Abstract Ken'ichi Harada Department of Computing Science Keio UniversityThe purpos
UPenn - E - 06
Word Sense Induction: Triplet-Based Clustering and Automatic EvaluationStefan Bordag Natural Language Processing Department University of Leipzig Germany sbordag@informatik.uni-leipzig.deAbstractIn this paper a novel solution to automatic and uns
UPenn - P - 89
Unification-BasedSemantic InterpretationRobert C. Moore Artificial Intelligence Center SRI International Menlo Park, CA 94025 AbstractWe show how unification can be used to specify the semantic interpretation of natural-language expressions, inc
UPenn - N - 04
Feature-based Pronunciation Modeling for Speech RecognitionKaren Livescu and James Glass MIT Computer Science and Articial Intelligence Laboratory Cambridge, MA 02139, USA {klivescu, glass}@csail.mit.eduAbstractWe present an approach to pronuncia
UPenn - J - 92
Class-Based n-gram Models of Natural LanguageP e t e r F. B r o w n &quot; P e t e r V. d e S o u z a * R o b e r t L. Mercer* IBM T. J. Watson Research Center V i n c e n t J. D e l l a Pietra* J e n i f e r C. Lai*We address the problem of predicting
UPenn - J - 95
Transformation-Based Error-Driven Learning and Natural Language Processing: A Case Study in Part-of-Speech TaggingEric Brill*The Johns Hopkins UniversityRecently, there has been a rebirth of empiricism in the field of natural language processing.
UPenn - E - 06
Adaptive Transformation-based Learning for Improving Dictionary TaggingBurcu Karagol-Ayan, David Doermann, and Amy Weinberg Institute for Advanced Computer Studies (UMIACS) University of Maryland College Park, MD 20742 {burcu,doermann,weinberg}@umia
UPenn - E - 06
Phrase-Based Backoff Models for Machine Translation of Highly Inected LanguagesMei Yang Department of Electrical Engineering University of Washington Seattle, WA, USA yangmei@ee.washington.edu Katrin Kirchhoff Department of Electrical Engineering Un
UPenn - P - 04
Towards a Semantic Classication of Spanish Verbs Based on Subcategorisation InformationEva Esteve Ferrer Department of Informatics University of Sussex Brighton, BN1 9QH, UK E.Esteve-Ferrer@sussex.ac.uk AbstractWe present experiments aiming at an a
UPenn - N - 03
A Phrase-Based Unigram Model for Statistical Machine TranslationChristoph Tillmann and Fei Xia IBM T.J. Watson Research Center Yorktown Heights, NY 10598 {ctill,feixia}@us.ibm.comAbstractIn this paper, we describe a phrase-based unigram model fo
UPenn - CIS - 610
TensorTextures: Multilinear Image-Based RenderingM. Alex O. Vasilescu and Demetri Terzopoulos University of Toronto, Department of Computer Science New York University, Courant Institute of Mathematical SciencesFigure 1: Frames from the Treasure C
UPenn - P - 84
Features and ValuesLauri Karttunen University of Texas at Austin Artificial Intelligence Center SRI International and Center for the Study of Language and Information Stanford UniversityAbstractThe paper discusses the linguistic aspects of a new
UPenn - T - 87
Unification a n d the n e w g r a m m a t i s m Steve Pulman University of Cambridge Computer Laboratory Corn Exchange Street Cambridge C B 2 3QG, UK.Whatare w e talking about?The prototypical unification grammar consists of a context-free skel
UPenn - H - 01
Guidelines for Annotating Temporal InformationInderjeet Mani, George WilsonThe MITRE Corporation, W640 11493 Sunset Hills Road Reston, Virginia 20190-5214, USA +1-703-883-6149Lisa FerroThe MITRE Corporation, K329 202 Burlington Road, Rte. 62 Bed
UPenn - C - 86
The computational complexity of sentence derivation in functional unification grammarGraeme Ritchie Department of Artificial Intelligence University of Edinburgh Edinburgh EHI IHNAbstract Functional unification (FU) grammar is a general linguisti
UPenn - E - 87
DECLARATIVE k VIEVNOOEL FOR DEPENDENCY PARSING INTO BLACKBOARD METHOOOLOGY-Vatkonen, K., J i p p i n e n , H., L e h t o t a , A. and Ytltammi, KIELIKOHE-pr~ject, SITRA Foundation P.O.Box 329, S F - 0 0 1 2 1 H e t s i n k i FinLand t e L . i n
UPenn - A - 97
Layout &amp; Language: Preliminary experiments in assigning logical structure to table cellsMatthew Hurst and Shona Douglas Language Technology Group, Human Communication Research Centre, University of Edinburgh, Edinburgh EH8 9LW UK { M a t t h e w . H
UPenn - C - 88
AN I N T E G R A T E D MODEL F O R T H E TREATMENT OF TIME I N MT- SYSTEMSM. Meya Siemens CDS c/Luis Muntadas,5 CORNELLA, 08940-BARCELONA SpainJ. Vidul EUROTRA-E Ctra. Vallvidriera, 25.27 08017-BARCELONAAbstractOne of the ways to achieve a goo
UPenn - C - 90
Towards a Unification-Based PhonologyRichard Wiese Seminar f'dr Allgem. Spraehwissenschaft Heinrich-Heine-Univer sit,it DUsseldorf D-4000 Di.isseldorf 1 wiesedd0rud81.bitnet 1 Introduction. The Problem Phonological theory has undergone a number cf m
UPenn - C - 86
ASAELEMENTARY CONTRACTS PRAGMATIC BASIS OF LANGUAGEINTERACTIONE.L. Pershina A[ Laboratory, Computer Center Siberian Division of the USSR Ae. Sei. Novosibirsk 630090, USSR ABSTRACT Language interaction (LI) as a part of interpersonal communica
UPenn - E - 85
ON THE REPRESENTATION OF QUERY TERM RELATIONS BY SOFT BOOLEAN oPERATORSGerard Salton D e p a r t m e n t o f Computer S c i e n c e Cornell University Ithaca, NY 14853, USAABSTRACT The l a n g u a g e a n a l y s i s component i n m o s t t e x t
UPenn - C - 86
ConceptualLexicon Using an Object-OrientedLanguageShoiehi Y O K O Y A M A Electrotechnical Laboratory Tsukuba, Ibaraki, JapanKenji H A N A K A T A Universitat Stuttgart Stuttgart, F. R. G e r m a n yAbstractThis paper d e s c r i b e s the
UPenn - E - 91
STRUCTURAL NON-CORRESPONDENCE IN TRANSLATION Henry S. Thompson, Human Communication Research Centre, University of Edinburgh, 2 Buccleuch Place, Edinburgh, EH8 9LW, UIC ht@uk.ac.ed.cogsciLouisa Sadler, Dept. of Language and Linguistics, University
UPenn - C - 82
ADAPTIVE DIALOGUE - THE BASIS FOR PERSONAL COMPUTER SYSTEMVictor Briabrin Computing C e n t e r , Academy o f S c i e n c e s , Hosoow, USSR1. P e r s o n a l Computer S y s t e m s (POS) r e p r e s e n t nowadays a s i g n i f t e a u t t r e n
UPenn - C - 90
Complex Features in Description of Chinese LanguageFeng Zhiwei Imtitute of Applied Linguistics Chinese Academy of Social Sciences 51 Chaoyangmen Nanxiaojie 100010 Beijing, ChinaAbstract In this paper, the similarity of&quot; multi-vahw label fimction&quot; a
UPenn - C - 96
CHINESE STRING SEARCHING USING TtIE K M P ALGORITHMRobert W.P. LukDepartment of Computing, Hong Kong Polytechnic University,Kowloon,Hong Kong E-mail: csrluk@comp.polyu.edu.hkAbstract This paper is about the modification of KMP (Knuth, Morris and
UPenn - CIT - 591
ArraysApr 10, 2009A problem with simple variablesOne variable holds one valueThe value may change over time, but at any given time, a variable holds a single value If you want to keep track of many values, you need many variables All
UPenn - CIT - 591
Numbers and ArraysWidening and narrowing Numeric types are arranged in a continuum: Wider double float long int short byte,char Narrower You can easily assign a narrower type to a wider type: doublewide; intnarrow; wide=narrow; But if you want
UPenn - STAT - 112
Stat 112Review Notes for Chapter 4, Lecture Notes 6-91. Best Simple Linear Regression: Among the variables X 1 , K , X K , the variable which best predicts Y based on a simple linear regression is the variable for which the simple linear regressi
Auburn Montgomery - MATH - 190
Decimal Expansion of FractionsBrent MurphyP QProblem: Under What conditions will the decimal expansion of p/q terminate? Under what conditions will it repeat? p/q can be investigated as p*(1/q).Terminating When placing 1 over q as a fraction t
Auburn Montgomery - MATH - 190
Project 1.2 Decimal Expansions of Rational NumbersJacob Brozenick Anthony Mayle Kenny Milnes And Tim SweetserProblem Descriptions1. Determine which values of q in the expression p/q will cause the termination of the resulting decimal expansion.
Auburn Montgomery - MATH - 190
Calculus Project 1.2By Dorothy McCammon, Tammy Boals, George Reeves, Robert StevensPart 1 When you have a fraction x/y, y can be divided into x to obtain that fraction in decimal form. There are two different types of decimal numbers you can obt
Auburn Montgomery - MATH - 190
PROBLEM 1: Under what conditions will the decimal expansion p/q terminate? Repeat? PROBLEM 2: Suppose that we are given the decimal expansion of a rational number. How can we represent the decimal in the rational form p/q? PROBLEM 3: Express each
UPenn - M - 95
STATISTICAL SIGNIFICANCE OF MUC-6 RESULT SNancy Chinchor, Ph.D.Science Applications International Corporatio n 10260 Campus Point Drive, M/S A2- F San Diego, CA 9212 1 chinchor@gso.saic.com (619) 458-261 4 INTRODUCTIONThe results of the MUC-6 eva
UPenn - D - 07
Improving Query Spelling Correction Using Web Search ResultsQing Chen Natural Language Processing Lab Northeastern University Shenyang, Liaoning, China, 110004 chenqing@ics.neu.edu.cn Ming Zhou Microsoft Research Asia 5F Sigma Center Zhichun Road, H
UPenn - M - 92
U,SC : MUC-4 Test Results and AnalysisD . Moldovan, S. Cha, M . Chung, K. Hendrickson, J . Kim, and S. Kowalsk iParallel Knowledge Processing Laborator y University of Southern Californi a Los Angeles, California 90089-256 2 moldovan@gringo .usc .