8 Pages

I05-3006

Course: I 05, Fall 2009
School: UPenn
Rating:
 
 
 
 
 

Word Count: 4007

Document Preview

Named Product Entity Recognition Based on Hierarchical Hidden Markov Model Feifan Liu, Jun Zhao, Bibo Lv, Bo Xu Hao Yu National Laboratory of Pattern Recognition FUJITSU R&D Institute of Automation Chinese Academy of Sciences Xiao Yun Road No.26 Beijing P.O. Box 2728, 100080 Chao Yang District, Beijing, 100016 {ffliu,jzhao,bblv,xubo}@nlpr.ia.ac.cn yu@frdc.fujitsu.com Abstract A hierarchical hidden Markov...

Register Now

Unformatted Document Excerpt

Coursehero >> Pennsylvania >> UPenn >> I 05

Course Hero has millions of student submitted documents similar to the one
below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.

Course Hero has millions of student submitted documents similar to the one below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.
Named Product Entity Recognition Based on Hierarchical Hidden Markov Model Feifan Liu, Jun Zhao, Bibo Lv, Bo Xu Hao Yu National Laboratory of Pattern Recognition FUJITSU R&D Institute of Automation Chinese Academy of Sciences Xiao Yun Road No.26 Beijing P.O. Box 2728, 100080 Chao Yang District, Beijing, 100016 {ffliu,jzhao,bblv,xubo}@nlpr.ia.ac.cn yu@frdc.fujitsu.com Abstract A hierarchical hidden Markov model (HHMM) based approach of product named entity recognition (NER) from Chinese free text is presented in this paper. Characteristics and challenges in product NER is also investigated and analyzed deliberately compared with general NER. Within a unied statistical framework, the approach we proposed is able to make probabilistically reasonable decisions to a global optimization by leveraging diverse range of linguistic features and knowledge sources. Experimental results show that our approach performs quite well in two different domains. entity recognition which can be crucial and valuable in many business IE applications, especially with the increasing research interest in Market Intelligence Management(MIM), Enterprise Content Management (ECM) [Pierre 2002] and etc. This paper describes a prototype system for product named entity recognition, ProNER, in which a HHMM-based approach is employed. Within a unied statistical framework, the approach based on a mixture model is able to make probabilistically reasonable decisions to a global optimization by exploiting diverse range of linguistic features and knowledge sources. Experimental results show that ProNER performs quite well in two different domains. 2 Related Work Up to now not much work has been done on product named entity recognition, nor systematic analysis of characteristics for this task. [Pierre 2002] developed an English NER system capable of identifying product names in product views. It employed a simple Boolean classier for identifying product name, which was constructed from the list of product names. The method is similar to token matching and has a limitation for product NER applications. [Bick et al. 2004] recognized named entities including product names based on constraint grammar based parser for Danish. This rule-based approach is highly dependent on the performance of Danish parser and suffers from its weakness in system portability. [C. Niu et al. 2003] presented a bootstrapping approach for English named entity recognition using successive learners of parsing-based decision 1 Introduction Named entity recognition(NER) plays a signicantly important role in information extraction(IE) and many other applications. Previous study on NER is mainly focused either on the proper name identication of person(PER), location(LOC), organization(ORG), time(TIM) and numeral(NUM) expressions almost in news domain, which can be viewed as general NER, or other named entity (NE) recognition in specic domain such as biology. As far as we know, however, there is little prior research work conducted by far on product named 0 This work was supported by the Natural Sciences Foundation of China(60372016,60272041) and the Natural Science Foundation of Beijing(4052027). 40 System [Zhang et al. 2003] [Sun et al. 2002] [Tsai et al. 2004] Statistical Model HMM class-based LM ME model Linguistic Feature semantic role, tokens word form, NE category tokens Combinative Points pattern rules cue words list knowledge representation Table 1: Comparison between several Chinese NER systems1 list and HMM, and promising experiment results (F-measure: 69.8%) on product NE (corresponding to our PRO) were obtained. Its main advantage lies in that manual annotation of a sizable training corpus can be avoided, but it suffers from two problems, one is that it is difcult to nd sufcient concept-based seeds needed in bootstrapping for the coverage of the variations of PRO subcategories, another it is highly dependent on parser performance as well. Research on product NER is still at its early stage, especially in Chinese free text collections. However, considerable amount of work has been done in the last decade on the general NER task and biological NER task. The typical machine learning approaches for English NE are transformation-based learning[Aberdeen et al. 1995], hidden Markov model[Bikel et al. 1997], maximum entropy model[Borthwick, 1999], support vector machine learning[Eunji Yi et al. 2004], unsupervised model[Collins et al. 1999]and etc. For Chinese NER, the prevailing methodology applied recently also lie in machine learning combining other knowledge base or heuristic rules, which can be compared on the whole in three aspects showed in Table 1. In short, the trend in NER is to adopt a statistical framework which try to exploit some knowledge base as well as different level of text features within and outside NEs. Further those ideas, we present a hybrid approach based on HHMM [S. Fine et al. 1998] which will be described in detail. Brand Name(BRA), Product Type(TYP), Product Name(PRO), and BRA and TYP are often embedded in PRO. In the following two examples, there are two BRA NEs, one TYP NE and one PRO NE all of which belong to the family of product named entities. Exam 1: (Benq)/BRA (brand) (market shares) (steadily) (ascend) Exam 2: (corporation) (will) (deliver) [Canon/BRA 334 (ten thousand) (pixels) (digital) (camera) Pro90IS/TYP]/PRO Brand Name refer to proper name of product trademark such as (Benq) in Exam 1. Product Type is a kind of product named entities indicating version or series information of product, which can consist of numbers, English characters, or other symbols such as + and etc.In our study, two principles should be followed. (1) Chinese characters are not considered to be TYP, nor subpart of TYP although some of them can contain version or series information. For instance, in 2005 (happy new year) (version) (cell phone), here (happy new year) (version)should not be considered as a TYP. (2) Numbers are essential elements in product type entity. For instance, in PowerShot (series) (digital) (camera), PowerShot is not considered as a TYP, however, in PowerShot S10 (digital) (camera), PowerShot S10 can make up of a TYP. Product Name, as showed above in Exam 2, is a kind of product named entities expressing selfcontained proper name for some specied product in real world compared to BRA and TYP which only express one attribute of product. i.e. a PRO NE must be assigned with distinctly discriminative information which can not shared with other general product-related expressions. 3 Problem Statements and Analysis 3.1 Task Denition 3.1.1 Denition of Product Named Entity In our study, only three kinds of product named entities are considered, namely 1 Note: LM(language model); ME(maximum entropy). 41 (1) Product-related expressions which are embedded with either BRA or TYP can be qualied to be a PRO entity. e.g. BenQ (ash) (disk) is a PRO entity, but the general product-related expression (ash) (market) (investigation) cannot make up of a PRO entity. (2) Product-related expressions indicating some specic version or series information which is unique for a BRA can also be considered as a PRO entity. e.g. DIGITAL IXUS (series) (digital) (camera) is a PRO because DIGITAL IXUS series is unique for Canon product, but (intelligent) (version) (cell phone) is not a PRO because the attribute of intelligent version can be assigned to any cell phone product. 3.1.2 Product Named Entity Recognition Product named entity recognition involves the identication of product-related proper names in free text and their classication into different kinds of product named entities, referring to PRO, TYP and BRA in this paper.In comparison with general NER, nested product NEs should be tagged separately rather than being tagged just as a single item, shown as Figure 1. 3.2 Challenges for Product Named Entity Recognition For general named entities, there are some cues which are very useful for entity recognition, such as (city), (Inc.), and etc. In comparison, product named entities have no such named conventions and cues, resulting in higher boundary ambiguities and more complex NE candidate triggering difculties. In comparison with general NER, more challenges in product NER result from miscellaneous classication ambiguities. Many entities with identical form can be a kind of general named entity, a kind of product named entity, or just common words. In comparison with general named entities, product named entities show more exible variant forms. The same entity can be expressed in several different forms due to spelling variation, word permutation and etc. This also compounds the difculties in product named entity recognition. In comparison with general named entities, it is more frequent that product named entities are nested as Figure 1 illustrates. More efforts have to be made to identify such named entities separately. 3.3 Our Solutions We adopt the following strategies in triggering and disambiguating process respectively. (1) As to product NER, its pivotal to control the triggering candidates efciently for the balance between precision and recall. Here we use the knowledge base such as brand word list, and other heuristic information which can be easily acquired. (2)After triggering candidates, we try to employ a statistical model to make the most of multi-level context information mentioned above in disambiguation. We choose hierarchical hidden Markov model (HHMM) [S. Fine et al. 1998] for its more powerful ability to model the multiplicity of length scales and recursive nature of sequences. 42 4 Hybrid Approach for Product NE Recognition 4.1 Overall Workow of ProNER Preprocessing: Segment, POS tagging and general NER is primarily conducted using our offshelf SegNer2.0 toolkit on input text. Generating Product NE Candidates: First, BRA or ORG and TYP are triggered by brand word list and some word features respectively. Here we categorize the triggering word features into six classes: alphabet string, alphanumeric string, digits, alphabet string with fullwidth, digits with fullwidth and other symbols except Chinese characters. Then PRO are triggered by BRA and TYP candidates as well as some clue words indicating type information to some extent such as (version), (series). In this step the model structure(topology) of HHMM[S. Fine et al. 1998] is dynamically constructed, and some conjunction words or punctuations and specied maximum length of product NE are used to control it. Disambiguating Candidates: In this module, boundary and classication ambiguities between candidates are resolved simultaneously. And Viterbi algorithm is applied for most-likely state sequences based on the HHMM topology. 4.2 Integration with Heuristic Information To get more efcient control in triggering process above, we try to integrate some heuristic information. The heuristic rules we used are as domainindependent as possible in order that they can be integrated with statistical model systematically rather than just some tricks on it. (1) Stop Word List: Common English words, English brand word, and some punctuations are extracted automatically from training set to make up of stop word list for TYP; by co-occurrence statistics between ORG and its contexts, some words are extracted from the contexts to make up of stop word list for PRO in order to overcome the case that brand word is prone to bind its surroundings to be a PRO. (2) Constrain Rules: Rule 1: For the highly frequent pattern + (number + English quantier 0.7 IS0 0.3 0.2 0.5 0.7 0.3 0.5 0.3 ES IS1 0.2 0.3 IS2 PS5 ES PS1 PS2 ES PS3 PS4 Figure 2 Structure of Hierarchical Hidden Markov Model (HHMM) word), all the corresponding TYP candidates triggered by categorized word features(CWF) should be removed. Rule 2: Product NE candidates in which some binate symbols dont match each other should be removed. Rule 3: Unreasonable symbols such as - or : should not occur in the beginning or end of product NE candidates. 4.3 HHMM for product NER application By HHMM [S. Fine et al. 1998] the product NER can be formulated as a tagging problem using Viterbi algorithm. Unlike traditional HMM in POS tagging, here the topology of HHMM is not xed and internal states can be also a similar stochastic model on themselves, called internal states compared to production states which will emit only observations. Our HHMM structure actually consists of three level approximately illustrated as gure 2 in which IS denotes internal state, PS denotes production state and ES denote end state at every level. For our application, an input sequence from our SegNer2.0 toolkit can be formalized as w1 /t1 w2 /t2 . . . wi /ti . . . wn /tn , among which wi and ti is the ith word and its part-ofspeech, n is the number of words. The POS tag set here is the combination of tag set from Peking University(PKU-POS) and our general NE categories(GNEC) including PER(person), LOC(location), ORG(organization), TIM(time expression), NUM(numeric expression). Therefore we can construct our HHMM by model the state set {S} consisting of {GNEC}, {BRA, PRO, TYP}, and {V} as well as the observation set {O} consisting of {V} which is the word set from training data. That is to say, the word forms 43 in {V} which are not included in NEs are also viewed as production states. In our model, only PRO are internal state which may activate other production states such as BRA and TYP resulting in recursive HMM. In consisd tence with S. Fines work, qi (1 d D) is used to indicate the ith state in the dth level of hierarchy. So, the product NER problem is to nd the most-likely state activation sequence Q*, a multiscale list of states, based on the dynamic topology of HHMM given a observation sequence W = w1 w2 . . . wi . . . wn , formulated as follows based on Bayes rule (P (W )=1). arriving at an end state, thus hierarchical computation is implemented; k (3) If qj = BRA, we assign equation (3) a constant value in that BRA candidates consist of only a single brand word in our method. In addition brand word can also generate ORG candidates, thus we can assign equation (3) as follows. k p([wqk begin ...wqk end ]|qj = BRA) = 0.5 (4) j j Q= arg max P (Q|W )= arg max P (Q)P (W |Q) Q Q (1) From the root node of HHMM, activity ows to all other nodes at different levels according to their transition probability. For description convenience, we take the kth level as example(activated by the mth state at the k-1th level). horizontal transition |q k | k k1 k k P (Q) p(q1 |qm ) p(q2 |q1 ) = vertical transition k k k p(qj |qj1 , qj2 ) j=3 k (4) If qj = T Y P , categorized word features(CWFs) dened in section 4.1 are applied, i.e. the words associated with the current state are replaced with their CWFs (WC) acting as observations. Then we can compute the emission probability of this TYP production state as the followk ing equation, among which |qj | is the length of observation sequence associated with the current state. k p([wqk begin ...wqk end ]|qj = T Y P ) j j k =p(wc1 |begin)p(end|wc|qj | ) k |qj | p(wcm |wcm1 ) m=2 (2) = k |qP S | All the parameters in every level of HHMM can be acquired using maximum likelihood method with smoothing from training data. 4.4 Mixture of Two Hierarchical Hidden Markov Models Now we have implemented a simple HHMM for product NER. Note that in the above model(HHMM-1), we exploit both internal and external features of product NEs only at levels of simply semantic classication and just word form. To achieve our motivation in section 3.3, we construct another HHMM(HHMM2) for exploiting multi-level contexts by mixing with HHMM-1. In HHMM-2, the difference from HHMM-1 lies in the state set SII and observation set OII . Because the input text will be processed by segment, POS tagging and general NER, as a alternative, we can also take T=t1 t2 . . . ti . . . tn as observation sequence, i.e. OII ={PKU-POS}. Accordingly, SII = {{PKU-POS}, {GNEC}, BRA, TYP, j=1 k p([wqk begin ...wqk end ]|qj ) j j P (W |Q)= k / if qj {IS} activate other states recursively k if qj {IS} (3) Where and is the number of production states in the kth level; wqk begin ...wqk end indicates the word sequence j j |q k | is the number of all states k |qP S | k corresponding to the state qj . k (1) In equation (3), if qj {{GNEC},{V}}, k )=1, because we asp([wqk begin ...wqk end ]|qj j j sume that the general NER results from the preceding toolkit are correct; k (2) If qj = P RO, production states in the (k+1)th level will be activated by this internal state through equation (2),(3) and go back when 44 Data Sets DataSetPRO1.2 OpenTestSet CloseTestSet PRO 12,432 1800 1553 BRA 5,047 803 513 TYP 10,606 1364 1296 PER 424 39 55 LOC 1,733 207 248 ORG 4,798 614 619 Table 2: Overview of Data Sets PRO}, among which PRO is internal state. Similarly, the problem is formulated as follows with HHMM-2. Q = arg max P (QII |T ) II QII 5 Experiments and analysis 5.1 Data Set Preparation A large number of web pages in mobile phone and digital domain are compiled into text collections, DataSetPRO, on which multi-level processing were performed. Our nal version, DataSetPRO1.2, consists of 1500 web pages, roughly 1,000,000 Chinese characters. Randomly selected 140 texts (digital 70, mobile phone 70) are separated from DataSetPRO1.2 as our OpenTestSet, the rest as TrainingSet, from which 160 texts are extracted as CloseTestSet. Table 2 illustrates the overview of them. 5.2 Experiments Due to various and exible forms of product NEs, though some boundaries of recognized NEs are inconsistent with manual annotation, they are also reasonable. So soft evaluation is also applied in our experiments to make the evaluation more reasonable. The main idea is that a discount score will be given to recognized NEs with wrong boundary but correct detection and classication. However, strict evaluation only score completely correct ones. All the results is conducted on OpenTestSet unless it is particularly specied. Also, the evaluation scores used below are obtained mainly by = arg max P (QII )P (T |QII ) QII (5) The description and computation of HHMM-2 is similar to HHMM-1 and is omitted here. We can see that besides making use of semantic classication of NEs in common, HHMM-1 and HHMM-2 exploit word form and part-of-speech (POS) features respectively. Word form features make the model more discriminative, while POS features result in robustness. Intuitively, the mixture of these two models is desirable for higher performance in product NER by balancing the robustness and discrimination which can be formulated in logarithmic form as follows. (Q , Q ) II = arg max{log(P (Q)) + log(P (W |Q)) Q,QII + [log(P (QII )) + log(P (T |QII ))]} (6) Where is a tuning parameter for adjusting the weight of two models. 45 Product NEs PRO TYP BRA Product NEs PRO TYP BRA Digital Domain ( 8) Close Test Open Test Precision Recall F-measure Precision 0.864 0.799 0.830 0.762 0.903 0.906 0.905 0.828 0.824 0.702 0.758 0.723 Mobile Phone Domain ( 8) Close Test Open Test Precision Recall F-measure Precision 0.917 0.935 0.926 0.799 0.959 0.976 0.967 0.842 0.911 0.741 0.818 0.893 Recall 0.744 0.944 0.705 F-measure 0.753 0.882 0.714 Recall 0.856 0.886 0.701 F-measure 0.827 0.864 0.785 Table 3: Experimental Results in Digital and Mobile Phone Domain soft metrics, and strict scores are also given for comparison in experiment 3. 1. Evaluation on the Inuence of in the Mixture Model. In the mixture model denoted as equation (6), the value reects the different contribution of two individual models to the overall system performance. The larger , the more contribution made by HHMM-2. Figure 3, 4, 5 illustrate the varying curves of recognition performance with the value on PRO, TYP, BRA respectively. Note that, if equal to 1 then two models are mixed with equivalent weight. We can see that, as goes up, the F-measures of PRO and TYP increase obviously rstly, and begin to go down slightly after a period of growing at. It can be explained that HHMM-2 mainly exploits part-of-speech and general NER features which can relieve the sparseness problem to some extent, which is more serious in HHMM-1 due to using lower level of contextual information such as word form. However, as becomes larger, the problem of imprecise modeling in HHMM2 will be more salient and begin to illustrate a side-effect in the mixture model. Whereas, the inuence of on BRA is negligible because its candidates are triggered by the relatively reliable knowledge base and its sub-model in HHMM is assigned a constant as shown in equation(4). Summings-up: (1) Mixture with HHMM-2 can make up the weakness of HHMM-1. (2) HHMM-2 can make more contributions to the mixture model under the conditions that limited annotated data is available at present. In our system, is assigned t...

Find millions of documents on Course Hero - Study Guides, Lecture Notes, Reference Materials, Practice Exams and more. Course Hero has millions of course specific materials providing students with the best way to expand their education.

Below is a small sample set of documents:

UPenn - H - 05
OPINE: Extracting Product Features and Opinions from ReviewsAna-Maria Popescu Bao Nguyen Oren Etzioni Department of Computer Science and Engineering University of Washington Seattle, WA 98195-2350 {amp,omicron,etzioni}@cs.washington.eduAbstractCo
UPenn - P - 86
BRINGINGNATURALLANGUAGEPROCESSINGTO T H E M I C R O C O M P U T E RMARKETTHES T O R Y OF Q & AGary G. Hendrix Symantec Corporation 10201 Torre Avenue Cupertino, CA 95014OVERVIEWThis is the story of how one of the new natural langu
UPenn - H - 05
Extracting Product Features and Opinions from ReviewsAna-Maria Popescu and Oren Etzioni Department of Computer Science and Engineering University of Washington Seattle, WA 98195-2350 {amp, etzioni}@cs.washington.eduAbstractConsumers are often for
UPenn - N - 07
OMS-J: An Opinion Mining System for Japanese Weblog Reviews Using a Combination of Supervised and Unsupervised ApproachesGuangwei WangGraduate School of Information Science and Technology Hokkaido University Sapporo, Japan 060-0814 wgw@media.eng.ho
UPenn - C - 90
Automatic Indexing and Government-Binding TheoryRobert J. Kuhns 205 Walnut Street Brookline, M A 02146 USAABSTRACTThis project note describes a systern that receives, parses, indexes, and routes news reports. The core of this ,'mtomatic indexer
UPenn - X - 96
A N O V E R V I E W OF THE P R O T O T Y P E I N F O R M A T I O N DISSEMINATION SYSTEM (PRIDES)Virginia Cevasco Logicon, Operating Systems Division 2100 Washington Blvd. Arlington, VA 22204 gcevasco@logicon.com (703) 486-3500INTRODUCTIONThe Prot
UPenn - P - 81
J. Norwood Crout Artificial Intelligence Corporation The INTELLECT n a t u r a l language database query system, a product of Artificial Intelligence Corporation, is the only commercially available system with true English query capability. Based on
UPenn - P - 80
FUTURE PROSPECTS FOR COMPUTATIONAL LINGUISTICS Gary G. H e n d r i x SRI International Preparation of this paper was supported by the Defense Advance Research Projects Agency The views under contract N00039-79-C-0118 with the Naval Electronic Systems
UPenn - X - 93
DICTIONARY CONSTRUCTION BY DOMAIN EXPERTSEllen Riloff and Wendy G. LehnertDepartment of Computer Science University of Massachusetts Amherst MA 01003 Sites participating in the recent message understanding conferences have increasingly focused thei
UPenn - C - 86
Q&A: A l r e a d y a S u c c e s s ?Gary G. Hendrix Symantec Corporation Cupertino, CA 95014 USA When Prof. Wolfgang Wahlster (the organizer of this COLING-86 panel on "Natural Language Interfaces: Ready for Commercial Success?") sent out invitatio
UPenn - H - 91
Bayesian Learning of Gaussian M i x t u r e Densities for Hidden Markov ModelsJean-Luc Gauvain I and Chin-Hui LeeSpeech Research Department AT&T Bell Laboratories Murray Hill, NJ 07974 ABSTRACTAn investigation into the use of Bayesian learning of
UPenn - N - 06
From Pipedreams to Products, and Promise!Janet M. BakerSaras Institute / Dibner Institute MIT Bldg E56-100 38 Memorial Drive Cambridge, MA 02139 USA HistSpch@mit.eduPatri J. PuglieseSaras Institute / Dibner Institute MIT Bldg E56-100 38 Memori
UPenn - E - 89
Fourth Conferenceof the European Chapterof theAssociation for Computational LinguisticsProceedings of the Conference10- 12 April 1989 University of Manchester Institute of Science and Technology Manchester, EnglandPublished by the Associat
UPenn - J - 96
Book ReviewsJapanese Sentence Processing Reiko Mazuka and Noriko Nagai (editors)(Duke University) Hillsdale, NJ: Lawrence Erlbaum Associates, 1995, x+360 pp; hardbound, ISBN 0-8058-1125-7, $89.95Reviewed by Patrick Sturt University of Edinburgh
UPenn - J - 96
Computational LinguisticsVolume 22, Number 2Representing Time in Natural Language: The Dynamic Interpretation of Tense and AspectAlice G. B. ter M e u l e n(Indiana University) Cambridge, MA: The MIT Press, 1995, xii+144 pp; hardbound, ISBN 0-
UPenn - E - 87
Third Conference of t h e European Chapter of t h e A s s o c i a t i o n for C o m p u t a t i o n a l LinguisticsP r o c e e d i n g s of t h e C o n f e r e n c e1-3 A p r i l 1987 U n i v e r s i t y of C o p e n h a g e n Copenhagen, Denmark
UPenn - J - 95
Briefly NotedParallel natural language processing Geert Adriaens and Udo Hahn (editors)(University of Leuren and University of Freiburg) Norwood, NJ: Ablex Publishing Company, 1994, vi + 467 pp. Hardbound, ISBN 0-89391-869-5, $79.50 ($37.50 prepaid
UPenn - J - 96
Book ReviewsNatural Language Processing for Prolog Programmers Michael A. Covington(University of Georgia) Englewood Cliffs, NJ: Prentice-Hall, 1994, xvi+348 pp; hardbound, ISBN 0-13-629213-5, no price listedReviewed by Ken Barkerand Stan Szpako
UPenn - J - 01
Book ReviewsLexicon Development for Speech and Language Processing Frank Van Eynde and Dafydd Gibbon (editors)(University of Leuven and University of Bielefeld) Dordrecht: Kluwer Academic Publishers (Text, speech and language technology series, ed
UPenn - J - 89
Book ReviewsBriefly Notedfully, and with a pedagogically attractive style, it discusses topics related to expert systems (e.g., representation, rules, shells, explanation, and confidence factors) as well as topics related to natural language proc
UPenn - J - 89
Book ReviewsText Coherence in TranslationTEXT COHERENCE IN TRANSLATION Bart Papegaaij and Klaus Schubert (BSO/Research, Utrecht)Dordrecht: Foils Publications, 1988, 211 pp. (Distributed language translation 3) ISBN 90-6765-360-8, Dfl 110.- (hb)
UPenn - J - 92
Book ReviewsCognitive M o d e l s of Speech Processing Gerry T. M. Altmann (editor)(University of Sussex) Cambridge, MA: The MIT Press (The ACL-MIT Press Series in Natural Language Processing), 1990, x + 540 pp. Hardbound, ISBN 0-262-01117-4, $55.0
UPenn - J - 88
Book ReviewsMachine Translation: Past, Present, Futurepresented as the only significant computational contributions. There is no discussion of processing as it is defined computationally. There is no mention of processing concerns that require in
UPenn - J - 81
Book ReviewsNatural Language Processing Harry Tennant Petrocelli Books, Princeton, 1981, 276 pp., Paperback, $17.50, ISBN 089433-100-0. (Dist: Van Nostrand-Reinhold, New York)How do computers understand natural language? This question is asked by
UPenn - J - 92
Computational LinguisticsVolume 18, Number 3Adaptive Parsing: Self-Extending Natural Language Interfaces Jill Fain Lehman(Carnegie Mellon University) Boston: Kluwer Academic Publishers (The Kluwer International Series in Engineering and Computer
UPenn - E - 83
AssooJatil~ Computatlona in uisticsProceeding:s of the ConferenceFirst Conference of the European Chapter of the for1-2 September 1983 Pisa, ItalyPublished by the Association for Computational LinKuistics1983, Association [or Computational L
UPenn - E - 93
Sixth Conference of the European Chapter of the Association for Computational LinguisticsProceedings of the Conference9~~21 - 23 April 1993 OTS - ResearchInstitute for Languageand Speech Utrecht University Utrecht, The NetherlandsPublishe
UPenn - E - 99
Ninth Conference of the European C h a p t e r of the Association for C o m p u t a t i o n a l Linguistics8-12 June 1999 University of Bergen Bergen, NorwayPublished by the Association for Computational LinguisticsThe conference was sponsored
UPenn - J - 00
Book ReviewsSpeech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition Daniel Jurafskyand James H. Martin(University of Colorado, Boulder) Upper Saddle River, NJ: Prentice Hall
UPenn - J - 05
Book ReviewsSpoken Dialogue Technology: Toward the Conversational User InterfaceMichael F. McTear (University of Ulster) London: Springer-Verlag, 2004, x+432 pp; paperbound, ISBN 1-85233-672-2, $59.95 Reviewed by Johannes Pittermann University of
UPenn - J - 96
Computational LinguisticsVolume 22, Number 3D e i x i s in Narrative: A C o g n i t i v e S c i e n c e Perspective Judith F. Duchan, Gail A. Bruder, and Lynne E. Hewitt (editors)(State University of New York at Buffalo) Hillsdale, NJ: Lawrence
UPenn - J - 98
Computational LinguisticsVolume 24, Number 4Multilingual Text-to-Speech Synthesis: The Bell Labs Approach Richard Sproat(editor)(Bell Laboratories, Lucent Technologies) Dordrecht: Kluwer Academic Publishers, 1998, xxvi+300 pp; hardbound, ISBN 0-
UPenn - J - 86
Book ReviewsConceptual Structuresinferences are generated, how many inferences are generated, and what knowledge sources contribute to the generation of inferences. In their book Structures and Procedures of Implicit Knowledge, Graesser and Clark
UPenn - J - 95
Briefly NotedVerbmobih A Translation System for Face-to-Face Dialog Martin Kay, Jean Mark Gawron, and Peter Norvig(Xerox Palo Alto Research Center, SRI International, and Sun Microsystems Laboratories) Stanford: Center for the Study of Language and
UPenn - J - 99
Book ReviewsType-Logical SemanticsBob Carpenter(Lucent Technologies) Cambridge, MA: The MIT Press (Language, speech, and communication series), 1997, xxi+575 pp; hardbound, ISBN 0-262-03248-1, $60.00; paperbound, ISBN 0-262-53149-6, $35.00Revie
UPenn - J - 00
Book ReviewsLocal Constraints vs. E c o n o m y David E. Johnson and Shalom Lappin (IBM Thomas J. Watson Research Center and King's College, University of London)Stanford, CA: CSLI Publications (Stanford monographs in linguistics), 1999, x+150 pp;
UPenn - J - 91
The Logical Structure of English: Computing Semantic Content Allan Ramsay(University College Dublin) London: Pitman, 1990, iv + 209 pp. Paperbound ISBN 0-273-03287-9, 19.99Reviewed by Martin Volk University of Koblenz-LandauH o w do you judge whe
UPenn - J - 04
Book ReviewHandbook for Language EngineersAli Farghaly (editor) (SYSTRAN Software Corporation) Stanford, CA: CSLI Publications (CSLI lecture notes, number 164) (distributed by the University of Chicago Press), 2003, xi+442 pp; hardbound, ISBN 1-575
UPenn - N - 04
HLT-NAACL 2004Human Language Technology Conference of the North American Chapter of the Association for Computational LinguisticsProceedings of the Main ConferenceMay 2-7, 2004 Boston, Massachusetts, USAISBN 1-932432-23-X900009 781932 432
UPenn - J - 90
Book ReviewsComputationalLinguistics: An IntroductionG6ran Maimgren describes regularities in polysemy: types of metaphoric transfer of meaning in nouns, regular extensions in verb meanings, and changes in adjective meanings as the argument chang
UPenn - J - 98
Book ReviewsCorpus-Based Methods in Language and Speech Processing Steve Young and Gerrit Bloothooft (editors)(Cambridge University and Utrecht University) Dordrecht: Kluwer Academic Publishers (Text, Speech and Language Technology series, edited b
UPenn - J - 01
Computational LinguisticsVolume 27, Number 2Learnability in Optimality Theory Bruce Tesar and Paul Smolensky(Rutgers University and The Johns Hopkins University) Cambridge, MA: The MIT Press, 2000, vii+140 pp; hardbound, ISBN 0-262-20126-7, $25.
UPenn - J - 93
Book ReviewsA n I n t r o d u c t i o n to M a c h i n e Translation W. John Hutchins and Harold L. Somers (University of East Anglia and University of Manchester Institute of Science and Technology)London: Academic Press, 1992, xxi + 362 pp. Hardb
UPenn - CIS - 610
Chapter 8 A Detour On Fractals8.1 Iterated Function Systems and FractalsA pleasant application of the Hausdor distance and of the xed point theorem for contracting mappings is a method for dening a class of self-similar fractals. For this, we can
UPenn - CIS - 610
Chapter 5 Lie Groups, Lie Algebras and the Exponential Map5.1 Lie Groups and Lie AlgebrasIn Chapter 2, we dened the notion of a Lie group as a certain type of manifold embedded in RN , for some N 1. Now that we have the general concept of a manif
UPenn - CIS - 610
Chapter 7 Geodesics on Riemannian Manifolds7.1 Geodesics, Local Existence and UniquenessIf (M, g) is a Riemannian manifold, then the concept of length makes sense for any piecewise smooth (in fact, C 1) curve on M . Then, it possible to dene the s
UPenn - CIS - 610
Chapter 6 The Classication Theorem for Compact Surfaces6.1 Cell ComplexesIt is remarkable that the compact (two-dimensional) polyhedras can be characterized up to homeomorphism. This situation is exceptional, as such a result is known to be essent
UPenn - CIS - 610
Chapter 4 The Fundamental Group, Orientability4.1 The Fundamental GroupIf we want to somehow classify surfaces, we have to deal with the issue of deciding when we consider two surfaces to be equivalent. It seems reasonable to treat homeomorphic su
UPenn - CIS - 610
Chapter 4 Polyhedra and Polytopes4.1 Polyhedra, H-Polytopes and V-PolytopesThere are two natural ways to dene a convex polyhedron, A: (1) As the convex hull of a nite set of points. (2) As a subset of En cut out by a nite number of hyperplanes, mo
UPenn - CIS - 511
Chapter 8 Phrase-Structure Grammars and Context-Sensitive Grammars8.1 Phrase-Structure GrammarsContext-free grammars can be generalized in various ways. The most general grammars generate exactly the recursively enumerable languages. Between the c
UPenn - CIS - 610
Chapter 4 Basics of Classical Lie Groups: The Exponential Map, Lie Groups, and Lie AlgebrasLe rle prpondrant de la thorie des groupes en mathmatiques a t longtemps o e e e e ee insouponn; il y a quatre-vingts ans, le nom mme de groupe tait ignor. Ce
UPenn - CIS - 511
Chapter 6 Elementary Recursive Function Theory6.1 Acceptable IndexingsIn a previous Section, we have exhibited a specic indexing of the partial recursive functions by encoding the RAM programs. Using this indexing, we showed the existence of a uni
UPenn - CIS - 610
Chapter 4 Manifolds, Tangent Spaces, Cotangent Spaces, Vector Fields, Flow, Integral Curves4.1 ManifoldsIn Chapter 2 we dened the notion of a manifold embedded in some ambient space, RN . In order to maximize the range of applications of the theor
UPenn - CIS - 610
Chapter 8 The Log-Euclidean Framework Applied to SPD Matrices and Polyane Transformations8.1 IntroductionIn this Chapter, we use what we have learned in previous chapters to describe an approach due to Arsigny, Fillard, Pennec and Ayache to dene a
UPenn - CIS - 610
Chapter 6 Riemannian Manifolds and Connections6.1 Riemannian MetricsFortunately, the rich theory of vector spaces endowed with a Euclidean inner product can, to a great extent, be lifted to various bundles associated with a manifold. The notion of
UPenn - CIS - 610
Chapter 4 Partial Orders, Lattices, Well Founded Orderings, Equivalence Relations, Distributive Lattices, Boolean Algebras, Heyting Algebras4.1 Partial OrdersThere are two main kinds of relations that play a very important role in mathematics and
UPenn - CIS - 610
Chapter 2 Relations, Functions, Partial Functions2.1 What is a Function?We use functions all the time in Mathematics and in Computer Science. But, what exactly is a function? Roughly speaking, a function, f , is a rule or mechanism, which takes in
UPenn - STAT - 102
Homework 1Spring 2007(HW for Sections 2 & 3 (Zhao) is due in class on Jan. 16th and for Section1 is due in class on Jan. 17th.)Read: Chapter 2: Sections 2.1 through 2.7 should be review. Sections 2.8 & 2.9 may be newWritten HW: Problems 2.25,
UPenn - STAT - 102
Statistics 102Lecture 2L. Brown & L. ZhaoSpring 2007Tests and Confidence Intervals for Two MeansRead: Sections 2.7 and 2.8 of Dielman Do advertisements help to increase store sales? Data from two independent samples Analysis assuming equal
UPenn - STAT - 102
Department of Statistics The Wharton School University of Pennsylvania Statistics 102L. Brown & L. ZhaoSpring 2007Administrative IssuesWeb site www-stat.wharton.upenn.edu/~stat102 TEXT: Dielman, T. Applied Regression Analysis Fourth Edition,
UPenn - STAT - 011