7 Pages

POS-info

Course: LDC 2003, Fall 2008
School: UPenn
Rating:
 
 
 
 
 

Word Count: 1712

Document Preview

PART-OF-SPEECH/MORPHOLOGICAL ARABIC ANALYSIS TAGGING The Penn Arabic Treebank uses a level of annotation more accurately described as morphological analysis than as part-of-speech tagging. In October 2001, the decision was taken to use Tim Buckwalter's morphological analyzer and main lexicon, which currently contains over 77,800 stem entries representing some 45,000 lexical items. A DESCRIPTION OF TIM...

Register Now

Unformatted Document Excerpt

Coursehero >> Pennsylvania >> UPenn >> LDC 2003

Course Hero has millions of student submitted documents similar to the one
below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.

Course Hero has millions of student submitted documents similar to the one below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.
PART-OF-SPEECH/MORPHOLOGICAL ARABIC ANALYSIS TAGGING The Penn Arabic Treebank uses a level of annotation more accurately described as morphological analysis than as part-of-speech tagging. In October 2001, the decision was taken to use Tim Buckwalter's morphological analyzer and main lexicon, which currently contains over 77,800 stem entries representing some 45,000 lexical items. A DESCRIPTION OF TIM BUCKWALTER'S ARABIC MORPHOLOGICAL ANALYSIS TOOL The Arabic morphological analysis and part-of-speech tagging was performed with the Buckwalter Arabic Morphological Analyzer, an open-source software package distributed by the LDC. The source code of the program and a full technical description can be downloaded for free from the LDC website: http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2002L49. What follows is a brief description of the Arabic morphology analysis algorithm and the structure of lexicon entries. The Arabic morphology analysis is based on these assumptions: 1.Arabic words are composed of three elements: prefix, stem, and suffix 2.Prefix length is 0-4 characters 3.Stem length is 1-infinite characters 4.Suffix length is 0-6 characters Given these rules, an Arabic word can be segmented as follows (using wbAlErbyp as an example): PrefixStemSuffix wbAlErbyp wbAlErbyp wbAlErbyp wbAlErbyp wbAlErbyp wbAlErbyp wbAlErbyp wbAlErbyp wbAlErbyp wbAlErbyp wbAlErbyp wbAlErbyp wbAlErbyp wbAlErbyp wbAlErbyp wbAlErbyp wbAlErbyp wbAlErbyp wbAlErbyp wbAlErbyp wbAlErbyp wbAlErbyp wbAlErbyp wbAlErbyp wbAlErbyp wbAlErbyp wbALErbyp wbAlErbyp wbAlErbyp wbAlErbyp wbAlErbyp wbAlErbyp Arabic dictionary look-up consists of asking, for each segmentation: 1. does the prefix exist in the lexicon of prefixes? 2. if so, does the stem exist in the lexicon of stem? 3. if so, does the suffix exist in the lexicon of suffixes? Note that the dictionary of prefixes contains not only the individual prefixes (wa-, fa-, li-, Al-, bi-, etc.) but all valid concatenations of these as well (waAl-, biAl-, wabiAl-, etc). The same applies to the dictionary of suffixes: (-ap, -At, -Ani, -athu, -Athum, -Anihi, -tumuwhA, etc). Here are some sample entries from the dictionary of prefixes: wlwaliNPref-Liand + for/to <pos>wa/CONJ+li/PREP+</pos> lllilNPref-Lito/for + the <pos>li/PREP+Al/DET+</pos> wllwalilNPref-Liland + to/for + the <pos>wa/CONJ+li/PREP+Al/DET+</pos> wbAlwabiAlNPref-BiAland + with/by the <pos>wa/CONJ+bi/PREP+Al/DET+</pos> The first column contains the actual string that we look up, whereas the second column has the vocalized version of the same string. The third column has the morphological category (whose function is explained further below). The fourth column has the corresponding English glosses and contains part-of-speech information for the constituent morphemes. Here are some sample entries from the dictionary of stems (lines beginning with ";; " contain the lemma ID string): ;; Earabiy~_1 ErbyEarabiy~N/apArab <pos>Earabiy~/NOUN</pos> ErbEarabNArabs <pos>Earab/NOUN</pos> ErbyEarabiy~N/apArab <pos>Earabiy~/ADJ</pos> ErbEarabNArab <pos>Earab/ADJ</pos> ;; Earabiy~_2 ErbyEarabiy~N-apArabic;Arab <pos>Earabiy~/ADJ</pos> ;; Earabiy~_3 ErbyEarabiy~N0Arabi <pos>Earabiy~/NOUN_PROP</pos> ;; Earabiy~ap_1 ErbyEarabiy~NapAtArabic (language) <pos>Earabiy~/NOUN</pos> The following are sample entries from the dictionary of suffixes: papNSuff-ap[fem.sg.] <pos>+ap/NSUFF_FEM_SG</pos> AkAkaNSuff-Ahyour two <pos>+A/NSUFF_MASC_DU_NOM+ka/POSS_PRON_2MS</pos> AkAkiNSuff-Ahyour two <pos>+A/NSUFF_MASC_DU_NOM+ki/POSS_PRON_2FS</pos> If all three word elements (prefix, stem, suffix) are found in their respective lexicons, we then use their respective morphological categories (the string in column 3) to determine whether they are compatible. We ask: 1. is the morphological category of the prefix compatible with the morphological category of the stem? (i.e., is the combination found in the list of compatible prefix-stem morphological categories?) 2. if so, is the morphological category of the prefix compatible with the morphological category of the suffix? (i.e., is the combination found in the list of compatible prefix-suffix morphological categories?) 3. if so, is the morphological category of the stem compatible with the morphological category of the suffix? (i.e., is the combination found in the list of compatible stem-suffix morphological categories?) If the answer to the last question is "yes" then the morphological analysis is valid. Example: INPUT STRING: ???? LOOK-UP WORD: wSfh SOLUTION 1: (waSafahu) [waSaf-i_1] waSaf/VERB_PERFECT+a/PVSUFF_SUBJ:3MS+hu/PVSUFF_DO:3MS (GLOSS): + describe/characterize + he/it <verb> it/him SOLUTION 2: (waSafahu) [waSaf-i_1] waSaf/VERB_PERFECT+a/PVSUFF_SUBJ:3MS+hu/PVSUFF_DO:3MS (GLOSS): + prescribe/give a prescription to + he/it <verb> it/him SOLUTION 3: (waSofh) [waSof_1] waSof/NOUN+hu/POSS_PRON_3MS (GLOSS): + description/portrayal/characterization + its/his SOLUTION 4: (waSofh) [waSof_2] waSof/NOUN+hu/POSS_PRON_3MS (GLOSS): + characteristic + its/his SOLUTION 5: (waSaf~ahu) [Saf~-u_1] wa/CONJ+Saf~/VERB_PERFECT+a/PVSUFF_SUBJ:3MS+hu/PVSUFF_DO:3MS and (GLOSS): + arrange/classify + he/it <verb> it/him SOLUTION 6: (waSaf~h) [Saf~_1] wa/CONJ+Saf~/NOUN+hu/POSS_PRON_3MS (GLOSS): and + line/row/class + its/his Solution #1 was found to be valid because: 1. All 3 components(null)+wSf+h exist in their respective lexicons (note that there is a literal entry for the null prefix): (null)(null)Pref-0(null) wSfwaSafPVdescribe;characterize hahuPVSuff-ahhe/it <verb> it/him <pos>+a/PVSUFF_SUBJ:3MS+hu/PVSUFF_DO:3MS</pos> 2. The morphological categories of all 3 components are listed as compatible pairs in the relevant compatibility tables: 1."Pref-0 PV" (listed in the table of compatible prefix-stem morphological categories) 2."PV PVSuff-ah" (listed in the table of compatible stem-suffix morphological categories) 3."Pref-0 PVSuff-ah" (listed in the table of compatible prefix-suffix morphological categories) Solution #6 was found to be valid because: 1. All 3 components w+Sf+h exist in their respective lexicons: wwaPref-Waand <pos>wa/CONJ+</pos> SfSaf~Nduline;row;class HhNSuff-hits/his <pos>+hu/POSS_PRON_3MS</pos> 2. The morphological categories of all 3 components are listed as compatible pairs in the relevant compatibility tables: 1."Pref-Wa Ndu" (listed in the table of compatible prefix-stem morphological categories) 2."Ndu NSuff-h" (listed in the table of compatible stem-suffix morphological categories) 3."Pref-Wa NSuff-h" (listed in the table of compatible prefix-suffix morphological categories) The lexicon of stems used in the morphology analysis contains 83,811 entries and 39,321 lemmas (as of Dec. 20, 2002). AFP ARABIC POS TAGS ABBREV ADJ ADV CONJ DEM_PRON_F DEM_PRON_FD DEM_PRON_FS DEM_PRON_MD DEM_PRON_MP DEM_PRON_MS DET EMPHATIC_PARTICLE EXCEPT_PART FUNC_WORD FUT INTERJ INTERROG_PART IV1P IV1S IV2D IV2FS IV2MP IV2MS IV3FD IV3FP IV3FS IV3MD IV3MP IV3MS IVSUFF_DO:1P IVSUFF_DO:1S IVSUFF_DO:2MP IVSUFF_DO:2MS IVSUFF_DO:3D IVSUFF_DO:3FS IVSUFF_DO:3MP IVSUFF_DO:3MS IVSUFF_SUBJ:2FS_MOOD:SJ IVSUFF_SUBJ:D_MOOD:I IVSUFF_SUBJ:D_MOOD:SJ IVSUFF_SUBJ:FP IVSUFF_SUBJ:MP_MOOD:I IVSUFF_SUBJ:MP_MOOD:SJ NEG_PART NO_FUNC NON_ALPHABETIC NON_ARABIC NOUN NOUN_PROP NSUFF_FEM_DU_ACCGEN NSUFF_FEM_DU_ACCGEN_POSS NSUFF_FEM_DU_NOM NSUFF_FEM_DU_NOM_POSS NSUFF_FEM_PL NSUFF_FEM_SG NSUFF_MASC_DU_ACCGEN NSUFF_MASC_DU_ACCGEN_POSS NSUFF_MASC_DU_NOM NSUFF_MASC_DU_NOM_POSS NSUFF_MASC_PL_ACCGEN NSUFF_MASC_PL_ACCGEN_POSS NSUFF_MASC_PL_NOM NSUFF_MASC_PL_NOM_POSS NSUFF_MASC_SG_ACC_INDEF NUM NUMERIC_COMMA PART POSS_PRON_1P POSS_PRON_1S POSS_PRON_2FS POSS_PRON_2MP POSS_PRON_2MS POSS_PRON_3D POSS_PRON_3FP POSS_PRON_3FS POSS_PRON_3MP POSS_PRON_3MS PREP PRON_1P PRON_1S PRON_2FS PRON_2MP PRON_2MS PRON_3D PRON_3FP PRON_3FS PRON_3MP PRON_3MS PUNC PVSUFF_DO:1P PVSUFF_DO:1S PVSUFF_DO:3D PVSUFF_DO:3FS PVSUFF_DO:3MP PVSUFF_DO:3MS PVSUFF_SUBJ:1P PVSUFF_SUBJ:1S PVSUFF_SUBJ:2FS PVSUFF_SUBJ:2MP PVSUFF_SUBJ:3FD PVSUFF_SUBJ:3FP PVSUFF_SUBJ:3FS PVSUFF_SUBJ:3MD PVSUFF_SUBJ:3MP PVSUFF_SUBJ:3MS REL_PRON REL_ADV RESULT_CLAUSE_PARTICLE SUBJUNC VERB_IMPERFECT VERB_PERFECT VERB_PASSIVE AFP POS COVERAGE STATISTICS The AFP Corpus contains 140,265 tokens, of which 16,455 are punctuation, numbers, and Latin strings, and 123,810 are Arabic word tokens. Punctuation, Numbers, Latin strings16,455 Arabic Word Tokens123,810 TOTAL140,265 Of the 123,810 Arabic word tokens, 112,215 (90.63%) were provided with an accurate morphological analysis and POS tag, and 11,595 (09.37%) Arabic word tokens were judged to be inaccurate and flagged with a Comment describing the nature of the inaccuracy. Accurately parsed Arabic Word Tokens112,21590.63% Commented Arabic Word Tokens11,59509.37% TOTAL123,810100.00% Of the 11,595 Comments, the most frequently identified problems are the inaccurate parsing of proper names (28.47%) and the improper tagging of adjectives (18.30%). A large group of Comments (29.16%) could not be interpreted automatically (via scripting languages such as Perl) and was classified as Miscellaneous. ARABIC POS QUALITY CONTROL COMPARISON, 6-26-02 Five files with a total of 853 words (and a varying number of POS choices per word) were each tagged independently by five annotators for a quality control comparison of POS annotators. Out of the total of 853 words, 128 show some disagreement. All five annotators agreed on 85% of the words; the pairwise (between 2 annotators) agreement rate is at least 92.2%. There are a total of 82 words where four annotators agreed and only one disagreed. Of those, 55 are cases of "no selection" having been chosen from among the POS choices, due to one annotator's definition of good-enough-match differing from all of the others'. The annotators have since reached agreement on which cases are truly "no selection", and thus the rate of this disagreement should fall markedly in future POS files, raising the rate of overall agreement. In addition, we plan to revise the same five files to create a gold standard, which in the future may be used to evaluate and guide new annotators during their training period. AFP POS ANNOTATORS: Current annotators: Wigdan EL MEKKI Mohamed MANSOUR Zohra BENTAOUIT Rachida FATHALLAH Dalel ZAKHARY Tasneem GHANDOUR Ichraf AMGHOUZ Niama LAADIOUI Past annotators: Fatima EL HIMYANI Alexa FIRAT Sarah TLILI Gordon WITTY
Textbooks related to the document above:
Find millions of documents on Course Hero - Study Guides, Lecture Notes, Reference Materials, Practice Exams and more. Course Hero has millions of course specific materials providing students with the best way to expand their education.

Below is a small sample set of documents:

UPenn - LDC - 96
VOICE ACROSS HISPANIC AMERICA TRANSCRIPTION = Yeshwant Muthusamy, Barb Wheatley and Joseph Picone Personal Systems Laboratory, Texas Instruments INTRODUCTION -This document describes the conventions used to validate and transcribe Spanish speech data
UPenn - LDC - 2004
*BUCKWALTER ARABIC MORPHOLOGICAL ANALYZER VERSION 2.0Portions (c) 2002-2004 QAMUS LLC (www.qamus.org),(c) 2002-2004 Trustees of the University of Pennsylvania**LDC USER AGREEMENTUse of this version of the Buckwalter Arabic Morphological Analy
UPenn - CIT - 591
Abstract Classes and InterfacesApr 10, 2009Abstract methodsYou can declare an object without defining it:Personp; publicabstractvoiddraw(intsize); Notice that the body of the method is missingSimilarly, you can declare a method without
UPenn - CIT - 590
Extreme ProgrammingApr 10, 2009Software engineering methodologiesA methodology is a formalized process or set of practices for creating software An early methodology was the waterfall model, so named because each stage flowed into the next,
UPenn - CIT - 591
Which is better?Which is better? Assume s1 and s2 are Strings:A. if(s1=s2){.} B. if(s1.equals(s2){.}Answer: B s1=s2tests whether s1 and s2 reference the same string; s1.equals(s2) tests whether they reference equal strings Strings1=&quot;ABC&quot;; Str
UPenn - CIT - 591
Objects: Extended ExampleGeneral idea Simulate (model) the following situation: A customer walks into a grocery store, picks up a few items, pays for them, and leaves Lets write a program to do this Limitations: As yet, we have no way of inte
UPenn - CIT - 591
The Rabbit HuntAn example Java programThe user interfaceThe program designThe eight classes RabbitHunt - just gets things started Controller - accepts GUI commands from user View - creates the animated display Model - coordinates all the a
UPenn - CIT - 591
Lunar LanderAn Example of Interacting ClassesApr 10, 2009LunarLanderGameThis class contains the publicstaticvoidmain(String[]args) method. In this method, you should (1) create a LunarLander object, (2) create an IOFrame object, and (3) send
UPenn - CIT - 594
StacksWhat is a stack? A stack is a Last In, First Out (LIFO) data structure Anything added to the stack goes on the top of the stack Anything removed from the stack is taken from the top of the stack Things are removed in the reverse order
UPenn - CIT - 594
RecursionApr 10, 2009Definitions IA recursive definition is a definition in which the thing being defined occurs as part of its own definition Example: An atom is a name or a number A list consists of: An open parenthesis, &quot;(&quot; Zero or m
UPenn - CIT - 597
JSPJava Server PagesReference: http:/www.apl.jhu.edu/~hall/java/Servlet Tutorial/ServletTutorialJSP.htmlApr 10, 2009A Hello World servlet(from the Tomcat installation documentation)publicclassHelloServletextendsHttpServlet{ publicvoiddoGet(H
UPenn - CIT - 597
Regular Expressions in JavaApr 10, 2009Regular Expressions A regular expression is a kind of pattern that can be applied to text (Strings, in Java) A regular expression either matches the text (or part of the text), or it fails to match I
UPenn - CIT - 597
AjaxApr 10, 2009The hypeAjax (sometimes capitalized as AJAX) stands for Asynchronous JavaScript And XML Ajax is a technique for creating better, faster, more responsive web applications Web applications with Ajax are supposed to replac
UPenn - CIT - 591
Enums(and a review of switch statements)Apr 10, 2009Enumerated valuesSometimes you want a variable that can take on only a certain listed (enumerated) set of values Examples: dayOfWeek: SUNDAY, MONDAY, TUESDAY, month: JAN, FEB, MAR,
UPenn - CIT - 591
Additional Java SyntaxApr 10, 2009Odd cornersWe have already covered all of the commonly used Java syntax Some Java features are seldom used, because: They are needed in only a few specialized situations, or Its just as easy to do withou
UPenn - CIT - 591
Which is better?Apr 10, 2009Which is better?Assume s1 and s2 are Strings: A. if(s1=s2){.} B. if(s1.equals(s2){.}?2Answer: Bs1=s2tests whether s1 and s2 reference the same string;s1.equals(s2) tests whether they reference equal str
UPenn - CIT - 591
Characters and StringsApr 10, 2009CharactersIn Java, a char is a primitive type that can hold one single character A character can be: A letter or digit A punctuation mark A space, tab, newline, or other whitespace A control character
UPenn - CIT - 594
GenericsApr 10, 2009Arrays and collectionsIn Java, array elements must all be of the same type: int[]counts=newint[10]; String[]names={&quot;Tom&quot;,&quot;Dick&quot;,&quot;Harry&quot;};Hence, arrays are type safe: The compiler will not let you put the wrong kind
UPenn - CIT - 594
Linked ListsAnatomy of a linked list A linked list consists of: A sequence of nodesmyList a b c dEach node contains a value and a link (pointer or reference) to some other node The last node contains a null link The list may have a headerMor
UPenn - LING - 001
Ling 001: Syntax IIMovement &amp; Constraints 2-11-2009Phrases In the last lecture, we talked about simple phrases; e.g. Noun Phrases like The dog The big dog The big dog that John was talking to In this lecture, we will look at how phrases and
UPenn - LING - 102
GenieandLanguageAcquisitionHowchildrenlearntospeakandwhat happensoncetheypassthecritical periodwithouthavingdoneso.Infants:010mos. Infantscandistinguishsoundsfrombirth,even ifthosesoundsarenotpartoftheirparents speech. Bysixmonths,babiesbegintol
UPenn - LING - 102
LanguageContactpresentedby MichaelL.Friesner August6,2007Thank you to Gillian Sankoff for sending me her PPT slides (among other things).TwoMainTypesof LanguageContactAgent:Nonnativespeakersaffectingalanguagethey cometospeak languageshift
UPenn - LING - 102
Acts of Conflicting IdentityThe Sociolinguistics of British pop-song pronunciation by Peter TrudgillThe Accent of pop singing At least since the 20s and the advent of Jazz, singers have adopted speech patterns while singing that are different fro
UPenn - LING - 001
A puzzle: why language? Quantitatively and qualitatively unique like elephants trunks No similar evolutionary trends in other species other species dont want to pick up peanuts with their noses all mammals have flexible noses, some use them as
UPenn - LING - 120
LING 120 Introduction to Speech AnalysisFall 2007Week 6Speech analysis II: Stops, nasals, liquidsOct. 8-12, 20072LING 120 Introduction to Speech Analysis, Fall 20073LING 120 Introduction to Speech Analysis, Fall 20074LING 1
UPenn - LING - 520
LING 520 Introduction to Phonetics IFall 2008Week 9Basic audition Speech perception Nov. 3, 20082LING 520 Introduction to Phonetics I, Fall 20083LING 520 Introduction to Phonetics I, Fall 20084LING 520 Introduction to Phonetics
UPenn - LING - 520
LING 520 Introduction to Phonetics IFall 2008Week 2English consonants and vowels Articulatory phonology Sep. 15, 20082 1. Consonants are longer when at the end of a phrase (bib, did, don, nod). 2. Voiceless stops (i.e., /p, t, k/) are asp
UPenn - COGSCI - 501
Loudness predicts prominence: fundamental frequency lends little.G. Kochanski and E. Grabe and J. Coleman and B. Rosner( 2006/08/27 09:49:02 UTC )Running title: Fundamental Frequency Lends Little Prominence The University of Oxford Phonetics Lab
UPenn - COGSCI - 501
Psychological Review Vol. 65, No. 6, 19S8THE PERCEPTRON: A PROBABILISTIC MODEL FOR INFORMATION STORAGE AND ORGANIZATION IN THE BRAIN 1F. ROSENBLATT Cornell Aeronautical LaboratoryIf we are eventually to understand the capability of higher organi
UPenn - LING - 120
LING 120 Introduction to Speech AnalysisFall 2007Week 5Speech analysis I: Vowels and FricativesOct. 1-5, 20072[From: UCL phonetics website]LING 120 Introduction to Speech Analysis, Fall 20073LING 120 Introduction to Speech Ana
UPenn - LING - 520
LING 520 Introduction to Phonetics IFall 2008Week 3Sounds in other languagesSep. 22, 2008Languages in the world There are about 7,000 languages in the world today. Over half of them (52 percent) are spoken by fewer than 10,000 people; over
UPenn - COGSCI - 501
268IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE,VOL. 24, NO. 2,FEBRUARY 2002Short Papers_Two Variations on Fishers Linear Discriminant for Pattern RecognitionTristrom CookeAbstractDiscriminants are often used in patter
UPenn - LING - 106
Right Linear GrammarsLing 106 October 8, 20031.Regular languages as languages generated by FSAWhen we did distributional analysis, we saw that linguistic units in natural language (roughly words) can be classified into grammatical categories o
UPenn - LING - 520
LING 520 Introduction to Phonetics IFall 2008Week 5Acoustic theory of speech production Acoustics of vowels Oct. 6, 20082 LING 120 Introduction to Phonetics I, Fall 20083LING 120 Introduction to Phonetics I, Fall 20084n=2L
UPenn - LING - 120
LING 120 Introduction to Speech AnalysisFall 2007Week 2Anatomy of speech production Phonetic transcription RecordingSep. 10-14, 20072Nasal Cavity Oral Cavity Pharynx Larynx: vocal folds in it Trachea: the windpipe Lung: supply airstreamSa
UPenn - LING - 120
LING 120 Introduction to Speech AnalysisFall 2007Week 4Acoustics of speech production SamplingSep. 24-28, 20072 LING 120 Introduction to Speech Analysis, Fall 20073n=2L nfn =vn=nv 2Ln = 1, 2, 3.L = /2 = 2L f = v/
UPenn - LING - 120
LING 120 Introduction to Speech AnalysisFall 2007Week 3Physics of soundSep. 17-21, 20072Motion: Distance (unit: meters, 1 m 39 inches); displacement (vector); Speed = distance / time (units: meters/sec, m/s); Velocity specifies the di
UPenn - LING - 120
LING 120 Introduction to Speech AnalysisFall 2007Week 8Speech analysis IV: Variation and statistical techniques (I)Oct. 22-24, 2007Variation in speech2 Linguistic factors: phonetic context, intonation, syntax/semantics, etc. Paralin
UPenn - LING - 120
LING 120 Introduction to Speech AnalysisFall 2007Week 9Speech analysis IV: Variation and statistical techniques (II)Oct. 29 - Nov. 2, 2007Hypothesis testing Steps for Hypothesis Testing: 1. Formulate your hypotheses: - Need a Null Hypothesis
UPenn - LING - 102
LING-102, Summer 2007Instructor: Marjorie PakJuly 25, 2007Homework 4. Due Monday, July 30, at 10am. Part of the homework will be handwritten and turned in to me in class; the other part will be emailed to me before class. See below for exact in
UPenn - LING - 120
LING 120 Introduction to Speech AnalysisFall 2007Week 1Overview of the course The speech chain Linguistic organizationSep. 5-7, 2007Syllabus2http:/www.ling.upenn.edu/courses/ling120/LING 120 Introduction to Speech Analysis, Fall 2007
UPenn - LING - 102
LING-102, Summer 2007Instructor: Marjorie PakHomework 2. Due Wednesday, July 18, at the beginning of class. 1. The following spectrogram shows me saying two separate made-up words with a pause in between. Each word is composed of three vowels. On
UPenn - LING - 525
Signal Processing ToolboxFor Use with MATLABComputation Visualization ProgrammingUsers GuideVersion 4.2How to Contact The MathWorks:PHONEFAX MAIL508-647-7000 508-647-7001Phone Fax MailuINTERNETThe MathWorks, Inc. 24 Prime Park
UPenn - LING - 120
LING 120 Introduction to Speech AnalysisFall 2007Week 7Speech analysis III: Speech prosodyOct. 17-19, 20072LING 120 Introduction to Speech Analysis, Fall 20073LING 120 Introduction to Speech Analysis, Fall 20074Tone1: High level
UPenn - LING - 102
LING-102, Summer 2007 Homework 1. Due Wednesday, July 11 at the beginning of class (hard copy) Part 1. Pick any sentence from todays class handout that contains at least 5 words, and transcribe it on a separate piece of paper using the IPA. Bring you
UPenn - LING - 001
SyntaxLING 001 - October 11, 2006 Joshua TaubererSyntaxHow can the words of a language be put together?SyntaxWhat makes a valid combination or order of words? What are the relations between the words in a sentence? What is the mecha
UPenn - LING - 001
Sound StructurePart II: Phonology 1-28-2009Review of Phonetics Speech sounds are decomposable into articulatory primitives (also known as features) Consonants and Vowels Feature differences (e.g., voiced vs. voiceless, nasal vs. not nasal, labi
UPenn - LING - 001
undergraduate faculty campus student college academic curriculum freshman classroom professor moral considerateness bison whale governance utilitarianism ethic entity preference utilitarian diabetes elderly appendix geriatric directory hospice arthri
UPenn - LING - 001
Semanticsand some syntax, math, and computational linguistics tooLING 001 - October 16, 2006Joshua TaubererSemantics Why does a sentence mean what it means? What are the meanings of words and how do they come together to make larger meanings
UPenn - LING - 001
Linguistics 001: StructuresSyntax I 2-9-2009Plagiarism at Harvard Last year, a Harvard student accused of plagiarism of a teen novel Sabrina was the brainy Angel. Yet another example of how every girl had to be one or the other: Pretty or smart
UPenn - LING - 001
Who has a more sophisticated communication system, molluscs or monkeys? frequency and length of communicative interactions? role of communication in social life? number of distinct communicative displays? information content (entropy) of communi
UPenn - LING - 102
featuresofAAVEasfeaturesofPPE:a studyofadolescentsinphiladelphiapaperby: tonyawolford keelanevanini presentationby: anthonysobackground puertoricansfoundtoadoptaspectsof AAE effectsofdirectcontactwithAAE grammaticaluninflectedbe phonologica
UPenn - LING - 521
LING 521 Phonetics PracticumLiberman &amp; Yuan, Spring 2006Lecture8IntroductorystatisticsIII: Correlationandregression MoreonANOVA Mar.1,2006Correlation Istherearelationshipbetweentwovariables(e.g.,ageand speechrate)?Whatisthestrengthofthisrelati
UPenn - LING - 521
LING 521 Phonetics PracticumLiberman &amp; Yuan, Spring 2006Lecture7IntroductorystatisticsII: ttests OnewayANOVA Feb.22,2006Review:distributionofsamplemean Themeanofthesamplemean(X)isthepopulationmean(): mean(X)=Overrepeatedsamples,thesamplemeanw
UPenn - LING - 521
LING 521 Phonetics PracticumLiberman &amp; Yuan, Spring 2006Lecture5Probability Machinelearning (ataglance) Feb.8,2006Probability Itisatruthvaluethatallowsyoutobeuncertain,ifyouwish. Itdescribeshowmuchyouknowaboutasituation. Itcountshowbigaspec
UPenn - LING - 521
LING 521 Phonetics PracticumLiberman &amp; Yuan, Spring 2006Lecture6IntroductorystatisticsI: ExploratoryDataAnalysis Inferenceofpopulationmeans Feb.17,2006Fundamentalconcepts Statistics:thescienceofcollecting,analyzing,andinterpreting data. Popula
UPenn - LING - 001
undergraduate faculty campus student college academic curriculum freshman classroom professor moral considerateness bison whale governance utilitarianism ethic entity preference utilitarian diabetes elderly appendix geriatric directory hospice arthri
UPenn - LING - 001
Linguistics 001Spring 2009Professors David Embick and Charles YangBasics An introduction to the scientific study of language No prerequisites Satisfies Gen.Req. V/Sector VIIWebpage Information about readings and other matters will appear on
UPenn - LING - 102
LING-102, Summer 2007Instructor: Marjorie PakJuly 25, 2007Homework 4. Due Monday, July 30, at 10am. Part of the homework will be handwritten and turned in to me in class; the other part will be emailed to me before class. See below for exact in
UPenn - LING - 102
This is a questionnaire for a class Im taking. Were interested in language, and in the meanings and sounds of words for people in different places. There are a few parts. First I just need some information about you: What year were you born? What cit
UPenn - LING - 106
Highlights of Pinker, Chapter 4 (How Language Works) Ling 106 the arbitrariness of the sign discrete combinatorial system (examples: grammars, DNA) A generative grammar consists of a finite collection of discrete elements and a finite number of ru