User Tools

Site Tools


01_corpus:02_preprocessing:06_pos

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Last revisionBoth sides next revision
01_corpus:02_preprocessing:06_pos [2020/04/17 11:09] – [Italian] simone01_corpus:02_preprocessing:06_pos [2020/05/04 13:48] – [Swiss German dialect] simone
Line 6: Line 6:
 The whole French corpus has been annotated with [[https://team.inria.fr/almanach/fr/melt/|MElt]] (Modified French TreeBank) using the tag set [[http://french-postaggers.tiddlyspot.com/|CC Tagset]]. Available annotations are "mftb_pos" (for part of speech) and "mftb_lem" (for the lemma). The following tags are used: The whole French corpus has been annotated with [[https://team.inria.fr/almanach/fr/melt/|MElt]] (Modified French TreeBank) using the tag set [[http://french-postaggers.tiddlyspot.com/|CC Tagset]]. Available annotations are "mftb_pos" (for part of speech) and "mftb_lem" (for the lemma). The following tags are used:
  
-  * ADJ adjective +  * ''ADJ'' adjective 
-  * ADJWH interrogative adjective +  * ''ADJWH'' interrogative adjective 
-  * ADV adverb +  * ''ADV'' adverb 
-  * ADVWH interrogative adverb +  * ''ADVWH'' interrogative adverb 
-  * CC coordinating conjunction +  * ''CC'' coordinating conjunction 
-  * CLO object clitic pronoun +  * ''CLO'' object clitic pronoun 
-  * CLR reflexive clitic pronoun +  * ''CLR'' reflexive clitic pronoun 
-  * CLS subject clitic pronoun +  * ''CLS'' subject clitic pronoun 
-  * CS subordinating conjunction +  * ''CS'' subordinating conjunction 
-  * DET determiner +  * ''DET'' determiner 
-  * DETWH interrogative determiner +  * ''DETWH'' interrogative determiner 
-  * ET foreign word +  * ''ET'' foreign word 
-  * I interjection +  * ''I'' interjection 
-  * NC common noun +  * ''NC'' common noun 
-  * NPP proper noun +  * ''NPP'' proper noun 
-  * P preposition +  * ''P'' preposition 
-  * P+D preposition+determiner amalgam +  * ''P+D'' preposition+determiner amalgam 
-  * P+PRO prepositon+pronoun amalgam +  * ''P+PRO'' prepositon+pronoun amalgam 
-  * PONCT punctuation mark +  * ''PONCT'' punctuation mark 
-  * PREF prefix +  * ''PREF'' prefix 
-  * PRO full pronoun +  * ''PRO'' full pronoun 
-  * PROREL relative pronoun +  * ''PROREL'' relative pronoun 
-  * PROWH interrogative pronoun +  * ''PROWH'' interrogative pronoun 
-  * V indicative or conditional verb form +  * ''V'' indicative or conditional verb form 
-  * VIMP imperative verb form +  * ''VIMP'' imperative verb form 
-  * VINF infinitive verb form +  * ''VINF'' infinitive verb form 
-  * VPP past participle +  * ''VPP'' past participle 
-  * VPR present participle +  * ''VPR'' present participle 
-  * VS subjunctive verb form+  * ''VS'' subjunctive verb form
  
  
Line 48: Line 48:
 The [[https://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/data/stts_guide.pdf|tagset]] uses the following tags: The [[https://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/data/stts_guide.pdf|tagset]] uses the following tags:
  
-  * ADJA attributive adjective (including participles used adjectivally)  +  * ''ADJA'' attributive adjective (including participles used adjectivally)  
-  * ADJD predicate adjective; adjective used adverbially  +  * ''ADJD'' predicate adjective; adjective used adverbially  
-  * ADV adverb (never used as attributive adjective)  +  * ''ADV'' adverb (never used as attributive adjective)  
-  * APPR preposition left hand part of double preposition  +  * ''APPR'' preposition left hand part of double preposition  
-  * APPRART preposition with fused article  +  * ''APPRART'' preposition with fused article  
-  * APPO postposition  +  * ''APPO'' postposition  
-  * APZR right hand part of double preposition  +  * ''APZR'' right hand part of double preposition  
-  * ART article (definite or indefinite)  +  * ''ART'' article (definite or indefinite)  
-  * CARD cardinal number (words or figures); also declined  +  * ''CARD'' cardinal number (words or figures); also declined  
-  * FM foreign words (actual part of speech in original language may be appended, e.g. FMADV/ FM-NN)  +  * ''FM'' foreign words (actual part of speech in original language may be appended, e.g. FMADV/ FM-NN)  
-  * ITJ interjection  +  * ''ITJ'' interjection  
-  * KON co-ordinating conjunction  +  * ''KON'' co-ordinating conjunction  
-  * KOKOM comparative conjunction or particle  +  * ''KOKOM'' comparative conjunction or particle  
-  * KOUI preposition used to introduce infinitive clause  +  * ''KOUI'' preposition used to introduce infinitive clause  
-  * KOUS subordinating conjunction  +  * ''KOUS'' subordinating conjunction  
-  * NA adjective used as noun  +  * ''NA'' adjective used as noun  
-  * NE names and other proper nouns  +  * ''NE'' names and other proper nouns  
-  * NN noun (but not adjectives used as nouns)  +  * ''NN'' noun (but not adjectives used as nouns)  
-  * PAV [PROAV] pronominal adverb  +  * ''PAV [PROAV]'' pronominal adverb  
-  * PAVREL pronominal adverb used as relative  +  * ''PAVREL'' pronominal adverb used as relative  
-  * PDAT demonstrative determiner  +  * ''PDAT'' demonstrative determiner  
-  * PDS demonstrative pronoun  +  * ''PDS'' demonstrative pronoun  
-  * PIAT indefinite determiner (whether occurring on its own or in conjunction with another determiner)  +  * ''PIAT'' indefinite determiner (whether occurring on its own or in conjunction with another determiner)  
-  * PIS indefinite pronoun  +  * ''PIS'' indefinite pronoun  
-  * PPER personal pronoun  +  * ''PPER'' personal pronoun  
-  * PRF reflexive pronoun  +  * ''PRF'' reflexive pronoun  
-  * PPOSS possessive pronoun  +  * ''PPOSS'' possessive pronoun  
-  * PPOSAT possessive determiner  +  * ''PPOSAT'' possessive determiner  
-  * PRELAT relative depending on a noun  +  * ''PRELAT'' relative depending on a noun  
-  * PRELS relative pronoun (i.e. forms of der or welcher)  +  * ''PRELS'' relative pronoun (i.e. forms of //der// or //welcher//)  
-  * PTKA particle with adjective or adverb  +  * ''PTKA'' particle with adjective or adverb  
-  * PTKANT answer particle  +  * ''PTKANT'' answer particle  
-  * PTKNEG negative particle  +  * ''PTKNEG'' negative particle  
-  * PTKREL indeclinable relative particle  +  * ''PTKREL'' indeclinable relative particle  
-  * PTKVZ separable prefix  +  * ''PTKVZ'' separable prefix  
-  * PTKZU infinitive particle zu +  * ''PTKZU'' infinitive particle //zu// 
-  * PWS interrogative pronoun  +  * ''PWS'' interrogative pronoun  
-  * PWAT interrogative determiner  +  * ''PWAT'' interrogative determiner  
-  * PWAV interrogative adverb  +  * ''PWAV'' interrogative adverb  
-  * PWAVREL interrogative adverb used as relative  +  * ''PWAVREL'' interrogative adverb used as relative  
-  * PWREL interrogative pronoun used as relative  +  * ''PWREL'' interrogative pronoun used as relative  
-  * TRUNC truncated form of compound  +  * ''TRUNC'' truncated form of compound  
-  * VAFIN finite auxiliary verb  +  * ''VAFIN'' finite auxiliary verb  
-  * VAIMP imperative of auxiliary  +  * ''VAIMP'' imperative of auxiliary  
-  * VAINF infinitive of auxiliary  +  * ''VAINF'' infinitive of auxiliary  
-  * VAPP past participle of auxiliary  +  * ''VAPP'' past participle of auxiliary  
-  * VMFIN finite modal verb  +  * ''VMFIN'' finite modal verb  
-  * VMINF infinitive of modal  +  * ''VMINF'' infinitive of modal  
-  * VMPP past participle of auxiliary  +  * ''VMPP'' past participle of auxiliary  
-  * VVFIN finite full verb  +  * ''VVFIN'' finite full verb  
-  * VVIMP imperative of full verb  +  * ''VVIMP'' imperative of full verb  
-  * VVINF infinitive of full verb  +  * ''VVINF'' infinitive of full verb  
-  * VVIZU infinitive with incorporated zu  +  * ''VVIZU'' infinitive with incorporated //zu//  
-  * VVPP past participle of full verb +  * ''VVPP'' past participle of full verb
  
 As in the French corpus, there are also combined tags such as //VAFIN+PPER// when a personal pronoun is agglutinated to a verb (//hätti// for 'hätte ich'). As in the French corpus, there are also combined tags such as //VAFIN+PPER// when a personal pronoun is agglutinated to a verb (//hätti// for 'hätte ich').
Line 113: Line 113:
  
 The following PoS [[https://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/data/italian-tagset.txt|tagset]] was used: The following PoS [[https://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/data/italian-tagset.txt|tagset]] was used:
-  * ABR abbreviation +  * ''ABR'' abbreviation 
-  * ADJ adjective +  * ''ADJ'' adjective 
-  * ADV adverb +  * ''ADV'' adverb 
-  * CON conjunction +  * ''CON'' conjunction 
-  * DET:def definite article +  * ''DET:def'' definite article 
-  * DET:indef indefinite article +  * ''DET:indef'' indefinite article 
-  * FW foreign word +  * ''FW'' foreign word 
-  * INT interjection +  * ''INT'' interjection 
-  * LS list symbol +  * ''LS'' list symbol 
-  * NOM noun +  * ''NOM'' noun 
-  * NPR name +  * ''NPR'' name 
-  * NUM numeral +  * ''NUM'' numeral 
-  * PON punctuation +  * ''PON'' punctuation 
-  * PRE preposition +  * ''PRE'' preposition 
-  * PRE:det preposition+article +  * ''PRE:det'' preposition+article 
-  * PRO pronoun +  * ''PRO'' pronoun 
-  * PRO:demo demonstrative pronoun +  * ''PRO:demo'' demonstrative pronoun 
-  * PRO:indef indefinite pronoun +  * ''PRO:indef'' indefinite pronoun 
-  * PRO:inter interrogative pronoun +  * ''PRO:inter'' interrogative pronoun 
-  * PRO:pers personal pronoun +  * ''PRO:pers'' personal pronoun 
-  * PRO:poss possessive pronoun +  * ''PRO:poss'' possessive pronoun 
-  * PRO:refl reflexive pronoun +  * ''PRO:refl'' reflexive pronoun 
-  * PRO:rela relative pronoun +  * ''PRO:rela'' relative pronoun 
-  * SENT sentence marker +  * ''SENT'' sentence marker 
-  * SYM symbol +  * ''SYM'' symbol 
-  * VER:cimp verb conjunctive imperfect +  * ''VER:cimp'' verb conjunctive imperfect 
-  * VER:cond verb conditional +  * ''VER:cond'' verb conditional 
-  * VER:cpre verb conjunctive present +  * ''VER:cpre'' verb conjunctive present 
-  * VER:futu verb future tense +  * ''VER:futu'' verb future tense 
-  * VER:geru verb gerund +  * ''VER:geru'' verb gerund 
-  * VER:impe verb imperative +  * ''VER:impe'' verb imperative 
-  * VER:impf verb imperfect +  * ''VER:impf'' verb imperfect 
-  * VER:infi verb infinitive +  * ''VER:infi'' verb infinitive 
-  * VER:pper verb participle perfect +  * ''VER:pper'' verb participle perfect 
-  * VER:ppre verb participle present +  * ''VER:ppre'' verb participle present 
-  * VER:pres verb present +  * ''VER:pres'' verb present 
-  * VER:refl:infi verb reflexive infinitive +  * ''VER:refl:infi'' verb reflexive infinitive 
-  * VER:remo verb simple past+  * ''VER:remo'' verb simple past
  
01_corpus/02_preprocessing/06_pos.txt · Last modified: 2022/06/27 09:21 by 127.0.0.1

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki