User Tools

Site Tools


01_corpus:02_preprocessing:06_pos

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
01_corpus:02_preprocessing:06_pos [2020/04/17 11:09] – [Italian] simone01_corpus:02_preprocessing:06_pos [2022/06/27 09:21] (current) – external edit 127.0.0.1
Line 6: Line 6:
 The whole French corpus has been annotated with [[https://team.inria.fr/almanach/fr/melt/|MElt]] (Modified French TreeBank) using the tag set [[http://french-postaggers.tiddlyspot.com/|CC Tagset]]. Available annotations are "mftb_pos" (for part of speech) and "mftb_lem" (for the lemma). The following tags are used: The whole French corpus has been annotated with [[https://team.inria.fr/almanach/fr/melt/|MElt]] (Modified French TreeBank) using the tag set [[http://french-postaggers.tiddlyspot.com/|CC Tagset]]. Available annotations are "mftb_pos" (for part of speech) and "mftb_lem" (for the lemma). The following tags are used:
  
-  * ADJ adjective +  * ''ADJ'' adjective 
-  * ADJWH interrogative adjective +  * ''ADJWH'' interrogative adjective 
-  * ADV adverb +  * ''ADV'' adverb 
-  * ADVWH interrogative adverb +  * ''ADVWH'' interrogative adverb 
-  * CC coordinating conjunction +  * ''CC'' coordinating conjunction 
-  * CLO object clitic pronoun +  * ''CLO'' object clitic pronoun 
-  * CLR reflexive clitic pronoun +  * ''CLR'' reflexive clitic pronoun 
-  * CLS subject clitic pronoun +  * ''CLS'' subject clitic pronoun 
-  * CS subordinating conjunction +  * ''CS'' subordinating conjunction 
-  * DET determiner +  * ''DET'' determiner 
-  * DETWH interrogative determiner +  * ''DETWH'' interrogative determiner 
-  * ET foreign word +  * ''ET'' foreign word 
-  * I interjection +  * ''I'' interjection 
-  * NC common noun +  * ''NC'' common noun 
-  * NPP proper noun +  * ''NPP'' proper noun 
-  * P preposition +  * ''P'' preposition 
-  * P+D preposition+determiner amalgam +  * ''P+D'' preposition+determiner amalgam 
-  * P+PRO prepositon+pronoun amalgam +  * ''P+PRO'' prepositon+pronoun amalgam 
-  * PONCT punctuation mark +  * ''PONCT'' punctuation mark 
-  * PREF prefix +  * ''PREF'' prefix 
-  * PRO full pronoun +  * ''PRO'' full pronoun 
-  * PROREL relative pronoun +  * ''PROREL'' relative pronoun 
-  * PROWH interrogative pronoun +  * ''PROWH'' interrogative pronoun 
-  * V indicative or conditional verb form +  * ''V'' indicative or conditional verb form 
-  * VIMP imperative verb form +  * ''VIMP'' imperative verb form 
-  * VINF infinitive verb form +  * ''VINF'' infinitive verb form 
-  * VPP past participle +  * ''VPP'' past participle 
-  * VPR present participle +  * ''VPR'' present participle 
-  * VS subjunctive verb form+  * ''VS'' subjunctive verb form
  
  
Line 48: Line 48:
 The [[https://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/data/stts_guide.pdf|tagset]] uses the following tags: The [[https://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/data/stts_guide.pdf|tagset]] uses the following tags:
  
-  * ADJA attributive adjective (including participles used adjectivally)  +  * ''ADJA'' attributive adjective (including participles used adjectivally)  
-  * ADJD predicate adjective; adjective used adverbially  +  * ''ADJD'' predicate adjective; adjective used adverbially  
-  * ADV adverb (never used as attributive adjective)  +  * ''ADV'' adverb (never used as attributive adjective)  
-  * APPR preposition left hand part of double preposition  +  * ''APPR'' preposition left hand part of double preposition  
-  * APPRART preposition with fused article  +  * ''APPRART'' preposition with fused article  
-  * APPO postposition  +  * ''APPO'' postposition  
-  * APZR right hand part of double preposition  +  * ''APZR'' right hand part of double preposition  
-  * ART article (definite or indefinite)  +  * ''ART'' article (definite or indefinite)  
-  * CARD cardinal number (words or figures); also declined  +  * ''CARD'' cardinal number (words or figures); also declined  
-  * FM foreign words (actual part of speech in original language may be appended, e.g. FMADV/ FM-NN)  +  * ''FM'' foreign words (actual part of speech in original language may be appended, e.g. FMADV/ FM-NN)  
-  * ITJ interjection  +  * ''ITJ'' interjection  
-  * KON co-ordinating conjunction  +  * ''KON'' co-ordinating conjunction  
-  * KOKOM comparative conjunction or particle  +  * ''KOKOM'' comparative conjunction or particle  
-  * KOUI preposition used to introduce infinitive clause  +  * ''KOUI'' preposition used to introduce infinitive clause  
-  * KOUS subordinating conjunction  +  * ''KOUS'' subordinating conjunction  
-  * NA adjective used as noun  +  * ''NA'' adjective used as noun  
-  * NE names and other proper nouns  +  * ''NE'' names and other proper nouns  
-  * NN noun (but not adjectives used as nouns)  +  * ''NN'' noun (but not adjectives used as nouns)  
-  * PAV [PROAV] pronominal adverb  +  * ''PAV [PROAV]'' pronominal adverb  
-  * PAVREL pronominal adverb used as relative  +  * ''PAVREL'' pronominal adverb used as relative  
-  * PDAT demonstrative determiner  +  * ''PDAT'' demonstrative determiner  
-  * PDS demonstrative pronoun  +  * ''PDS'' demonstrative pronoun  
-  * PIAT indefinite determiner (whether occurring on its own or in conjunction with another determiner)  +  * ''PIAT'' indefinite determiner (whether occurring on its own or in conjunction with another determiner)  
-  * PIS indefinite pronoun  +  * ''PIS'' indefinite pronoun  
-  * PPER personal pronoun  +  * ''PPER'' personal pronoun  
-  * PRF reflexive pronoun  +  * ''PRF'' reflexive pronoun  
-  * PPOSS possessive pronoun  +  * ''PPOSS'' possessive pronoun  
-  * PPOSAT possessive determiner  +  * ''PPOSAT'' possessive determiner  
-  * PRELAT relative depending on a noun  +  * ''PRELAT'' relative depending on a noun  
-  * PRELS relative pronoun (i.e. forms of der or welcher)  +  * ''PRELS'' relative pronoun (i.e. forms of //der// or //welcher//)  
-  * PTKA particle with adjective or adverb  +  * ''PTKA'' particle with adjective or adverb  
-  * PTKANT answer particle  +  * ''PTKANT'' answer particle  
-  * PTKNEG negative particle  +  * ''PTKNEG'' negative particle  
-  * PTKREL indeclinable relative particle  +  * ''PTKREL'' indeclinable relative particle  
-  * PTKVZ separable prefix  +  * ''PTKVZ'' separable prefix  
-  * PTKZU infinitive particle zu +  * ''PTKZU'' infinitive particle //zu// 
-  * PWS interrogative pronoun  +  * ''PWS'' interrogative pronoun  
-  * PWAT interrogative determiner  +  * ''PWAT'' interrogative determiner  
-  * PWAV interrogative adverb  +  * ''PWAV'' interrogative adverb  
-  * PWAVREL interrogative adverb used as relative  +  * ''PWAVREL'' interrogative adverb used as relative  
-  * PWREL interrogative pronoun used as relative  +  * ''PWREL'' interrogative pronoun used as relative  
-  * TRUNC truncated form of compound  +  * ''TRUNC'' truncated form of compound  
-  * VAFIN finite auxiliary verb  +  * ''VAFIN'' finite auxiliary verb  
-  * VAIMP imperative of auxiliary  +  * ''VAIMP'' imperative of auxiliary  
-  * VAINF infinitive of auxiliary  +  * ''VAINF'' infinitive of auxiliary  
-  * VAPP past participle of auxiliary  +  * ''VAPP'' past participle of auxiliary  
-  * VMFIN finite modal verb  +  * ''VMFIN'' finite modal verb  
-  * VMINF infinitive of modal  +  * ''VMINF'' infinitive of modal  
-  * VMPP past participle of auxiliary  +  * ''VMPP'' past participle of auxiliary  
-  * VVFIN finite full verb  +  * ''VVFIN'' finite full verb  
-  * VVIMP imperative of full verb  +  * ''VVIMP'' imperative of full verb  
-  * VVINF infinitive of full verb  +  * ''VVINF'' infinitive of full verb  
-  * VVIZU infinitive with incorporated zu  +  * ''VVIZU'' infinitive with incorporated //zu//  
-  * VVPP past participle of full verb +  * ''VVPP'' past participle of full verb
  
 As in the French corpus, there are also combined tags such as //VAFIN+PPER// when a personal pronoun is agglutinated to a verb (//hätti// for 'hätte ich'). As in the French corpus, there are also combined tags such as //VAFIN+PPER// when a personal pronoun is agglutinated to a verb (//hätti// for 'hätte ich').
Line 113: Line 113:
  
 The following PoS [[https://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/data/italian-tagset.txt|tagset]] was used: The following PoS [[https://www.cis.uni-muenchen.de/~schmid/tools/TreeTagger/data/italian-tagset.txt|tagset]] was used:
-  * ABR abbreviation +  * ''ABR'' abbreviation 
-  * ADJ adjective +  * ''ADJ'' adjective 
-  * ADV adverb +  * ''ADV'' adverb 
-  * CON conjunction +  * ''CON'' conjunction 
-  * DET:def definite article +  * ''DET:def'' definite article 
-  * DET:indef indefinite article +  * ''DET:indef'' indefinite article 
-  * FW foreign word +  * ''FW'' foreign word 
-  * INT interjection +  * ''INT'' interjection 
-  * LS list symbol +  * ''LS'' list symbol 
-  * NOM noun +  * ''NOM'' noun 
-  * NPR name +  * ''NPR'' name 
-  * NUM numeral +  * ''NUM'' numeral 
-  * PON punctuation +  * ''PON'' punctuation 
-  * PRE preposition +  * ''PRE'' preposition 
-  * PRE:det preposition+article +  * ''PRE:det'' preposition+article 
-  * PRO pronoun +  * ''PRO'' pronoun 
-  * PRO:demo demonstrative pronoun +  * ''PRO:demo'' demonstrative pronoun 
-  * PRO:indef indefinite pronoun +  * ''PRO:indef'' indefinite pronoun 
-  * PRO:inter interrogative pronoun +  * ''PRO:inter'' interrogative pronoun 
-  * PRO:pers personal pronoun +  * ''PRO:pers'' personal pronoun 
-  * PRO:poss possessive pronoun +  * ''PRO:poss'' possessive pronoun 
-  * PRO:refl reflexive pronoun +  * ''PRO:refl'' reflexive pronoun 
-  * PRO:rela relative pronoun +  * ''PRO:rela'' relative pronoun 
-  * SENT sentence marker +  * ''SENT'' sentence marker 
-  * SYM symbol +  * ''SYM'' symbol 
-  * VER:cimp verb conjunctive imperfect +  * ''VER:cimp'' verb conjunctive imperfect 
-  * VER:cond verb conditional +  * ''VER:cond'' verb conditional 
-  * VER:cpre verb conjunctive present +  * ''VER:cpre'' verb conjunctive present 
-  * VER:futu verb future tense +  * ''VER:futu'' verb future tense 
-  * VER:geru verb gerund +  * ''VER:geru'' verb gerund 
-  * VER:impe verb imperative +  * ''VER:impe'' verb imperative 
-  * VER:impf verb imperfect +  * ''VER:impf'' verb imperfect 
-  * VER:infi verb infinitive +  * ''VER:infi'' verb infinitive 
-  * VER:pper verb participle perfect +  * ''VER:pper'' verb participle perfect 
-  * VER:ppre verb participle present +  * ''VER:ppre'' verb participle present 
-  * VER:pres verb present +  * ''VER:pres'' verb present 
-  * VER:refl:infi verb reflexive infinitive +  * ''VER:refl:infi'' verb reflexive infinitive 
-  * VER:remo verb simple past+  * ''VER:remo'' verb simple past
  
01_corpus/02_preprocessing/06_pos.1587114548.txt.gz · Last modified: 2022/06/27 09:21 (external edit)

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki