Appendix E. Part-of-Speech Tags
Note: For a mapping of the part-of-speech tags that appear in this appendix to the tags used in the Penn Treebank Project, see the tab-delimited .csv files (one per language) distributed along with this Application Developer's Guide in the penn_treebank subdirectory. The RBL-JE Korean part-ofspeech tags conform to the Penn Treebank standard.
E.1. Arabic POS Tags
| Tag | Description | Example |
|---|---|---|
| ABBREV | abbreviation | ا ف ب |
| ADJ | adjective | اَلَأمْرِيكِيّ، عَرَبِي |
| ADV | adverb | هُنَاكَ، ثُمَّ |
| ATMENTION | at-mention | @basistechnology |
| CONJ | conjunction | وَ |
| CV | verb (imperative) | أَضِفْ |
| DEM_PRON | demonstrative pronoun | هٰذَا |
| DET | determiner | لِل |
| email address | [email protected] | |
| EMO | emoji or emoticon | :-) |
| EOS | end of sentence | ! ؟ . |
| EXCEPT_PART | exception particle | إلا |
| FOCUS_PART | focus particle | أما |
| FUT_PART | future particle | سَوْفَ |
| HASHTAG | hashtag | # الات |
| INTERJ | interjection | آه |
| INTERROG_PART | interrogative particle | هَلْ |
| IV | verb (imperfect) | يَكْتُبُ، يَأْكُلُ |
| IV_PASS | verb (passive imperfect) | يُضَافُ، يُشَارُ |
| NEG_PART | negative particle | لَن |
| NON_ARABIC | not Arabic script | a b c |
| NOUN | noun | طَائِرْ، كُمْبِيُوتَرْ، بَيْتْ |
| NOUN_PROP | proper noun | دْ طَوُنِي، مُحَم |
| NO_FUNC | unknown part of speech | |
| NUM | numbers (Arabic-Indic numbers, Latin, | 14 ، أَرْبَعَة عَشَرْ، ١ |
| PART | particle | تُهَا، إِي أَ |
| PREP | preposition | أَمَامْ، فِي |
| PRONOUN | pronoun | هُوَ |
| PUNC | punctuation | () ؛.،؟: |
| PV | perfective verb | كَانَت، قَالَ |
| PV_PASS | passive perfective verb | أُعْتَبَر |
| RC_PART | resultative clause particle | فَلَمَا |
| REL_ADV | relative adverb | حَيْثُ |
| REL_PRON | relative pronoun | انِ ذِي، اَلل |
| SUB_CONJ | subordinating conjunction | إِذَا، إِذ |
| URL | URL | http://www.basistech.com/ |
| VERB_PART | verbal particle | لَقَدْ |