... | ... | @@ -70,7 +70,7 @@ Any pre-existing information, other than tokenisation, will be overwritten. Ther |
|
|
|
|
|
### Running UDPipe on partly annotated files
|
|
|
|
|
|
Suppose your corpus is already tokenized, annotated for MWEs, and annotated for morphology (LEMMA, UPOS and FEATS columns) but not for syntax (HEAD and DEPREL columns). Otherwise, you can use UDPipe in a custom way:
|
|
|
Suppose your corpus is already tokenized, annotated for MWEs, and annotated for morphology (LEMMA, UPOS and FEATS columns) but not for syntax (HEAD and DEPREL columns). You can use UDPipe in a custom way:
|
|
|
|
|
|
1. Convert your .cupt files into .conllu (deleting the last column):
|
|
|
- for every file, say input-001.cupt, run the following command from the PARSEME utilities repo, indicating your language code by the `--lang` option:
|
... | ... | |