... | ... | @@ -66,7 +66,7 @@ Since the corpus is large, you may want to parallelise this process by running s |
|
|
|
|
|
Suppose your corpus is already tokenized and annotated for MWEs and in the .cupt (or .folia) format but misses morphosyntactic annotation. To enhance it with UDPipe, proceed as above, this time passing your .cupt files to the parser:
|
|
|
`utilities/lang-leaders/pre-annot/run_udpipe.sh MODELPATH input-001.cupt input-002.cupt ...`
|
|
|
Any existing pre-information, other than tokenisation, will be overwritten. Therefor, if you already have part of the morphosyntactic annotation (e.g. UPOS tags) which you want to keep, you should your local run UDPipe in a customized version (see below).
|
|
|
Any pre-existing information, other than tokenisation, will be overwritten. Therefore, if you already have part of the morphosyntactic annotation (e.g. UPOS tags) which you want to keep, run UDPipe in a customized way (see below).
|
|
|
|
|
|
### Running UDPipe on partly annotated files
|
|
|
|
... | ... | |