... | ... | @@ -10,26 +10,30 @@ Quick links: |
|
|
|
|
|
## Annotation: FLAT
|
|
|
|
|
|
The annotation of corpora, in most languages, uses the central PARSEME annotation platform. Below you will find the link to the platform and the user guide. If you do not have an account on FLAT yet, you have to ask the core organisers to create one for you and your team.
|
|
|
|
|
|
* [FLAT annotation platform](http://mwe.phil.hhu.de/): the PARSEME instance of [FLAT](https://github.com/proycon/flat), developed by Maarten van Gompel and hosted at the University of Düsseldorf.
|
|
|
* [FLAT user guide](https://docs.google.com/document/d/1zd_VhXQTel_IRVQ_u6s2wvJttwBHdDIk5YtWDMa3QW4/edit#) for PARSEME annotation
|
|
|
|
|
|
## File formats and conversions: utilities
|
|
|
|
|
|
* [PARSEME utilities](https://gitlab.com/parseme/utilities/): a repository containing useful scripts for corpus management, including parsemetsv<->cupt conversion, adjudication, consistency checks, and corpus statistics. LLs may need to run some of these scripts with the help of core organizers
|
|
|
* [PARSEME utilities](https://gitlab.com/parseme/utilities/): a repository containing useful scripts for corpus management, including parsemetsv<->CUPT conversion, adjudication, consistency checks, and corpus statistics. LLs may need to run some of these scripts with the help of core organizers.
|
|
|
* [CUPT format](http://multiword.sourceforge.net/cupt-format/): Description of the PARSEME version of extended [CoNLL-U format](https://universaldependencies.org/format.html), defined jointly with [Universal Dependencies](http://universaldependencies.org/). The generic meta-format extending CoNLL-U is called [CoNLL-U Plus](https://universaldependencies.org/ext-format.html).
|
|
|
|
|
|
## Morphosyntactic annotations with UDPipe
|
|
|
|
|
|
## Consistency checks scripts
|
|
|
|
|
|
PARSEME provides scripts to increase the consistency of annotations. Their use is described on the LL's guide to [enhance existing corpora](Enhancing-existing-corpora). They can be found in the [PARSEME utilities](https://gitlab.com/parseme/utilities/) repository.
|
|
|
|
|
|
## Error mining: Grew-match
|
|
|
|
|
|
* [Grew-match](http://match.grew.fr/?corpus=PARSEME-EN): an online query tool on annotated data.
|
|
|
|
|
|
## Gitlab data repositories
|
|
|
|
|
|
* [Development Gitlab space](https://gitlab.com/parseme/sharedtask-data-dev) (for authorised users): contains development versions of the corpora, double-aligned corpora for IAA calculation, system results from previous editions, various scripts for ST organizers (automating system evaluation, publishing the results, running IAA). In 2020, we would like to experiment moving the development version of language corpora to dedicated gitlab repositories.
|
|
|
* [Description of PARSEME repositories](https://docs.google.com/document/d/1Wkx7bWTR04TXFVypPKy-qYi4ugc_034BtfskDeLDoGU/). This document may require updates, please send us a message if you find any inconsistency.
|
|
|
* [Development Gitlab space](https://gitlab.com/parseme/sharedtask-data-dev) (for authorised users): contains development versions of the corpora, double-aligned corpora for IAA calculation, system results from previous editions, various scripts for ST organizers (automating system evaluation, publishing the results, running IAA). In 2020, we gradually move the development version of language corpora to dedicated gitlab repositories, keeping in this repository only organisation data (preliminary results, IAA data, internal scripts)
|
|
|
* [Description of PARSEME repositories](https://docs.google.com/document/d/1Wkx7bWTR04TXFVypPKy-qYi4ugc_034BtfskDeLDoGU/). This document may require updates and its content should be slowly moved here. Please send us a message if you find any inconsistency.
|
|
|
|
|
|
## Guidelines editions and example editing
|
|
|
|
... | ... | |