Skip to content

initial solution to issue-#104

Martin Rypar requested to merge issue-#104 into master

Few notes to the initial solution for issue-#104:

1. there might be needed to correct/complement the explanatory in the line 862 of txt_parser.py

2. the solution can be simplified (maybe): currently, it should be able to parse restrictions divided by whitespace (e.g. restriction: ~ se) even though I have not found any of those in the pdt-vallex datafile (all of them are together as a single restriction). but still unsure whether those can exist/occur in the future I kept it more robust.

3. during generation of the new pdt-vallex file according to the new xsl stylesheet (according to https://svn.ms.mff.cuni.cz/trac/vallex/changeset/7484) there occured one change compared to my previous file:

prev:
* KÁŽÍCÍ (v-w1375)
: id: v-w1375f1
~ A: kážící
+ PAT(4;obl) ADDR(;opt)
-synon: kázat
-example: kážící vodu.PAT
-status: reviewed

new:
* KÁZAT (v-w1375)
: id: v-w1375f1
~ A: kázat
+ PAT(4;obl) ADDR(;opt)
-synon: kázat
-example: kážící vodu.PAT
-status: reviewed

which seems that in the newfile is extra KÁZAT instead of KÁŽÍCÍ (KÁZAT (v-w1374) is present in both files)

Illustration (mouseover over 'se'): Screenshot_from_2020-09-29_21-31-20

Edited by Martin Rypar

Merge request reports