Splitting script: discard meta-data other than source_sent_id and text
Currently meta-data (sometimes) includes information about document/paragraph structure, as well as some FLAT annotation information. These should not be preserved in the resulting train/test .cupt files.