PDT-C 2.0 presents a uniformly annotated, genre-diversified Czech corpus of nearly 4 million tokens. It includes meaning representations and inter-sentential phenomena like coreference and discourse relations, and is accompanied by fully compatible lexicons. The resource is available under a CC BY-NC-SA licence.
Prague Dependency Treebank Consolidated 2.0 Released
from English