A minimal, pure Python library to interface with CoNLL-U format files.
-
Updated
Dec 5, 2025 - Python
A minimal, pure Python library to interface with CoNLL-U format files.
End-to-end integration of HuggingFace's models for sequence labeling.
A number of command-line tools for working with FoLiA (Format for Linguistic Annotation). Includes validators, converters, visualisers, and more.
A package for manipulating Universal Dependencies trees
Simple script to parse text with spaCy and print the output in CoNLL-U format.
ACoLi CoNLL libraries: Several tools for processing, manipulating and transforming TSV formats (CoNLL-RDF, CoNLL-Merge, CQP4RDF)
A Python3 package for extracting syntactic complexity measures from CoNLL-U annotations.
A Python toolkit for working with CoNLL-U files, Universal Dependencies treebanks, and annotated corpora.
Toolkit that simplifies corpus processing
High-performance toolkit for querying linguistic dependency parses
Repository for the paper "Exploring Non-Verbal Predicates in Semantic Role Labeling: Challenges and Opportunities"
A pipeline for machine translation (using OPUS-MT models) of parliamentary text collections in 30+ languages (ParlaMint corpora). The pipeline includes parsing TEI XLM and CONLL-u files, linguistic processing with the Stanza pipeline, machine translation and word alignment with the Eflomal tool.
Small bilar packages
"Galahad". Goal: enable linguists to experiment with different taggers and use the result in other INT products
C++20 library (and Python bindings) for RegenT text simplification
A modern, embeddable query engine for corpus linguistics.
Add a description, image, and links to the conllu topic page so that developers can more easily learn about it.
To associate your repository with the conllu topic, visit your repo's landing page and select "manage topics."