Parsing CoNLL-U files with NLTK

1.1k Views Asked by At

I know there are CoNLL-U parsers in Python. I would just like to get confirmation that NLTK does not have a native routine to parse CoNLL-U (or other CoNLL formats with dependency syntax).

Looking at the code, it seems HEAD and DEP are not among the permitted column types of conll. This is very unexpected because CoNLL-U is very popular nowadays, dependency syntax has been a core feature of many CoNLL formats since about 15 years, and this gap is not documented anywhere, so I'm pretty sure I'm overlooking something.

1

There are 1 best solutions below

2
On