Loading treebank corpus with brown's tagset

471 Views Asked by pg2455 At 14 June 2025 at 23:46

I have a WSJ treebank corpus from nltk. I want to load it with the tagset of brown corpus. Is it possible?

import nltk
wsj = nltk.corpus.treebank.tagged_sents(tagset ='universal') # universal tags
wsj2 = nltk.corpus.treebank.tagged_sents() # treebank specific tags

Original Q&A

There are 1 best solutions below

b3000 On 23 July 2015 at 16:13

According to the discussion in this thread it is not possible.

So far NLTK only provides the possibility to map specific tagsets to the universal tagset. Maybe one of the suggested solutions in the discussion can help:

This is apparently not supported in NLTK yet, but see Dan Zeman's Interset tool or my script at https://gist.github.com/nschneid/6476715

Loading treebank corpus with brown's tagset

There are 1 best solutions below

Related Questions in PYTHON-2.7

Related Questions in NLP

Related Questions in NLTK

Related Questions in CORPUS

Related Questions in TAGGED-CORPUS

Trending Questions

Popular # Hahtags

Popular Questions