Please suggest me a downloadable English corpus that contains informal, playful words such as 'gonna', 'LOL' and 'wanna'

2

There are 2 best solutions below

0
On BEST ANSWER

Use 'NetLingo'. They have a rich content :)

0
On

I don't know such lexicon but you can try to do this, alternatively:

  • Get the vocabulary V1 of Twitter or other web and chat corpus.
  • Get the vocabulary V2 of literary corpus.

The lexicon you want might be V1 \ V2 i.e. all the words of V1 which are not in V2.

Using Python, NLTK provides corpora (see nltk.corpus.webtext). Moreover, as @mbatchkarov said in the comments: Twitter is full of informal language.