Stanford CoreNLP train model from text file like englishPCFG.ser.gz

636 Views Asked by At

I am new to Stanford CoreNLP, Initially I have worked with Moses project. So far I have worked with the demo file ParserDemo2 and everything worked fine using englishPCFG.caseless.ser.gz model. I need to create my own model, from the text English monolingual corpus which I have.

So far I have searched and found that I need to create a TreeBank and use method trainFromTreebank in LexicalizedParser class.

I am really confused how to do this.

Can you provide some information or point me to the documentation on how to do so?

1

There are 1 best solutions below

6
Jon Gauthier On

The Stanford Parser FAQ answers: "Can I train the parser?"

It's probably easiest to start with a vanilla PCFG model and then work your way up with state-splitting, etc. to more complex models. See "Can I just use the parser as a vanilla PCFG parser?"