I need a part of speech tagger that does not just return the optimal tag sequence for a given sentence, but that returns the n-best tag sequences. So for 'time flies like an arrow', it could return both NN VBZ IN DT NN and NN NNS VBP DT NN for example, ordered in terms of their probability. I need to train the tagger using my own tag set and sentence examples, and I would like a tagger that allows different features of the sentence to be engineered. If one of the nltk taggers had this functionality, that would be great, but any tagger that I can interface with my Python code would do. Thanks in advance for any suggestions.
NLTK (or other) Part of speech tagger that returns n-best tag sequences
224 Views Asked by user3449494 At
1
There are 1 best solutions below
Related Questions in NLTK
- Removing URL features from tokens in NLTK
- Django webapp (on an Apache2 server) hangs indefintely when importing nltk in views.py
- Stanford Entity Recognizer (caseless) in Python Nltk
- How to Train an Input File containing lines of text in NLTK Python
- Python child process silently crashes when issuing an HTTP request
- 'NoneType' object has no attribute 'kill_cursors' when nltk is imported
- NLTK - Get and Simplify List of Tags
- Check if items in list a are found in list b and return list c with matching indexes of list b in Python
- Extract word from a list of synsets in NLTK for Python
- Python NLTK pos_tag not returning the correct part-of-speech tag
- Using WordNet-Affect with NLTK
- Check the similarity between two words with NLTK with Python
- How to remove a custom word pattern from a text using NLTK with Python
- Printing Simplified Corpus to Json File
- NLTK: Package Errors? punkt and pickle?
Related Questions in POS-TAGGER
- Python HMM tagger _pickle.PicklingError attribute lookup estimator on nltk.tag.hmm failed
- Python NLTK pos_tag not returning the correct part-of-speech tag
- How to use OpenNLP to get POS tags in R?
- spaCy Alternatives in Java
- Gold POS in Stanford parser
- Finding POS-TAG Frequency in sentences of a corpus
- How to find the average frequency of a POS-TAG per sentence
- What is the most fast and accurate POS Tagger in Python (with a commercial license)?
- how can i convert list of sentences to IOB format, saving the sentences separation in the output
- Average POS-TAG Frequency
- Trouble running Java application implementing Stanford POS tagger as a .jar, runs fine in NetBeans IDE
- How to convert text file to CoNLL format for malt parser?
- extracting sentences from pos-tagged corpus with certain word, tag combos
- how can i create my own model in Stanford Pos tagger?
- Java Command Fails in NLTK Stanford POS Tagger
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
I would recommend having a look at spaCy. From what I have seen, it doesn't by default allow you to return the top-n tags, but it supports creating custom pipeline components.
There is also an issue on Github where exactly this is discussed, and there are some suggestions on how to implement it relatively quickly.