I have a database and API for hindi wordnet. I want to access this wordnet from NLTK python, so as to use NLTK Wordnet functions with our wordnet. Is there any way to add our own wordnet into NLTK? Or Are there any tools for Word Sense Disambiguation in Hindi (that can work with any Language Wordnet with some modifications) (which gives most suitable sense from wordnet)?
I have a database and API for hindi wordnet. I want to access this wordnet from NLTK python. Is there any way to add our own wordnet into NLTK?
1.2k Views Asked by Aniruddha Tammewar At
1
There are 1 best solutions below
Related Questions in PYTHON
- new thread blocks main thread
- Extracting viewCount & SubscriberCount from YouTube API V3 for a given channel, where channelID does not equal userID
- Display images on Django Template Site
- Difference between list() and dict() with generators
- How can I serialize a numpy array while preserving matrix dimensions?
- Protractor did not run properly when using browser.wait, msg: "Wait timed out after XXXms"
- Why is my program adding int as string (4+7 = 47)?
- store numpy array in mysql
- how to omit the less frequent words from a dictionary in python?
- Update a text file with ( new words+ \n ) after the words is appended into a list
- python how to write list of lists to file
- Removing URL features from tokens in NLTK
- Optimizing for Social Leaderboards
- Python : Get size of string in bytes
- What is the code of the sorted function?
Related Questions in NLTK
- Removing URL features from tokens in NLTK
- Django webapp (on an Apache2 server) hangs indefintely when importing nltk in views.py
- Stanford Entity Recognizer (caseless) in Python Nltk
- How to Train an Input File containing lines of text in NLTK Python
- Python child process silently crashes when issuing an HTTP request
- 'NoneType' object has no attribute 'kill_cursors' when nltk is imported
- NLTK - Get and Simplify List of Tags
- Check if items in list a are found in list b and return list c with matching indexes of list b in Python
- Extract word from a list of synsets in NLTK for Python
- Python NLTK pos_tag not returning the correct part-of-speech tag
- Using WordNet-Affect with NLTK
- Check the similarity between two words with NLTK with Python
- How to remove a custom word pattern from a text using NLTK with Python
- Printing Simplified Corpus to Json File
- NLTK: Package Errors? punkt and pickle?
Related Questions in WORDNET
- How can I make a hash table for all relations in wordNet and cPickle them?
- Extract word from a list of synsets in NLTK for Python
- Using WordNet-Affect with NLTK
- jwnl unable to install database
- How to tag monosemous words
- Is RiTa framework supporting Android?
- How to find the path length between 2 senses?
- Word of a particular domain from Wordnet
- lemmatize plural nouns using nltk and wordnet
- Implement a semantic web search engine using Arabic wordnet
- Find Synonyms using Arabic Wordnet in java
- Writing a function that lemmatizes all words in a sentence by considering their POS tags
- Python - WordNet NLTK KeyError
- How to count noun's hyponyms that does not have hyponyms with NLTK and WordNet?
- What is pos of `r` or `s` in Wordnet via NLTK
Related Questions in HINDI
- google speech recognition api in hindi
- Unable to copy exact hindi content from pdf
- Change locale in android app (onto Hindi)
- Weird characters while displaying hindi text in android
- BreakIterator in Android counts character wrongly
- How to detect if a string contains hindi (devnagri) in it with character and word count
- Tcpdf Hindi sentence display issue
- How to set UTF-8 encoding in solr-php client to store hindi content in solr
- I want to upload file containing Hindi language words and English words to phpMyadmin but getting errors
- AS3 - Hindi font half letters getting converted
- Unknown fonts in mysql database
- Hindi font is not showing in 32 bit machine. (windows 7) what could be the possible reason?
- Supporting various languages (Hindi, Arabic, Tamil) in Android
- Display hindi html in android webview
- QT/QML hindi font support
Related Questions in WSD
- Word sense disambiguation using WEKA
- Can I use WMI to manage WSD devices (specifically printers)?
- I have a database and API for hindi wordnet. I want to access this wordnet from NLTK python. Is there any way to add our own wordnet into NLTK?
- entity recognition and disambiguation api
- WS Discovery vs. SSDP - Pros and Cons
- how can I get lesk similarity score for all the senses of given words using ws4j library?
- C# wia + network scanner How do I connect?
- Using WSDCreateHttpAddress
- What other inputs are there to Word Sense Disambiguation task?
- How to get a list of devices enumerated in Explorer network using C++
- List of stopwords for NLP
- "An error occurred when verifying security for the message." WSDL Zeep
- Is there any way to extract the sense_id or sense_key of a particular word from semcor-tagged sentences?
- Soap Unable to connect from one server yet the same file connects from another
- How to extract feature in Hindi Word Sense Disambiguation task
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
If you look in your nltk_data folder, you'll see that wordnet like every other NLTK corpus is just a bunch of plain-text files. So, there must be a way to format your Hindi wordnet the same way as the NLTK one to use the functions. Here is the excerpt from the nltk.corpus.reader.wordnet object where these files are being read:
I suppose you don't really need to generate all these files but more importantly have to use the "index.sense" file for Word Sense Disambiguation. This is not generated by NLTK but have to be pre-processed before that or must be coming with your Hindi wordnet in the following format - http://wordnet.princeton.edu/wordnet/man/senseidx.5WN.html.
After you've done all steps I would just go to ../nltk/corpus/reader/wordnet.py and either create a copy of it where you can change the root and filenames and maybe some other dependencies but still use the functionality OR change what you need within existing classes (not recommended).
P.S. A little of googling gave me the link to http://www.cs.utexas.edu/~rashish/cs365ppt.pdf, which references a bunch of other sources on the subject.