I'm looking for a datasets with all the Chinese character Mandarin pronunciations in bopomofo and/or pinyin. Also, I need open source datasets that I can copy into my own code bases.
Where can I find Chinese character bopomofo/pinyin data?
2.3k Views Asked by Nathan Breit At
2
There are 2 best solutions below
0
NallaN
On
this is a bit of a late entry but I was searching for the same thing last year and ended up compiling my own character/bopomofo database based on a bunch of different data sets. I have put enough work into this thing to thoroughly call it my own though so you should check it out! its part of a rubygem I made to sort by bopomofo (I had a system that would not let me change the database colaltion settings) https://github.com/nallan/a-b-chi
Related Questions in INTERNATIONALIZATION
- How to keep a variable in the URL when using Spring LocaleChangeInterceptor
- Shared behavior state across custom elements
- Using Bootstrap Datepicker Internationally
- ResourceBundleMessagesource - No way to access all properties from a file?
- angular-translate-based custom filter
- How to localize TextAngular tooltips
- Codeigniter view in Wordpress
- grails message as argument of other message
- Play Framework 2.4.0 and I18n with Scala
- Language selector in Play 2.4 & Scala 2.11.6
- How to manage en.yml for different api version
- Spring MVC: Fallback for unknown language code in uri parameter value
- CakePHP internationalization
- Internationalize C program
- symfony2: how to parameter localization so we can have 2 languages?
Related Questions in DATASET
- Is there a data format lighter than json?
- How to remove datatable from dataset?
- fetch data from web service to dataset in Delphi
- Find All Rows in DataTable Where Column Value is NOT Unique Using Linq Query
- Binding DataGirdComboBoxColumn to DataSet doesn't work
- Visual Studio Professional 2012 says dataset is not a member of my form
- Fastest way to query SQL Server CE database file in Windows Mobile 6.5?
- to_char(Column_name) what would be the column name in dataset
- A column named 'Prize' already belongs to this DataTable
- InvalidCastException when trying to make a list from xml file
- changing the order of records in dataset
- pybrain datasets insert tuple
- c# Tableadapter fails to update through odbc
- how to find out if a specific table exists in a dataset
- .Dat file to dataset
Related Questions in CHINESE-LOCALE
- "incomplete final line found by readTableHeader" when using read.delim() on a tab-delimited file with Chinese character
- Chinese input in iOS simulator
- How to rename CJK file from PHP script
- HashMap does not behave as expected for Chinese characters
- opencc python causes segmentation fault
- HtmlUnit: Encoding for Chinese Website
- How to import data with Chinese Characters correctly in R
- Localization of OSX service menu entry does not work for Simplified Chinese
- How to use Python to use stanford parser dealing with chinese sentence
- preg_replace I want to replace text in Chinese + English inside the quotes
- Which coding System should I use in Emacs? When I copy Chinese words to emacs, then the frame appear the disordered codes.
- ASP.NET how to encode into Chinese?
- Chinese URL loads correctly in Edge but not other browsers
- Obj-C, encode Simplified Chinese in an iTunes search querystring?
- 'translateChar' must be called on a CHARSXP
Related Questions in OPENDATA
- Can't target fields in SODA api because of whitespace
- Can Worklight http adapter load open data format like JSON , xml, csv?
- How to track changes in Google Cloud public datasets?
- Create search bar for json API using just javascript and html?
- How to enable API button for CKAN data set?
- Where can I find Chinese character bopomofo/pinyin data?
- Web Service as Open Data in CKAN
- How do I connect to an OpenDataSet from Azure Data Factory?
- rdf for public procurement
- Pandas - send warnings when data's shape changes
- restful pagination using python elementree xml parser and loop
- SOQL Socrata query datetime between
- Is it possible to write offline GPS navigation software as a hobbyist?
- Python code snippet throw 500 in CKAN
- How can I make the new language appear in the homepage language drop-down?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
It sounds like you might be looking for the Unihan Database. The Unihan Database is maintained by the Unicode Consortium.
For an example, here is the data for 爱.
Here is the description of the organization and content of the Unihan Database. Be sure to read that to understand what the data is referring to.
If this is the information you want, you can download the ZIP archive that contains all this data.
The Unihan Database doesn't have Bopomofo (Zhuyin) pronunciations, but it has Pinyin readings. Converting from Pinyin to Zhuyin is simple; there are a lot of online tools that can do it for you.
As for licensing issues, the Unihan Database data files have a liberal copyright notice. So, you shouldn't run into any problems using that data in your own software.