How should I configure a MySQL DB in phpmyadmin for storing both latin and cyrillic data sets in the same table, for a multi-language application?
choosing character sets and collations for combined Latin / Cyrillic language data
1.6k Views Asked by Supreme At
1
There are 1 best solutions below
Related Questions in MYSQL
- MySQL Select Rank
- When dealing with databases, does adding a different table when we can use a simple hash a good thing?
- Push mysql database script to server using git
- Why does mysql stop using indexes when date ranges are added to the query?
- Google Maps API Re-size
- store numpy array in mysql
- Whats wrong with this query? Using ands
- MySQL-Auto increment
- show duplicate values subquery mysql
- Java Web Application Query Is Not Working
- microsoft odbc driver manager data source name not found and no default driver specified
- Setting foreign key in phpMyAdmin
- No responses from google places text search api
- Adding to MAMP database in SQL/PHP
- I want to remove certain parent- and child-divs in all my wordpress posts with php or some other script
Related Questions in CHARACTER-ENCODING
- How to encode bytes as a printable unicode string (like base64 for ascii)
- FPDF with iconv from utf8mb4
- Char encoding and SQL in C#
- How to set only one table charset to utf8mb4 without change mysql configuration?
- Why does opening a file in two different encodings work as expected?
- —- " added in HTML when converting MarkDown file to HTML using Jekyll tool
- Unicode error. database malfunctions
- Can we convert ANSI encoded CSV file to utf-8 encoded file with javascript?
- Determining ISO-8859-1 vs US-ASCII charset
- Unexpected Python String Encoding of '/b'
- Rails ActiveRecord string field encoding vs Ruby String encoding
- Jekyll JSON incorrect character encoding
- Nodejs encoding issue
- How do I encode HTML characters within Javascript functions?
- Specifying Encoding While Placing Files In InDesign Using Extendscript
Related Questions in INTERNATIONALIZATION
- How to keep a variable in the URL when using Spring LocaleChangeInterceptor
- Shared behavior state across custom elements
- Using Bootstrap Datepicker Internationally
- ResourceBundleMessagesource - No way to access all properties from a file?
- angular-translate-based custom filter
- How to localize TextAngular tooltips
- Codeigniter view in Wordpress
- grails message as argument of other message
- Play Framework 2.4.0 and I18n with Scala
- Language selector in Play 2.4 & Scala 2.11.6
- How to manage en.yml for different api version
- Spring MVC: Fallback for unknown language code in uri parameter value
- CakePHP internationalization
- Internationalize C program
- symfony2: how to parameter localization so we can have 2 languages?
Related Questions in COLLATION
- Char encoding and SQL in C#
- SQL Duplicate query counter needing collation
- Adding Collation to a SQL Server CTE statement
- SQL Server : RegEx ASCII removal
- Highlighting Search Results: RegEx Character Collation?
- How to set the delimiter, Postgresql
- Postgresql COPY encoding, how to?
- Save Latin characters with accents in Oracle
- Default ordering in C# vs. F#
- choosing character sets and collations for combined Latin / Cyrillic language data
- Refreshing SYS.columns after db collation change
- Unexpected Sort Behavior PHP Collator::asort for values in format YYYY-MM-DD
- Mysql convert table, collation not changing
- Customizing collator rules doesn't work
- MySQL collation query results
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
When you create your database, you can choose a default...
You give a command like this:
phpMyAdmin has a dialog box that prompts you for those values.
(MySQL loves to brag about its Swedish roots by setting its serverwide defaults to Latin1 character sets and Swedish collation. So be aware you might have to override the defaults. If I were Swedish I would brag too.)
Then, you can, if you wish, override those choices for each table or even for each column of a table.
The character set is the most important of these choices, because the data you put into tables will be represented in that character set. If your application is a new start, you should pick the character set utf8mb4. In any case you should pick a Unicode character set like utf8. Unicode is capable of representing almost all known natural languages with a single character set, including English, Spanish, Cyrillic, Magyar, Hebrew, Turkish, Greek, Arabic, and Eastern languages. See here for a description of the various character sets.
https://dev.mysql.com/doc/refman/5.6/en/charset-unicode-sets.html
The collation governs how text is sorted and searched. MySQL offers many case-insensitive collations. This is really cool for natural language text, because it makes search work better.
You should pick utf8mb4_unicode_ci for a new start, or utf8_unicode_ci. That should serve you well unless you have very specific linguistic details to deal with. (Spanish, for example, handles Ñ as a separate letter rather than a case-variant of N. To get that right you need to use the utf8mb4_spanish_ci or utf8_spanish_ci collation.)