I'm trying to count occurences of all letters in Turkish alphabet in a MySQL database.
When I try to count letter "a" like this, I get correct result :
while($nt=mysql_fetch_array($rt))
{
$mystring = $nt["word"];
for($i = 0; $i < strlen($mystring) ; $i++)
{
if($mystring[$i] == 'a')
{
$a++;
}
}
}
When I replace "a", with "ç" I get zero. I already added this code :
$bd = mysql_connect($mysql_hostname, $mysql_user, $mysql_password) or die("database unavailable");
mysql_set_charset('utf8', $bd);
How can I fix my code for Turkish characters? Thanks.
In UTF-8
ç
is encoded as two bytes (C3 A7
), therefore byte-by-byte comparison won't work. Considersubstr_count
:or use a unicode-aware function like this:
This assumes that your string are actually UTF-8, if this is not the case (hint:
var_dump(rawurlencode($str))
), check your DB and connection settings (see the linked thread).