How to Find & Select NON English Characters in Hive Table

23 Views Asked by At

How to find Table records which are having Non-English (Spanish/Chinese/Japanese/..etc) characters in multiple columns. I tried below query but I couldn't able to find exact result. Can someone please help me on this.

Query:

SELECT 
      tbl1.id                 AS id
    , tbl1.first_name         AS first_name
    , tbl1.second_name        AS second_name
FROM emp_table tbl1                                                         
WHERE ((tbl1.first_name RLIKE '[^a-zA-Z\\d\\s:]') OR (tbl1.second_name RLIKE '[^a-zA-Z\\d\\s:]')) 
;

Eg:

id  first_name  second_name
1   尚            爱华
2   Shané        Olum�
3   Abc           Xyz
4   PQR           RST

Expected Output would be ID :

id  first_name  second_name
1   尚            爱华
2   Shané        Olum�
0

There are 0 best solutions below