When I have an "order by" clause inside of a hive query, for example:
SELECT *
FROM categories
ORDER BY category_name
The results will be sorted as all the capital letters first and then all the lower letters. I need some table constraint or configuration to enforce the below behavior. A session sorting with UPPER/LOWER won't help.
Current results:
AAA
KKK
ZZZ
aaa
bbb
yyy
Expected results:
aaa
AAA
bbb
KKK
yyy
ZZZ
Is there any configuration which enforces hive to sort the data Alphabetical sorting first?
Within sql it's a collation. Within oracle it's LTS.
What is the right configuration for this kind of expected sorting results, and where to set it?
How about just using
lower()
?Note: this will be arbitrary about the case of the result. Because lower-case letters come after upper case in all modern collations, you could do: