I want to build an app that will recognize what emojis have been used on the wallpaper.
So for instance this app will receive on input:
And on output should array of names of recognizing emojis return:
[
"Smiling Face with Sunglasses",
"Grinning Face with Smiling Eyes",
"Kissing Face with Closed Eyes"
]
Of course, the names of these emojis will come from the names of files of training images. For example this file:
It will be called Grinning_Face_with_Smiling_Eyes.jpg
I would like to use AWS Rekognition Label or Google AutoML Vision, but they require a minimum of 10 images of each emoji for training. As you know, I can only provide one image of each emoji, because there is no more option, they are in 2D ;)
Now my question is: What should I do? How can I skip these requirements? Which service should I choose?
PS. In real business instead of emojis, there are covers of the books, which AI has to recognize. There is also one image per book-cover photo in 2D.