I'm starting new project, this project should take two or more languages in the same audio as an input and outputs the speech in one language transcript. any thoughts how I can achieve this project
I tried wav2vec ,Nivida Nemo and deepspeech but no luck.