How to implement real-time transcription along with speaker Identification for google meet

685 Views Asked by Gulzar Ali At 18 April 2023 at 07:07

I'm working on developing a tool that can automatically join a Google Meet session, record the audio, and generate real-time notes that are aware of who is speaking. The tool should be able to identify speakers and accurately associate their spoken words with their name.

Is there an official Google API available for this purpose, or are there any other recommended approaches to achieve this functionality?

I attempted to implement this functionality using Google Cloud Speech-to-Text, but I found that the service requires the meeting to be pre-recorded before it can transcribe the audio. Additionally, the accuracy of speaker recognition using this service was not satisfactory as we can't get the actual speaker names. I have tried to scrap the google meet captions but it does not seems to be a reliable solution. I want it like the webkitSpeechRecognition but with the identification of speakers.

Original Q&A

There are 1 best solutions below

user2207488 On 22 November 2023 at 17:26

Is there an official Google API available for this purpose, or are there any other recommended approaches to achieve this functionality?

Looks like part of the problem might be addressed by this new Google Meet API, though it's still in preview: https://developers.google.com/meet/api/guides/overview

How to implement real-time transcription along with speaker Identification for google meet

There are 1 best solutions below

Related Questions in SPEECH-TO-TEXT

Related Questions in GOOGLE-SPEECH-TO-TEXT-API

Related Questions in GOOGLE-MEET

Related Questions in WEBKITSPEECHRECOGNITION

Trending Questions

Popular # Hahtags

Popular Questions