How to get audio data in a specific format in real time from a Twilio MediaStreamTrack?

556 Views Asked by ege At 28 July 2025 at 01:10

I am using Twilio Programmable video, and trying to pipe remote participant's audio in real time to Google Cloud Media Translation client.

There is a sample code on how to use Google Cloud Media Translation client via microphone on here.

What I am trying to accomplish is that instead of using a microphone and node-record-lpcm16, I want to pipe what I am getting from Twilio's AudioTrack to Google Cloud Media Translation client. According to this doc,

Tracks represent the individual audio, data, and video media streams that are shared within a Room.

Also, according to this doc, AudioTrack contains an audio MediaStreamTrack. I am guessing this can be used to extract the audio and pipe it to somewhere else.

What's the best way of tackling this problem?

Original Q&A

There are 1 best solutions below

philnash On 26 August 2021 at 02:19

Twilio developer evangelist here.

With the MediaStreamTrack you can compose it back into a MediaStream object and then pass it to a MediaRecorder. When you start the MediaRecorder it will receive dataavailable events which will be a chunk of audio in the webm format. You can then pipe those chunks elsewhere to do the translation. I wrote a blog post on recording using the MediaRecorder, which should give you a better idea how the MediaRecorder works, but you will have to complete the work to stream the audio chunks to the server to be translated.

How to get audio data in a specific format in real time from a Twilio MediaStreamTrack?

There are 1 best solutions below

Related Questions in GOOGLE-CLOUD-PLATFORM

Related Questions in TWILIO

Related Questions in MEDIASTREAM

Related Questions in TWILIO-VIDEO

Related Questions in MEDIASTREAMSOURCE

Trending Questions

Popular # Hahtags

Popular Questions