Is it possible to get an approximate duration of each word in an audio file? The closest thing (for audio files from youtube videos) is to download the captions file as an srt
. The srt
will then have the duration for each sentence in the video.
I was wondering if it is possible to somehow get the duration for each word in a sentence. Maybe not accurate but something around that ?