I have set up an IVR in Twilio Studio and I have a voice prompt where the user states some information and I would like to have the IVR produce a transcript of that audio or produce a media file of the full audio (that I will use elsewhere) - I am unable to find this functionality in the docs but didn't know if there is a workaround or another solution.
Edit for more context - I want to ask the user to say their street address when prompted (which I will then use to look up their account and scheduled appointments).
Currently I can only set logic for things like "audio contains/does not contain X" but I have more advanced logic that I would like to implement and need the entirety of the user answer.
A good starting point for transcription is here: https://www.twilio.com/docs/glossary/what-is-call-transcription.
When using Gather with input as speech, the "action" URL is the endpoint where you will receive the transcribed text after the caller finished speaking:
The "partialResultsCallback" is the URL where you would receive the transcription in real-time as soon as it is processed:
Here is some C# based on the example for gathering speech:
The /completed endpoint will receive the transcribed text after the input is finished or timesout.
The /partial endpoint will receive the transcribed text as soon as it is available.
You would then need to receive the data from either the completed or partial endpoints and generate further Twiml to continue the call using that information.
As a caution, you may have more accurate results and possibly a better user experience asking for their phone number (which is only digits) or using the phone number from the caller id to lookup the account. Voice-to-text is more accurate for numbers than free form words. What you are seeking may not be possible in real-time with a high level of accuracy due to the accuracy of transcribing the wide variety of words that are in the street name. Or, you could do a combination of using caller id plus transcribed speech to achieve a higher level of accuracy.