We currently use voicexml and https://www.plumvoice.com/ to get voice recordings which are then sent to our backend server for processing similar to the example mentioned in the docs for recording user input: https://www.plumvoice.com/docs/dev/developer_reference:tutorial
<?xml version="1.0"?>
<vxml version="2.0">
<form>
<record name="myrecording" type="audio/x-wav" beep="true">
<prompt>
Please record a message after the beep.
</prompt>
<filled>
You just recorded the following message:
<value expr="myrecording"/>
<submit next="submitrecording.php" namelist="myrecording"
method="post" enctype="multipart/form-data"/>
</filled>
</record>
</form>
</vxml>
This works fine and gives a wav file at the end of user input. Is there a way to get the user input as an Audio Stream as the user speaks instead of a file at the end?
Rather than reinventing the wheel, you can use FFMPEG, advertised as “A complete, cross-platform solution to record, convert and stream audio and video.”
you can select option say no latency "-preset ultrafast -tune zerolatency" or capture from Logitec C930 camera "-i /dev/video0" or video file "-i your_file_location"
One example I can give is how I stream my webcam with sound to an online server
Dont be scared, you dont need all the option. Just take the Input and output and encoding standard you are good to go.