Although Whisper’s transcription is highly accurate, there is always jargon (GPT) or non-standard spellings that make the transcript flawed (example: “Dave Prior” is a podcast host and transcription will spell his last name as “Pryor.”) What are some ways to improve transcription?
How to improve Whisper speech to text
30 Views Asked by Lance Kind At
1
There are 1 best solutions below
Related Questions in OPENAI-WHISPER
- using whisper in the terminal gives an error
- How to improve Whisper speech to text
- Unable to install ‘audio.whisper’ package from GitHub in RStudio despite correct Rtools installation
- Error: Cannot set headers after they are sent to the client && Error in getTranscript: AxiosError: Request failed with status code 400
- load local model using `WhisperForConditionalGeneration` class
- TranscriptionOptions.__new__() missing 3 required positional arguments: 'max_new_tokens', 'clip_timestamps', and 'hallucination_silence_threshold'
- How to optimise Hyperparameters for Whisper finetuning?
- Can't load Tokenizer using hugging-face whisper and gradio
- Can openAI Whisper recognize audio at specified frequencies?
- Can Distilled Whisper Models be used as a Drop-In Replacement for OpenAI Whisper?
- Making a Moviepy video and saving it to an S3 with python flask
- Huggingface Seq2seqTrainer freezes on evaluation
- replace whisper tokenizer with BERT tokenizer
- Faster Whisper float 16 issue
- WinError 2 The system cannot find the file specified
Related Questions in TRANSCRIPTION
- How can I get live transcription on OS X (without audio files)?
- Continuous speech recognition without restart after 1 minute
- google speech API does only partial transcript
- Music Transcription of Wav files in Java
- Not Transcribing short answers
- Enhanced playback with Spotify API
- Split transcript into transcripts for different speakers
- Is there is a time-limit to speech recognition while using Web Speech API?
- Optimal string from segments with words and timestamps
- Error 'tuple.index(x): x not in tuple' in 'model.transcribe' from wishper in python
- Google Cloud Speech-to-Text Automatic Punctuation
- Detecting a pause of 2 Seconds or more in Speech
- Azure Transcription: Missing/Incorrect caption generated by Speech to Text
- Timestamping audio from any language given the audio source and an accurate transcription
- transcribe a phone recording
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
There are three usual ways to improve Whisper transcription service:
I suggest the above order is in increasing difficulty. If Whisper is having trouble with your accent or how you say acronyms, then fine tuning will be the best solution. The first two options are nice as one could build he prompts dynamically.