No punctuation in dialogue box and not able to correctly slow down the automatic speech recognition #966
Replies: 3 comments 3 replies
-
Hey there! It looks like you haven't connected your GitHub account to your Deepgram account. You can do this at https://community.deepgram.com - being verified through this process will allow our team to help you in a much more streamlined fashion. |
Beta Was this translation helpful? Give feedback.
-
It looks like we're missing some important information to help debug your issue. Would you mind providing us with the following details in a reply?
|
Beta Was this translation helpful? Give feedback.
-
I'm sorry to hear that you're having trouble with the automatic speech recognition speed and punctuation in the transcribed text. Let's try to address these issues one by one. For the punctuation issue, Deepgram's Dictation feature might be what you need. It converts spoken dictation commands into their corresponding punctuation marks. To enable Dictation, set the curl \
--request POST \
--header 'Authorization: Token YOUR_DEEPGRAM_API_KEY' \
--header 'Content-Type: audio/wav' \
--data-binary @youraudio.wav \
--url '<https://api.deepgram.com/v1/listen?dictation=true&punctuate=true>' As for the automatic speech recognition speed, it's hard to provide a specific solution without more details. However, you might want to check the audio input rate and ensure it's compatible with Deepgram's requirements. If the audio input rate is too high, it might cause the speech recognition to process too quickly. If you're still having trouble, could you please provide more details about your implementation? For example, it would be helpful to know how you're using the Deepgram API (e.g., are you using the SDK or making direct API calls?), and any relevant code snippets. This will help us better understand your situation and provide more targeted assistance.
|
Beta Was this translation helpful? Give feedback.
-
I'm using the Deepgram API as the STT solution for my chat interface. But I have been having 2 problems. Problem 1 is the absence of punctuation in the transcribed text. I've tried fixing it in the configuration settings, but I'm still having the issue. Problem 2 is with the automatic speech recognition. The user's speech input is being registered too fast. I've tried adjusting the audio input rate, but the results I got were not optimal. Is there anyone who could help me with this?
Beta Was this translation helpful? Give feedback.
All reactions