1,451 questions with Azure AI Speech tags

Sort by: Updated
0 answers

Error while trying to train a 202240228 Whisper Large v2 baseline model

When trying to train a custom speech model using a dataset containing an audio file and its transcript, the model failed to train due to an internal error. Can anyone provide any insights on how to troubleshoot this issue?

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,451 questions
asked 2024-05-03T08:53:22.2033333+00:00
Engineering 0 Reputation points
commented 2024-05-30T08:04:15.4066667+00:00
YutongTie-MSFT 46,996 Reputation points
2 answers

Azure speech to text batch stucked on "Running" status and no percentage

this is the request: "azureRequest": { "displayName": "job_title...", "description": "job_title...", "locale": "it-it", "contentUrls": [ "{url of a wave…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,451 questions
asked 2024-05-29T07:44:05.1833333+00:00
Fabrizio Barone 0 Reputation points
answered 2024-05-30T07:05:57.2166667+00:00
Fabrizio Barone 0 Reputation points
0 answers

TTS繁體中文國語發音錯誤

「重考」發音應該是 ㄔㄨㄥˊ ㄎㄠˇ 「假期」發音應該是 ㄐㄧㄚˋ ㄑㄧˊ TTS 是收費服務,因此請儘快修正。 謝謝

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,451 questions
asked 2024-05-26T17:34:31.5566667+00:00
疼目職人 0 Reputation points
commented 2024-05-30T04:11:55.0866667+00:00
YutongTie-MSFT 46,996 Reputation points
0 answers

Handling connection errors in Speech SDK

Hi, we are using Speech SDK (version 1.35.0, C++) for "speech to text". We use SpeechRecognizer->StartKeywordRecognitionAsync. While running the application, we lose connection sometimes and sometimes internet connection is okay, but we get…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,451 questions
asked 2024-05-29T13:16:12.9766667+00:00
Jasmin Hadzajlic 0 Reputation points
commented 2024-05-29T17:50:10.1533333+00:00
romungi-MSFT 42,966 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

Sample Data for different styles of Custom Neural Voices (happy, excited, sad).

I could find individual utterances for neutral speech, questions, and exclamations here: https://github.com/Azure-Samples/Cognitive-Speech-TTS/blob/master/CustomVoice/Sample%20Data/Individual%20utterances%20%2B%20matching%20script/SampleScript.txt To…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,451 questions
asked 2024-05-28T14:56:18.8666667+00:00
PAVAGEAU Perrine 20 Reputation points
accepted 2024-05-29T14:49:07.5+00:00
PAVAGEAU Perrine 20 Reputation points
1 answer One of the answers was accepted by the question author.

Do we need to close/suspend built-in AI voices (Ava, Andrew, Emma, Brian, etc) after using them to create a file in Audio Content Creation?

Hello, I understand that Custom Neural Voices need to be suspended after use due to their per-hour pricing. Do we also need to suspend anything after using Microsoft's built-in AI voices? I couldn't find specific information on this and want to avoid…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,451 questions
asked 2024-05-29T09:04:31.9+00:00
PAVAGEAU Perrine 20 Reputation points
accepted 2024-05-29T14:47:46.8766667+00:00
PAVAGEAU Perrine 20 Reputation points
1 answer One of the answers was accepted by the question author.

How to estimate the time needed to train a custom STT model?

Hey! I'm thinking about fine-tuning a STT model with Audio + human-labeled transcript data in Speech Studio. However, as I read through the docs, I can see that "If you switch to a base model that supports customization with audio data, the training…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,451 questions
asked 2024-05-28T15:54:02.83+00:00
Bruno Goncalves Vaz (P) 20 Reputation points
accepted 2024-05-29T08:55:29.6566667+00:00
Bruno Goncalves Vaz (P) 20 Reputation points
0 answers

How can I make Microsoft consider adding Faroese language to Speech Services

I need text-to-speech services for Faroese in Speech Services. How would I go about getting Microsoft to consider this request? Is there any way for me myself to train a custom voice, for a language that doesn't yet exist in Microsoft's repository of…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,451 questions
asked 2024-05-27T22:55:11.9533333+00:00
68046286 0 Reputation points
commented 2024-05-29T05:35:59.46+00:00
dupammi 7,480 Reputation points Microsoft Vendor
0 answers

How do you do pronunciation

Recently I had a script for a programming video, and I needed the word GUID, or goo id. I tried typing many different ways, and the only way I could get the word GUID, was to type goo hid, and use an audio editor and get rid of the H sound. Azure Speech…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,451 questions
asked 2024-05-25T06:46:27.2966667+00:00
Data Juggler 181 Reputation points
commented 2024-05-29T05:01:47.5866667+00:00
navba-MSFT 17,980 Reputation points Microsoft Employee
0 answers

400 Bad request using whisper with AzureCliCredentials

I'm trying to use Whisper using the AzureCliCredential and i always get an error as follow { code: 'Request is badly formated', message: 'Resource Id is badly formed: NA' } my very simple code is : import * as fs from "fs"; import {…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,451 questions
Azure OpenAI Service
Azure OpenAI Service
An Azure service that provides access to OpenAI’s GPT-3 models with enterprise capabilities.
2,325 questions
asked 2024-05-23T21:20:18.4833333+00:00
Julien C 0 Reputation points
commented 2024-05-29T04:48:12.3366667+00:00
navba-MSFT 17,980 Reputation points Microsoft Employee
0 answers

training with mixed language in custom-stt(English & Korean)

Hi, I am working on training korean custom-stt, but in the training data , there are a few english words mixed in it. Some of them are well processed and accepted as train data but others get rejected such as winder, insulator, gripper, rewinding. below…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,451 questions
asked 2022-10-14T06:11:55.737+00:00
VPA 21 Reputation points
commented 2024-05-28T23:25:13.9066667+00:00
Elias Salazar Zeledon (Manpower Costa Rica S A) 0 Reputation points Microsoft Vendor
1 answer One of the answers was accepted by the question author.

Can I re-train an already deployed custom voice model with newly added data without undergoing the entire training time again (approximately 24 hours)?

Here’s the context: We set up a voice talent, added training data, trained the model, and deployed it. We've now updated the dataset with more audios and transcripts, increasing the number of utterances from 1300 to 1500. When I try to train this voice…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,451 questions
asked 2024-05-27T12:15:25.9833333+00:00
PAVAGEAU Perrine 20 Reputation points
accepted 2024-05-28T08:53:37.8633333+00:00
PAVAGEAU Perrine 20 Reputation points
0 answers

Speech recognition service is not working correctly

Hi, I'm using your speech service to recognize phrases spoken by a user in real time and evaluate their pronunciation. However, I am facing the following issues If I pass the reference text and set EnableMiscue =true, then all the wrong words the user…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,451 questions
asked 2024-05-27T20:35:08.7633333+00:00
Miroslav 0 Reputation points
edited a comment 2024-05-28T05:33:18.8566667+00:00
navba-MSFT 17,980 Reputation points Microsoft Employee
1 answer One of the answers was accepted by the question author.

Why is the Isabella Multilingual voice available only in Clipchamp?

Hello, I noticed that the Isabella Multilingual voice for Thai Text to Speech is available in Clipchamp but not in Audio Content Creation. I'm interested in using this voice for my projects. I was wondering if there are any specific reasons why this…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,451 questions
asked 2024-05-27T04:40:01.55+00:00
i'm MariOhn 61 Reputation points
accepted 2024-05-27T09:36:07.3633333+00:00
i'm MariOhn 61 Reputation points
1 answer

How to output transcription on a word-level

With the provided callback function, the text is outputted as described by you, either after a short pause or after a maximum of 15 seconds. Is it possible to output word by word so that the text can be seen while speaking? def…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,451 questions
asked 2024-05-17T08:41:50.08+00:00
Sophie 0 Reputation points
commented 2024-05-27T08:53:18.7966667+00:00
Gowtham CP 1,970 Reputation points
1 answer

Azure TTS batch synthesis activity logs

Hi there, we're using Azure speech synthesis (batch, since we have content over 10mins). In the Azure Portal, I can see metrics for my speech resource but I can't see any records of past jobs. Is there any way to see these? Thanks, Tim

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,451 questions
asked 2024-05-21T18:11:34.5866667+00:00
Tim Schmidt 0 Reputation points
commented 2024-05-27T05:58:41.07+00:00
navba-MSFT 17,980 Reputation points Microsoft Employee
0 answers

Bug Report: Mispronunciation of Welsh Contraction "i’w" in Azure Neural TTS

Subject: Bug Report: Mispronunciation of Welsh Contraction "i’w" in Azure Neural TTS Description: The Azure Neural TTS system is mispronouncing the Welsh contraction "i’w." Instead of producing the correct pronunciation…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,451 questions
asked 2024-05-16T14:22:18.8166667+00:00
Verbari LLC 0 Reputation points
commented 2024-05-27T05:56:24.0566667+00:00
navba-MSFT 17,980 Reputation points Microsoft Employee
1 answer

here i cannot find To create a custom avatar endpoint, follow these steps: Sign in to Speech Studio. Navigate to Custom Avatar > Your project name > Train model.

i cannot find custom avatar key after sign in to the speech studio .

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,451 questions
asked 2024-05-16T11:23:22.97+00:00
Praveen Jaganivasan 0 Reputation points
commented 2024-05-27T04:55:52.14+00:00
santoshkc 5,255 Reputation points Microsoft Vendor
1 answer One of the answers was accepted by the question author.

Inquiry Regarding Azure AI Speech Error

Dear Azure Support Team I recently encountered an issue while using Azure AI Speech service with recordings from the VoiceMemo app on iPhone. Specifically, when attempting to process recordings of approximately 30 minutes in length, I received the…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,451 questions
asked 2024-05-15T12:18:25.3266667+00:00
y.ashibe 25 Reputation points
edited a comment 2024-05-27T04:06:55.3133333+00:00
navba-MSFT 17,980 Reputation points Microsoft Employee
2 answers One of the answers was accepted by the question author.

Speech Studio Audio Content Creation (x) Content Format and Audio Export Fail

I discovered https://speech.microsoft.com/portal, audio creation tile. (I think it should be the first one and described as "interactive batch TTS web interface.") I uploaded a file named test.txt, which has two paragraphs. For decades now,…

Azure AI Speech
Azure AI Speech
An Azure service that integrates speech processing into apps and services.
1,451 questions
asked 2024-05-18T19:20:53.55+00:00
ivo welch 40 Reputation points
commented 2024-05-27T01:39:15.4733333+00:00
dupammi 7,480 Reputation points Microsoft Vendor