Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature request]: more information on voices, children and multilinguals voices #786

Open
Felwin opened this issue Jan 25, 2024 · 7 comments
Assignees
Labels
enhancement New feature or request text-to-speech

Comments

@Felwin
Copy link

Felwin commented Jan 25, 2024

When using speechSynthesizer.getVoicesAsync we don't get the information on which voice is a child.

I manually need to go to the documentation in order to filter the children voices out of my list (some users might want the opposite)
https://learn.microsoft.com/en-us/azure/ai-services/speech-service/language-support?tabs=tts

Would be great if this information was available directly like the gender.

Also would be great to know which supported locales are available for Multilingual voices without having to refer to the documentation.

@Felwin Felwin added the bug Something isn't working label Jan 25, 2024
@glharper glharper added enhancement New feature or request and removed bug Something isn't working labels Jan 25, 2024
@glharper glharper assigned yulin-li and unassigned glharper Jan 25, 2024
@yulin-li
Copy link
Contributor

@Kerry-LinZhang could you help to take a look?

@Felwin
Copy link
Author

Felwin commented Jan 26, 2024

From my understanding the sdk is calling Text to speech REST API, looks like the documentation is there:
https://learn.microsoft.com/en-us/azure/ai-services/speech-service/rest-text-to-speech?tabs=streaming

According to the sample response, in order to get the supported locales for a multilingual voice , the "SecondaryLocaleList" should be returned.

Not sure about the children voices, would need to run the request and look for one of the child voice.

Is there a reason why sdk do not return all data available with the rest api ?

@Kerry-LinZhang
Copy link

Hi @Felwin Thanks and well received for the feedback, I will bring it to our team for further discussion and keep you updated for the progress.

@yulin-li yulin-li assigned Kerry-LinZhang and unassigned yulin-li Apr 14, 2024
@pelaez87
Copy link

Is there any estimate about when SecondaryLocaleList property will be available on the SDK?
Right now, multilingual voices just doesn't seems multilingual using SDK, as it provides only one language code, so no idea from SDK point of view that this voice allows more languages

@Kerry-LinZhang
Copy link

Is there any estimate about when SecondaryLocaleList property will be available on the SDK? Right now, multilingual voices just doesn't seems multilingual using SDK, as it provides only one language code, so no idea from SDK point of view that this voice allows more languages

@yulin-li could you please help to track this?

@Kerry-LinZhang
Copy link

Dear customers, we are following up with feature PM for the request.

@Kerry-LinZhang
Copy link

Hi @Felwin we are adding tags to the new AI studio voice gallery. APIs will be planned later.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request text-to-speech
Projects
None yet
Development

No branches or pull requests

5 participants