I want to do speech <-> text with mixed-language inputs.
Initially only Chinese & English, but eventually more language pairs. Vast majority of speech will be English, but small amounts of Chinese will be included. The application is kind of a "conversational verbal dictionary":
speech-to-text with mixed-language input: "How do you say 猫?"
text-to-speech with mixed-language input: "The English word for 猫 is Cat." I would want this to be spoken with the voice/accent of a native English speaker.
I noticed that the text-to-speech demo at this URL can handle sentences like this IF you choose the "Chinese-CN", "Chinese-HK", or "Chinese-TW" accent, but not if you choose any of the "English-*" accents. This doesn't work for me because I need a native English-speaking accent ...
Bing Speech to Text does mixed language support for a small set of vocabulary for Chinese/english. But this is a small set and we do not have plans to expand to other languages this year.
Text To Speech is only available for the voices published.
Related
While using google translate api for english to hindi translation, we are trying to fetch translated roman hindi for an english search query, searching for suitable method.
Currently, the transliteration from English en to Hindi hi, which means that the result will the translated text with only Latin characters, is not supported. As per the comments, from English shoes the expected output would be joote. However, the output is जूते.
There is an already open Feature request within Google's Issue tracker, here, addressing the Transliteration for the Cloud Translation API. Thus, you can click on the +1 button and leave a comment so the Googlers know this issue is also relevant to you. Lastly, I should point out that this platform is used to track bugs and feature requests made by the costumers.
I'm sending the following text in spanish: "Paciente P2, Posición 6" to the voice "Raul, Apollo" but it seems that it's ignoring the accent of the word "Posición"
The language of "es-MX, Raul, Apollo" is Spanish (Mexico), so I suggest you try using any of the Spanish (Spain) language voices like "es-ES, Pablo, Apollo" for example.
You can find the list of all the available voices, including their (locale, languages, gender, service name mapping) and the supported languages, including their (acoustic adaptation, language adaptation, pronunciation adaptation) in the documentation here
Is it possible and if so how to respond within a skill with different languages? For example I'm developing a skill for the German skill store which reads various texts from the internet. Those can be any in language and I can determine the language when I'm about to emit the response.
From what I can see the SSML subset Alexa implements does not specify the language in which the response is given. But Alexa's own Kindle skill is able to read me eBooks in either German or English (perhaps Amazon's own skills are special).
As said in other answers the right way is to use the <lang> tag in SSML. However since the english voice do not speak German it is quite weird. The right solution is to change the voice using <voice> tag.
Here is an example in German
<speak>
<voice name="Hans"><lang xml:lang="de-DE">Ich bin ein Berliner</lang></voice>.
I am a Berliner.
</speak>
It is described in this doc https://developer.amazon.com/fr/docs/custom-skills/speech-synthesis-markup-language-ssml-reference.html#examplefrench-content-in-an-english-skill
It looks like this is not possible at the moment: https://forums.developer.amazon.com/questions/55086/specify-output-language-per-intent.html
You can use the <lang> tag in SSML for this.
Here is an example in German.
<speak>
<lang xml:lang="de-DE">Mein Luftkissenfahrzeug ist voller Aale</lang>.
Hello in the default language.
</speak>
Here is a list of supported Amazon Polly languages for Alexa.
Is it possible to get transcription of word translated by Google API?
Same as in translate.google.com right under the textarea.
I need translation and transcription of single words only, like in dictionary, so may be Google offers some dictionary API, simpler than translation but with transcription?
Look at this picture, second red arrow points to transcription of word "transcription"
Google Translation API doesn't give you the variants of translation of a single word and doesn't give to you the text to speech functionality. Translate API can detect the language automatically and respond the translated text.
How to retrieve a content from FreeBase.com in Italian language?
Now, I can only see content from eng.wikipedia.org.
A lot of the information in Freebase is language independent, but if you want Italian language strings, you can get them by specify that it your queries (the default is English if no language is specified). There's an example app here
For things which are sourced from Wikipedia, the English language Wikipedia forms the basis, so if something only exists in the Italian Wikipedia, it won't be included. For things which are in both, you can get links to both the English article and the Italian article (and you can't get the Italian name directly from Wikipedia or look things up by it).
I don't know how you are making the call, but add this parameter to it:
lang=/lang/it