SuttaCentral

Wanted 🕵️‍♀ : Translator for SC-Voice interface

sc-voice
Tags: #<Tag:0x00007f7893ce3958>

#121

Adding Portuguese/Cristiano is easy in principle (just update words/voices.json). However, it is logistically complicated as Anagarika Sabbamitta mentions. We are paying for 30GB of disk storage currently and the disk is 40% full. To give you an idea of our disk needs, Angutara Nikaya in Pali today takes up 1.8GB. :open_mouth:

The website translations themselves are cheap and easy. They require minimal disk space. It is the TTS storage that consumes major disk. To store the speech for all the available languages on a single server would probably require almost 1000GB or 1TB. That would be quite costly for use to support. But supporting different web interface languages is easy and fun!

Because of the difficulty of TTS, we’re proceeding cautiously with the TTS portion of new languages. As Anagarika mentions, we are starting first with Deutsch and now, thanks to your team’s help, we are also exploring Portuguese. Each AWS Polly voice we add requires lots of careful editing to adjust pronunciation. For example, you’ve probably noticed that Gabriel_L has already made many corrections to the Portugues voices for words like “arahant”. Each AWS Polly voice has its own quirks. Fixes for Ricardo will probably not work for Cristiano. :see_no_evil:

In the longer term, we need to work out a strategy for multilingual Voice. Although we might store everything on one huge server, perhaps that is not the way to proceed. Instead, it may make more sense to have local servers handle local languages. For example, we can host EU languages on an EU server or servers. And we can host Asian languages in data centers that will minimize latency. We could host the Voice Portuguese server in Brazil. This is quite doable. In fact, Aminah herself has created her own Voice server that we are using for staging. If the Portuguese team would like to explore this, we have instructions for AWS Installation. In this way, the Portuguese team could dive deep into Voice Portugues with their own servers using a custom fork of Voice. We’re currently looking at Bilara integration, so your translations would also be hosted on those servers.

The logic for this is in public/js/scv-singleton.js:

var navLang = g.navigator && g.navigator.language;

For testing, I set my browser to Deutsch (for example) and then CTRL_SHIFT_R for a hard refresh of the website => Alles wird zu unserem Vergnügen auf Deutsch gezeigt


#122

Ah, like that.
Confirmed, setting my preferred language to Romanian, makes it appear in Romanian by default.


#123

Danish interface is now on the staging server, --> Alt vises på dansk til vores fornøjelse! :smile:


#124

I can help for the Vietnamese translation :vietnam:
On a side note, I’m still editing the Vietnamese audio, it’s just that I have less free time at the moment


#125

Great, Phineas, thank you! Here is your file to translate.

If you have any questions, please check this thread; some issues have already been discussed here. If there is still something unclear, feel free to ask anytime.

And no worries for the Vietnamese audios: When they are done they are done, and we will happily include them into the Voice Wiki. :pray:


#126

Voice v1.8.11 is ready for staging with the Portuguese fixes as well as the initial Vietnamese file. I was saddened to discover that AWS Polly has no Vietnamese voice. We may have to rely on other TTS services such as Microsoft. I’m thinking about breaking out the Voice TTS adapter as its own Github NodeJs project so that others can help with such efforts. It would be useful to have an open-source TTS package that presented a unified API for all languages independent of the EBTs.

Translation in general is quite a painstaking process. You are all helping the Voice web page with localizations for specific languages. However, as you’ve noticed, not all languages will have EBT translations in Voice. We currently are working on having Voice display German segmented translations. For some languages there may not yet be any segmented or even unsegmented translations available for Voice to display. Furthermore, Voice currently only supports English searches. Anagarika Sabbamitta and I are working to support German word search and are making great progress, but it will be some time before we can support Voice search for all languages. Voice search relies entirely on segmented translations and does not work with unsegmented translations. Nevertheless, it is quite exciting to see the languages of the world appear as shown above.

Thank you all!
:pray:


#127

It’s done for the most part. But I find myself unable to translate the motto “Dedicated to the dark bound for light”. I’ll ask someone else to help and create the pull request once it’s done.
Anyone who can translate that is welcome :wink:


#128

You can always rephrase it in English before giving it a try. My Portugese translation reads more along the lines of “For those in darkness and going toward the light.”


#129

I suspect you may already know this, but in case not … the phrase dark bound for light is translated by Thích Minh Chȃu as sống trong bóng tối, hướng đến ánh sáng.


#130

Set translation language to English; click on the motto “Dedicated to the dark bound for light”; see what suttas come up; look into the Vietnamese translation of these suttas --> this is your motto.

This motto is a link, and when clicking it it should start a search for these words. Therefore they need to be the same wording as actually found in the Vietnamese translations. Otherwise the search can’t be successful.

In the case of Portuguese too, the wording should be exactly the same as in the new segmented translation as found in the suttas.


#131

Yes, as soon as we have that segmented translation available it will be adjusted. :+1:


#132

Thank you very much @Dhammanando @sabbamitta @Gabriel_L
I submitted the translation :slight_smile:


#133

Thanks to you, Voice has another new face:

Bildschirmfoto%20vom%202019-10-17%2022-22-50

:white_check_mark: :heart:


#134

Hi friends

@Invo @Jarek @Robbie @erlendne

Just asking—and by no means wanting to push anybody—where you are at with your respective translations? We are considering another release before the end of this year and are wondering whether your translations would be ready to be added. Please just let us know. Thank you! :sunflower:


#135

Hello there :slight_smile:

I’m sorry for my delays, too much stuff happening lately.
I’ll take a look at Jarek work hopefully after the weekend. I’ll give it one day to check if something can be corrected, but I’m not sure if the translation will be good enough to be published, since we are both not really good at IT stuff. But at least it could be something to work on for someone more specialised.


#136

Usually what happens is that someone sees it and then comes up with suggestions for improvement… but this won’t happen when it’s not published. :grin:


#137

Bokmål is the most commonly used variant of Norwegian and that is the one used in the existing suttas translated by Kåre Lien. I will use the same in my translation.


#138

Thanks for the reminder. I must confess I forgot about this. Will start today and should have it done in a few weeks. _/_


#139

Very good, thank you! :heart:


#140

Hi. Once I got started I couldn’t help myself and finished it straight away :smile:
no.zip (3.1 KB)
Please let me know when it is active on the staging server and how I can test it there to verify that the translation was correct. Some of the lines was a little hard since I didn’t know the context.