SuttaCentral Voice Assistant

karl_lew · December 2, 2018, 3:44pm

@sabbamitta, @aminah. As I look at this, I see that indeed this is AI in pain. Those segments are HUGE and AWS is definitely complaining. The code has two ways of shortening the text presented to AWS to observe its text limitation:

it reduces the size of the XML without loss of information.
it eliminates the XML which forced AWS to treat every word separately to avoid HIndi phrasing. The loss of intelligibility therefore arises out of Aditi’s attempts to apply Hindi phrasing to Pali.

The second measure is rarely necessary, since it is draconian and desperate, and reduces intelligibility. And that is the sad result we hear today in MN12 segment 31. I have a hope that AWS may raise its limit in the future, which would allow us to avoid using measure #2. For this reason I have moved this bug out of v0.9.1 and into the SCV Backlog. We thereby acknowledge the bug but have no practical solution at this time. Long segments will be less intelligible until this is fixed. If this issue continues to arise and be disruptive, we may have to research a costly solution in advance of any action taken by AWS. That would be a PM decision.

sabbamitta · December 2, 2018, 3:50pm

What does PM stand for?

karl_lew · December 2, 2018, 3:55pm

HRH @Aminah, long may she reign as Product Manager.

The Product Manager is the voice of the customer. The PM prioritizes user requirements and requests. In this way, dispute ceases amongst engineers except on implementation issues. This role is the heart of SCRUM. The normal term is “Product Owner”, but given Aminah’s current modest shyness ( ), she might feel uncomfortable with that. I really do think of her as the Product Owner.

And your own role I would characterize as Quality Assurance Lead of the development team. You have identified more issues than any of us. Indeed, we have trouble keeping up with you!

I’m just in charge of the “how”. It’s best that I do just this because I have been known to tilt at windmills.

sabbamitta · December 2, 2018, 3:58pm

Thanks for explaining. I struggle a bit with all these acronyms. PM for example stands also for a private message here on this forum…

karl_lew · December 2, 2018, 4:01pm

@Aminah, you are now the PO. Congratulations!

sabbamitta · December 2, 2018, 4:05pm

Only now I see this. This is the first time anyone assigns a title to me for being faultfinding…

karl_lew · December 2, 2018, 4:11pm

Good QA Leads have a natural instinct and awareness of inconsistencies. I have worked with many such people. You too have that natural ability. And you actually don’t find fault. You find inconsistent behavior. Then you ask if it is intentional. Finding inconsistencies is much much harder than finding fault. It is harder because it requires immense awareness of internal consistency and a capacity to understand and absorb the new.

sabbamitta · December 2, 2018, 4:13pm

Oh thank you, this is quite a compliment!

Aminah · December 2, 2018, 7:34pm

Excellent! As per your PM (private message) with the PM (HRH) from my perspective you already accomplished the primary reason for even having a v0.9.1 (fixing the “no translation” bug) and everything after that is “cheery on top” and anything big after that most definitely ought to be pushed to the backlog as you’ve done here.

You know I’m a staunch republican (UK sense), right?

Likewise, it’s the first time anyone has asked me to be a bossy-boots! I’ve decided it’s alright to wear the title because all I seem to have to do is say “okay, that sounds like a good idea” and just let Karl get on with building is incredible app.

Most deserve, and most emphatically seconded!

sabbamitta · December 3, 2018, 7:37pm

The word saṅkhittena appears repeatedly in AN 4.192, and Aditi again clips the ṅ, just like in saṅkhāra.

karl_lew · December 3, 2018, 7:44pm

Very good. I hope it shall be fixed when I publish the fix this week.

sabbamitta · December 3, 2018, 7:54pm

AN 8.6, in the introductory section, segment 3:

Usually in the Pali, numbers are not pronounced, but in the English they are. In this phrase

6. Worldly Conditions (2nd)

Amy correctly pronounces the “6”, but forgets the “2” and only says “nd” instead.

karl_lew · December 3, 2018, 8:14pm

Nice one! Found and fixed for 2nd, 3rd, 4th in v0.9.1.

karl_lew · December 4, 2018, 8:09pm

SuttaCentral Voice Assistant v0.9.1 Release at 20:20 UTC COMPLETED

V0.9.1 (Dec. 3-7, 2018)

Features

Dedication on home page provides examples of searches
“Examples” button provides 3 random search links
“Play Sutta” button on each search result
Show SuttaPlayer time estimate
Auto-play sutta after launch sound.

Bugs

Handle ṁ, ṃ, ṅ with the same phoneme
Another potential bug: *saṅkhāra" sounds like sakhāra—I can’t hear the ṅ at all.
Progress bar doesn’t halt when playing search result quotes
Untranslated segment causes playing to stop.
Searching for “AN 5.179” returns " No suttas found", while “AN5.179” (no space) finds the sutta.
In MN 12 section 2 segment 214 occurs the word naehibhaddantik

Release notes
Release plan

And special thanks to @sabbamitta for her discerning ear that misses nothing.

Up next we’ll be tackling Release v1.0.0. Bhante Sujato has a Pali pronunciation boot camp that we have to run Aditi through. After v1.0.0, we shall switch to maintenance/research mode for a while investigating German translation voice support. However, if you have ideas for features, please do let @Aminah know. Aminah is the SuttaCentral Voice Product Owner.

Aminah · December 4, 2018, 8:29pm

Utterly fantastic! Thanks, Karl!

sabbamitta · December 4, 2018, 8:39pm

Going to try it out!

karl_lew · December 4, 2018, 8:42pm

I cleared out the sound cache and you will probably experience some slowness on suttas you have previously played. They should play at the normal speed the second play. We made so many changes that all the existing sounds in the cache are probably useless, so I cleared the cache.

Aminah · December 4, 2018, 9:54pm

So that things don’t get too disorderly, Karl and I agreed that we’d close this thread after the release of v0.9.1.

Fear not! There’s a new thread here:

I’m going to put an auto close for 20hrs so as not to deprive anyone the opportunity to offer Karl hearty congratulations for his accomplishments so far (although, really I can understand if you’re all too busy listen to the suttas to even do that ).

Again, deep thanks to both Karl and all supporting contributors!

anon87721581 · December 5, 2018, 4:00am

Thank you soooooo much karl. The merit you are making can’t be small.

karl_lew · December 5, 2018, 3:27pm

I have a hope.
A tiny tiny hope.
I have a wish.
A tiny tiny wish.

That we might all listen together.
Whenever.
Whereever.
With whomever.

And that one day,
…all together…
We might recite in unison, without dispute.
With only metta in our hearts.

Thank you Bhante Sujato for making this possible.
Thank you all for making this possible.