After getting huge success on the first event of Common Voice, we plan to organize another event on Common Voice for the Bengali language. This time the event was a little bit big, 70 people came up.
According to our last event plan, this time we first demonstrated about Common Voice. Then we have shared our criteria for writing sentences.
Here are some criteria to help participants write their own sentences:
- Ideally, it should take 5 seconds to read each sentence and no more than 10 seconds. So aim for sentences around 5 to 10 words.
- 1 or 2 words sentences is also ok, but not for all sentences! Try to have a mix of short and medium-sized sentences, but try to keep them all under 10 seconds.
- Try to use as many different words as possible. This will help the machine to enrich its vocabulary.
- Including sentences with punctuation (ex !,?) is also great to have but do not stress yourself if you can’t think of any.
- If your language is using any special symbols (ex.â, ü, ß, š) that’s great. Using those actually helps the machine to distinguish different sounds.
- Try to include proper nouns (first names, streets names, places, etc).
- Numbers are fine, but please spell the number rather than write the digits (ie. “five-hundred twenty-seven” is rather than “527”)
Then we distributed the white papers. They started writing down sentences with full enthusiasm.
This time again me and few Mozilla volunteers (Mahay Alam Khan from Bangladesh, Shiladittya Paul, Shouvik Mukherjee, Debraj from India) who is still trying those sentences for digital submission. Special thanks go to you guys.
This time we got also a huge success. Thanks, Mozilla for again giving us a nice opportunity to engage a general audience in this event. At last being a professional Bengali linguist, it’s pleasure to work with this project.
Hopefully next time we can organize another big event.
Sorry to come late into my blog. Due to my personal problem, I could not get time to blog about this event.
We all know this year Mozilla organized a two-day collaborative hackathon, the 2018 Global Sprint, which was really fun around the world. As a part of this global event, we also organized one small event in Barasat, Kolkata on Common Voice for the Bengali language.
As we all know that Mozilla’s one of the biggest initiatives is to teach machines how real people speak, that’s the Common Voice project. During this Global Sprint, they asked the world to help launch Common Voice in new languages by writing or collecting non-English sentences for people to read. We were glad to take part in the Bengali language.
Here I organized this event in a different way. First I invited general people who are not tech savvy. Some kids also were present there. Total 25 people came. Then I gave a brief introduction on Common Voice. Then I asked them to write at least 50 sentences for each person. Each people came up with 50 new sentences by writing down those in a paper. Few people have written double. We got more than 1500+ sentences from the whole event.
Then me and few Mozilla volunteers (Mahay Alam Khan, SM Sarwar Nobin from Bangladesh, Shiladittya Paul, Shouvik Mukherjee, Sayantan Dutta from India) who typed those sentences for digital submission. Special thanks go to you guys.
Participants were highly inspired by this event. We got great success. Thanks, Mozilla for giving us a nice opportunity. I am a professional bengali linguist. It’s pleasure to work with this project.