Tag

Localization

Browsing

From the time of the first conference in 1999, there have been last 18 Tamil Internet conferences,  and, we have seen tremendous progress on the science and engineering behind Indian Language computing that’s happening. In this post, I am sharing the notes I have taken during the TIC 2019 that’s underway at Anna University. [Disclaimer: These notes are NOT meant to be comprehensive, they are just my notes of (only) the sessions I attended, please treat them as such] In the keynote, Hon’ble Minister Mr “Mafoi” Pandiarajan spoke in length about his department’s work on developing one of the large…

I was invited to present a guest talk for the annual Tamil Internet Conference 2019, that is underway at Anna University, organized by INFITT along with Tamil Virtual Academy and others. I presented a talk with demos titled aptly as “செயல் விளக்கம்” – Demonstration of tools for Tamil for writing and coding. I demonstrated the following: Google Voice Typing in Tamil on your PC – கூகுளின் குரல்வழித் தமிழில் உள்ளிடல் வசதி. How to use Tesseract, the open-source OCR engine to convert scanned pages in Tamil to PDF with Embedded Text (in Tamil) feature enabled, so that search in Tamil and copy…

Learning anything new is NOT easy. And doing it yourself is tough after for years you had a team who can do it faster and better than you. I am talking about me (learning &) writing code in Python programming language. A few weeks ago, I wrote (mostly copy ‘n’ paste) a couple of snippets in Python to do Speech To Text, and, Text to Voice, for the Tamil language – the blog post is here. I followed that with a small program to OCR of the Tamil text from a given image and then machine translating to English. The…

Using Python is fun. In a matter of minutes, I was able to build a rudimentary speech to text recognition app, for Tamil. And after that, in reverse, that is, to speak out loud a sentence of given Tamil text. Speech to Text: I got the base code from here. You need to install SpeechRecognition package (available through PIP) and PyAudio (available through PIP in Linux, but on Windows, you need to install appropriate package from here). Text to Speech: When I posted the above code, I got a reader request for a code that does the reverse – to…

For more than two decades, having computer translation, voice recognition and handwriting recognition in their mother tongue(s) have been a dream for many of the Indian language speakers including Tamilians. Partnering with institutions around the world, INFITT members have been working on this and had presented many technical papers on the subject in its annual conferences like with TI2010, when Unicode got Tamil Nadu Governnment’s approval. [This year’s conference is happening at the Anna University, Chennai from September 20-22, 2019] Even when the support for local languages are in the devices, how effectively are they being used? That’s a discussion…