Tag Archives: Bollywood Voice Songs Search

Key learnings from building a “voice search” for hindi film songs

A couple of posts ago, I talked about the idea that “audio search” makes so much sense for a music app. We have been working behind the scenes looking at voice to speech technologies and evaluating them with a view to offer voice search in our app “Filmi Filmy”.

We are happy to report that we were completely wrong when we first thought of this – Since all of the song titles are entered in English but represent Hindi words phonetically eg: “O mere dil ke chain”, “Gata rahe mera dil” we think that we can use a voice to speech engine to take user inputs, turn them into phonetic English and use the English text as the search keys.

It turns out that is it much more elegant and natural to take the voice input “O mere dil ke chain”, render it as the hindi string “ओ मेरे दिल के चैन” and search for the hindi string in the database. One significant advantage to this is that it reduces the complexity of the phonetics completely. It does not matter if the “ke” is spelled as “key” anymore as in Hindi it will always be spelled as “के”.

The challenge of course is getting a database of film song titles entered in Hindi. Nearly all song databases have English transliterated titles – and may we add- not two of them spell the same song the same way. A healthy inheritance from English led and US led software is that from YouTube to the home grown Gaana nearly all the songs are in English.

We are happy to report that fortunately a bit of innovation and tons of persistence can solve this problem (we may not have a huge cash chest at Pariksha but we are certainly not short on tech coolness). One of our engineers figured out a way to use existing open-source tools to build hindi equivalents of the titles.

The results are spectacular, to say the least. Consider for example this song search using voice search with hindi titles v/s text search with English phrases below:

Text Search With English Phrases                                                Voice Search with Hindi Titles

We need to do a bit more work on the hindi song titles and improve the error handling on the search and this should be ready for public use. Now consider the scenario we had described earlier – Imagine slumping in a car after a long day and with no energy to type to search, all you have to do is say the song and voila the app will play it on your phone, ear-phone or connected blue-tooth speaker. Dare we say, it is not long before this will be a reality!

Welcome to the world of Geekoo

ParikshaLabs is an innovative R&D oriented software service provider with competencies and focus on Enterprise Level RIA, Online Video, Adobe platforms, Mobile Application development, and QA Testing Services. The journey of ParikshaLabs started in Sep 2007. In this short span of time we have covered numerous milestones.

Our mascot “Geekoo” believes that high end R&D can help deliver high ROI. Geekoo has deep technical consulting experience and programming expertise. He has designed and executed global project management practices and attracted top programming talent from the industry for ParikshaLabs. He is smart, young, and dynamic. He represents our culture and will be face of our blog.

Geekoo will share industry trends, technology trends and our achievements and various other technical and programming experiences while executing projects. Geekoo will cover verticals such as Enterprise Level RIA, Online Video, Adobe platforms, Mobile Application development, and QA Testing Services. And of course collaborative distributed software development.

We hope you will find Geekoo’s ideas useful in your area of business. Geekoo wants to make ParilshaLabs a software service provider of your choice.