Skip Links

Network World

  • Social Web 
  • Email 
  • Close

IBM develops speech recognition in Indian language

By John Ribeiro , IDG News Service , 08/16/2007

IBM’s India Research Laboratory (IRL) has developed a speech recognition software for Hindi, one of the key languages in India.

The software has both commercial applications and social applications such as bridging the digital divide, Daniel Dias, director of the lab, said in a telephone interview on Thursday.

The Indian government and other local agencies have been promoting the use of local languages in computing, but the development of an input device for Indian languages has proven to be quite difficult. The Devnagri script used in Hindi has over 40 basic characters, and some 12 modifiers to the characters that are represented above or below the basic characters.

There are a number of keyboards available for the Devnagri script, but to input one character of the script, the user has to punch a combination of keys, said Ashish Verma, a senior researcher at IRL, and the lead on this project.

HP’s lab in India has designed a touch-pad, which it calls the "gesture keyboard", which uses a combination of tapping and gestures to handle Hindi. The touch-pad has the basic characters and numbers of the Devnagri script on it. The character with the required modifier can be input into the computer by specific user gestures when tapping the basic character with a pen-based input device.

IBM, by contrast, has opted for a technology based on speech recognition, both because it is simpler, and also because it can be used by the large number of Indians who are semi-literate or not familiar with a computer keyboard. The dictionary, developed by IRL for the speech recognition system, has over 75,000 words in Hindi, with a provision to add new words, Verma said.

One of the challenges in developing a speech recognition system for Hindi was that words in the language are often pronounced quite differently in various parts of the country. "We had to come up with multiple pronunciations for a given word in Hindi, and include them in the dictionary, and get them recognized (by the system) in the testing phase," Verma said.

The core technology, developed by IRL, can be used in PC applications such as data entry, letter-writing, sending emails, as well as to speech-enable automated teller machines (ATM), kiosks and other devices, Dias said. The software can  also be used for issuing commands to the computer, and for interactive voice response (IVR) applications in telephony, he added. As the software supports Unicode it can be integrated with a number of word processing and e-mail applications including from Microsoft, Verma said.

Partner Content
CA logo

CA Network & Voice Resource Center

Comprehensive Network & Voice Management Visit CA Network & Voice Management Resource Center and get insights into industry best practices, information that helps you to address your challenges.

CA Network & Voice Management Resource Center

whitepaper

Managing Voice Over IP for Successful Convergence

Voice over IP (VoIP) has much to offer in cost savings but some customers have concerns about VoIP call quality compared to the quality of traditional voice services. This white paper will help you learn how to take the right steps so that voice quality is assured.

Managing VoIP for Successful Convergence

whitepaper

The Changing Face of Network Management

Managing your network is serious business. This paper discusses the benefits of integrating configuration change-awareness into your network fault management solution

Download Whitepaper

Comment
Login
Forgot your account info?
Add comment
Anonymous comments subject to approval. Register here for member benefits.
Have a NetworkWorld account? Log in here. Register now for a free account.

Videos

rssRss Feed
Get instant email notification when white papers, webcasts, executive guides are added to our library. Stay informed and up-to-date with the latest on IT Technologies with Network World's Resource Alerts.