- Securing SSLVPN with client certificates
- Toshiba propels DVD quality to near HD
- 16 hot roles for IT pros
- Torvalds: Fed up with the 'security circus'
- The dos and don'ts of IT job seeking
News | Newsletters | Podcasts | Chats | Opinions | RSS Feeds | This Week In Print | IT Careers | Community | Reports | Downloads | Slideshows | New Data Center
Partner Sites:App Performance | On Demand Security | Networking Solution | SOA | Value of WDS
IBM’s India Research Laboratory (IRL) has developed a speech recognition software for Hindi, one of the key languages in India.
The software has both commercial applications and social applications such as bridging the digital divide, Daniel Dias, director of the lab, said in a telephone interview on Thursday.
The Indian government and other local agencies have been promoting the use of local languages in computing, but the development of an input device for Indian languages has proven to be quite difficult. The Devnagri script used in Hindi has over 40 basic characters, and some 12 modifiers to the characters that are represented above or below the basic characters.
There are a number of keyboards available for the Devnagri script, but to input one character of the script, the user has to punch a combination of keys, said Ashish Verma, a senior researcher at IRL, and the lead on this project.
HP’s lab in India has designed a touch-pad, which it calls the "gesture keyboard", which uses a combination of tapping and gestures to handle Hindi. The touch-pad has the basic characters and numbers of the Devnagri script on it. The character with the required modifier can be input into the computer by specific user gestures when tapping the basic character with a pen-based input device.
IBM, by contrast, has opted for a technology based on speech recognition, both because it is simpler, and also because it can be used by the large number of Indians who are semi-literate or not familiar with a computer keyboard. The dictionary, developed by IRL for the speech recognition system, has over 75,000 words in Hindi, with a provision to add new words, Verma said.
One of the challenges in developing a speech recognition system for Hindi was that words in the language are often pronounced quite differently in various parts of the country. "We had to come up with multiple pronunciations for a given word in Hindi, and include them in the dictionary, and get them recognized (by the system) in the testing phase," Verma said.
The core technology, developed by IRL, can be used in PC applications such as data entry, letter-writing, sending emails, as well as to speech-enable automated teller machines (ATM), kiosks and other devices, Dias said. The software can also be used for issuing commands to the computer, and for interactive voice response (IVR) applications in telephony, he added. As the software supports Unicode it can be integrated with a number of word processing and e-mail applications including from Microsoft, Verma said.
Partner Content
CA Network & Voice Resource Center
Comprehensive Network & Voice Management Visit CA Network & Voice Management Resource Center and get insights into industry best practices, information that helps you to address your challenges.
CA Network & Voice Management Resource Center
Managing Voice Over IP for Successful Convergence
Voice over IP (VoIP) has much to offer in cost savings but some customers have concerns about VoIP call quality compared to the quality of traditional voice services. This white paper will help you learn how to take the right steps so that voice quality is assured.
Managing VoIP for Successful Convergence
The Changing Face of Network Management
Managing your network is serious business. This paper discusses the benefits of integrating configuration change-awareness into your network fault management solution
Download Whitepaper
Comment