<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>IVR World &#187; Automatic Speech Recognition</title>
	<atom:link href="http://www.ivrsworld.com/tag/automatic-speech-recognition/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.ivrsworld.com</link>
	<description>IVR, Interactive Voice Response System, CTI Applications</description>
	<lastBuildDate>Fri, 10 Feb 2012 09:25:15 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Top 3 reasons not to use ASR in IVR Systems in India</title>
		<link>http://www.ivrsworld.com/ivr-tips/top-3-reasons-not-to-use-asr-in-ivr-systems-in-india/</link>
		<comments>http://www.ivrsworld.com/ivr-tips/top-3-reasons-not-to-use-asr-in-ivr-systems-in-india/#comments</comments>
		<pubDate>Wed, 10 Nov 2010 06:30:37 +0000</pubDate>
		<dc:creator>Uttam Pegu</dc:creator>
				<category><![CDATA[ivr tips]]></category>
		<category><![CDATA[asr]]></category>
		<category><![CDATA[Automatic Speech Recognition]]></category>
		<category><![CDATA[DTMF Keypress]]></category>
		<category><![CDATA[india]]></category>
		<category><![CDATA[ivr developer]]></category>
		<category><![CDATA[ivr in India]]></category>
		<category><![CDATA[ivr in rural india]]></category>
		<category><![CDATA[IVR Vendor]]></category>
		<category><![CDATA[Keypress]]></category>
		<category><![CDATA[Mobile user]]></category>
		<category><![CDATA[voice portal]]></category>

		<guid isPermaLink="false">http://www.ivrsworld.com/?p=1617</guid>
		<description><![CDATA[Most probably, the debate on merits and demerits of using ASR ( Automatic Speech Recognition) would continue for some time to come, but in case of India, there are three compelling reason avoid using ASR where simple, old, robust and universally understood &#8216;keypress&#8217; can do. While ASR can surely speed up and increase interactivity of [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>Most probably, the debate on merits and demerits of using ASR ( Automatic Speech Recognition) would continue for some time to come, but in case of India, there are three compelling reason avoid using ASR where simple, old, robust and universally understood &#8216;keypress&#8217; can do. While ASR can surely speed up and increase interactivity of an IVR system, in Indian scenario, it can be a big nuisance, specially for IVR vendors providing IVR to be used nationwide. </p>
<p>Here are my reasons:-</p>
<p><strong>1. Poor Recognition due to background noise</strong></p>
<p>  With due respect to all ASR engines, India is a loud country with lot of noises all around! So, with lot of background noise, it is not possible to recognise words or phrases with high degree of accuracy. Indian cities are noisy with lot of noise pollution all around. Even Indian houses are not well ventilated. Indian villages are crowed with various domestic animal who create lot of sounds. </p>
<p>Because of this background noises, good ASR engines also fail to perform properly and end up with poor accuracy. But users will never appreciate the &#8216;genuine&#8217; technical problems behind this, but the blame will fall squarely on IVR developer or IVR Vendor.<br />
<span id="more-1617"></span><br />
<strong>2. Languages and Accent</strong></p>
<p>  India is home to 1652 languages. Out of these, 29 languages are spoken by more than more than 1 million people  and another 122 are spoken by more than 10,000. It has officially recognised 22 languages. English is widely spoken and understood all around the country but India also has more than 250 million illiterate people.<br />
So with this statistics, one can easily figure out the changes in accents for speaking even English numerals, forget about English words or Phrases!  </p>
<p>  This lack of common accent for speaking out any word or phrase prevent any ASR engine detecting them accurately. But, everyone who has dialed into an IVR system, already know how to press a key from the telephone instrument or the mobile. So, in order to wider use of IVR as well as accuracy, ASR should be avoided.</p>
<p><strong>3. Poor Mobile Signal</strong></p>
<p>Most of the recent telephone users in India are mobile phone users. Apart from big cities, large areas as well as large pockets of cities too, have poor network signal and keeps fluctuating rapidly. Due to which, two human may be able to somehow communicate, but in case of IVR, there chances that it would miss some voices, part of voice which are vital for recognising a word or phrase by ASR. </p>
<p> So, even the best ASR would fail to detect words accurately resulting in caller dissatisfaction, call abandonment which will defeat the very reason for using an IVR system. </p>
<p>While there would be  remedies, tricks and techniques to address these problems, but these are top reasons if some one wants to avoid using ASR in India! </p>
]]></content:encoded>
			<wfw:commentRss>http://www.ivrsworld.com/ivr-tips/top-3-reasons-not-to-use-asr-in-ivr-systems-in-india/feed/</wfw:commentRss>
		<slash:comments>12</slash:comments>
		</item>
		<item>
		<title>IVR Menu System</title>
		<link>http://www.ivrsworld.com/ivr/ivr-menu-system/</link>
		<comments>http://www.ivrsworld.com/ivr/ivr-menu-system/#comments</comments>
		<pubDate>Tue, 02 Feb 2010 06:01:13 +0000</pubDate>
		<dc:creator>Uttam Pegu</dc:creator>
				<category><![CDATA[ivr]]></category>
		<category><![CDATA[asr]]></category>
		<category><![CDATA[Automatic Speech Recognition]]></category>
		<category><![CDATA[database integration]]></category>
		<category><![CDATA[DTMF]]></category>
		<category><![CDATA[dynamic ivr menu]]></category>
		<category><![CDATA[dynamic menu]]></category>
		<category><![CDATA[ivr menu]]></category>
		<category><![CDATA[ivr menu options]]></category>
		<category><![CDATA[ivr menu system]]></category>
		<category><![CDATA[static ivr menu]]></category>
		<category><![CDATA[TTS]]></category>
		<category><![CDATA[voice over artist]]></category>
		<category><![CDATA[Voice Prompts]]></category>
		<category><![CDATA[voice user interface]]></category>
		<category><![CDATA[vui design]]></category>
		<category><![CDATA[welcome prompt]]></category>

		<guid isPermaLink="false">http://www.ivrsworld.com/?p=1235</guid>
		<description><![CDATA[Like any other computer software, IVR too work inputs of responses provided by the user. Though, a computer application software has widely used input devices like keyboard, mouse, voice, even hand movement, IVR applications have limited, only two types of inputs or responses. One is telephone keypad which has 12 keys and automatic speech recognition! [...]]]></description>
			<content:encoded><![CDATA[<p></p><p>Like any other computer software, IVR too work inputs of responses provided by the user. Though, a computer application software has widely used input devices like keyboard, mouse, voice, even hand movement, IVR applications have limited, only two types of inputs or responses. One is telephone keypad which has 12 keys and automatic speech recognition!</p>
<p>Another specific difference between a normal computer application software and a IVR software is, IVR can provide the options sequentially in voice while computer application software can software options in various menus, buttons, drop down menu, quick links using related graphics etc. So IVR provides its options or tasks it can perform to a caller in series of voice announcement along with key option to press or word, phrase to speak for choosing the tasks. This is known as <strong>IVR menu</strong>.</p>
<p>With advent of hosted IVR services, IVR application has become designing of this menu system only, while the integration of CTI hardware with various PBX systems have become standard modules provided by hosted IVR service provider.</p>
<p><span id="more-1235"></span>Irrespective of direction of call of the IVR ( inbound IVR or outbound IVR), menu starts as soon as the call is picked up. Normally, IVR starts with a &#8220;welcome prompt&#8221; typically welcoming the caller with greetings and quick short brief about the service, organisation behind the IVR.</p>
<p>IVR menus can be divided into two types depending on how user give commands to IVR application or how user responds to its menu options, DTMF(DualTone Multiple Frequency) or Telephone driven and ASR (Automatic Speech Recognition) driven.  Both have advantage and disadvantages and are widely used. ( <a href="http://www.ivrsworld.com/general/the-big-question-voice-enabled-menu-or-dtmf-pressed-menu/">Click here</a> for more about DTMF driven menu or ASR driven menu)</p>
<p>Apart from above two types of IVR menus, it can also be divided another two types depending on how menu options are generated.</p>
<ol>
<li> Menu options can be recorded using a voice artist in studio and played back.</li>
<li> Menu options can be generated using TTS (Text to Speech) converters.</li>
</ol>
<p>Again, choices in a menu, may be static or dynamic. When the choices in a menu are fixed for the IVR system, than the menu can be called as static menu while if the menu options changes dynamically depending on caller id, time of the day etc., than the menu can be called dynamic. For example:-</p>
<p>In a call center IVR, if a registered customer dials, he may be given different options while an unregistered customer would be presented a different options in the IVR menu.</p>
<p>Many people do not like TTS generated menus as one can make out the difference between human voice and TTS generated voice, specially in India, it has mainly to do with accent of the voice. TTS can be very useful in dynamic menu as well as presenting data retrieved from database where playing recorded voice may not be possible all the time.</p>
<p>Usability of an IVR depends mainly on the design of IVR menu. For more about usability one may check this : <a href="http://www.ivrsworld.com/advanced-ivrs/usability-guidelines-of-ivr-systems/">Usability of IVR systems</a>.</p>
<p>An IVR user is interacts with IVR through it menu only. So it is very important to design IVR menu which is easily understandable to first time callers. Previously people used Visio like tools to design IVR menu. But now a days, almost all IVR development tools come with powerful, feature rich visual IVR menu designer giving DYI(Do It Yourself) feature for IVR customers.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.ivrsworld.com/ivr/ivr-menu-system/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Automatic Speech Recognition in voice portal</title>
		<link>http://www.ivrsworld.com/automatic-speech-recognition/speech-recognition-in-voice-portal/</link>
		<comments>http://www.ivrsworld.com/automatic-speech-recognition/speech-recognition-in-voice-portal/#comments</comments>
		<pubDate>Mon, 09 Nov 2009 09:27:09 +0000</pubDate>
		<dc:creator>Uttam Pegu</dc:creator>
				<category><![CDATA[Automatic Speech Recognition]]></category>
		<category><![CDATA[Speech Recognition]]></category>
		<category><![CDATA[voice portal]]></category>
		<category><![CDATA[asr]]></category>
		<category><![CDATA[DTMF]]></category>
		<category><![CDATA[dtmf dialing]]></category>
		<category><![CDATA[Indian Telecom]]></category>
		<category><![CDATA[ivr]]></category>
		<category><![CDATA[mrcp]]></category>
		<category><![CDATA[Pulse dialing]]></category>
		<category><![CDATA[speech enabled IVR menu]]></category>
		<category><![CDATA[tag]]></category>
		<category><![CDATA[Telecom]]></category>
		<category><![CDATA[Telecom news India]]></category>
		<category><![CDATA[tone dialing]]></category>
		<category><![CDATA[touch tone]]></category>
		<category><![CDATA[voice recognition]]></category>
		<category><![CDATA[Voice VAS]]></category>
		<category><![CDATA[words]]></category>

		<guid isPermaLink="false">http://www.ivrsworld.com/?p=1056</guid>
		<description><![CDATA[What is speech recognition or automatic speech recognition ? Automatic speech recognition can be defined as a process by which spoken words are converted to text. Many people may confuse with voice recognition as speech recognition. Voice recognition is recognising the speaker of the voice, while automatic speech recognition recognise the words spoken. While speech [...]]]></description>
			<content:encoded><![CDATA[<p></p><p><strong>What is speech recognition or automatic speech recognition ?</strong><br />
      Automatic speech recognition can be defined as a process by which spoken words are converted to text. Many people may confuse with voice recognition as speech recognition. Voice recognition is recognising the speaker of the voice, while automatic speech recognition recognise the words spoken. While speech recognition is mainly used for menu selection, voice recognition is used for checking authorised user.</p>
<p><strong>How automatic speech recognition can be used in voice portal ?</strong><br />
   Voice portal is a large IVR application which provides many value added services to the voice portal user. Normally, voice portals are operated by telecom operators to provide information of their services, activating deactivating services, utility services like matrimonial, job search, listening to music and download, jokes etc. Since voice portals have large information to be provided to a caller, it needs to give out a long list of options. But the DTMF digits or touch tones on a mobile phone or telephone instrument is limited. So automatic speech recognition is used as an alternative to touch tone or DTMF input for selecting a menu option in the voice portal or IVR.</p>
<p><strong>Advantages of Automatic Speech Recognition over DTMF in Voice Portal: </strong><br />
  1. Caller can choose a menu option quickly without having to listen to boring long list of menus. IVR can become truly interactive with its caller using same human voice.<br />
 <span id="more-1056"></span> 2. Using an IVR was always a problem as one he has to remove it from his ear to look at the keypad and press a key. This can be again quite irritating for a voice portal user. Implementation of ASR completely remove this problem.<br />
  3. Change of menu or introduction of new options a menu, or shuffling of options in a menu becomes easier as menu options are the words most commonly used by everybody for a particular option. Using a touchtone or DTMF menu, any option was assigned to a DTMF key or a number. With ASR, it is assigned to the word itself.<br />
  4. Not many people are aware of # and * keys as they are hardly used in normal operation while IVR seems to use them for some options for very long menus. ASR does not face this kind of problem.</p>
<p>So, one may find it very useful in a developer point of view to use ASR in voice portal as it gives more flexibility for designing menu and presentation of data. But as user point of view it may not that user friendly as touch tone. This mainly due to lack of 100% accuracy in recognising the word spoken by the caller. Wrong detection of words by ASR may result in very awkward menu options and information fetching. One way of tackling this problem is to confirm a user of his chosen option in the menu with a YES or NO for every menu option. But it would be too irritating for a user! A voice portal is normally a premium service where irritating caller is the biggest crime one can commit!</p>
<p><strong>Problems of Speech recognition<br />
</strong>1. Accuracy of recognition<br />
   Accuracy of recognition can be thought as converting words spoken by a user accurately to ts corresponding text.<br />
   As per information available ( by Googling and visiting ASR engine provider websites), almost all ASR engine has high accuracy for detecting two words, YES and NO. Apart from them, other words like numbers, date of births etc. has lower accuracy in recognising. For example, accuracy of recognition of numbers 1,2,3 etc vary from 87 to 91% depending on ASR engine. Accuracy of recognising other natural English language is fare much worse! As per report, recognising departments in company is maximum 85%!<br />
  Though accuracy can be improved by training of ASR, but it is not practical at all and does not serve a voice portal.</p>
<p><strong>2. Different accent<br />
</strong>  Using ASR in a big country where people speak different languages or same language with different accents,speech recognition accuracy is bound to fare worse.</p>
<p><strong>3. Confirming YES/NO again and again is irritating<br />
</strong> People may design IVR very intelligently to confirm for any doubtful word recognition by YES or NO, but it is still irritating for many people and it slows down the time for fetching an information. In a voice portal where caller pays by minute of usage, it may not work in the interest of voice portal users.</p>
<p>Still, automatic speech recognition (ASR) is  used widely in voice portal now a days and here are the few services in voice portal where ASR may work well :-</p>
<p>1. Contests<br />
2. Classifieds<br />
3. City wise weather information</p>
]]></content:encoded>
			<wfw:commentRss>http://www.ivrsworld.com/automatic-speech-recognition/speech-recognition-in-voice-portal/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>

