<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" href="http://blogs.msdn.com/utility/FeedStylesheets/rss.xsl" media="screen"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:wfw="http://wellformedweb.org/CommentAPI/"><channel><title>Keyword spotting</title><link>http://blogs.msdn.com/sprague/archive/2007/04/25/keyword-spotting.aspx</link><description>Lots of buzz about using SR technology to pick up keywords in an audio stream. See Robert Scoble's demo of Nexidia or this post by Eduardo Olvera. The basic idea has been around for years, especially if you believe the NSA has been listening in on phone</description><dc:language>en-US</dc:language><generator>CommunityServer 2.1 SP1 (Build: 61025.2)</generator><item><title>re: Keyword spotting</title><link>http://blogs.msdn.com/sprague/archive/2007/04/25/keyword-spotting.aspx#2300241</link><pubDate>Fri, 27 Apr 2007 18:04:07 GMT</pubDate><guid isPermaLink="false">91d46819-8472-40ad-a661-2c78acb4018c:2300241</guid><dc:creator>Drew Lanham</dc:creator><description>&lt;p&gt;I can agree a single word may not necessarily be representative of the content of a discussion. In our experience, however, the spoken word -- when looked at holistically -- can be an excellent indicator of content and context. To be clear, Nexidia is not proposing using a single word as an indictor of the appropriate ads to run. &amp;nbsp;In the video example, our technology has categorized the content (without the benefit of tags or metadata), determined the frequency of the word and the relationship to other words in time (e.g. camcorder within 10 seconds of video and or home movie). &amp;nbsp;Once content and context have been refined, this information is then passed to the ad server in order to be combined with other information (demographics, geography, search history, etc..) to allow the ad server to serve the highest value advertisement to the user at that point in time. The timing of the word was to illustrate accuracy but also that we could control timing of the ad to correspond with the content to increase frequency of the ad or ads. &amp;nbsp;The timing and frequency of the ad is important because it isn’t intrusive to the experience (e.g. frontroll ad), so there is no harm to the user in it being changed throughout playback. &lt;/p&gt;
&lt;p&gt;Nexidia is not just word spotting, we are rendering the spoken word content fully searchable in over 33 languages. &amp;nbsp;Our technology is used to render tens of thousands of hours of audio content searchable every day in call centers, legal, media and government applications. &amp;nbsp;We add an analytics layer on this capability to extract actionable knowledge from this otherwise unusable volume of unstructured data. &amp;nbsp;Using a single, dual core dual processor box, we index audio or video searchable at 340 &amp;nbsp;faster than real time. &amp;nbsp;This equates to 8,000 hours of content per day per box. As a result of this kind of efficiency, the cost of the infrastructure is not a barrier to getting started experimenting in the promising spaces of media search, categorization and contextual ad targeting in audio and video. &amp;nbsp;&lt;/p&gt;</description></item></channel></rss>