<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Wordchillies &#187; latent semantic indexing</title>
	<atom:link href="http://www.papertip.com/blog/tag/latent-semantic-indexing/feed" rel="self" type="application/rss+xml" />
	<link>http://www.papertip.com/blog</link>
	<description>Letters &#38; Words!</description>
	<lastBuildDate>Fri, 06 Aug 2010 10:42:26 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>LSI – The Search Engine Think-Tank</title>
		<link>http://www.papertip.com/blog/lsi-%e2%80%93-the-search-engine-think-tank.html</link>
		<comments>http://www.papertip.com/blog/lsi-%e2%80%93-the-search-engine-think-tank.html#comments</comments>
		<pubDate>Fri, 06 Nov 2009 06:07:01 +0000</pubDate>
		<dc:creator>Adam</dc:creator>
				<category><![CDATA[Content & Google]]></category>
		<category><![CDATA[Essence of Content!]]></category>
		<category><![CDATA[Letters & Words]]></category>
		<category><![CDATA[Trivial Tips!]]></category>
		<category><![CDATA[Words in Advertising]]></category>
		<category><![CDATA[applied semantics]]></category>
		<category><![CDATA[artificial intelligence]]></category>
		<category><![CDATA[competitive search terms]]></category>
		<category><![CDATA[content creation]]></category>
		<category><![CDATA[ideal search result]]></category>
		<category><![CDATA[keyword analysis]]></category>
		<category><![CDATA[latent semantic indexing]]></category>
		<category><![CDATA[LSI]]></category>
		<category><![CDATA[search platforms]]></category>
		<category><![CDATA[search process]]></category>

		<guid isPermaLink="false">http://www.papertip.com/blog/?p=184</guid>
		<description><![CDATA[LSI is different, a kind of ‘Artificial Intelligence’ really. Looking at the speed and number of web-pages being added on the net everyday, it definitely is a challenging task for any search engine to provide an ideal search result]]></description>
			<content:encoded><![CDATA[<p><strong><img class="alignleft size-medium wp-image-185" src="http://www.papertip.com/blog/blog/wp-content/uploads/2009/11/web1-web4-300x200.jpg" alt="web1-web4" width="300" height="200" />Latent Semantic Analysis </strong>is primarily used by Google to detect spam, where excessive use of keyword is inserted in order to fool the search engines into providing a higher page-ranking for that keyword. This was indeed achievable by writing meaningless templates with rotating synonyms in which any keyword was multiply inserted by using software. Thousands of pages could be generated each hour targeting a specific keyword, and people were making thousands each day using Adsense.</p>
<p>The principles of LSI in determining <a href="http://www.papertip.com/web-services/seo-content"><strong>content of web-pages</strong></a>, was first used by a small company called Oingo. Later renamed Applied Semantics, they developed a search system for determining the relevance of<a href="http://www.papertip.com/web-services/site-content"><strong> page content</strong></a> for specific advert placement, and called this Adsense. This was bought by Google in April, 2003 and used for replacing their system which was still developing. Adsense was not developed by Google, but purchased, and a good bargain at that.</p>
<p>However, a general opinion of the LSI is slightly off-mark. Instead of thinking it to be just an algorithm screwed on to the search engine, this could be better understood as a concept. Tie in the phrase ‘Artificial Intelligence’ to the LSI, and we see this in better light.</p>
<p>When we search for Tiger Woods, the search will not result in putting forth pages related to the keywords ‘tiger’ and ‘woods’; instead what we get is what we call ‘relevance feedback.’  This search will present a number of pages relating to golf. LSI is the most potent tool available for the search engines, as of now.</p>
<p>The LSI enabled search platforms are most effective and able to make better sense since they do not just focus on a bunch of keywords. Conventional search engines were unable to give good results since they were basing results only on ‘keywords’ analysis.</p>
<p>This was the reason they were unable to tell the difference between:</p>
<ul>
<li><strong>Similar words but with      different meanings </strong>– and we have a whole lot of them.<strong> </strong></li>
<li><strong>Words similar in      meaning, but spelled differently </strong>– sickness/vomiting.<strong> </strong></li>
<li><strong>Singular and plural      forms of words </strong>– dice/die, man/men.<strong></strong></li>
<li><strong>Branches of words from      the same root </strong>– like ‘bath’, ‘bathe’, ‘bathing’, or      ‘bathed.’<strong></strong></li>
</ul>
<p>LSI is different, a kind of ‘Artificial Intelligence’ really. Looking at the speed and number of web-pages being added on the net everyday, it definitely is a challenging task for any search engine to provide an ideal search result.</p>
<p>The fact is that LSI is able to fit in this slot of search process, widely &amp; wisely capable of enhancing the qualities of any search engine using LSI.</p>
<p><strong> </strong></p>
]]></content:encoded>
			<wfw:commentRss>http://www.papertip.com/blog/lsi-%e2%80%93-the-search-engine-think-tank.html/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>
