<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Porter Stemming algorithm for search</title>
	<atom:link href="http://www.codediesel.com/php/porter-stemming-in-search/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.codediesel.com/php/porter-stemming-in-search/</link>
	<description>/* PHP &#38; MySQL Journal */</description>
	<lastBuildDate>Fri, 27 Jan 2012 16:39:42 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
<xhtml:meta xmlns:xhtml="http://www.w3.org/1999/xhtml" name="robots" content="noindex" />
	<item>
		<title>By: Porter Stemming algorithm for search : CodeDiesel</title>
		<link>http://www.codediesel.com/php/porter-stemming-in-search/comment-page-1/#comment-1202</link>
		<dc:creator>Porter Stemming algorithm for search : CodeDiesel</dc:creator>
		<pubDate>Sat, 02 May 2009 12:09:10 +0000</pubDate>
		<guid isPermaLink="false">http://www.codediesel.com/?p=724#comment-1202</guid>
		<description>[...] Read more here: Porter Stemming algorithm for search : CodeDiesel [...]</description>
		<content:encoded><![CDATA[<p>[...] Read more here: Porter Stemming algorithm for search : CodeDiesel [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Sameer Borate&#8217;s Blog: Porter Stemming algorithm for search : Dragonfly Networks</title>
		<link>http://www.codediesel.com/php/porter-stemming-in-search/comment-page-1/#comment-1199</link>
		<dc:creator>Sameer Borate&#8217;s Blog: Porter Stemming algorithm for search : Dragonfly Networks</dc:creator>
		<pubDate>Thu, 30 Apr 2009 04:50:51 +0000</pubDate>
		<guid isPermaLink="false">http://www.codediesel.com/?p=724#comment-1199</guid>
		<description>[...] a recent post to his blog Sameer looks at implementing a Stemming algorithm to search an array of words. It uses this library (as written by Richard [...]</description>
		<content:encoded><![CDATA[<p>[...] a recent post to his blog Sameer looks at implementing a Stemming algorithm to search an array of words. It uses this library (as written by Richard [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Sameer Borate&#8217;s Blog: Porter Stemming algorithm for search : WebNetiques, LLC : Website Developers in Minneapolis, MN</title>
		<link>http://www.codediesel.com/php/porter-stemming-in-search/comment-page-1/#comment-1198</link>
		<dc:creator>Sameer Borate&#8217;s Blog: Porter Stemming algorithm for search : WebNetiques, LLC : Website Developers in Minneapolis, MN</dc:creator>
		<pubDate>Thu, 30 Apr 2009 04:48:02 +0000</pubDate>
		<guid isPermaLink="false">http://www.codediesel.com/?p=724#comment-1198</guid>
		<description>[...] a recent post to his blog Sameer looks at implementing a Stemming algorithm to search an array of words. It uses this library (as written by Richard [...]</description>
		<content:encoded><![CDATA[<p>[...] a recent post to his blog Sameer looks at implementing a Stemming algorithm to search an array of words. It uses this library (as written by Richard [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ivo Jansch</title>
		<link>http://www.codediesel.com/php/porter-stemming-in-search/comment-page-1/#comment-1197</link>
		<dc:creator>Ivo Jansch</dc:creator>
		<pubDate>Wed, 29 Apr 2009 22:32:32 +0000</pubDate>
		<guid isPermaLink="false">http://www.codediesel.com/?p=724#comment-1197</guid>
		<description>This is indeed a very useful algorithm. It&#039;s one of the algo&#039;s I use in http://flackr.net to match tweets. Since tweets have only a limited amount of words, comparing them by porterstem helps combine those that belong together.

@guy: there is generally no need to output them. Like sameer said it&#039;s not possible, but regardless of that you generally don&#039;t need it. You use it internally to do the search comparisons; what you display are the actual search terms as entered by the user and the search results.</description>
		<content:encoded><![CDATA[<p>This is indeed a very useful algorithm. It&#8217;s one of the algo&#8217;s I use in <a href="http://flackr.net" rel="nofollow">http://flackr.net</a> to match tweets. Since tweets have only a limited amount of words, comparing them by porterstem helps combine those that belong together.</p>
<p>@guy: there is generally no need to output them. Like sameer said it&#8217;s not possible, but regardless of that you generally don&#8217;t need it. You use it internally to do the search comparisons; what you display are the actual search terms as entered by the user and the search results.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Naz</title>
		<link>http://www.codediesel.com/php/porter-stemming-in-search/comment-page-1/#comment-1194</link>
		<dc:creator>Naz</dc:creator>
		<pubDate>Sun, 26 Apr 2009 15:39:10 +0000</pubDate>
		<guid isPermaLink="false">http://www.codediesel.com/?p=724#comment-1194</guid>
		<description>This algorithm has a chance to match word about 60% at least php version.
For example days of the week or months</description>
		<content:encoded><![CDATA[<p>This algorithm has a chance to match word about 60% at least php version.<br />
For example days of the week or months</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: sameer</title>
		<link>http://www.codediesel.com/php/porter-stemming-in-search/comment-page-1/#comment-1191</link>
		<dc:creator>sameer</dc:creator>
		<pubDate>Thu, 23 Apr 2009 05:03:42 +0000</pubDate>
		<guid isPermaLink="false">http://www.codediesel.com/?p=724#comment-1191</guid>
		<description>What you are trying to achieve is the reverse of Porter Stemming. Basically you want to generate inflections from the stem word which is not possible using this algorithm, as it is a simple algorithm without any dictionary lookup. To generate the word inflections as given by you will require a stemmer with a dictionary and understanding of some grammar.

One possibility I could think of is to use a plain dictionary with the above algorithm and extract those words that reduce down to a particular stem.  This words would be than the inflections of the original stem. For e.g to find the inflections of the word &#039;work&#039; we would pass all the words starting with &#039;work&#039; (or &#039;w&#039; to make it simpler) through the stemmer and only select those that reduce down to &#039;work&#039;. Its a crude method though, devoid of any understanding of verbs.

Or go with &lt;a target=&quot;_blank&quot; href=&quot;http://www.nltk.org/Home&quot; rel=&quot;nofollow&quot;&gt;NLTK&lt;/a&gt;.</description>
		<content:encoded><![CDATA[<p>What you are trying to achieve is the reverse of Porter Stemming. Basically you want to generate inflections from the stem word which is not possible using this algorithm, as it is a simple algorithm without any dictionary lookup. To generate the word inflections as given by you will require a stemmer with a dictionary and understanding of some grammar.</p>
<p>One possibility I could think of is to use a plain dictionary with the above algorithm and extract those words that reduce down to a particular stem.  This words would be than the inflections of the original stem. For e.g to find the inflections of the word &#8216;work&#8217; we would pass all the words starting with &#8216;work&#8217; (or &#8216;w&#8217; to make it simpler) through the stemmer and only select those that reduce down to &#8216;work&#8217;. Its a crude method though, devoid of any understanding of verbs.</p>
<p>Or go with <a target="_blank" href="http://www.nltk.org/Home" rel="nofollow">NLTK</a>.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Guy Patterson</title>
		<link>http://www.codediesel.com/php/porter-stemming-in-search/comment-page-1/#comment-1189</link>
		<dc:creator>Guy Patterson</dc:creator>
		<pubDate>Wed, 22 Apr 2009 17:34:16 +0000</pubDate>
		<guid isPermaLink="false">http://www.codediesel.com/?p=724#comment-1189</guid>
		<description>How would you output the stems? In other words, based off your example, how would I output: 

blog
blogging
blogged
blogger

tip
tips
tipped
tipping
tipper

late
later
latest

work
worker
workers
working

Or is this even a possibility? 

Thanks, 

Guy
https://www.nullamatix.com/pubkey.txt</description>
		<content:encoded><![CDATA[<p>How would you output the stems? In other words, based off your example, how would I output: </p>
<p>blog<br />
blogging<br />
blogged<br />
blogger</p>
<p>tip<br />
tips<br />
tipped<br />
tipping<br />
tipper</p>
<p>late<br />
later<br />
latest</p>
<p>work<br />
worker<br />
workers<br />
working</p>
<p>Or is this even a possibility? </p>
<p>Thanks, </p>
<p>Guy<br />
<a href="https://www.nullamatix.com/pubkey.txt" rel="nofollow">https://www.nullamatix.com/pubkey.txt</a></p>
]]></content:encoded>
	</item>
</channel>
</rss>

