<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Web scraping tutorial</title>
	<atom:link href="http://www.codediesel.com/php/web-scraping-in-php-tutorial/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.codediesel.com/php/web-scraping-in-php-tutorial/</link>
	<description>/* PHP &#38; MySQL Journal */</description>
	<lastBuildDate>Fri, 27 Jan 2012 16:39:42 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
<xhtml:meta xmlns:xhtml="http://www.w3.org/1999/xhtml" name="robots" content="noindex" />
	<item>
		<title>By: Pearls</title>
		<link>http://www.codediesel.com/php/web-scraping-in-php-tutorial/comment-page-1/#comment-1989</link>
		<dc:creator>Pearls</dc:creator>
		<pubDate>Sat, 29 May 2010 12:49:09 +0000</pubDate>
		<guid isPermaLink="false">http://www.codediesel.com/?p=347#comment-1989</guid>
		<description>Nice post. I will bookmark it.</description>
		<content:encoded><![CDATA[<p>Nice post. I will bookmark it.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: JenniC</title>
		<link>http://www.codediesel.com/php/web-scraping-in-php-tutorial/comment-page-1/#comment-1919</link>
		<dc:creator>JenniC</dc:creator>
		<pubDate>Mon, 10 May 2010 19:57:40 +0000</pubDate>
		<guid isPermaLink="false">http://www.codediesel.com/?p=347#comment-1919</guid>
		<description>Nice discussion.

I use web scraping on pages from our own website. I use this script.


http://www.biterscripting.com/helppages/SS_WebPageToText.html


It&#039;s pretty simple to use.</description>
		<content:encoded><![CDATA[<p>Nice discussion.</p>
<p>I use web scraping on pages from our own website. I use this script.</p>
<p><a href="http://www.biterscripting.com/helppages/SS_WebPageToText.html" rel="nofollow">http://www.biterscripting.com/helppages/SS_WebPageToText.html</a></p>
<p>It&#8217;s pretty simple to use.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: sameer</title>
		<link>http://www.codediesel.com/php/web-scraping-in-php-tutorial/comment-page-1/#comment-1899</link>
		<dc:creator>sameer</dc:creator>
		<pubDate>Tue, 04 May 2010 05:20:14 +0000</pubDate>
		<guid isPermaLink="false">http://www.codediesel.com/?p=347#comment-1899</guid>
		<description>Hi Sushil!

Web scraping is not hacking. Web scraping is just a technique to gather text information from a web page, just like web robots do. Many sites these days provide APIs to access information from their sites. 

Regarding the legal issues, it depends on the particular sites &#039;Terms &amp; conditions&#039;. Check the &#039;Legal Issues&#039; section of this article :

http://en.wikipedia.org/wiki/Web_scraping

If you want to make a career in scraping, do it in a broad way. That is also learn about text processing, text mining. These are all related fields. But you will have to market your skills yourself. Most companies don&#039;t advertise for such skills, even though they may need them. 

The best way to get started in scraping is to learn Perl and Regular Expressions. You can do web scraping in PHP, but its like digging a hole with a screwdriver; it can be done, but it will take a hell longer then if you would have used a shovel. Simple scraping in PHP is ok, but for complex things nothing beats Perl. There are loads of libraries in Perl for the same.

Even if after some time your career in scraping does not pan out, you will have learned a whole lot about text processing and regular expressions, which can be invaluable in any software field.</description>
		<content:encoded><![CDATA[<p>Hi Sushil!</p>
<p>Web scraping is not hacking. Web scraping is just a technique to gather text information from a web page, just like web robots do. Many sites these days provide APIs to access information from their sites. </p>
<p>Regarding the legal issues, it depends on the particular sites &#8216;Terms &#038; conditions&#8217;. Check the &#8216;Legal Issues&#8217; section of this article :</p>
<p><a href="http://en.wikipedia.org/wiki/Web_scraping" rel="nofollow">http://en.wikipedia.org/wiki/Web_scraping</a></p>
<p>If you want to make a career in scraping, do it in a broad way. That is also learn about text processing, text mining. These are all related fields. But you will have to market your skills yourself. Most companies don&#8217;t advertise for such skills, even though they may need them. </p>
<p>The best way to get started in scraping is to learn Perl and Regular Expressions. You can do web scraping in PHP, but its like digging a hole with a screwdriver; it can be done, but it will take a hell longer then if you would have used a shovel. Simple scraping in PHP is ok, but for complex things nothing beats Perl. There are loads of libraries in Perl for the same.</p>
<p>Even if after some time your career in scraping does not pan out, you will have learned a whole lot about text processing and regular expressions, which can be invaluable in any software field.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Sushil</title>
		<link>http://www.codediesel.com/php/web-scraping-in-php-tutorial/comment-page-1/#comment-1898</link>
		<dc:creator>Sushil</dc:creator>
		<pubDate>Mon, 03 May 2010 20:08:05 +0000</pubDate>
		<guid isPermaLink="false">http://www.codediesel.com/?p=347#comment-1898</guid>
		<description>Hi...

I am listen about the web scraping from my friend but i have some queries please explain this

Is web scraping is a part of hacking or hacking is possible using scraping

Is it legal Procedure ?

What&#039;s the career and future scope in this.</description>
		<content:encoded><![CDATA[<p>Hi&#8230;</p>
<p>I am listen about the web scraping from my friend but i have some queries please explain this</p>
<p>Is web scraping is a part of hacking or hacking is possible using scraping</p>
<p>Is it legal Procedure ?</p>
<p>What&#8217;s the career and future scope in this.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Sid</title>
		<link>http://www.codediesel.com/php/web-scraping-in-php-tutorial/comment-page-1/#comment-1829</link>
		<dc:creator>Sid</dc:creator>
		<pubDate>Mon, 12 Apr 2010 02:55:09 +0000</pubDate>
		<guid isPermaLink="false">http://www.codediesel.com/?p=347#comment-1829</guid>
		<description>Excellent post.. I&#039;m looking for some help with scraping javascript. I&#039;ve been trying to use YQL, but in vain. Could anybody please help me out...</description>
		<content:encoded><![CDATA[<p>Excellent post.. I&#8217;m looking for some help with scraping javascript. I&#8217;ve been trying to use YQL, but in vain. Could anybody please help me out&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Subramanyam Srikanth</title>
		<link>http://www.codediesel.com/php/web-scraping-in-php-tutorial/comment-page-1/#comment-1704</link>
		<dc:creator>Subramanyam Srikanth</dc:creator>
		<pubDate>Thu, 04 Mar 2010 05:13:27 +0000</pubDate>
		<guid isPermaLink="false">http://www.codediesel.com/?p=347#comment-1704</guid>
		<description>Nice post. Can any one tell me how can we scrap through javascript since i need it.</description>
		<content:encoded><![CDATA[<p>Nice post. Can any one tell me how can we scrap through javascript since i need it.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: funkuncut</title>
		<link>http://www.codediesel.com/php/web-scraping-in-php-tutorial/comment-page-1/#comment-1674</link>
		<dc:creator>funkuncut</dc:creator>
		<pubDate>Sun, 14 Feb 2010 10:28:34 +0000</pubDate>
		<guid isPermaLink="false">http://www.codediesel.com/?p=347#comment-1674</guid>
		<description>Awesome!  this is super useful</description>
		<content:encoded><![CDATA[<p>Awesome!  this is super useful</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Joe Duggins</title>
		<link>http://www.codediesel.com/php/web-scraping-in-php-tutorial/comment-page-1/#comment-1379</link>
		<dc:creator>Joe Duggins</dc:creator>
		<pubDate>Sun, 30 Aug 2009 03:46:07 +0000</pubDate>
		<guid isPermaLink="false">http://www.codediesel.com/?p=347#comment-1379</guid>
		<description>This was really helpful.  I just started using this set of php classes, and I&#039;m happy to have a good spot to begin, good work.</description>
		<content:encoded><![CDATA[<p>This was really helpful.  I just started using this set of php classes, and I&#8217;m happy to have a good spot to begin, good work.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: PHP Coder</title>
		<link>http://www.codediesel.com/php/web-scraping-in-php-tutorial/comment-page-1/#comment-1350</link>
		<dc:creator>PHP Coder</dc:creator>
		<pubDate>Wed, 19 Aug 2009 14:48:33 +0000</pubDate>
		<guid isPermaLink="false">http://www.codediesel.com/?p=347#comment-1350</guid>
		<description>Tips: if you write web scraper, its likely that the scraper function will repeats. for example: fetching page1.html, page2.html, ..., page10.html. in that case, set enough delay between request or you&#039;ll be kicked off the site as they see you as spam.</description>
		<content:encoded><![CDATA[<p>Tips: if you write web scraper, its likely that the scraper function will repeats. for example: fetching page1.html, page2.html, &#8230;, page10.html. in that case, set enough delay between request or you&#8217;ll be kicked off the site as they see you as spam.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Akshay</title>
		<link>http://www.codediesel.com/php/web-scraping-in-php-tutorial/comment-page-1/#comment-1284</link>
		<dc:creator>Akshay</dc:creator>
		<pubDate>Mon, 13 Jul 2009 05:13:46 +0000</pubDate>
		<guid isPermaLink="false">http://www.codediesel.com/?p=347#comment-1284</guid>
		<description>I have created an easy to use web scraper in the form of a WordPress plugin. It uses cURL and phpQuery (for parsing). It also provides with some output functions like clear, find and replace, output (text / html) and caching and error handling capabilities. Here&#039;s the link - http://wordpress.org/extend/plugins/wp-web-scrapper/</description>
		<content:encoded><![CDATA[<p>I have created an easy to use web scraper in the form of a WordPress plugin. It uses cURL and phpQuery (for parsing). It also provides with some output functions like clear, find and replace, output (text / html) and caching and error handling capabilities. Here&#8217;s the link &#8211; <a href="http://wordpress.org/extend/plugins/wp-web-scrapper/" rel="nofollow">http://wordpress.org/extend/plugins/wp-web-scrapper/</a></p>
]]></content:encoded>
	</item>
</channel>
</rss>

