<?xml version="1.0" encoding="UTF-8"?><rss version="2.0" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" > <channel><title>Comments on: Scraping Google Front Page Results</title> <atom:link href="http://kbeezie.com/view/scraping-google-results/feed/" rel="self" type="application/rss+xml" /><link>http://kbeezie.com/view/scraping-google-results/</link> <description>There&#039;s no place like ::1</description> <lastBuildDate>Mon, 18 Apr 2011 01:47:04 +0000</lastBuildDate> <sy:updatePeriod>hourly</sy:updatePeriod> <sy:updateFrequency>1</sy:updateFrequency> <generator>http://wordpress.org/?v=3.3.2</generator> <item><title>By: rocker_e</title><link>http://kbeezie.com/view/scraping-google-results/#comment-853</link> <dc:creator>rocker_e</dc:creator> <pubDate>Thu, 30 Sep 2010 06:22:58 +0000</pubDate> <guid isPermaLink="false">http://kbeezie.com/?p=85#comment-853</guid> <description>JayJay : I tried adding &amp;num=100  but still get only the first page of results...how can I get all 100 in the array?Thanks!</description> <content:encoded><![CDATA[<p>JayJay : I tried adding &amp;num=100  but still get only the first page of results&#8230;how can I get all 100 in the array?</p><p>Thanks!</p> ]]></content:encoded> </item> <item><title>By: JayJay</title><link>http://kbeezie.com/view/scraping-google-results/#comment-615</link> <dc:creator>JayJay</dc:creator> <pubDate>Mon, 26 Jul 2010 17:45:33 +0000</pubDate> <guid isPermaLink="false">http://kbeezie.com/?p=85#comment-615</guid> <description>RE: Frank @ frankmacdonald.co.ukI tried your keyword (tiger woods latest update) with your code and returned &quot;Fatal error: Call to a member function find() on a non-object in /home/linkdir/public_html.....&quot;  Though I added + to the query to fill in the spaces between the keywords. eg: tiger+woods+latest+update as google doesn&#039;t seem to like the spaces in the string (I&#039;ve had no problems with the other SE&#039;s with spaces) If you add the plus to the spaces and run with$h3 = $g-&gt;find(&#039;h3.r&#039;, 0); $s = $g-&gt;find(&#039;div.s&#039;, 0); $a = $h3-&gt;find(&#039;a&#039;, 0);from the original script you should get a result :) without the plus added to your keyword spaces you return an empty array.</description> <content:encoded><![CDATA[<p>RE: Frank @ frankmacdonald.co.uk</p><p>I tried your keyword (tiger woods latest update) with your code and returned &#8220;Fatal error: Call to a member function find() on a non-object in /home/linkdir/public_html&#8230;..&#8221;  Though I added + to the query to fill in the spaces between the keywords. eg: tiger+woods+latest+update as google doesn&#8217;t seem to like the spaces in the string (I&#8217;ve had no problems with the other SE&#8217;s with spaces) If you add the plus to the spaces and run with</p><p>$h3 = $g-&gt;find(&#8216;h3.r&#8217;, 0);<br /> $s = $g-&gt;find(&#8216;div.s&#8217;, 0);<br /> $a = $h3-&gt;find(&#8216;a&#8217;, 0);</p><p>from the original script you should get a result <img src='http://kbeezie.net/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> without the plus added to your keyword spaces you return an empty array.</p> ]]></content:encoded> </item> <item><title>By: JayJay</title><link>http://kbeezie.com/view/scraping-google-results/#comment-614</link> <dc:creator>JayJay</dc:creator> <pubDate>Mon, 26 Jul 2010 17:14:43 +0000</pubDate> <guid isPermaLink="false">http://kbeezie.com/?p=85#comment-614</guid> <description>Smooth bit of coding there :) I&#039;ve had the Simple HTML DOM Parser class on the back-end of  my server for a while now and I&#039;ve been looking for some direction for it&#039;s use. The code works like a charm and you can extend http://www,google,com/search?hl=en&amp;q=  by adding (&amp;num=100) to get the first 100 results. eg: http://www,google,com/search?num=100&amp;hl=en&amp;q=</description> <content:encoded><![CDATA[<p>Smooth bit of coding there <img src='http://kbeezie.net/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> I&#8217;ve had the Simple HTML DOM Parser class on the back-end of  my server for a while now and I&#8217;ve been looking for some direction for it&#8217;s use. The code works like a charm and you can extend <a href="http://www,google,com/search?hl=en&#038;q=" rel="nofollow">http://www,google,com/search?hl=en&#038;q=</a> by adding (&amp;num=100) to get the first 100 results. eg: <a href="http://www,google,com/search?num=100&#038;hl=en&#038;q=" rel="nofollow">http://www,google,com/search?num=100&#038;hl=en&#038;q=</a></p> ]]></content:encoded> </item> <item><title>By: Frank</title><link>http://kbeezie.com/view/scraping-google-results/#comment-291</link> <dc:creator>Frank</dc:creator> <pubDate>Thu, 10 Dec 2009 03:16:07 +0000</pubDate> <guid isPermaLink="false">http://kbeezie.com/?p=85#comment-291</guid> <description>heh, i fixed it.replace:$h3 = $g-&gt;find(&#039;h3.r&#039;, 0); $s = $g-&gt;find(&#039;div.s&#039;, 0); $a = $h3-&gt;find(&#039;a&#039;, 0);with:$h3 = $g-&gt;find(&#039;h3.r&#039;, 0); $s = $g-&gt;find(&#039;div.s&#039;, 0); if($h3 != &quot;&quot;){ $a = $h3-&gt;find(&#039;a&#039;, 0); }if you dont understant why this is needed cheak out the recent chages google have made to some of there results pages by searching for: tiger woods latest update</description> <content:encoded><![CDATA[<p>heh, i fixed it.</p><p>replace:</p><p>$h3 = $g-&gt;find(&#8216;h3.r&#8217;, 0);<br /> $s = $g-&gt;find(&#8216;div.s&#8217;, 0);<br /> $a = $h3-&gt;find(&#8216;a&#8217;, 0);</p><p>with:</p><p>$h3 = $g-&gt;find(&#8216;h3.r&#8217;, 0);<br /> $s = $g-&gt;find(&#8216;div.s&#8217;, 0);<br /> if($h3 != &#8220;&#8221;){<br /> $a = $h3-&gt;find(&#8216;a&#8217;, 0);<br /> }</p><p>if you dont understant why this is needed cheak out the recent chages google have made to some of there results pages by searching for: tiger woods latest update</p> ]]></content:encoded> </item> <item><title>By: Frank</title><link>http://kbeezie.com/view/scraping-google-results/#comment-290</link> <dc:creator>Frank</dc:creator> <pubDate>Thu, 10 Dec 2009 02:45:01 +0000</pubDate> <guid isPermaLink="false">http://kbeezie.com/?p=85#comment-290</guid> <description>Hi mate,Sadly i have to report that this script no longer works due to some changes google have made to thier pages. A new ajax element has been added also contained in li.g. if you hit a page with the new ajax bit the script fails. It would be good if you could help fix the script so it works again because its  good one :)</description> <content:encoded><![CDATA[<p>Hi mate,</p><p>Sadly i have to report that this script no longer works due to some changes google have made to thier pages. A new ajax element has been added also contained in li.g. if you hit a page with the new ajax bit the script fails. It would be good if you could help fix the script so it works again because its  good one <img src='http://kbeezie.net/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /></p> ]]></content:encoded> </item> <item><title>By: Frank</title><link>http://kbeezie.com/view/scraping-google-results/#comment-154</link> <dc:creator>Frank</dc:creator> <pubDate>Sun, 08 Nov 2009 19:24:36 +0000</pubDate> <guid isPermaLink="false">http://kbeezie.com/?p=85#comment-154</guid> <description>Hello,This is very nicely done. It might be worth adding proxy functionality so people dont get blocked by google. I was wondering if you could explain to me a little how I would go about using a foreach loop to put the data into a database table with the colums TITLE, LINK &amp; DESCRIPTION.</description> <content:encoded><![CDATA[<p>Hello,</p><p>This is very nicely done. It might be worth adding proxy functionality so people dont get blocked by google. I was wondering if you could explain to me a little how I would go about using a foreach loop to put the data into a database table with the colums TITLE, LINK &amp; DESCRIPTION.</p> ]]></content:encoded> </item> </channel> </rss>
