[nycphp-jobs] Spider, parsing, Google API and MySQL Freelance Project
Kevin Lee
kevin at did-it.com
Wed May 11 20:13:22 EDT 2005
OK, so here's what I need coded as a freelance project. My team is too
busy for these little side projects.
Let me know how much it would cost to develop this. UI is not that
important, I'm just looking for an easy way to get to the collected data.
Request for Quote Search Status Spider
Preliminary Project Specification
Project:
The idea of this project is to monitor the status of several domains in the
top search engines. We want to be able to see changes in both the number of
pages from a site that are in an engine (Site command) or the number of
pages the engine sees linking into the domain (Link to linkdomain commands)
1) Allow admin user to enter a URL to be monitored going forward
2) Every day, at a random PM time collect the following information on
each domain (perhaps at different times if we think the spider would get
shut off) and store it in a DB:
a. Google # of results for a site:URL search
i. For
example searching google for the query < site: phpbuilder.com > and
then parsing out the number of results and store that number in the DB for
that date and domain
b. Yahoo # of results for a site:URL search
c. MSN # of results for a site:URL search
d. Google # of results for a link:URL search
e. Yahoo # of results for a linkdomain:URL search
f. MSN # of results for a link:URL search
3) Allow for the selection of one or more URLs and results to be
displayed in a chart/table. (perhaps a list with checkboxes, as I dont
anticipate the list getting to be more than 100 domains)
a. Daily
i.
Last 30 days
ii. Or
select date range up to 100 days)
b. Weekly (average)
c. Monthly (average)
Optional: (provide cost)
4) Allow for the selection of one or more URLs and results to be
displayed in bar graphs.
Notes:
You may want to use the Google API if it provides the number count of results.
<http://www.googleduel.com/apiexample.php>http://www.googleduel.com/apiexample.php
I have a Google API license key or you can get one for yourself free.
Other search engines may require a spider be used to access the SERP
(search engine results page) in order to parse out the # we are looking for.
Kevin Lee
Executive Chairman and Co-Founder, DID-IT.COM, http://www.did-it.com/
Leaders in Search Engine Marketing since 1996
Ranked #1 Search-Specific Agency by Jupiter Research
Phone: 212-631-0157 Mobile: 917-842-0150 Fax 646-837-0725
200 West 60th St. Suite 21-F, NYC 10023
http://www.kevinlee.net/ Bio, articles and Conference speaking engagements
http://www.sempo.org/ Join SEMPO, the Search Engine Marketing Professional
Organization
http://www.clickz.com/experts/author/index.php/60853_all (my ClickZ columns)
More information about the jobs
mailing list