Posted on Jan 26, 2012 / Est. budget $ 50 / Project closed
*Note* - I'm new here but not new to outsourcing. You can see my same job posted here: https://www.odesk.com/jobs/Software-Developer-Needed-Website-Parser-Tool_~~70d96a0d6724a897 to see my profile and feedback.
I just selected random programming languages because it required me to but I'm really not that picky as long as it works on Windows OS.
I'm looking to get a simple tool made that will allow me to load a list of website URL's into the software, the tool with then parse through the URL's and find text on the page that is common with all of the sites in the list.
For example if I load 10 website URL's into the software and all the websites have, "powered by wordpress" in the footer, the software would pull that text as a common attribute all the sites have. If they all have the word, "please login" and "powered by joomla" it would show those as being common among the loaded URL's.
This tool will help me put together the footprints I need to scrape certain types of websites from Google.
It would be nice if the software could show percentages of how often a word or set of words show up in the list of URL's.
If I load a list of 100 URLS, and 75% of the URL's happen to have "powered by wordpress" it could show that. I'm not sure what the best way to display the information would be but we can talk about that further if you're interested in this small project.
I don't think any software like this has been done but I think it will be very useful for anyone involved in automated link building and SEO.