Monday, July 27, 2009
AP Would Rather Fuss Than Do The Really Simple Thing That Would Totally Solve Its Problem
The AP has made a lot of noise recently about how “news articles should not turn up on search engines and Web (sic) sites without permission”. What any webmaster knows is that a simple robots.txt file in the root directory will eliminate all “article stealing” by search engines.
So, how does the AP handle this? Based on AP.org, all their stories seem to be hosted on “hosted.ap.org”.
Take a look at their robots.txt file.