[mythtv-users] visit from googlebot

chris at cpr.homelinux.net chris at cpr.homelinux.net
Tue Oct 3 18:38:59 UTC 2006


On Tue, Oct 03, 2006 at 09:05:29AM -0400, Peter Watkins wrote:
> On Mon, Oct 02, 2006 at 01:36:43PM -0400, Michael T. Dean wrote:
> > So, the best thing to do is keep the Google bot off your website.
> No doubt.

Don't the spiders just follow links?  Someone posted urls Google 
had found for MythWeb installations running on non-standard ports, 
but how do you suppose Google found them?  It's probably because 
people put MythWeb on a non-standard port and then put a convenient 
link to the non-standard port from their server's normal pages.

The best solution is to use .htaccess to require authentication 
and/or use deny/allow rules to limit MythWeb usage to known IP 
addresses, but if you can't do that then the least you could do is 
NOT put a link to your MythWeb installation out there where the 
bots can find it....

FWIW, if you're going to use .htaccess and require passwords, a 
good follow-up would be to install fail2ban, which is a daemon 
which watches the logs for authorization failures and firewalls 
offending IP addresses.  That way you keep out the bots and also 
block malicious users.



More information about the mythtv-users mailing list