[Python-Dev] Webstats for www.python.org et al. (original) (raw)

Thomas Wouters thomas at python.org
Wed Mar 1 23:58:57 CET 2006


On 3/1/06, Jack Diederich <jack at performancedrivers.com> wrote:

On Wed, Mar 01, 2006 Brett Cannon wrote: > On 2/28/06, Fredrik Lundh <fredrik at pythonware.com> wrote: > > Thomas Wouters wrote: > > > > > I added webstats for all subsites of python.org: > > > > > > http://www.python.org/webstats/ > > > > what's that "Java/1.4.203" user agent doing? (it's responsible for > > 10% of all hits in january/february, and 20% of the hits today...) > > Most likely a crawler. > Youch, if I'm reading it right it consumed fully half of the bandwidth for today on python.org. And what 1.6 million pages did it spider on the site last month? Something smells broken.

Well, here's a hint. The file almost all of them are retrieving is /topics/xml/dtds/xbel-1.0.dtd. They're all being redirected to pyxml.sf.net, though. It's a lot of hits, but www.python.org doesn't serve any actual pages, so the actual load is not that big (at least, not for us :-) It skewes the statistics somewhat, maybe I should ignore the whole /topics/xml tree in the stats.

-- Thomas Wouters <thomas at python.org>

Hi! I'm a .signature virus! copy me into your .signature file to help me spread!



More information about the Python-Dev mailing list