[Python-Dev] stats.py (was 'summing a bunch of numbers ') (original) (raw)

Chad Netzer cnetzer@mail.arc.nasa.gov
21 Apr 2003 15:07:28 -0700


On Mon, 2003-04-21 at 05:30, Guido van Rossum wrote:

Since it already exists as a 3rd party package, we should definitely not try to duplicate the effort. Then the question is, is it enough to point to the 3rd party package or does it deserve to be incorporated into the core? We can't go and incorporate every useful 3rd party package into the core

True. I just happen to be of the opinion that a statistics package is the single most practical and useful mathematics pakage that can be added to a language, after basic linear algebra (which isn't in the core... Hmmm)

OTOH, having it in the core, with decent documentation, might prevent naive wannabe-statisticians like myself from misremembering how standard deviation is implemented, or when to use it. :-)

My concern, like yours, is that this kind of thing is probably reimplemented a LOT (at least the simple stats functions). If we adopoted it, I would actually favor keeping it fairly lightweight (although t-tests and even ANOVA should go in). Heavyweight users could always download a separate add on package.

Of course, the stats.py package (and it's SciPy cousin) DOES seem to be well maintained, so perhaps the issue is just making sure those that might need it can easily download and install it (ie. promote it to distributions, give it proper promotion on Vaults of Parnassus, mirror it, etc.).

My personal preference would be make it standard (as I propably would like NumPy to become). I like to use it in my unittests. :) That may not be the consensus, though.

--

Chad Netzer (any opinion expressed is my own and not NASA's or my employer's)