[Python-Dev] Iterating over marshal/pickle (original) (raw)

Tim Lesher tlesher at gmail.com
Mon Oct 9 16:52:44 CEST 2006


Both marshal and pickle allow multiple objects to be serialized to the same file-like object.

The pattern for deserializing an unknown number of serialized objects looks like this:

objs = [] while True: try: objs.append(marshal.load(fobj)) # or objs.append(unpickler.load()) except EOFError: break

This seems like a good use case for an generator:

def some_name(fobj): while True: try: yield marshal.load(fobj) # or yield unpickler.load() except EOFError: raise StopIteration

  1. Does this seem like a reasonable addition to the standard library?
  2. Where should it go, and what should it be called?

From an end-user point of view, this "feels" right:

import pickle u = pickle.Unpickler(open('picklefile')) for x in u: print x

import marshal for x in marshal.unmarshalled(open('marshalfile')): print x

But I'm not hung up on the actual names or the use of sequence semantics in the Unpickler case.

Incidentally, I know that pickle is preferred over marshal, but some third-party tools (like the Perforce client) still use the marshal library for serialization, so I've included it in the discussion

Tim Lesher <tlesher at gmail.com>



More information about the Python-Dev mailing list