1.12 Unzipping Simple List-Like Objects


Credit: gyro funch

1.12.1 Problem

You have a sequence and need to pull it apart into a number of pieces.

1.12.2 Solution

There's no built-in unzip counterpart to zip, but it's not hard to code our own:

def unzip(p, n):     """ Split a sequence p into a list of n tuples, repeatedly taking the     next unused element of p and adding it to the next tuple.  Each of the     resulting tuples is of the same length; if p%n != 0, the shorter tuples     are padded with None (closer to the behavior of map than to that of zip).         Example:         >>> unzip(['a','b','c','d','e'], 3)         [('a', 'd'), ('b', 'e'), ('c', None)]     """     # First, find the length for the longest sublist     mlen, lft = divmod(len(p), n)     if lft != 0: mlen += 1     # Then, initialize a list of lists with suitable lengths     lst = [[None]*mlen for i in range(n)]     # Loop over all items of the input sequence (index-wise), and     # Copy a reference to each into the appropriate place     for i in range(len(p)):         j, k = divmod(i, n)    # Find sublist-index and index-within-sublist         lst[k][j] = p[i]       # Copy a reference appropriately     # Finally, turn each sublist into a tuple, since the unzip function     # is specified to return a list of tuples, not a list of lists     return map(tuple, lst)

1.12.3 Discussion

The function in this recipe takes a list and pulls it apart into a user-defined number of pieces. It acts like a sort of reverse zip function (although it deals with only the very simplest cases). This recipe was useful to me recently when I had to take a Python list and break it down into a number of different pieces, putting each consecutive item of the list into a separate sublist.

Preallocating the result as a list of lists of None is generally more efficient than building up each sublist by repeated calls to append. Also, in this case, it already ensures the padding with None that we would need anyway (unless length(p) just happens to be a multiple of n).

The algorithm that unzip uses is quite simple: a reference to each item of the input sequence is placed into the appropriate item of the appropriate sublist. The built-in function divmod computes the quotient and remainder of a division, which just happen to be the indexes we need for the appropriate sublist and item in it.

Although we specified that unzip must return a list of tuples, we actually build a list of sublists, and we turn each sublist into a tuple as late in the process as possible by applying the built-in function tuple over each sublist with a single call to map. It is much simpler to build sublists first. Lists are mutable, so we can bind specific items separately; tuples are immutable, so we would have a harder time working with them in our unzip function's main loop.

1.12.4 See Also

Documentation for the zip and divmod built-ins in the Library Reference.



Python Cookbook
Python Cookbook
ISBN: 0596007973
EAN: 2147483647
Year: 2005
Pages: 346

flylib.com © 2008-2017.
If you may any questions please contact us: flylib@qtcs.net