DataFrame to Panel indexed by nonunique column with Pandas

1.3k Views Asked by Jessica Collins At 27 July 2025 at 18:01

The following code should do what I want but it takes 10gb of ram by the time it is 20% done with the loop.

# In [4]: type(pd)
# Out[4]: pandas.sparse.frame.SparseDataFrame
memid = unique(pd.Member)
pan = {}
for mem in memid:
    pan[mem] = pd[pd.Member==mem]
goal = pandas.Panel(pan)

Original Q&A

There are 1 best solutions below

Wes McKinney On 21 January 2012 at 23:05 BEST ANSWER

I created a GitHub issue here.

https://github.com/wesm/pandas/issues/663

I'm pretty sure I identified a circular reference between NumPy ndarray views causing a memory leak. Just committed a fix:

https://github.com/wesm/pandas/commit/4c3916310a86c3e4dab6d30858a984a6f4a64103

Can you install from source and let me know if that fixes your problem?

BTW you might try using SparsePanel instead of Panel because Panel will convert all of the sub-DataFrames to dense form.

Lastly, you might consider using groupby as an alternative to the O(N * M) chopping-up of the SparseDataFrame. It's even shorter:

pan = dict(pd.groupby('Member'))

DataFrame to Panel indexed by nonunique column with Pandas

There are 1 best solutions below

Related Questions in PYTHON

Related Questions in DATAFRAME

Related Questions in PANELS

Related Questions in PANDAS

Trending Questions

Popular # Hahtags

Popular Questions