How to efficiently melt multiple columns using the module melt in Pandas?

2k Views Asked by At

The objective is to unpivot the following table

  Activity General  m1   t1  m2   t2  m3   t3
0       P1      AA  A1  TA1  A2  TA2  A3  TA3
1       P2      BB  B1  TB1  B2  TB2  B3  TB3

into the following format

    Activity   General M Task
0   P1 AA A1  TA1 
1   P1 AA A2  TA2 
2   P1 AA A3  TA3 
3   P2 BB B1  TB1 
4   P2 BB B2  TB2
5   P2 BB B3  TB3

Based on some reading, the module melt can be used to achieved the desired objective.

import pandas as pd
from pandas import DataFrame
list_me = [['P1','AA','A1','TA1','A2','TA2','A3','TA3'],
           ['P2', 'BB', 'B1', 'TB1', 'B2', 'TB2', 'B3', 'TB3']]

df = DataFrame (list_me)
df.columns = ['Activity','General','m1','t1','m2','t2','m3','t3']   
melted_form=pd.melt(df, id_vars=['Activity','General'],var_name='m1',value_name='new_col')

However, most of the example found on the net was solely to tackle single column. I am thinking of using for loop to loop the m1 m2 and m3 and merge the result concurrently. This is because, in actually, the pair of m_i and t_i is at the range of hundreds (where i is the index)

But, I wonder there are more efficient approach than looping.

p.s. I had tried suggestion as in the OP, but, it does not give the intended output

1

There are 1 best solutions below

4
On BEST ANSWER

If I understand your question, you could use pd.wide_to_long :

    (pd.wide_to_long(df, 
                    i=["Activity", "General"], 
                    stubnames=["t", "m"], j="number")
    .set_axis(["Task", "M"], axis="columns")
    .droplevel(-1).reset_index()
     )

    Activity    General Task    M
0      P1       AA      TA1     A1
1      P1       AA      TA2     A2
2      P1       AA      TA3     A3
3      P2       BB      TB1     B1
4      P2       BB      TB2     B2
5      P2       BB      TB3     B3