Create new key based on relationship between two columns

Question

I'm trying to add a key for all related instances between two columns, then create a GroupID The logic will be: Check all instances of ID2 linked to ID1 CHeck all instances of ID1 linked to ID2 found in (1) Repeat until all relationships found Answer Let us try with networkx

Accepted Answer

Let us try with networkximport networkx as nxG=nx.from_pandas_edgelist(df, 'ID1', 'ID2')l=list(nx.connected_components(G))L=[dict.fromkeys(y,x) for x, y in enumerate(l)]d={k: v for d in L for k, v in d.items()}df['new'] = df['ID1'].map(d)dfOut[302]:   ID1  ID2  new0   A    1    01   A    2    02   B    1    03   B    3    04   C    4    15   C    5    16   D    2    0

Advertisement

Answer