Pandas – Duplicate Rows and Slice String

Question

I'm trying to create duplicate rows during a dataframe on conditions. For example, I have this Dataframe. And I would like to get the following output: Answer For pandas 0.25+ is possible use DataFrame.explode with splitted values by Series.str.split and for remark column list comprehension with filtering: And we get the following result:

Accepted Answer

For pandas 0.25+ is possible use DataFrame.explode with splitted values by Series.str.split and for remark column list comprehension with filtering:students = df["student"].str.split(", ")df = df.assign(name=students, remark=students).explode("name").reset_index(drop=True)df["remark"] = [    "with " + ", ".join(x for x in r if x != n) if len(r) > 1 else ""    for n, r in zip(df["name"], df["remark"])]print (df)And we get the following result:   team                   student    name                    remark0     a                    Ursula  Ursula                          1     b             Hayfa, Martin   Hayfa               with Martin2     b             Hayfa, Martin  Martin                with Hayfa3     c                      Kato    Kato                          4     d          Tanek, Ava, Pyto   Tanek            with Ava, Pyto5     d          Tanek, Ava, Pyto     Ava          with Tanek, Pyto6     d          Tanek, Ava, Pyto    Pyto           with Tanek, Ava7     e                      Aiko    Aiko                          8     f                    Hunter  Hunter                          9     g  Josiah, Derek, Uma, Nell  Josiah     with Derek, Uma, Nell10    g  Josiah, Derek, Uma, Nell   Derek    with Josiah, Uma, Nell11    g  Josiah, Derek, Uma, Nell     Uma  with Josiah, Derek, Nell12    g  Josiah, Derek, Uma, Nell    Nell   with Josiah, Derek, Uma

Advertisement

Answer