NaN values in pivot_table index causes loss of data

Question

Here is a simple DataFrame: Pivot method 1 The data can be pivoted to this: Downside: data in the 2nd row is lost because df[&#8216;b&#8217;][1] == None. Pivot method 2 Downside: column b is lost. How can the two methods be combined so that columns b and the 2nd row are kept like so: More generally: How can i…

Accepted Answer

Use set_index and unstack to perform the pivot:df = df.set_index(['a', 'b', 'c']).unstack('c')This is essentially what pandas does under the hood for pivot.  The stack and unstack methods are closely related to pivot, and can generally be used to perform pivot-like operations that don&#8217;t quite conform with the built-in pivot functions.The resulting output:                d          c              c1   c2   c3a  b                       a1 optional1  1.0  NaN  NaNa2 NaN        NaN  2.0  NaNa3 optional3  NaN  NaN  3.0

Pivot method 1

Pivot method 2

Advertisement

Answer