I have a Python DataFrame with a datetime column that has inconsistent format, and would like it to be all one format. The DataFarme contains 199622 rows, so this is not an exhaustive sample:
Example of DataFrame Column as an object type:
JavaScript
x
11
11
1
Date
2
0 6/14/22
3
1 6/1/22
4
2 5/23/22
5
3 11/17/21
6
4 11/9/21
7
5 May 4, 2021 6:02 PM
8
6 April 29, 2021 10:06 AM
9
7 March 31, 2021 2:04 PM
10
11
The desired transformed output would be a DateFrame with a date type column formatted:
JavaScript
1
10
10
1
Date
2
0 2022-06-14
3
1 2022-06-01
4
2 2022-05-23
5
3 2021-11-17
6
4 2021-11-09
7
5 2021-05-04
8
6 2021-04-29
9
7 2021-03-31
10
Is it possible to create a function that does this transformation for both sets of formats in a single column?
Advertisement
Answer
This should work.
JavaScript
1
3
1
df['Date'] = pd.DatetimeIndex(df['Date'])
2
df['Date'] = df['Date'].dt.strftime('%Y-%m-%d')
3