Split data frame in python based on one parameter shape

Question

I have a data frame which is like the following : In this data frame, there are many repeated rows for example the first row is repeated more than 1000 times, and so on for the other rows when I plot the time distribution I got that figure which shows that the frequency of the time parameter My question is

Accepted Answer

Note: InPlace of target you have to write time as your column name Is time,or change column name to targetdef calRows(df,x,y):#df For considerationdf1 = pd.DataFrame(df.target[df.target<=x])minCount = len(df1)targets = df1.target.unique()for i in targets: count = int(df1[df1.target == i].count()) if minCount > count: minCount = countif minCount > y: minCount = int(y)return minCountYou have To pass your data frame, x-intercept of the graph, y-intercept of graph to calRows(df,x,y) function which will return the number of rows to take for each target.rows = CalRows(df,6,75)print(rows)takeFeatures(df,rows,x) function will take dataframe, rows (result of first function), x-intercept of graph and will return you the final dataframe.def takeFeatures(df,rows,x):finalDf = pd.DataFrame(columns = df.columns)df1 = df[df.target<=x]targets = df1.target.unique()for i in targets: targeti = df1[df1.target==i] sample = targeti.sample(rows) finalDf = pd.concat([finalDf,sample])return finalDfCalling takeFeature() Functionfinal = takeFeatures(df,rows,6)print(final)Your Final DataFrame will have the Values ThatYou expected in GraphAnd After Plotting this final dataframe you will get like this graph

Advertisement

Answer