Tag: pandas

How to merge rows in a Dataframe based on a previous row?

I have a sequentially ordered dataframe that represent two events measured over time – the measurements are the start and end times of the event. They should be ordered in an ABABAB sequence, but in some cases I may have consecutive events of the same type (i.e. ABABAABABB). I am looking for a way to check the event label (A

Group by 2 columns with calculation of quantile of 3rd numerical column

pandas python python-3.x

I have a df below as: I have code below that calculates the % of each mealtype for each day How can I tweak this code to give me the quantiles – p50 and p90 of the oz column, but grouped by mealtype day and mealtype as well? Thanks! Answer You can try this or output

How to use pd.data_range() with a frequency of X minutes/hours/seconds?

date datetime numpy pandas python

I need to create a frequent date range with pandas date_range(). This works well with frequency=… parameter. But sometimes my code needs these frequent ranges in longer frequencys. for example 4 Hours or 5 minutes instead of one. How can I do that with pd.date_range(first_X_datetime, last_X_datetime, freq=frequency)? If there is not a more efficient way, my idea would be to

Setting column to true/false based on comparison of two other columns in pandas?

pandas python

I have the following dataframe and I want to compare column value and predicted, if they match then I want to set the value of a column “provided” to False. I’m having difficulty doing this. Here’s my data: I want a new column to just have a True/False if value and predicted match. I tried this but to no avail:

How to reindex a dataframe post splitting a row w.r.t a column?

pandas python

I have the dataframe with two columns namely Content which contains the text, and one more column named Coords which is a list of tuples. Each tuple containing the meta info of each word of the text. I want to split the row such as each row can have a word, and its corresponding tuple, and the line number to

How to convert json dataframe to normal dataframe?

json pandas python

I have a dataframe which has lots of json datas inside. for example : There are two types of data.Strain sensor and acceleration sensor. I want to parse these json datas and convert to normal form. I just need data part of json objects.At result I should have 4 columns for every values in Data. I tried json_normalize but I

Calculate rolling average for all columns pandas

dataframe pandas python rolling-average

I have the below dataframe: I want to replace the NaN values with the 3 month rolling average. How should I got about this? Answer If you take NaNs as 0 into your means, can do: This will give you:

How to change column value with pandas .apply() method

pandas python python-3.x

I stumbled upon an issue while trying to take data from a CSV file, assemble a key and then creating a new CSV file with only the necessary data. Example data: ID1 Data1 Data2 Price1 Color Key ID2 Data3 Price2 12345/6 950/000 Pd950 996 G 4/20017/6 4/20017/6 950/000 1108 12345/6 333/000 Pd333 402 G 4/20017/6 4/20017/6 333/000 501 12345/6 500/000

Add column to existing panas dataframe with values as ‘Top’ and ‘Bottom’

dataframe pandas python

I want to create a column in the existing dataframe with values as ‘Top’ and Bottom’, catch is, size of the dataframe changes according to calculations. For example: I will always have even number of rows. Please suggest a solution, thanks! Answer I don’t know exactly how your data is, but you can try something like this: Hypothetical data: Creating

Pandas compare and sum values between two DataFrame with different size

dataframe pandas python

Suppose I have two Dataframes with different sizes: to which I have: and: Now I want to add a third column to df1 say total_volume, where it is the summation of the volume that lie between individual row of xlow and xup of df1. I can do this using: we can check the value of say the second row as: