Tag: pandas

Change columns that contain string and replace rows

How could I change all columns that have “change_” in the column name? For these columns I want to then conditionally replace rows. Want in general code for a larger dataset Expected Output: Answer Use df.filter to filter out columns and then use df.replace:

Converting a multindex dataframe to a nested dictionary [closed]

data-science dataframe multi-index pandas python

Closed. This question needs debugging details. It is not currently accepting answers. Edit the question to include desired behavior, a specific problem or error, and the shortest code necessary to reproduce the problem. This will help others answer the question. Closed 2 years ago. Improve this question I hav…

How to make clusters of Pandas data frame?

dataframe numpy pandas python

I am trying to make a cluster of the following pandas data frame and trying to give the names. E.g – “Personal Info” is cluster name and it consist of (PERSON,LOCATION,PHONE_NUMBER,EMAIL_ADDRESS,PASSPORT,SSN, DRIVER_LICENSE) and also addition of there Counts. which will be 460. Clusters: for…

Pandas – Adding individual Z-Scores to each row based on an id

dataframe pandas python

So I have a Pandas data-frame with a game_id, player_id, and playtime column. I would like to add a z-score rating for each row to find how much from the norm, in terms of playtime, they are for each given game. How would I go through and add each one of these scores to a new column for the data-frame?

Convert timeseries csv in Python

csv data-science pandas python

I want to convert a CSV file of time-series data with multiple sensors. This is what the data currently looks like: The different sensors are described by numbers and have different numbers of axes. If a new activity is labeled, everything below belongs to this new label. The label is in the same column as th…

Splitting a Column using Pandas

pandas python

I am trying to split the following column using Pandas: (df name is count) to this desired outcome: I have tried removing the word ‘POINT’, and both the brackets. But then I am met with an extra white space at the beginning of the column. I tried using: But it was not removing the white space. I w…

Find string in data frame and store new values in a new column

pandas python

I am creating a script that takes a csv file which columns organisation and columns name are unknown. However I know that only one of the column contains some values in which the str ‘rs’ and ‘del’ appears. I need to create an extra column (called ‘Type’) and store ‘d…

Append two dataframes indexed with DateTime that have overlapping elements, keeping the overlaping elements of the second

append dataframe pandas python

Problem I have two dataframes that I want to merge. They both have a DateTime index and the measured values overlap for the last two hours of the first and the first two hours of second. I want to …

Estimate a linear trend in every row across multiple columns in order to project the next value

pandas python python-3.x scikit-learn

I have five columns of historic data which I’d like to find a linear trend across the columns in every row to project the next value in year 2021/22. The historic data is stored in a data frame as follows: – Index 2016/17 2017/18 2018/19 2019/20 2020/21 0 14.53 13.75 13.03 16.05 15.15 1 14.52 13.7…

Pandas Dataframe keep rows where date is between two dates (seperate columns)

dataframe pandas python python-3.x

I have a dataframe that looks similar to this : And then I have a list of dates. For example: list = [2020-01-03, 2020-02-04] I would like to keep only the rows of the dataframe where the dates are in between the From column and the To column. So, after transformation I would have the following dataframe. Fir…