Tag: pandas

Pandas: Cannot address column from previously merged multi level data frame

After an data frame aggregation with group by I’m trying to “flatten” the headers into one to properly export the data as CSV: The output looks like that: If I call the data frame directly, I get a different information: Output: It seems like pandas merged the column names, but I still have …

How can I make a: for i in range work with a float [closed]

pandas python

Closed. This question is not reproducible or was caused by typos. It is not currently accepting answers. This question was caused by a typo or a problem that can no longer be reproduced. While similar questions may be on-topic here, this one was resolved in a way less likely to help future readers. Closed 1 y…

Aggregate data with two conditions

aggregate pandas python

I have a data frame that looks something like this: What I would like to do is aggregate the data if the dates are the same – but only if the name is different. So the above data frame should actually become: Currently I am almost doing it with: However, this will also aggregate the ones where the name …

How to iterate over dataframe such that all rows which have a specific column value in common are saved to their respective files?

csv dataframe pandas python

This questions was a little harder for me to phrase so I request to help edit the question which would make more sense (if necessary). Problem Statement: I want all the rows which have a specific column value in common, saved to same file. Example Code I want to do something like this. Say, I have a dataframe…

Python pandas replace based on partial match with list item

pandas python

I have a large three-column dataframe of this form: And a long list with entries like this: I want to replace the entries in the Shaperef column of the df with a value from the list if the full Shaperef string matches any part of any list item. If there is no match, the entry is not changed. Desired output:

Using groupby and querying that group

dataframe pandas pandas-groupby python

I have a dataframe that I would like to group by one column (dadate) and then query another column (Place) to count only those with the value 1. The above is what I have tired with the error “‘DataFrameGroupBy’ object has no attribute ‘query’” Answer Create the Boolean Series the…

Python Rank with non numeric columns

pandas pandas-groupby python

I’m trying to find a way to do nested ranking (row number) in python that is equivalent to the following in TSQL: I have a table thank looks like this: Looking for Python equivalent to: The output to be: I’ve tried to use rank() and groupby() but I keep running into a problem of No numeric types t…

How to find churned customers on a monthly basis? Python Pandas

churn pandas python

I have a large customer dataset, it has things like Customer ID, Service ID, Product, etc. So the two ways we can measure churn are at a Customer-ID level, if the entire customer leaves and at a Service-ID level where maybe they cancel 2 out of 5 services. The data looks like this, and as we can see Alligator…

reduce() to merge if there are blank DataFrame

dataframe pandas python reduce

I want to use reduce() function to merge data. However, sometimes some dataframe df1 to df8 might be blank (but there is at least one dataframe not be blank). And I do not want to detect which one. For example, this time df1 to df7 are blank and only df8 is non-blank. Next time df1, df2, df5 are non-blank. Ho…

Create and fill a DataFrame column based on conditions

pandas python

I have a DataFrame and I need to create a new column and fill the values acording to how many words in a list of words are found in a text. I’m trying de code below: This code actually create a new column, but fill all the rows with the last ‘count_found_words’ of the loop. is there a right …