Tag: pandas-groupby

Order categories in a grouped bar in matplotlib

categories matplotlib pandas-groupby python

I am trying to plot a groupby-pandas-dataframe in which I have a categorical variable by which I would like to order the bars. A sample code of what I am doing: Which plots: However, I would like to plot within each category the order to be Jan, Feb, March. Any help on how to achieve this would be a appreciated.

conditional groupby and update column – python, pandas, groupby

pandas pandas-groupby python

i have a df which I want to add a column that shows the student who is place (1) from the group(‘subject’, ‘class’) and update the column after there is a new place (1). code: ╔═════════╦═════════╦═════════╦═══════╗ ║ subject ║ class ║ student ║ place ║ ╠═════════╬═════════╬═════════╬═══════╣ ║ eng ║ Class_4 ║ henry ║ 7 ║ ║ math ║ Class_3 ║

Pandas DataFrame Groupby two columns and get different relation in same keys insert list

dataframe dictionary pandas pandas-groupby python

I have this table : I have to create a dictionary with Head key and values equal to the relations but not repeated and for each value of the relations I have to insert the corresponding tail. example: I don’t really know how to do it. Is there someone who can help me? Second Example Input: the output: I would

How can I find rows in Pandas DataFrame where the sum of 2 rows is greater than some value?

pandas pandas-groupby python

In a dataset like the one below, I’m trying to group the rows by attr_1 and attr_2, and if the sum of the count column exceeds a threshold (in this case 100), I want to keep the original rows. account attr_1 attr_2 count ABC X1 Y1 25 DEF X1 Y1 100 ABC X2 Y2 150 DEF X2 Y2 0 ABC

Python: Calculate week start and week end from daily data in pandas dataframe?

dataframe pandas pandas-groupby python python-3.x

I have a daily dataset for different months. I want to calculate the week start(sunday) and week end(saturday) based on each product type & country and values should be the average for that particular week. SAMPLE result format: I tried with groupby but I’m not able to get week start and end for each product and country. Also Values should

Panda dataframe of distribution of particles: group by ID and find the half flux and the half flux radius

dataframe dataset pandas pandas-groupby python

I am using Panda dataframe; I have a distribution of particles, their distance from the center of the distribution, and the associated fluxes. I want to find the total flux enclosed in the “half flux radius” (or “half light radius”), which is the radius that encloses half of the flux, by definition. I make you an example and then I

Get Top N items per month in pandas

pandas pandas-groupby python

I have the following dataframe (which is a pretty reduced sample from my original one). I’m trying to get the top 2 ids for each year AND month. So, for example, the idea was to obtain the below df. My main problem here, is to get the Top n along with the dates, because the nlargest method applies to a

Plotting row based data

pandas pandas-groupby python

I have some data that looks something like this: What I would like to do is to plot the values of the header dates, but grouped by id and period. So essentially this would become 6 line plots, with the x-axis given as the dates. However, maybe I’m just tired, but this data set is weirdly put together imo, so

how to add new row into each group of groupby in PANDAS , one of the value of that row is sum of values of each groups

pandas pandas-groupby python

let’s say I have a data frame like this I wanted to add a new row into each group of groupby(by=[‘eff_date’,’mdl_cd’,’ast_cd’]) in which column value for eff_date,mdl_cd and ast_cd will remain same but for prop_cd value become Hlds and value value column become sum of value of that group e.g. for first group value of value column will be (-0.1234+0.5123+-0.7612)