Tag: pandas

how to properly apply a vector based function to a pandas dataframe column?

I am trying to apply a function that returns an specific date in an specific format, however I am struggling to apply this function to a new pandas dataframe column. Here’s what I got so far: The next error arises: KeyError: datetime.datetime(2021, 2, 1, 0, 0) Expected output could be a pandas dataframe…

Python Numpy: int arrays can be converted to a scalar index

machine-learning pandas python

Please help me to get out of this error, maybe, it’s duplicate but I could not set it for my code. ERROR dataset Answer I think you might want to select your X columns slightly differently, e.g.

Pandas dataframe (getting one value without the index)

pandas python

From a pandas dataframe, I want just the value and not the index. Returns: Adding .to_string() Returns: I do not want to loop, how can I just get: Answer Always check the docs for the method you are using:

Pandas Column join list values

pandas python

Using Python, Pandas, I have a couple columns that have lists in them, I want to convert the list to a string. Example:I have the following values in a record, in the field ev_connector_types [u’ACME’, u’QUICK_CONNECT’] I’d like the value for that record to show: ACME, QUICK_CONN…

Pandas group by unique ID and Distinct date per unique ID

pandas python

Title may be confusing: I have a dataframe that displays user_id sign in’s during the week. My goal is to display the de-duped ID along with the de-duped dates per employee, in order to get a count of # days the user uniquely signed in for the week. So I’ve been trying to enforce a rule to make su…

Filter Pandas MultiIndex over all First Levels Columns

dataframe multi-index pandas python

Trying to find a way of efficiently filtering all entries under both top level columns based on a filter defined for only one of the top level columns. Best explained with the example below and desired output. Example DataFrame Create filter for multiindex dataframe Desired output: Answer You can reshape for …

How to groupby 2 columns but order descending by count()

count pandas pandas-groupby python sorting

i have a dataframe and want to group 2 columns, which is working fine. Now the grouped dataframe is sorted by the CustomerID values. But i want to sort it by the count(). So that i have the Sektor then the CustomerIDs but the CustomerIds that occure the most should be at the top. So descending. Expected Outpu…

Uncommon rows based on a column in pandas

merge pandas python

Suppose I have two dataframes: and I want to use the second df as reference and drop those rows that exist in df2 from df1, so the result would be I tried: but this gives me the following: Answer Use Series.isin with inverted mask by ~ in boolean indexing, working well if need test only one column: If need te…

Optimizing a standard deviation function Pandas Numpy Python

arrays function numpy pandas python

The std pandas function below calculates the standard deviation of every nth value defined by number. So it would take the the values of PC_list with indexes [0,1,2,3,4,5] and calculate the standard deviation and then the indexes [1,2,3,4,5] and calculate the standard deviation until the end of PC_list. I am …

return a list if the column contains a string

pandas python python-3.x

I would like to check if the Names column contains any of the strings in the kw. If yes, return the list. Here is the data: I’ve tried: But it returns: I am expecting an output like: Answer You were very close: