Tag: pandas

How to read a file with a semi colon separator in pandas

I a importing a .csv file in python with pandas. Here is the file format from the .csv : here is how get it : Now when I print the file I get that : And so on… So I need help to read the file and split the values in columns, with the semi color character ;. Answer read_csv

How to get output of pandas .plot(kind=’kde’)?

pandas python

When I plot density distribution of my pandas Series I use Is it possible to get output values of this plot? If yes how to do this? I need the plotted values. Answer There are no output value from .plot(kind=’kde’), it returns a axes object. The raw values can be accessed by _x and _y method of the matplotlib.lines.Line2D object

How do I create test and train samples from one dataframe with pandas?

dataframe pandas python python-2.7

I have a fairly large dataset in the form of a dataframe and I was wondering how I would be able to split the dataframe into two random samples (80% and 20%) for training and testing. Thanks! Answer I would just use numpy’s randn: And just to see this has worked:

pandas apply function that returns multiple values to rows in pandas dataframe

apply dataframe iterable-unpacking pandas python

I have a dataframe with a timeindex and 3 columns containing the coordinates of a 3D vector: I would like to apply a transformation to each row that also returns a vector but if I do: I end up with a Pandas series whose elements are tuples. This is beacause apply will take the result of myfunc without unpacking it.

Check if a value exists in pandas dataframe index

ipython pandas python

I am sure there is an obvious way to do this but cant think of anything slick right now. Basically instead of raising exception I would like to get True or False to see if a value exists in pandas df index. What I have working now is the following Answer This should do the trick

How to do/workaround a conditional join in python Pandas?

conditional-statements dataframe join pandas python

I am trying to calculate time-based aggregations in Pandas based on date values stored in a separate tables. The top of the first table table_a looks like this: Here is the code to create the table: The second table, table_b, looks like this: and the code to create it is: I want to be able to get the sum of

Merging two dataframes in pandas without column names (new to pandas)

pandas python

Short explanation: If you have duplicate column names in your data, be sure to rename one column when you read the file. If you have NaN etc in your data, remove those. Then merge using correct answer below. Probably a pretty simple question. I have two datasets that I read in using pandas.read_csv(). My data is in two separate csv.

How to write DataFrame to postgres table

pandas postgresql python sqlalchemy

There is DataFrame.to_sql method, but it works only for mysql, sqlite and oracle databases. I cant pass to this method postgres connection or sqlalchemy engine. Answer Starting from pandas 0.14 (released end of May 2014), postgresql is supported. The sql module now uses sqlalchemy to support different database flavors. You can pass a sqlalchemy engine for a postgresql database (see