How to calculate the expectation value for a given probability distribution

Question

I am writing a program to determine the expectation value, expectation of the X^2 and E(X - X_avg)^2. I have written a program like so: The dataset that I am using is: Expected: E(X) = 16 E(X^2) = 276 E(X- X_avg)^2 =20 Actual: Answer Your problem is the step 1, so I took the liberty of rewriting it: df: The

Accepted Answer

Your problem is the step 1, so I took the liberty of rewriting it:# Step 1.1: read csv in the right wayprobabilityCSV = open('probability.csv')df = pd.read_csv(probabilityCSV)df["P"] = df.P.str.split("/", expand=True)[0].astype(int) / df.P.str.split("/", expand=True)[1].astype(int)df:    X   P0   8   0.1250001   12  0.1666672   16  0.3750003   20  0.2500004   24  0.083333The second step is right:# Step 2: convert dataframe to ndarryX = df['X'].to_numpy()p = df['P'].to_numpy()X, p:(array([ 8, 12, 16, 20, 24]), array([0.125     , 0.16666667, 0.375     , 0.25      , 0.08333333]))After this you correctly defined the function:def expected_value(values, weights):    return np.sum((np.dot(values,weights))) / np.sum(weights)You can use this function to compute E(X), E(X^2) and E(X - X_avg)^2. In particular:expected_value(X,p)# returns E(X) = 16.0expected_value(X**2, p)# returns E(X^2) = 276.0expected_value((X-X.mean())**2, p)# returns E(X - X_avg)^2 = 20.0The error has occurred because your df["P"] column is a string column.

Advertisement

Answer