Tag: cross-validation

Gaussian Process Regression: tune hyperparameters based on validation set

cross-validation gaussian-process non-linear-regression python scikit-learn

In the standard scikit-learn implementation of Gaussian-Process Regression (GPR), the hyper-parameters (of the kernel) are chosen based on the training set. Is there an easy to use implementation of GPR (in python), where the hyperparemeters (of the kernel) are chosen based on a separate validation set? Or cr…

In Leave One Out Cross Validation, How can I Use `shap.Explainer()` Function to Explain a Machine Learning Model?

cross-validation leave-one-out python shap

Background of the Problem I want to explain the outcome of machine learning (ML) models using SHapley Additive exPlanations (SHAP) which is implemented in the shap library of Python. As a parameter of the function shap.Explainer(), I need to pass an ML model (e.g. XGBRegressor()). However, in each iteration o…

How to find accuracy, precision, recall, f1 score for my word2vec model?

cross-validation evaluation python word2vec

I am working on a project to find similarity among products. The model splits the excel data sheet into 90% training / 10% validation. When I check manually for validation the model works pretty well. But I am having trouble with the evaluation process. How should I find accuracy, precision, recall and F1 sco…

Cross Validation with coco data format json files

cross-validation json machine-learning python semantic-segmentation

I am a newbie ML learner and trying semantic image segmentation on google colab with COCO data format json and lots of images on google drive. update I borrowed this code as a starting point. So my code on colab is pretty much like this. https://github.com/akTwelve/tutorials/blob/master/mask_rcnn/MaskRCNN_Tra…

GridSearchCV.best_score not same as cross_val_score(GridSearchCV.best_estimator_)

cross-validation grid-search python scikit-learn

Consider the following gridsearch : grid = GridSearchCV(clf, parameters, n_jobs =-1, iid=True, cv =5) grid_fit = grid.fit(X_train1, y_train1) According to Sklearn’s ressource, grid_fit.best_score_ returns The mean cross-validated score of the best_estimator . To me that would mean that the average of : …

Cross validation with grid search returns worse results than default

cross-validation grid-search machine-learning python scikit-learn

I’m using scikitlearn in Python to run some basic machine learning models. Using the built in GridSearchCV() function, I determined the “best” parameters for different techniques, yet many of these perform worse than the defaults. I include the default parameters as an option, so I’m s…