Comments

Model Evaluation and Refinement

 


In sample evaluation tells us how our model performance using the training dataset, but how about using an unseen dataset(testing dataset).

we split our data into two part: in-sample dataset (training), out-sample dataset (testing dataset)

Generalization Performance: 

it generalize or measure how our model predict using unseen dataset 


1
2
3
...
# split into train test sets
X_train, X_test, y_train, y_test = train_test_split(X, y, ...)

Overfitting, Underfitting and Model Selection

underfitting: when the model is too simple to fit the data
overfitting: when the model fit the noise rather that the function

Ridge regression is a regression that is employed in a Multiple regression model when Multicollinearity occurs. Multicollinearity is when there is a strong relationship among the independent variables. Ridge regression is very common with polynomial regression

Ridge Regression:

it prevent overfitting(when you have multi independent variable or features) , 
Share on Google Plus

About Inas AL-Kamachy

    Blogger Comment
    Facebook Comment

0 Comments:

Post a Comment