August 2021 - Mohan M A

y = mx + c

y – Dependent variable(DV)

x – Independent variable(IV)

c – constant

AI Context for the day – 6th August 2021

Training set : A set which is used to train a model.

Test set : A set which is a subset of training set and is used to test the trained model.

Additional resources:

Amazon

from sklearn.impute import SimpleImputer

We are importing the simple imputer from scikit library.

imputer = SimpleImputer(missing_values=np.nan, strategy=’mean’)

The strategy used is mean, where the missing data is replaced by average of the values.

imputer.fit(X[:, 1:3])

The number of columns on which the above function is used is 2 i.e between 1 and 3.

X[:, 1:3] = imputer.transform(X[:, 1:3])

The new column is again assigned to the 2 columns.

dataset = [library alias name].read_csv(‘file.csv’)

x = dataset.iloc[:,:-1].values

y = dataset.iloc[:, -1].values

In the above lines, read_csv and iloc are functions.

[:,:-1] is slicing range of the values.

Syntax

import [library] as []