Choose the Right Estimator for Your Machine Learning Task

Getting Started:

Machine learning tasks can be broadly classified into 3 broad categories: supervised, unsupervised, and reinforcement learning, depending on the presence of target class or business constraints.

(Source), Scikit-learn cheat-sheet to choose the right estimator

1. Classification:

The training data should at least have a few hundred of instances to train a robust model. For the classification tasks, the target class should be discrete.

(Source), Scikit-learn cheat-sheet to choose the right estimator for classification tasks

2. Regression:

Scikit-learn package offers the implementation of various regression estimators for a dataset with continuous target class labels.

(Source), Scikit-learn cheat-sheet to choose the right estimator for regression tasks

3. Clustering:

For unsupervised tasks, the scikit-learn package offers various clustering packages.

(Source), Scikit-learn cheat-sheet to choose the right estimator for clustering tasks

4. Dimension Reduction:

Dimension reduction techniques are used to reduce the dimensionality of the data, by projecting the existing data into a new space or dimension. Scikit-learn package offers various dimension reduction techniques discussed below.

(Source), Scikit-learn cheat-sheet to choose the right estimator for dimension reduction tasks

Conclusion:

In this article, we have discussed a cheat sheet or set of rules that can be followed to choose the right estimator for your dataset. The above-discussed rules or machine learning algorithms are restricted to scikit-learn packages.

 

Exit mobile version