Think of this as the one-stop-shop/dictionary/directory for your machine learning algorithms. In this post, you'll find up-to 100 algorithms, including useful infographics to help you know when to use each algorithm (if available).
Scikit-Learn Algorithm Cheat-Sheet
First and foremost are the first 25 algorithms, which are found with this Scikit-Learn cheat sheet below. If you click the image, you'll be taken to the same graphic except it will be interactive. I suggest saving this site as it makes remembering the algorithms and when best to use them incredibly simple and easy.
Included in this infographic are:
- SVC
- Ensemble Classifiers
- Naive Bayes
- KNeighbors Classifier (kNN)
- Kernel Approximation
- Liner SVC
- SGD Classifier
- SGD Regressor
- Elastic Net
- Lasso
- SVR(kernel='rbf')
- EnsebleRegressors
- RidgeRegression
- SVR(kernel='linear')
- Spectral Clustering
- GMM
- KMeans
- MiniBatch KMeans
- MeanShift
- VBGMM
- Randomized PCA
- Isomap
- Spectral Embedding
- LLE
- kernal approximation
SAS: The Machine Learning Algorithm Cheat Sheet
You can also find many of the same algorithms on SAS's machine learning cheet sheet, but it includes 15 that are different, of which I have listed below. The SAS website (click the pic) also gives great descriptions about how, when, and why to use each algorithm.
- Gradient Boosting Tree
- Random Forest
- Neural Network
- k-modes
- Hierarchical
- DBSCAN
- Gaussian Mixture Model
- Latent Dirichlet Analysis
- Principle component Analysis
- Singular Value Decomposition
- Linear SVM
- kernel SVM
- Decision Tree
- Logistic Regression
- Linear Regression
Microsoft Azure Machine Learning Algorithm Cheet Sheet
Microsoft Azure's cheet sheet is the simplest cheet sheet by far. Even though it is simple, Microsoft was still able to pack a ton of information into it. Microsoft also made their algorithm sheet available to download. You can find the next 20 algorithms below.
Anomoly Detection- One-class SVM
- PCA-based Anomaly Detection
- Ordinal Regression
- Poisson Regression
- Fast Forest Quantile Regression
- Bayesian Linear Regression
- Neural Network Regression
- Decision Forest Regression
- Boosted Decision Tree Regression
- Multiclass Logistic Regression
- Multiclass Neural Network
- Multiclass Decision Forest
- Multiclass Decision Jungle
- One-v-all Multiclass
- Two-Class SVM
- Two-Class Average Perceptron
- Two-Class Logistic Regression
- Two-Class Bayes Point Machine
- Two-Class Decision Forest
- Two-Class Boosted Decision Tree
- Two-Class Decision Jungle
- Two-Class Locally Deep SVM
- Two-Class Neural Network
THIS IS A NOTE FOR NATHAN TO TELL HIM THAT THERE ARE 40 ALGORITHMS ABOVE THIS NOTE!!!! I AM ALSO THINKING OF ORGANIZING THEM BY TOPIC KIND OF LIKE THE SCIKIT-LEARN CHEAT SHEET, BUT NOT SURE YET. | THIS IS A NEW NOTE ADDED ABOUT 10 MINUTES AFTER THE OTHER NOTE. I'M LIKING THE INFOGRAPHICS AND JUST LISTING THE ALGORITHMS BELOW THEM. WE DON'T NEED TO DESCRIBE ANYTHING BECAUSE THE INFOGRAPHICS DO IT FOR US. ALSO, GOING TO LINK THE ALGORITHMS BACK TO THEIR INDIVIDUAL PAGES BECAUSE WHY REINVENT THE WHEEL WHEN OTHER PEOPLE HAVE DONE THE WORK FOR US.
A special thanks to the Data Science Interns (Rahim, Rabeez, Tooba, Hunaid, Arslan, and Tarun) at Data Science Dojo who helped me put all this together.
This is a companion discussion topic for the original entry at https://blog.datasciencedojo.com/p/7769e39b-fd09-4b6a-b864-eef53aa6d664/