Overview

Learning Types

SupervisedUnsupervisedSemi-SupervisedSelf-Supervised

Supervised learning involves training a model on labeled data, where each data point is associated with a target label or category. The model then uses this labeled data to learn the patterns and relationships between the input text and the target labels.

Unsupervised learning is useful when there is no labeled data available or when the number of categories or topics is not known.

Semi-Supervised learning combines both supervised and unsupervised learning approaches. It involves using a small amount of labeled data to train a model and then using the model to classify the remaining unlabeled data. The model then uses the unlabeled data to improve its classification performance.

Semi-supervised learning is useful when labeled data is scarce or expensive to obtain.

Self-supervised learning is a form of unsupervised learning where the data provides the supervision. In other words, the model learns to predict certain parts of the input data from other parts of the same input data. It does not require explicit labels provided by humans, hence the term self-supervised.

In the context of language models, self-supervision is typically implemented by predicting parts of a sentence when given other parts. For example, given the sentence "The cat is on the __," the model would be trained to predict the missing word ("mat," in this case).

Algorithms

SupervisedUnsupervisedSemi-SupervisedReinforcement

Algorithm	Type
Naive Bayes	Classification
Logistic Regression	Classification
K-Nearest Neighbor (KNN)	Classification
Random Forest	Classification/Regression
Support Vector Machine (SVM)	Classification/Regression
Decision Tree	Classification/Regression
Simple Linear Regression	Regression
Multivariate Regression	Regression
Lasso Regression	Regression

Algorithm	Type
K-Means Clustering	Clustering
DBSCAN Algorithm	Clustering
Principal Component Analysis	Clustering
Independent Component Analysis	Clustering
Frequent Pattern Growth	Association
Apriori Algorithm	Association
Z-score Algorithm	Anomaly Detection
Isolation Forest Algorithm	Anomaly Detection

Algorithm	Type
Self-Training	Classification/Regression
Co-Training	Classification/Regression

Algorithm	Type
Policy Optimization	Model-Free
Q-Learning	Model-Free
Learn the Model	Model-Based
Given the Model	Model-Based