An early model often used for classification tasks.
Support Vector Machine (SVM) is a classification algorithm that works by separating data into categories by finding the best “decision boundary” (or hyperplane). SVM models are robust and work well for complex, high-dimensional datasets.
The model supports both linear and non-linear tasks. Common use cases include classifying ad copy sentiment (a type of sentiment-based classification) and audience segmentation. SVM is also listed as an example algorithm for implementing Search Intent Classification and Page Type Identification projects.
Sources & References
Explore other ML Models & Algorithms terms
B
BERT (Bidirectional Encoder Representations from Transformers)
The foundational language model used for transformer-based embeddings in BERTopic.
B
BERTopic
An unsupervised machine learning approach for topic modeling that generates interpretable topics and performs dynamic…
B
BERTopic
An unsupervised machine learning approach for topic modeling that generates interpretable topics and performs dynamic…
B
BIRCH (Balanced Iterative Hierarchical Based Clustering)
A hierarchical clustering method efficient for large datasets and time series.
B
Boyer-Moore
An exact string-matching algorithm and one of the best-known pattern recognition algorithms.
C
c-TF-IDF
Class-based Term Frequency-Inverse Document Frequency; used by BERTopic for clearer topic representation and selection of…
D
DBSCAN
Density-Based Spatial Clustering of Applications with Noise; groups data points based on density. Useful for…
D
Decision Tree
An early, simple model for classification or regression.
D
Distance-based matching
Fuzzy matching methods focusing on "edit distance" rather than exact spelling.
D
DistilBERT (Refined Query Semantic Class Classifier)
A fine-tuned BERT model used for semantic class classification based on queries.
E
Encoder Model
A machine learning model used in Google's two-step process for building and maintaining the Knowledge…
F
Fuzzy Matching / Fuzzy String Matching
A string similarity assessment approach, typically relying on character distance rather than semantics, used to…
