Bigram

A sequence of two adjacent words.

A bigram is a contiguous sequence of two adjacent words found within a text. Along with N-grams (single words), bigrams are analyzed to understand the core semantic patterns within a set of keywords or queries. Tools like KeyBERT are used to extract the single most important N-gram and the most important Bigram from a keyword, providing a compact label representing the query’s semantic core. Visualizing the relationships between core N-grams and Bigrams can reveal key insights into a dataset, such as high-volume or high-intent phrases.

Explore other Core Concepts (AI/ML) terms

A
AI Overview
AI-generated summaries of highly informational, low-intent queries, offering quick answers to users, or generally, a…
A
Artificial Intelligence (AI)
The overarching concept related to the design and study of intelligent systems. Early systems relied…
A
Augmented Search Queries
Queries that expand or modify the original user query to improve search accuracy and relevance…
B
Bag of Words
A type of semantic representation of data, which can be extracted from page contents.
D
Deep Learning
A part of machine learning; Generative AI models like ChatGPT and LLM-based chatbots fall within…
D
Dimensionality Reduction
A process that reduces data, such as high-dimensional vectors, for visualization while preserving semantic structure…
E
Embedding
A numerical representation capturing the meaning of a document or data. Also referred to as…
E
Entity
A representation of real-world objects (people, products, places, concepts) that hold value from an SEO…
E
Entity Attribute (EAV Model)
Defining properties or characteristics of an entity (e.g., location, niche) used in the EAV model…
E
Entity Attribute Variable (EAV Model)
The concept encompassing entities, their attributes, and the specific values (variables) associated with those attributes.
E
Entity Variables (EAV)
Specific values an entity attribute can take (e.g., London, Paris for the Location attribute).
F
Feature Extraction
The process of converting entities into numerical representations based on term importance (e.g., using TF-IDF).