Glossary The AI SEO
The AI SEO
vocabulary.
40 terms used in GEO, AEO, and AI search visibility — defined precisely, without filler.
Precise definitions matter in GEO and AEO work because the same words mean different things depending on who's using them — "AI SEO" covers everything from prompt engineering to structured data, and conflating them leads to strategy that addresses the wrong problem. This glossary defines the 40 terms that appear most frequently in AI search visibility engagements, using the meanings they carry in practice rather than the meanings content marketers have attached to them for traffic purposes.
- AEO
- Answer Engine Optimisation is the practice of structuring content so that AI-powered answer engines — ChatGPT, Perplexity, Google AI Overviews — retrieve and cite it when responding to user queries. AEO focuses on the supply side: making your content available, parseable, and credible enough for AI systems to excerpt. Unlike traditional SEO, which optimises for blue-link rankings, AEO optimises for direct inclusion in generated answers. Core tactics include structured FAQ sections, self-contained definition passages, clear entity disambiguation, and schema markup.
- AI Overview
- AI Overview is Google's feature that displays a generated AI-written summary at the top of search results for qualifying queries. It draws from indexed web pages and synthesises a direct answer, often citing sources with inline links. AI Overviews appear for informational and research-oriented queries, and their citation patterns differ significantly from organic ranking factors — content that ranks #5 can appear in an AI Overview while a #1 result does not. Optimising for AI Overview appearances is a core component of GEO strategy.
- AI SEO
- AI SEO refers to search engine optimisation strategies designed to achieve visibility inside AI-generated answers, not just traditional blue-link results. It encompasses GEO, AEO, entity optimisation, passage citation readiness, and the use of AI tools (Claude, ChatGPT) to accelerate SEO execution. AI SEO is the umbrella category: it includes both getting cited by AI search tools and using AI tools to perform SEO more efficiently. The discipline is evolving rapidly as AI search interfaces — ChatGPT web search, Perplexity, Google AI Overviews — grow in commercial traffic share.
- Answer box
- An answer box is a search engine results page element that displays a direct answer to a query above traditional organic results — also referred to as position zero. Answer boxes pre-date AI Overviews and typically display a short extract from a single page along with its URL. They are triggered by questions and definitional queries, and are generated by algorithms parsing page content for a concise, citation-worthy answer. Answer box optimisation — writing content with explicit, structured definitions — overlaps significantly with AEO content strategy.
- Answer engine
- An answer engine is a search interface that responds to queries with a synthesised answer rather than a list of links. ChatGPT, Perplexity, Google AI Overviews, and Bing Copilot are answer engines. The distinction from a traditional search engine matters strategically: users of answer engines expect a finished response, which changes how content needs to be written to be cited. Answer engines select sources based on retrieval relevance, authority signals, and content clarity — not just backlink counts. The term is used interchangeably with "AI search" in many contexts.
- Brand mention
- A brand mention is any reference to a brand, person, or organisation in online content — with or without a hyperlink. Unlinked brand mentions are increasingly recognised as a relevance and authority signal by both Google and AI retrieval systems. For GEO purposes, brand mentions in high-authority, frequently-crawled sources (industry publications, reputable directories, Q&A forums) contribute to citation frequency in AI-generated answers. Tracking brand mentions across AI search outputs is a primary KPI in GEO engagements alongside citation share.
- Citation share
- Citation share is the percentage of AI-generated responses for a defined set of target queries that include a specific brand as a cited source. It is the GEO equivalent of keyword ranking position: instead of asking "where do we rank on Google?", citation share asks "how often does ChatGPT or Perplexity cite us when someone asks about our category?" A brand with 0% citation share is invisible to AI search; a brand with 30% citation share appears in roughly one in three relevant AI responses. Improving citation share is the primary goal of an AEO/GEO engagement.
- Citability
- Citability describes how readily a piece of content can be extracted and cited by an AI language model or retrieval system without requiring surrounding context. High-citability content is self-contained: a passage that answers a question fully within a few sentences, without requiring the reader to understand the broader document to make sense of it. Writing for citability means front-loading definitions, avoiding demonstrative pronouns that reference earlier paragraphs, and structuring answers as standalone units. It is a content writing discipline, not just a technical SEO task.
- Content silo
- A content silo is a site architecture pattern that groups topically related pages together through internal linking and URL structure, creating a clear thematic cluster that search engines recognise as authoritative on a subject. Silos prevent topical dilution by ensuring that pages about the same topic reinforce each other's relevance signals through mutual linking, rather than linking randomly across the site. A well-constructed content silo for "Shopify SEO" would include a pillar page, supporting articles on subtopics, and internal links flowing upward to the pillar — all with no cross-category links that would dilute topical focus.
- Core Web Vitals
- Core Web Vitals are Google's three primary user experience metrics used as ranking signals: Largest Contentful Paint (LCP, measuring load speed), Interaction to Next Paint (INP, measuring responsiveness), and Cumulative Layout Shift (CLS, measuring visual stability). Pages that fail Core Web Vitals thresholds are at a ranking disadvantage and may be excluded from certain SERP features. Core Web Vitals are measured from real user data (CrUX dataset) and lab data (Lighthouse), and are reported in Google Search Console. They are a technical SEO prerequisite, not a content or authority factor.
- Crawl budget
- Crawl budget is the number of pages Googlebot crawls and indexes on a site within a given time period, determined by crawl rate limit (server capacity) and crawl demand (importance and freshness signals). Sites with large page counts — Shopify stores with thousands of product variants, programmatic SEO builds with millions of pages — need to manage crawl budget by blocking low-value URLs via robots.txt, canonicalising duplicates, and ensuring high-value pages are internally linked prominently. Wasted crawl budget on thin or duplicate pages reduces how frequently important pages are recrawled and reindexed.
- E-E-A-T
- E-E-A-T stands for Experience, Expertise, Authoritativeness, and Trustworthiness — the four quality dimensions Google's Search Quality Raters use to evaluate page quality. The first E (Experience) was added in December 2022 to distinguish content from someone with direct, first-hand experience from content that merely claims expertise. E-E-A-T is not a direct ranking algorithm; it is a framework raters use to assess whether Google's algorithms are surfacing quality content. For GEO/AEO, E-E-A-T signals — author credentials, entity disambiguation, verifiable citations — are particularly important because AI models use similar trust proxies when selecting sources to cite.
- Entity
- An entity in SEO is any clearly defined person, place, organisation, concept, or thing that can be unambiguously identified and distinguished from others with the same or similar names. Google's Knowledge Graph is built around entities: it understands that "Jatin Lokwani" is a specific person (AI SEO specialist, India) rather than generic text. Entity-based SEO focuses on creating clear, unambiguous signals about who or what a brand is, what it is associated with, and how it relates to other known entities — rather than simply accumulating keyword instances. Entity clarity is foundational to GEO: AI models must be able to identify a brand with confidence to cite it.
- Entity graph
- An entity graph is the network of relationships between entities as understood by a search engine or AI model. In Google's Knowledge Graph, every entity connects to others: Jatin Lokwani (person) is connected to Digital with Jatin (organisation), GEO (topic), Ahmedabad (location), and so on. A stronger entity graph means more connections, more confirmation signals, and greater model confidence when deciding whether to cite a brand. Building an entity graph for SEO means creating and strengthening these associations through structured data, entity mentions in authoritative sources, and consistent information across the web.
- Featured snippet
- A featured snippet is a special SERP element that displays an extracted passage, list, or table from a web page above organic results, directly answering the query without requiring the user to click through. Featured snippets are pulled algorithmically based on content clarity, relevance, and structure — pages that directly answer the query with a well-formatted, concise passage are most likely to be selected. Featured snippet optimisation (using structured headings, direct answers immediately below H2s, numbered or bulleted lists) is closely related to AEO content structure, as both target AI system extraction of self-contained answer passages.
- GEO
- Generative Engine Optimisation (GEO) is the practice of making a brand's content get cited inside AI-generated answers from tools like ChatGPT, Perplexity, Google AI Overviews, and Bing Copilot. The term distinguishes this discipline from traditional SEO (which targets Google's blue-link rankings) and from AEO (which focuses on content structure). GEO encompasses strategy, entity optimisation, citation building, source authority development, and tracking. A brand with strong GEO is consistently cited by AI search tools when users ask questions in its category — independently of where the brand ranks in traditional search results.
- Generative AI
- Generative AI refers to AI systems that produce new content — text, images, code, audio — by learning patterns from training data and generating outputs based on prompts. In the search and SEO context, generative AI specifically refers to large language models (LLMs) like GPT-4, Claude, and Gemini that generate written responses to user queries. These models underpin AI search features: Google AI Overviews, ChatGPT web search, and Perplexity all use generative AI to synthesise answers from retrieved source material. The rise of generative AI in search is the primary driver for GEO and AEO as distinct disciplines.
- Grounding
- Grounding in AI search refers to the process by which an AI language model connects its generated answer to specific, retrievable source documents — rather than generating responses purely from training data. A grounded AI response includes citations that users can verify; an ungrounded response relies on what the model "knows" from pre-training. Grounding is what makes citation share a meaningful metric: grounded AI systems (Perplexity, ChatGPT with web search, Google AI Overviews) actively retrieve web content and attribute sources. Content optimised for citability is more likely to be selected as a grounding source.
- Internal linking
- Internal linking is the practice of linking pages within the same website to one another, distributing authority signals, clarifying site architecture, and communicating topical relationships to search engines. Strategic internal linking passes PageRank to high-priority pages, creates crawl paths that ensure important content is discovered and indexed, and reinforces topical silos by keeping thematically related pages connected. For GEO, internal linking to entity-defining pages (brand page, founder page, key service pages) strengthens entity graph signals. Poor internal linking is one of the most common and highest-impact technical SEO issues found in audits.
- Knowledge graph
- A knowledge graph is a structured database of entities and their relationships, used by search engines to understand the real-world people, places, organisations, and concepts mentioned in content and queries. Google's Knowledge Graph powers features like the Knowledge Panel and informs how AI Overviews understand brand context. When a brand has a strong presence in Google's Knowledge Graph — confirmed entity relationships, consistent NAP data, Wikipedia disambiguation, Wikidata entry — it is easier for AI models to identify and cite the brand with confidence. Building knowledge graph presence is a core component of entity-focused GEO strategy.
- Knowledge panel
- A knowledge panel is the information box that appears in Google search results when a user searches for a recognised entity — a person, brand, organisation, or place. Knowledge panels are populated from Google's Knowledge Graph and signal that Google has sufficient entity confidence to display structured information. Having a knowledge panel is not required for GEO success, but it is a strong signal of entity establishment. Brands without a knowledge panel can still build entity clarity through structured data, consistent external citations, and Wikipedia disambiguation — the same signals that eventually trigger knowledge panel creation.
- llms.txt
- llms.txt is a proposed web standard — a plain-text file placed at the root of a website (e.g., /llms.txt) that provides a concise, structured summary of a site's content for large language models. Analogous to robots.txt for traditional crawlers and sitemap.xml for indexing, llms.txt is designed to make it easier for AI systems to understand a site's purpose, key pages, and content hierarchy without crawling every URL. Adoption is early-stage but growing, and having a well-structured llms.txt file is a forward-compatible signal for AI retrieval systems. See the full specification at llmstxt.org.
- LSI keywords
- LSI (Latent Semantic Indexing) keywords are terms semantically related to a primary keyword that help search engines understand the full context and topical scope of a piece of content. The concept originates from information retrieval research but is widely used in SEO to mean "related terms that signal topical coverage." A page about "Shopify SEO" that also covers collection page optimisation, product schema, and canonical tags sends broader topical signals than a page that repeats the single phrase. Modern use of the term overlaps with semantic SEO — using the vocabulary of a topic comprehensively rather than targeting exact-match phrases.
- Named entity recognition
- Named entity recognition (NER) is the NLP task of identifying and classifying named entities in text — people, organisations, locations, dates, products — into predefined categories. Search engines use NER to understand what a piece of content is about beyond keywords: a page that mentions "Jatin Lokwani" (person), "Ahmedabad" (location), and "GEO" (concept) gives multiple entity signals that contribute to how the content is indexed and associated in the knowledge graph. For SEO, being aware of NER means writing content with explicit entity names and clear entity relationships rather than relying on pronouns and implicit references.
- NLP SEO
- NLP SEO (Natural Language Processing SEO) refers to optimisation practices that align content with how search engines and AI models parse natural language — including entity recognition, semantic relationship mapping, passage-level relevance scoring, and sentiment analysis. Google's MUM and BERT models process content using NLP techniques, which means content that reads naturally, uses topically complete vocabulary, and structures information clearly performs better than content written for exact-match keyword frequency. NLP SEO is less a distinct tactic and more a lens for evaluating whether content will be understood correctly by machine readers.
- Passage citation
- Passage citation is the mechanism by which an AI answer engine selects a specific passage from a web page to cite in a generated response — not the page as a whole, but a discrete excerpt that directly answers the query. Google's Passage Indexing (launched 2021) indexes individual passages independently of full page authority; AI models do the same. For AEO, this means every paragraph of a page can be a citation opportunity: if a passage is self-contained, clearly structured, and directly answers a likely query, it can be cited even if the overall page would not rank for that query. Writing for passage citation is the most granular AEO content technique.
- Perplexity
- Perplexity is an AI-powered answer engine that responds to queries with synthesised, grounded answers citing retrievable web sources. Unlike ChatGPT's base model, Perplexity always performs live web retrieval before generating a response, making it more citation-predictable for GEO purposes. Perplexity is growing in use among research-oriented and professional users, and its citation patterns tend to favour clearly structured, authoritative domain-level sources. Monitoring Perplexity citation share alongside ChatGPT and Google AI Overviews is standard practice in a GEO tracking setup.
- Programmatic SEO
- Programmatic SEO is the practice of generating large numbers of search-optimised pages at scale from structured data sources — templates, databases, or APIs — rather than writing each page manually. Common applications include location pages ("Plumber in [City]"), product pages at scale, and comparison pages ("X vs Y"). Effective programmatic SEO requires rigorous template quality control, unique data per page to avoid thin content penalties, and careful crawl budget management. It is distinct from AI-generated content at scale: the architecture is data-driven, not AI-written, though AI tools can help generate content variants within the template.
- RAG
- RAG (Retrieval-Augmented Generation) is an AI architecture in which a language model's response is grounded by first retrieving relevant documents from a knowledge base or the live web, then generating a response conditioned on those retrieved documents. Most AI search tools — Perplexity, ChatGPT with web search, Google AI Overviews — use a RAG-like architecture. Understanding RAG matters for GEO: your content must first be retrieved (crawlability, indexing, topical relevance) before it can be cited. A brand invisible to the retrieval stage cannot appear in the generation stage, regardless of content quality.
- Robots.txt
- Robots.txt is a plain-text file at the root of a website (e.g., /robots.txt) that instructs web crawlers which pages or directories they should not crawl. It is the first file most crawlers request when visiting a site. Misconfigured robots.txt is a common cause of indexing failures — accidentally blocking Googlebot from crawling CSS, JavaScript, or key page templates. For AI search, robots.txt also affects whether AI crawlers (GPTBot, ClaudeBot, PerplexityBot) can access your content; blocking these crawlers removes the site from consideration as a citation source in live-retrieval AI search tools.
- Schema markup
- Schema markup is structured data added to a page's HTML using the Schema.org vocabulary — typically in JSON-LD format — that explicitly tells search engines and AI systems what a page is about, what type of content it contains, and how its elements relate to each other. Schema types include Article, FAQ, HowTo, Product, Person, Organisation, BreadcrumbList, and many others. Correct schema markup improves eligibility for rich results in Google and strengthens entity signals for AI citation systems. It is one of the highest-leverage technical SEO tasks available because it makes implicit page information explicit and machine-readable.
- Search intent
- Search intent (also: user intent, query intent) is the underlying goal a user has when performing a search query — classified as informational (learning), navigational (finding a specific site), commercial (researching before buying), or transactional (ready to buy/act). Matching content to search intent is the single most important on-page SEO factor: a page optimised for the wrong intent will struggle to rank even with strong technical SEO and high authority. For AEO specifically, informational and commercial-investigational intents are the most citation-relevant — these are the query types that trigger AI Overviews and Perplexity synthesised answers.
- Semantic SEO
- Semantic SEO is the practice of optimising content for meaning and contextual relevance rather than exact-match keyword frequency. It involves using the full vocabulary of a topic, covering related concepts and entities, structuring content to reflect relationships between ideas, and aligning with how search engine NLP models parse language. A semantically rich page about "GEO" would cover generative AI, citation share, AI Overviews, entity clarity, and AEO — demonstrating genuine topical coverage rather than just repeating "generative engine optimisation." Semantic SEO is closely related to topical authority and is the content-side foundation for both traditional and AI search visibility.
- Source authority
- Source authority is the degree to which an AI system or search engine trusts a domain or page as a credible, reliable source for a given topic. It is influenced by backlink profile, brand mention frequency in authoritative sources, E-E-A-T signals, topical focus, and citation history. AI search tools prioritise high-authority sources during retrieval — a brand cited frequently in industry publications and referenced by trusted entities is more likely to appear in AI-generated answers than a brand with strong keyword rankings but weak external authority signals. Building source authority is a long-term citation-building and PR activity, not a purely technical task.
- Structured data
- Structured data is any information presented in a standardised, machine-readable format — most commonly JSON-LD using Schema.org vocabulary for web SEO purposes. It provides explicit context to search engines and AI systems about what content represents, reducing ambiguity that algorithms would otherwise need to infer. Structured data and schema markup are often used interchangeably, though structured data is the broader category (it includes microdata and RDFa formats as well as JSON-LD). Implementing structured data is a core technical SEO task and one of the most direct signals you can give AI retrieval systems about your brand's entity type and authority.
- Topical authority
- Topical authority is the degree to which a website is recognised by search engines as a comprehensive, credible source on a specific subject. It is built by covering a topic in depth — addressing the full range of related subtopics, questions, and entities — rather than producing isolated pieces of content. Google's Helpful Content and algorithm updates increasingly reward topical depth over broad coverage: a site that covers GEO, AEO, entity SEO, citation tracking, and schema markup in depth will outrank a general marketing site that touches each topic superficially. Topical authority is both a ranking factor and a prerequisite for sustained AI citation share.
- Topical depth
- Topical depth refers to how comprehensively a piece of content covers its subject — addressing not just the primary question but related sub-questions, edge cases, definitional nuances, and entity connections that a truly knowledgeable resource would include. Topical depth differs from length: a 500-word piece with high topical depth covers the essential and non-obvious dimensions of a topic; a 3,000-word piece with low topical depth repeats the same basic information at length. For AI citation, topical depth signals that a source is worth citing for a broad range of related queries — which is why deep, well-organised content earns disproportionate citation share relative to shallow pages with more backlinks.
- Topic cluster
- A topic cluster is a content architecture model in which a single pillar page covers a broad topic comprehensively, and multiple cluster pages cover specific subtopics in depth, with all cluster pages linking back to the pillar and the pillar linking to each cluster. Topic clusters create hierarchical topical authority: the pillar benefits from the depth of the cluster pages, and the clusters benefit from the authority of the pillar. First formalised by HubSpot, the model aligns with how search engines evaluate topical coverage and is one of the clearest frameworks for building content silo architecture that produces measurable ranking results.
- Vector search
- Vector search is a method of information retrieval in which both queries and documents are represented as high-dimensional numerical vectors, and retrieval is performed by finding documents whose vector representations are most similar to the query vector — a process called nearest-neighbour search. Unlike keyword search (which matches exact terms), vector search captures semantic similarity: a query about "how to rank in ChatGPT" can retrieve documents about GEO and AI citation share even if those exact words don't appear. AI search tools use vector search in their retrieval stage (RAG architectures), which is why content that covers a topic's full semantic field is more likely to be retrieved than content optimised for a single exact phrase.
- Zero-click search
- A zero-click search is a search query that is resolved without the user clicking through to any website — because the answer is displayed directly in the SERP (via featured snippet, knowledge panel, answer box, or AI Overview). Zero-click rates have grown significantly with the expansion of Google's rich SERP features and AI Overviews: a study by SparkToro and Datos found that more than half of Google searches in 2023 ended without a click. For SEO strategy, zero-click search creates a tension: optimising for featured snippets and AI Overviews increases brand visibility and citation share, but may reduce direct click-through traffic — making citation tracking a more relevant KPI than pure organic traffic in many GEO engagements.
A
B
C
E
F
G
I
K
L
N
P
R
S
T
V
Z
Put the vocabulary to work.
Understanding GEO and AEO terms is step one. Getting cited in ChatGPT and AI Overviews is the actual goal.