GEO Research 2026 | what we're seeing in AI citations

GEO (Generative Engine Optimisation) and AI Search refer to the same thing – how businesses appear in AI-powered search tools like ChatGPT, Perplexity, and Google AI. We use both terms interchangeably.

Why we're curating GEO research

Generative Engine Optimisation is moving rapidly. There's a lot of noise. Not enough data shared openly. We think businesses deserve research they can actually rely on.

Known & Cited doesn't run its own research programme yet. Instead, we curate high-quality GEO research from international sources – academic studies, industry analysis, and original observations from our audits – and share what it means for your business. This page brings together what the community is learning and what we've observed in our measurement work.

We combine published research with findings from our AVS audits. Our goal is to help you understand the GEO landscape – what's working, what's uncertain, and where the field is still figuring things out. We're honest about confidence levels. Where research is solid, we'll say so. Where it's emerging or contested, we'll flag that too.

We want to start our own data-led international research programme – measuring how AI citations work across countries and languages – but that's future work. For now, this is our curation hub. Bookmark it.

Featured research · Added 9 June 2026

State media control measurably shapes what LLMs say

A peer-reviewed Nature paper shows that the upstream information environment, including state-shaped media, propagates measurably into LLM outputs, and that the effect varies by the language used to ask the question. Demonstrated in depth for China, with correlational evidence across many languages. This is the serious, citable foundation under everything we do.

Source: Waight, H., Yang, E., Yuan, Y. et al., “State media control influences large language models”, Nature (2026), DOI 10.1038/s41586-026-10506-7 · Also: Nature News & Views companion

What the paper found

→Bias varies by language. Across six studies, LLMs show a stronger pro-government valence in the languages of countries with lower media freedom than in the languages of countries with higher media freedom.
→The mechanism is the training data. A five-word-gram similarity analysis of the CulturaX corpus found 3.1 million Chinese-language documents (1.64%) matching state-coordinated media, a measurable channel from the information environment into the model.
→Depth for China, breadth across languages. The effect is demonstrated in detail for China and shown correlationally across many languages. Hold the claim at what the evidence supports.

K&C commentary

LLMs have quietly become information intermediaries, sitting between people and what they come to believe – much as search engines and news feeds did before them. This paper is among the first large-scale empirical demonstrations that the upstream information environment, including state-shaped media, propagates measurably into those systems. The significance is less "AI can be biased" (long known) and more that the specific bias of a national information ecosystem can be traced into the model, and that it varies by the language used.

The affected domains are broad. Anyone relying on LLMs for cross-border information – journalism, international business, policy analysis, due diligence, translation, market research, education – is exposed to language-dependent variance they may not know is there. The risk is sharpest precisely where verification is hardest: questions about countries and institutions where independent information is scarce.

The longer-term implication is a strategic one, and it is uncomfortable. If model outputs can be shaped by what dominates the information environment, then states and powerful institutions now have a fresh incentive to flood that environment – not only to influence people directly, but to influence the AI systems that increasingly mediate what people learn. That is a structural concern for information integrity, for AI governance, and for the regulation of how training data is sourced and disclosed. It reframes content as something with a very long downstream tail: today's published text is tomorrow's training data is the day-after's machine-generated answer.

Section 4: Relevance for PR & Communications

Treat LLM outputs as situated, not neutral. The most direct lesson for communicators is that an LLM answer is an artefact of an information environment, not a view from nowhere. When a team uses ChatGPT, Gemini or Claude to research a market, summarise sentiment about a country or institution, or draft material for an international audience, the answer carries the fingerprints of whatever text dominated the relevant corpus. For PR – a discipline whose entire job is shaping and reading information environments – that is both a professional insight and a daily operational risk.

Language is now a variable in reputation, not just a translation step. The finding that the same question yields differently-valenced answers in different languages has immediate consequences for multilingual and multinational practice. Media monitoring, reputation audits and "what does AI say about us" checks need to be run in each relevant language, because the English-language answer may not represent what audiences in other markets are being told by the same tools. A single-language audit gives a false sense of completeness.

This is the serious face of GEO, AEO and LLMO. Generative-, answer- and LLM-engine optimisation rest on the premise that what circulates in the information environment shapes what models surface. This paper is empirical confirmation of that premise – and a sharp illustration of its dark mirror. The legitimate-versus-manipulative distinction matters enormously here: improving genuine relevance and accuracy so that models represent an organisation fairly is legitimate practice; coordinated flooding of the corpus to bend model sentiment is the state-propaganda playbook the paper documents. The same mechanism underwrites both, and practitioners advising on AI visibility should be able to articulate that line clearly.

The information-integrity case now has a peer-reviewed anchor. For teams advising on disinformation, influence operations and information integrity, this is a citable, Nature-grade demonstration that information operations can propagate into AI systems and surface as seemingly neutral answers. It strengthens the argument for provenance, source diversity and multilingual scrutiny in any integrity-focused brief.

The over-trust risk is acute. This connects to the persuasion trap: fluent, confident AI answers are persuasive regardless of whether they are accurate or balanced. A communications professional who pastes an LLM's summary of a foreign government, a contested geopolitical event or an international institution into a brief may be laundering a state-shaped narrative into client-facing work without knowing it. Automation bias makes this worse – the smoother the output, the less it gets challenged.

Opportunities worth naming. There is consultancy and training value in building multilingual "what does AI say about us" audit routines; advising international clients on the geopolitics of AI-mediated reputation; developing review checklists that treat language and source-provenance as first-class checks; and creating training modules that teach teams to interrogate AI outputs on politically or geographically sensitive topics rather than trust them. The governance angle is also live – this is a clean example of why an organisation adopting AI needs someone who holds the whole: a human accountable for understanding where outputs come from and where they could mislead.

Ethically, two cautions stand out. First, do not present AI-generated summaries of contested or geopolitical matters as neutral fact; disclose AI involvement and verify against independent sources. Second – and this is the anointment effect turned on the paper itself – a Nature study is exactly the kind of source that gets over-cited and over-extended. The robust claim is that state media control measurably influences LLM outputs, demonstrated in depth for China and correlationally across many languages. The over-claim would be that every AI answer in a given language is propaganda. Hold the line at what the evidence supports.

Newest research · Added 9 June 2026

Two thirds of Google searches now end without a click

Source: SparkToro · Window: US Google searches, January to April 2026

SparkToro's latest zero-click analysis confirms a decade-long trajectory has hardened. Most Google searches no longer send anyone to a website, and AI Overviews are accelerating the shift wherever they appear.

68.01%

of US Google searches ended with no click, up from 60.45% in 2024

~60%

drop in click-through rate when an AI Overview is present, now on 20%+ of searches

22.9%

fall in “clicks 1X+” (any click to Google, organic or ads) between 2024 and 2026

What it tells us

→The zero-click world is the default now. Across the last decade zero-click searches have climbed from roughly 45% to 68%. This is the baseline, not a blip.
→AI Overviews compound it. Where they appear, they keep the user on the results page and roughly halve the clicks that would have reached a website.

K&C commentary

The zero-click trajectory is structural, not seasonal. If two thirds of US Google searches end without a click, and AI Overviews knock another 60% off click-through when they appear, then measuring how AI answers represent a brand is no longer optional. The traffic that used to land on a brand's website is now resolved on the results page or inside the AI. If you only measure traffic, you only see what is left.

Featured research · Added 13 May 2026

The first proper clock on AI citation

Until now, nobody had a credible number for how long it takes ChatGPT and Claude to start citing newly published content. Josh Blyskal at Profound now does. The dataset is small. The method is honest. The finding is genuinely useful.

Source: Profound, Josh Blyskal on LinkedIn, 11 May 2026 · Method: ~900 newly published marketing pages, observed across ChatGPT and Claude agent logs over a 60-day window, March to May 2026

90%

cited within 37 days

Half

cited within 7 days

6.81

days typical time to first citation

What it tells us

→Half of newly published pages are cited within a week. Not months, not a quarter. A week.
→90% within 37 days. If your page is not getting cited by day 37, the problem is almost certainly upstream of the content. Indexing, retrieval, or signal architecture.
→It is a pipeline benchmark, not a content benchmark. Rodolfo Sabino flagged this in the LinkedIn thread: citation only happens after Google indexes the page and the model’s fan-out retrieves it. The clock measures the whole pipeline.
→Emergent topics get cited faster than mature ones. Garrett Smith’s empirical observation from the same thread. New ground gets covered quickly. Crowded ground takes longer.

Two clocks, one game – K&C commentary

Profound’s data measures first citation. How quickly a page shows up in an AI answer at all. That is one clock, and it is the one PR and content teams have been desperate for.

Our own measurement focuses on a different clock: sustained citation across the volatility window. AI answers wobble. Appearing once is not the same as being part of the answer set. Our tech partner’s data shows around 45% of brands appear only once in a 7-day window on unbranded prompts. So while Profound tells you how fast the door opens, AVS tells you whether it stays open. Same game. Different clocks. You need both.

Source: Profound (Josh Blyskal), LinkedIn, 11 May 2026. Caveats from the thread credited to Rodolfo Sabino (pipeline reframe) and Garrett Smith (emergent topics).

Read our full take on this: AI cites you in 6.81 days, if everything else is already working · the longer blog piece on what the Profound clock changes for PR teams, and why first-citation is only one of two clocks worth watching.

Newest research · Added 5 June 2026

AirOps: four numbers worth keeping

Source: AirOps, a measurement firm, across four separate studies · Note: each number comes from its own piece of work, so treat them as four separate readings, not one stacked case.

AirOps put out a run of studies recently. Four stats stuck with us. Together they describe a market that is far less stable, and far less about your own website, than most businesses assume.

30%

of brands stay visible from one AI answer to the next. Just 20% hold across five runs of the same question.

AirOps volatility report →

more likely to lose AI citations if a page has not been updated in over a year.

AirOps freshness report →

6.5x

more likely to be cited on someone else’s page than on your own.

AirOps citations report →

~50%

of all AI citations come from community sites. Reddit alone turns up in over a fifth of answers.

AirOps 2026 State of AI Search →

What it tells us

→Visibility is jumpy. One snapshot lies. If only 30% of brands stay visible run to run, a single check tells you almost nothing reliable.
→Stale pages fall out. Content that sits untouched quietly drops out of the answers. Freshness is not optional.
→Other people’s pages do the work. Your own website is not where most of the battle is won. Third-party coverage is.
→Communities count, but not everywhere. Community sites are huge in aggregate, yet in your particular category Reddit might not show up at all. Worth checking before you pour effort in.

Why this is exactly how we built AVS – K&C commentary

The volatility number is the whole reason an AVS run is not a single snapshot. We ask upwards of 6,000 prompts over seven days precisely because one reading wobbles. It is the only way we have found to see past the noise and give a client a number they can actually trust.

The freshness finding is why our recommendations carry a freshness rule and we tend to push clients toward a quarterly refresh. The answer is not cheap automated content, which will not cut through. It is real writing, sometimes started by AI and then finessed by good copywriters, kept moving so you do not quietly drop out of the answers without ever knowing why.

And the 6.5x figure is why we score Source Quality as a third of the picture, and why good PR people who get you mentioned in the right third-party places earn their keep. We read these as four separate readings, not one stacked case. But pointed the same way, they describe the job precisely. Be Known. Be Cited.

Research note · Added June 2026

Reddit’s CEO: “no Reddit, no AI”

Source: Search Engine Journal · When: late May 2026

Steve Huffman, who runs Reddit, told a Fast Company summit that large language models (the technology behind ChatGPT and the rest) “would not exist as we know them” without Reddit. He called the platform’s human conversation “modern oil”. He has been backing the claim up commercially: Reddit has signed licensing deals with Google and OpenAI, says it is open for more, and is suing AI firms it believes used its data without paying.

What it tells us

→Reddit is structurally important to the models. It is one of the largest sources of natural human conversation the models were trained on, and it keeps turning up in live answers.
→It is hard to influence, which is the point. That difficulty is exactly why it carries weight, and why it often holds genuine consumer insight worth factoring in.

K&C commentary

Reddit matters, but it is not the be-all and end-all, and your particular category may not lean on it at all. We treat communities as a measurement signal, not a manipulation channel. We check whether your category’s AI answers actually draw on Reddit before recommending any effort there, and we never astroturf, because communities spot it and the bans follow you. Where Reddit does matter, the work is monitoring, genuine participation, and content good enough to be shared.

Research note · Added June 2026

Publishers draw the line: the CMA opt-out and Sulzberger’s “brazen theft”

Sources: CMA, GOV.UK and Press Gazette · When: early June 2026

Two things landed on the publisher side. The UK competition regulator, the CMA, used new powers to force Google to let sites opt out of appearing in its AI summaries without being penalised in normal search, a world-first ruling that the BBC and other big UK names had pushed for. And in Marseille, AG Sulzberger, who runs the New York Times, told a room of the world’s news publishers that AI scraping is “brazen theft” and that they should fight it in court, in parliament, and by setting licensing terms together.

What it tells us

→Real opt-outs now exist. Regulators have given publishers a way out of AI summaries without a search penalty.
→The fight is really about money. Opt-outs and lawsuits are leverage toward commercial licensing deals.

K&C commentary

This looks like a tactic to force commercial agreements. Publishers need the reach, so it is hard to believe they truly want their content kept out of the models. Any publisher without LLM access, or without a commercial deal, will simply matter less and less. We have already seen where this goes: OpenAI struck a deal with Reddit, and Stack Overflow, the site every developer lives on, signed one too. The endgame is not walls. It is contracts.

For almost every other commercial brand, the lesson runs the other way: blocking yourself from AI answers is self-harm. You want to be in the answer your buyer reads, not out of it.

Featured study · Added May 2026

State of Martech 2026 – Brinker’s AEO chapter

Source: Scott Brinker (chiefmartec) and Frans Riemersma (MartechTribe) · Published: May 2026 · Method: 125-page annual report. Survey base of 208 marketing and ops leaders, February 2026. 40% VP+, 61% pure B2B, 36% from tech, 21% professional services. Sponsored by GrowthLoop, Hightouch, Knak, MoEngage, Pega, Progress, SAS.

The official chiefmartec annual. Brinker has spent two decades mapping the marketing technology landscape; this is the report every CMO, vendor and analyst will be quoting for the next twelve months. The 2026 edition does something the field has been waiting for: it formally renames the SEO subcategory of the MartechMap to SEO/AEO/GEO, names the new tool category in print, and frames the measurement gap that AVS was built to close.

What it confirms

→The discipline is real and named. Brinker has formally renamed the SEO subcategory of the MartechMap to SEO/AEO/GEO. The whole industry is being told, in print, that the discipline is changing shape rather than dying.
→63.1% of marketers are publishing AI‑optimised content. Only 13.6% are measuring AI inclusion rate or agent‑referred conversion. The industry is doing the work without checking if it lands. (Page 24 to 25.)
→CMS & Web Experience Management grew 21.4% (504 to 612 products) and Ecommerce Platforms grew 19.9% (547 to 656). Brinker reads it as the website’s job being renegotiated for machines as a first‑class audience.
→Mobile & Web Analytics grew 11.3% after years of stagnation. Brinker’s line: you can’t measure what you can’t see, so you measure harder where you still can.
→The new tool category is named. Brinker calls out AirOps, Bluefish, Daydream, Evertune, Profound and Scrunch, plus Semrush (Adobe‑owned now) and Ahrefs extending in. The product layer is forming. The methodology layer underneath it is still wide open.
→73% of marketers now have a formal generative AI policy, up from 52% in 2024. SAS reports only 8% have full confidence in their broader AI governance readiness. Policy‑rich, infrastructure‑poor.

63.1% / 13.6%

Publishing AI content vs. measuring it

+21.4%

CMS subcategory growth, 2026

800M

ChatGPT weekly active users

15,505

Total martech products tracked

73%

Marketers with a formal genAI policy

What this means for UK businesses – K&C commentary

The 63.1% / 13.6% number is the entire reason a service like AVS exists. Brinker frames the gap as transitional opacity that the tools will solve. We think it’s structural. Most teams will never close it on their own because the work isn’t in the publishing layer (which they own); it’s in the answer layer (which they cannot see). Even with llms.txt files, schema markup, structured FAQs and content rebuilt for machines, you still need somebody running queries, capturing answers, scoring citations, and tracking competitor presence over time. That isn’t a feature your CMS will ship in a release note. It’s a process, run by a person who knows what to look for.

The named tool list (AirOps, Bluefish, Daydream, Evertune, Profound, Scrunch, plus Semrush and Ahrefs) is all US‑based product companies. K&C isn’t there yet, and that’s the honest read. Tooling tells you whether you appeared in a query. It does not tell you whether you appeared in the right queries, whether your competitors appeared more, or whether the citation pattern across a category is moving toward or away from you. That layer (query design, sector benchmarks, scored methodology) is where AVS sits. It’s a different shape of product. You can’t buy it off a shelf yet.

The MartechMap rename matters more than it sounds. Brinker has put the field’s name in print, in the report every CMO and vendor will read for the next year. The category has a shape. The question now is who owns the methodology layer for the businesses Brinker’s tool list won’t reach: UK SMEs, mid‑market B2B, charities, ecommerce operators without a US product team on retainer. That’s the K&C shape. Be Known. Be Cited.

Featured study · Added May 2026

AirOps – The Complete AI Search Playbook for Marketers

Source: AirOps · Published: March 2026 · Method: ~15M data points across AI answers, queries, citations and brand mentions; 12,000+ pages for structural analysis; 21,000+ brands for third-party citation behaviour; 5.5M answers for community-citation patterns

A 17-page report from a US content-operations platform, built on the most rigorous published GEO dataset to date. The AirOps team turned twelve months of citation analysis into a practitioner-facing playbook – the partner piece to their larger 2026 State of AI Search companion dataset with Kevin Indig. The data is the strongest bit. The frameworks and case studies are where it tilts into sales pitch.

What it confirms

→68% of brand mentions in AI search appear in only one model. Cross-model consistency is rare – if you only track one platform, you only see about a third of your visibility picture.
→85% of brand mentions come from third-party sources, not the brand’s own site. Brands are 6.5× more likely to be cited via third-party sources than from their own domain.
→~90% of third-party citations come from listicles, comparisons and review sites. 80% of cited brands sit in the top three of those formats. If you’re not on the key comparison page in your category, you’re effectively invisible for that query.
→Pages with three or more schema types are 13% more likely to be cited. Clear H1→H2→H3 hierarchy gives 2.8× higher citation odds. Lists or tables appear in nearly 80% of ChatGPT citations – vs 29% in Google’s top results.
→70% of AI-cited pages were updated within the past year. Content less than three months old is 3× more likely to be cited. Annual refresh cycles are a year too late.

What this means for UK businesses – K&C commentary

The data is real. The frameworks are sales pitches dressed as research, every featured case study (Carta, Webflow, Chime, Docebo, Klaviyo, LegalZoom) is an AirOps customer, and methodology disclosure is non-existent – no published prompt set, no model versions, no country or language coverage. Worth reading. Worth questioning. Cite the customer numbers as “AirOps reports their customer X saw…”, not as independent benchmarks.

The headline frame is ~15 million data points across 21,000 brands – roughly 700 prompts per brand on average. As industry-wide field reading, that’s plenty. As a measurement of your business, it’s thin and generic. AVS Annual measures 6,000+ prompts per brand, every prompt designed for that brand’s sector and buyers. Roughly 8× the per-brand depth – tailored, not generic.

The product K&C sells is the judgement on top of the data, not the data itself. Anyone with a budget can buy AI search numbers now. The bit you can’t buy off a shelf is somebody who knows what those numbers mean for your buyers, which findings matter, and which battles to pick first. Generic benchmarks tell you which way the field is moving. They don’t tell you what to do.

Read the full K&C deep-dive on the AirOps Playbook →

Featured study · Added May 2026

Seer Interactive – GEO Olympics Study

Source: Seer Interactive · Published: February 2026 · Method: 231,347 LLM responses across six LLMs, mapped to the 2026 Winter Olympics

The most rigorous piece of GEO research the field has produced so far. Seer queried six LLMs across every category in the 2026 Winter Olympics – broadcast partners, sportswear, equipment manufacturers, ticket platforms, host-city hospitality – and measured which brands appeared in AI answers and which did not. The headline finding is a 7.8× outcome gap between brands with strong “signal architecture” and brands without.

What it confirms

→Signal architecture matters. Brands strong on entity authority, third-party validation and community discussion appeared 7.8× more often than brands weak on those layers.
→Wikipedia is the single most-cited source across the dataset – though only viable for businesses already independently notable.
→Reddit is a heavyweight community signal in many categories – category-dependent, and a measurement signal rather than a manipulation channel.
→Roughly 1 in 5 LLM mentions are framed in stale or out-of-date language – meaning the description AI returns of you may not be the description you want.
→The Binary Cliff is real. You are either in the consideration set or you are invisible. Real-world performance does not carry over to AI surfacing on its own.

What this means for UK businesses – K&C commentary

The temptation, reading this study, is to walk away thinking “we need a Wikipedia page and a Reddit campaign.” Resist it. The Olympics is one slice – broadcast media, sportswear, equipment manufacturers, ticket platforms. That slice happens to lean on Wikipedia and Reddit. Most other categories don’t, in the same proportions or at all.

Some sectors live on trade press and analyst reports. Some live on accreditation bodies and government registers. Some live on LinkedIn and founder presence. Some genuinely live on Reddit. The right move is to map your sector’s signal architecture first – work out which sources AI is actually pulling from for queries in your category – and only then decide where to invest.

The method generalises. The specific sources don’t. We’ve written a longer take on what this study means for K&C and our clients – including the bits agencies will pretend they always knew.

Signal architecture – Seer’s frame, and how to use it

Signal architecture is the three-layer model Seer Interactive built their GEO Olympics Study around. It’s a useful frame. It is not a checklist. The layers Seer identified – entity authority (who is the brand?), third-party validation (who validates it?), and community discussion (who talks about it?) – are best treated as a diagnostic lens. Strong on all three, you appear in AI answers. Weak on any of them, you start losing ground. Weak on all three, you fall off Seer’s “Binary Cliff”.

Where this gets misread is on the specific tactics. Wikipedia, for example, is downstream of notability. It is not a tactic you can run. A Wikipedia page is the downstream effect of significant coverage in reliable third-party sources you don’t control – trade press, analyst reports, books, peer-reviewed work. Paid placement doesn’t count. Sponsored content doesn’t count. Your own blog doesn’t count. Your LinkedIn doesn’t count. If you are not already independently notable, do not try to write your own page. The community will spot it, delete it, and the brand will end up on the talk page as a cautionary example. Plenty of household names don’t have a Wikipedia entry and won’t get one until somebody else writes it.

Reddit and equivalent communities are a measurement signal, not a manipulation channel. K&C doesn’t run Reddit campaigns and won’t pretend to. Astroturfing kills brands and the sub-bans follow you. What is on offer is monitoring (what the relevant subs are saying about you), authentic engagement under real names from staff and founders who actually belong in those subs, and earned mention through products and content good enough that people share them organically. If a category doesn’t lean on Reddit, don’t invest there. Most categories don’t.

The diagnostic value is the work. Knowing whether your sector’s AI answers are being shaped by trade press, accreditation bodies, Wikipedia, Reddit, LinkedIn or analyst reports tells you where to put effort. Most businesses skip this step and invest in the wrong place – running content programmes for sources their category’s AI answers don’t pull from, and ignoring the ones that matter.

That mapping is the bit AVS is built to do. Three LLMs, a tailored prompt set for your business, the actual citations behind each answer, scored across our twelve pillars. If you want to know what your category’s signal architecture looks like, that is what the AVS Exec Brief is for – or read our manifesto piece on why AI search is about architecture, not campaigns.

What else do we know so far?

Drawing on our own audit data, published academic research, and analysis from across the GEO landscape, here are the patterns we're seeing. We've tried to be honest about confidence levels – some of this is well-established, some is emerging, and some is informed speculation.

Each LLM behaves differently – and the gap is widening

ChatGPT, Perplexity, Gemini, Claude, and Bing Copilot do not cite the same businesses for the same queries. Our audits consistently show that a business scoring well on one platform can be invisible on another. Perplexity tends to cite more sources explicitly. ChatGPT is more likely to synthesise without attribution. Google's AI Overviews favour content already ranking in traditional search. This means a single-platform GEO strategy is inherently fragile.

Confidence: High – observed consistently across K&C audits. Also noted by Muck Rack and academic GEO research (Georgia Tech, 2024).

Structured, authoritative content correlates with citation

Businesses that produce clear, well-structured content that directly answers common questions tend to be cited more often. This includes FAQ pages, "how it works" explainers, and content that uses schema markup. The Georgia Tech GEO research found that adding citations, quotations from authoritative sources, and statistics to content improved LLM citation rates by up to 40%. Content that reads like it was written for AI extraction – clear, factual, attributable – performs better than content written purely for human engagement.

Confidence: Medium-High – supported by academic research and our audit observations. The exact mechanisms remain unclear.

Third-party presence matters more than your own website

AI models don't just read your website. They've been trained on the entire web. Businesses that appear on trusted third-party domains – industry publications, review sites, Wikipedia, professional directories – tend to be cited more consistently. In our audits, businesses with strong earned media presence almost always outperform businesses that only invest in their own domain, even when the owned content is excellent. PR, in the traditional sense, may be one of the strongest GEO signals.

Confidence: Medium-High – consistent across audits. Also emphasised by Hotwire, C8 Consulting, and the broader PR industry's GEO research.

AI answers fluctuate – and that's normal

Run the same query on ChatGPT today and tomorrow, and you may get different brands cited. LLMs are probabilistic, not deterministic. Citation scores fluctuate weekly. This creates a measurement challenge: a single snapshot can be misleading. Meaningful trends only emerge over quarterly periods. Anyone claiming to offer real-time GEO tracking is measuring noise, not signal. This is why our methodology uses structured query frameworks across multiple time windows.

Confidence: High – fundamental to how LLMs work. Confirmed by every audit we've run.

GEO varies significantly by country – and almost nobody is measuring it

Our multi-country audits show that the same business can be recommended in the UK but invisible in Germany, or cited in the US but not in France. Language matters. Local sources matter. Regional search behaviour patterns influence AI training data. Most GEO services operate in English only, in a single market. This leaves a massive blind spot for any business operating internationally. We believe international GEO research is one of the most underexplored areas in the field.

Confidence: Medium – based on our multi-country audit data. Limited external validation because few others are doing this research.

Business narrative consistency amplifies citation

When a business tells the same story across multiple sources – website, press, industry directories, LinkedIn, review sites – AI models appear to develop a stronger "understanding" of what that business does and who it's for. Businesses with fragmented or contradictory positioning across different channels tend to receive weaker, less specific citations. The implication: GEO is partly a business consistency exercise.

Confidence: Medium – pattern observed in audits, particularly in competitor benchmarking. Causation not established.

What we're tracking

Our ongoing GEO research programme is designed around the questions that matter most to businesses. These are the themes we're actively investigating through our audit data and dedicated research queries.

Cross-LLM citation patterns

How do ChatGPT, Perplexity, Gemini, Claude, and Bing Copilot differ in what they cite and how? Which platforms are most volatile? Which are most consistent? When one platform starts citing a business, do others follow?

International citation variation

How do AI recommendations differ between the UK, US, Germany, France, and other markets? Do localised queries produce fundamentally different business recommendations? How much does language affect citation?

Content signals that lead to citation

What types of content correlate most strongly with AI citation? How important are schema markup, FAQ pages, structured data, and authoritative third-party mentions? Can we isolate individual signals?

Citation velocity and decay

How quickly does new content get picked up by AI platforms? How long does a citation last? Is there a "half-life" for AI visibility? What causes a business to drop out of AI recommendations?

Sector-specific GEO dynamics

Does GEO work differently in professional services vs. consumer businesses? How do B2B and B2C citation patterns differ? Are some sectors more "GEO-ready" than others?

The PR-GEO relationship

How directly does traditional PR activity translate to AI citations? Is earned media the strongest GEO signal? How long after publication does a press mention start appearing in AI answers?

Our approach to curation

We're not an academic institution. We're a commercial business that runs AI visibility audits. We curate GEO research from a specific perspective: we want to understand this landscape well enough to give our clients genuinely useful advice. That means sourcing rigorous work and being honest about what's proven versus what's still emerging.

How we source research

We monitor GEO research from academic institutions, industry platforms, and fellow practitioners. We analyse what's relevant to our clients – how AI citations work, what signals matter, how measurement should work. We combine published findings with observations from our own audit data. We prioritise international research and cross-platform analysis.

How we report findings

Every finding includes a confidence level. "High" means it's been observed consistently and backed by multiple sources. "Medium" means we've seen it in our audit data or industry analysis, but it's not yet settled. "Low" means it's emerging or contested. We're honest about what we actually know.

What we're tracking, not claiming

We don't claim to lead GEO research – that's community work. What we do is analyse high-quality research, measure how it plays out across platforms and countries, and share observations that help you navigate the GEO landscape. This curation is part of our service to clients.

GEO Research

Why we're curating GEO research

State media control measurably shapes what LLMs say

What the paper found

K&C commentary

Section 4: Relevance for PR & Communications

Two thirds of Google searches now end without a click

What it tells us

K&C commentary

agentsfriendly.ai: a readability check for AI visitors

The first proper clock on AI citation

What it tells us

Two clocks, one game – K&C commentary

AirOps: four numbers worth keeping

What it tells us

Why this is exactly how we built AVS – K&C commentary

Reddit’s CEO: “no Reddit, no AI”

What it tells us

K&C commentary

Google’s AI mentions report lands in Search Console

What it tells us

K&C commentary

Publishers draw the line: the CMA opt-out and Sulzberger’s “brazen theft”

What it tells us

K&C commentary

Is being cited even the point? Burson and AMEC weigh in

What it tells us

K&C commentary

State of Martech 2026 – Brinker’s AEO chapter

What it confirms

What this means for UK businesses – K&C commentary

AirOps – The Complete AI Search Playbook for Marketers

What it confirms

What this means for UK businesses – K&C commentary

Seer Interactive – GEO Olympics Study

What it confirms

What this means for UK businesses – K&C commentary

Signal architecture – Seer’s frame, and how to use it

What else do we know so far?

The landscape is moving fast

What we're tracking

Cross-LLM citation patterns

International citation variation

Content signals that lead to citation

Citation velocity and decay

Sector-specific GEO dynamics

The PR-GEO relationship

Our approach to curation

How we source research

How we report findings

What we're tracking, not claiming

The numbers behind GEO – in one place

Want to know what AI says about your business?

See how this research applies to you