What is result_source in ChatGPT?

result_source is an undocumented field in ChatGPT's web-search response stream. It is stamped on every web result ChatGPT retrieves and records which retrieval pipeline or vendor fetched the page. The values observed in our hotel data are labrador (a licensed, quality-gated content tier), bright (Bright Data structured datasets), oxylabs (scraped open web), and null (untagged). A serp value appears in other verticals but never in hotel queries. Across 30,002 tier-tagged hotel citations, 99.85% were labrador.

When did ChatGPT start using result_source?

In our capture history the field first appears on 26 May 2026, with the bright tier arriving 8 June and oxylabs 15 June. The rollout was not a clean switch: the share of citations carrying a tag rose from 0% before late May to a peak of 81.8% in the first week of June, then dropped to 7.2% and recovered to around 44% — a pattern consistent with a staged experiment being dialled in and out.

What is turn_use_case and why does it matter for SEO?

turn_use_case is ChatGPT's classification of a query before it searches, and it decides whether the web is touched at all. For hotel prompts the most common value is text (37.3%), which means no web search — the answer comes from training data. The rest split between search (31.8%) and the local/maps pipeline (26.3%). The practical implication: for a large share of questions, citability is irrelevant because no retrieval happens; the wording of the query determines whether your content can be surfaced.

Which sources does ChatGPT trust most for hotels?

Within the licensed labrador tier, retrieved is not the same as cited. Official brand sites are cited most often when retrieved — Four Seasons 82% (press site 86%), Hyatt 84%, Hilton 80%, Ritz-Carlton 77%, Marriott 68%. Strong OTAs are mid-pack (Booking.com 60%, Expedia 29%, Hotels.com 23%). Aggregator listicles are retrieved in volume but rarely cited: hotelierschoice.com 2%, thehotelguru.com 1%, luxuryhotel.guru 2%, travelmyth.com 11%.

What is the difference between the labrador, bright, and oxylabs tiers?

They reflect how a page was acquired. labrador is the licensed, quality-gated tier — OTAs, big-brand chains and established editorial — and dominates at 99.85%. bright (Bright Data datasets) skews to structured directories and individual luxury properties such as Five Star Alliance, Tablet Hotels and individual hotel websites. oxylabs (scraped open web) is the long tail, including social posts and individual hotel sites. An individual hotel’s own website or Instagram tends to reach ChatGPT through the scraper tiers, not the licensed labrador feed.

How should hotels optimize for ChatGPT based on this data?

First, make sure the query intent even triggers a web search — a third of hotel turns are answered from training data with no retrieval. Second, invest in first-party, brand-owned content: ChatGPT cites official brand sites far more than the aggregator listicles it retrieves and discards. Placement on a listicle gets your page retrieved but rarely cited, whereas a clean, parseable brand page is the highest-leverage asset for the vertical.

June 2026AI Search · Retrieval

ChatGPT's hidden `result_source`:how it really sources hotel answers

Name: ChatGPT result_source & turn_use_case — Hotel Search 2026
Creator: Nicolas Sitter
Published: 2026-06-25
License: https://creativecommons.org/licenses/by/4.0/

TL;DR: Every web page ChatGPT retrieves carries an undocumented tag, result_source, naming the pipeline that fetched it. Across 30,002 hotel citations, 99.85% come from one licensed tier (labrador) and the open-web serp tier never appears. A companion field decides whether the web is searched at all — and 37.3% of hotel questions never search. Within the licensed tier, ChatGPT cites official brand sites 77–86% of the time while discarding the aggregator listicles it retrieves.

Nicolas Sitter

Published June 25, 2026

50,899

ChatGPT captures

30,002

Tagged citations

99.85%

From labrador tier

37.3%

Turns never search

Read the Report

Summary 1. The hidden field 2. The rollout 3. The search gate 4. Trusted sources 5. Per-tier signature Methodology FAQ

Executive Summary

ChatGPT runs hotel search on a licensed retrieval tier — and tells you so in a field users never see.

ChatGPT's web-search stream stamps every retrieved page with result_source — the pipeline or vendor that fetched it — and classifies each query with turn_use_case before it decides to search. Neither is documented. We backfilled both across our full ChatGPT hotel capture history: 50,899 captures and 30,002 tier-tagged citations.

The picture is lopsided and actionable. Hotels are sourced almost entirely from the licensed labrador tier (99.85%); the open-web serp baseline never appears. The field is brand new (first seen 26 May 2026) and was rolled out intermittently. And inside the licensed tier, retrieved is not cited: official brand sites win, aggregator listicles get pulled and dropped.

Section 1

A three-tier sourcing system, hidden in plain sight

Every search_result block in ChatGPT's raw response carries a result_source tag next to the publisher attribution and publish date. It takes one of a small set of values — and for hotels, one of them dominates completely.

result_source distribution across 30,002 tier-tagged ChatGPT hotel citations.

result_source	Share of tagged citations	What it is
labrador	99.85%	Licensed / quality-gated content tier
bright	0.14%	Bright Data structured web datasets
oxylabs	0.01%	Scraped open web (Oxylabs is a scraping vendor)
serp	0%	Open-web baseline — never appears for hotels

What it looks like in the raw stream

// labrador — licensed editorial / OTA / brand content
{ ..., "pub_date":1777852800,
       "result_source":"labrador",
       "attribution":"discoverzermatt.com" }

// bright — Bright Data structured dataset (an individual luxury hotel)
{ ..., "result_source":"bright",
       "attribution":"Schweizerhof Zermatt" }

// oxylabs — scraped open web
{ ..., "result_source":"oxylabs",
       "attribution":"Tripadvisor" }

For hotel queries, ChatGPT answers almost entirely from a licensed content tier (labrador, 99.85%). The open-web serp tier that appears in other verticals never shows up — the model isn't reading the live SERP for hotels, it's reading a curated feed.

Section 2

A brand-new field — switched on, off, then on again

Because we have citations from before the field existed, the share carrying a tag traces the rollout. It first appears on 26 May 2026, peaks the first week of June, then drops sharply — the signature of a staged experiment, not a permanent flip.

chatgpt-result-source-rollout-2026

The tag jumped from 0% to 82% in two weeks, then fell back to 7%. If you sample ChatGPT's internals on a handful of queries over a few days, you can easily catch a tier “everywhere” one week and “gone” the next. Population-scale capture is what separates a real behaviour from a snapshot artefact.

Section 3

Does your question even reach the web?

Before any retrieval, turn_use_case files the query into a bucket that decides which pipelines fire. For hotels, the single most common bucket is text — which means no web search at all.

turn_use_case distribution across ChatGPT hotel captures.

turn_use_case	Share of turns	Hits the web?
text	37.3%	No — answered from training data
search	31.8%	Yes
local	26.3%	Yes (maps / places)
instant search	3.3%	Yes
instant answers	1.3%	Partial
thinking / unknown	0.1%	—

More than a third of hotel questions never search the web. For those, citability is irrelevant — the answer comes from what the model already memorised. The wording of the query, not just the topic, decides whether your content can be surfaced at all.

Section 4

Retrieved ≠ cited: who ChatGPT actually trusts

These are the top domains ChatGPT retrieves inside the labrador tier, with the share of those retrievals it actually cites. The gap is the whole story.

chatgpt-labrador-retrieved-vs-cited-2026

Retrieved vs cited rate for top labrador hotel domains

Top labrador-tier hotel domains: times retrieved vs share actually cited.

Domain	Retrieved	Cited %	Type
tripadvisor.com	2,209	54%	Review aggregator
marriott.com	1,019	68%	Brand
expedia.com	926	29%	OTA
booking.com	632	60%	OTA
oyster.com	411	20%	Aggregator
hyatt.com	399	84%	Brand
hilton.com	360	80%	Brand
fourseasons.com	321	82%	Brand

Own your brand content. Official brand .com sites are cited 77–86% of the time when retrieved; aggregator listicles 1–11%. ChatGPT pulls the listicles, then leans on the official source. For hotels, a clean first-party page beats a placement on a “best hotels” roundup.

Section 5

Each tier has a different publisher fingerprint

The three tiers aren't redundant — they map to how a page was acquired, and it shows in what each one carries.

Per-tier publisher signature observed in the ChatGPT hotel data.

Tier	Character	Representative sources
labrador	Licensed / quality-gated — OTAs, big-brand chains, established editorial	tripadvisor.com, marriott.com, booking.com, expedia.com, hyatt.com, hilton.com, fourseasons.com, timeout.com
bright	Bright Data structured datasets — premium directories + individual luxury properties	Five Star Alliance, Tablet Hotels, Forbes Travel Guide, montcervinpalace.ch, tajhotels.com, agoda.com
oxylabs	Scraped open web — long tail incl. social + individual hotel sites	instagram.com, resortpass.com, thebarnett.com

The plumbing tell: an individual hotel's own website or its Instagram reaches ChatGPT through the scraper tiers (bright / oxylabs), not the licensed labrador feed. Big brands and OTAs are licensed in; everyone else is crawled ad-hoc.

Methodology

Study Design

Data Collection

50,899 ChatGPT hotel captures via Bright Data, 25 Dec 2025 – 22 Jun 2026.
result_source and turn_use_case parsed out of the raw SSE response stream and joined to 30,002 flattened per-citation rows.
ChatGPT-only — no equivalent field is emitted by Gemini, Perplexity, Copilot or Google AI Mode.

Caveats

One vertical (hotels) and one collection method; tier mix is query-type dependent.
serp never appears for hotels (0 of 30,002) but may surface in other verticals.
The field is ~4 weeks old and intermittent; bright (n=42) and oxylabs (n=3) samples are tiny — directional, not definitive.
Cited % is share-of-retrievals-cited, not a ranking guarantee.

Open data. Headline stats and the underlying tables are published as CSV: summary.csv, weekly_rollout.csv, labrador_top_sources.csv, turn_use_case.csv.