AI presents a conversational expertise. We use LLMs via chatbots. However nobody has but checked out how citations and mentions evolve in a dialog.
I analyzed knowledge from the Semrush AI Visibility Toolkit to assessment 20 purchaser journeys throughout 4 completely different verticals to check excessive vs. low reasoning for ChatGPT5.2.
On this evaluation:
- Why excessive reasoning cites an almost completely different net (solely 25.6% area overlap with minimal) and which supply varieties achieve or lose floor.
- Why TOFU content material has a payoff once more: Grands cited on the Downside stage usually tend to persist all the best way to Choice beneath excessive reasoning, and by no means beneath minimal.
- Easy methods to cut up your immediate monitoring by reasoning mode so your AI visibility reporting displays 2 completely different techniques, not an averaged one.
Methodology
Knowledge comes from the Semrush AI Visibility Toolkit, which captures the prompts, citations, and fan-out queries ChatGPT generates per response.
- We ran 100 prompts twice via GPT-5.2, as soon as with minimal reasoning and as soon as with excessive reasoning, for 200 whole responses.
- Prompts span 20 purchaser journeys throughout 4 classes (B2B SaaS, Finance, Client Tech, Well being/Life-style), with 5 phases per journey: Downside, Exploration, Comparability, Validation, Choice.
- Quotation charge is the share of prompts the place the response cited not less than one exterior supply.
- Common quotation counts sources per cited response.
- Fan-out queries are the sub-queries the mannequin fires internally to analysis the immediate earlier than answering, surfaced through the Semrush API.
GPT 5.2’s excessive reasoning cites and searches extra
Flip excessive reasoning on, and the quotation charge jumps from 50% to 68% (+18 proportion factors), the typical sources per response practically doubles (2.6 to 4.5), and fan-out queries go up 4.6x. Excessive reasoning additionally pulls from 173 distinctive domains throughout the check set vs. 127 for minimal; 99 of these domains by no means seem beneath minimal reasoning.


*Quotation Fee is outlined because the share of prompts the place the response cited not less than one exterior supply.


That is grounding at its best. When the mannequin thinks more durable, it depends extra on net search. Reasoning performs a significant function in model visibility, although we don’t know what number of customers activate reasoning vs not.
Question intent is a cleaner proxy than person demographics. Free-tier customers have reasoning entry too, simply rate-limited, and ChatGPT auto-routes onerous prompts to Pondering mode with out the person clicking something. So the query isn’t who can afford reasoning. It’s which prompts set off reasoning robotically.
Multi-criteria comparisons, analysis frameworks, regulatory and compliance questions, and complicated procuring builds are the prompts probably to fireside reasoning no matter plan. Map your viewers by question kind, not by paywall standing.
Excessive reasoning fires extra fan-out queries deeper within the funnel
Customers transfer via problem-solving and buy selections in phases, typically inside the similar dialog. The hole between minimal and excessive reasoning isn’t fixed. It scales with the place the person sits within the journey.
What the 5 phases appear like in apply. Take a purchaser evaluating CRM software program:
- Downside: “How do I do know if my gross sales workforce wants a CRM?”
- Exploration: “What varieties of CRM software program exist for B2B SaaS?”
- Comparability: “HubSpot vs. Salesforce vs. Pipedrive for a 50-person gross sales workforce.”
- Validation: “Is HubSpot definitely worth the value for mid-market B2B?”
- Choice: “How do I get began with HubSpot Gross sales Hub?”
The three patterns maintain throughout all 20 journeys:
- Quotation charge climbs via the funnel beneath each modes, however excessive reasoning closes the early-stage hole most aggressively: +35pp at Downside, solely +5pp at Validation. The mannequin treats early-funnel questions as analysis duties when excessive reasoning is on, whereas it answers-from-memory when it’s off.
- Fan-out queries peak at Comparability. Excessive reasoning fires 24 sub-queries per response there vs. 5.5 for minimal. Choice runs 15.4 vs. 2.6.
- Common citations per response peaks at Comparability (9.8 excessive, 5.8 minimal) and narrows at Choice (4.7 excessive, 2.6 minimal). The mannequin resembles an hourglass throughout funnel phases.


On the mixture degree, minimal reasoning fires 245 search queries throughout 100 prompts. Excessive reasoning fires 1,130. When the mannequin operates with excessive reasoning, it runs a mini investigation per immediate, and a lot of the investigation occurs on the Comparability and Choice phases.


What does a fan-out truly appear like?
A B2B SaaS immediate beneath excessive reasoning evaluating Salesforce, HubSpot, and Pipedrive for a 50-person gross sales workforce breaks into separate queries about API charge limits per vendor, SOC 2 / ISO 27001 compliance, SAML/SSO/SCIM help, webhook structure, OAuth move, developer documentation, enterprise pricing tiers, and change-data-capture help. Every turns into its personal retrieval. The model that wins the reply is the one whose documentation surfaces clear for every sub-query, not the one which ranks for the mum or dad immediate.


The Choice stage has the widest per-response question variance: 0 to 40 fan-out queries on the identical five-stage cohort. The driving force is immediate specificity. Bounded prompts (like “ought to I finance via the vendor at 0% APR or use a financial institution?” or “draft an RFP to three website positioning companies”) run zero queries as a result of the reply’s construction is given. Open-ended product builds (“procuring checklist for a $3,000 house fitness center” or “which journey card ecosystem matches our grocery spending?”) run 28 to 40 queries. The Choice stage isn’t bounded by one kind of query, and the mannequin’s analysis effort tracks what number of levels of freedom the immediate leaves on the desk.
| Stage | Minimal: Avg queries | Excessive: Avg queries |
| Downside | 0.0 | 5.2 |
| Exploration | 0.8 | 2.6 |
| Comparability | 5.5 | 24.1 |
| Validation | 3.4 | 9.1 |
| Choice | 2.6 | 15.4 |
For entrepreneurs: Early-funnel visibility is a reasoning-mode story. In case your consumers use ChatGPT with reasoning on, problem-stage, and exploration-stage content material is in play. In the event that they don’t, you’re successfully invisible till Comparability.
Reasoning impacts how manufacturers seem in a dialog
An LLM session is a dialog, not a single question. The query that it opens up: Does a model cited initially of the journey carry via to the tip? If sure, early-funnel visibility compounds. If not, each stage is a recent combat.
When a model will get cited within the Downside stage (step 1), does it survive to the Choice stage (step 5)? When utilizing minimal reasoning: No. Zero journeys present this sort of persistence. In excessive reasoning: Sure. Model continuity is maintained in 4 journeys throughout all 5 phases.
Inside a single response, excessive reasoning additionally anchors more durable on particular person sources. 51 of 100 high-reasoning responses cite the identical area greater than as soon as in the identical reply, vs. 26 of 100 for minimal. Excessive reasoning quotes a supply repeatedly when it commits to it.
Model mentions inform a softer model of the identical story. In the event you loosen the check from cited area to model named within the reply textual content, persistence reveals up in 3 high-reasoning journeys (HubSpot throughout CRM Choice, American Specific throughout Enterprise Credit score Playing cards, Sony and Canon throughout Mirrorless Digital camera) and a couple of minimal-reasoning journeys (HubSpot, Mercury). Client Tech reveals up right here regardless that it doesn’t present up within the quotation persistence desk. Manufacturers like Sony and Canon are talked about via the dialog with out the mannequin linking out to them, which is its personal type of class dominance and price monitoring individually.


Excessive reasoning builds a constant psychological mannequin of the answer house all through a session. The headline discovering: TOFU prompts have worth. If a model reveals up on the Downside stage, it tends to hold via to Choice. Prime-of-funnel content material isn’t simply model consciousness for AI visibility. It’s a number one indicator of the place the mannequin lands at determination time.
Two extra implications:
- All 4 persistent journeys are in Finance, which suggests persistence rides on the identical authoritative-source content material (regulatory pages, official model websites) that drives the +28pp Finance raise general.
- For entrepreneurs working an account-based or category-creation play, reasoning-mode visibility is the prize. It’s the one mode the place early-funnel content material compounds into selection-stage citations.
Reasoning mode is a separate search engine
The model that wins beneath minimal reasoning isn’t the model that wins beneath excessive reasoning: 3 in 4 cited domains are completely different. The combination of supply varieties is completely different. The phases the place citations seem are completely different.
I’m enthusiastic about two findings particularly from this evaluation:
The primary is measurement. We have to observe low vs. excessive reasoning in our immediate trackers. It’s greatest to keep away from an mixture view as a result of the mechanisms are actually completely different.
Dangerous information: This provides extra effort and price to immediate monitoring. Excellent news: We will make immediate monitoring much more correct.
The second is funnel phases. Within the latest AI Mode user behavior study, I discovered that customers react strongly to shortlists, demonstrating an identical habits seen with Google’s basic search outcomes the place the highest outcome issues most. That outcome made it appear to me that specializing in BOFU prompts that return shortlists is the sport.
Nevertheless, now we all know there may be worth in TOFU prompts due to persistence: Manufacturers that seem early within the purchaser journey can persist right through. One of the best ways to search out that out for your self is to map purchaser journeys and observe your persistence.
This publish first appeared on the creator’s web site and is republished right here with permission.
Contributing authors are invited to create content material for Search Engine Land and are chosen for his or her experience and contribution to the search group. Our contributors work beneath the oversight of the editorial staff and contributions are checked for high quality and relevance to our readers. Search Engine Land is owned by Semrush. Contributor was not requested to make any direct or oblique mentions of Semrush. The opinions they categorical are their very own.
#model #visibility #thinks #more durable

