Blog

From SEO to Survival: The Three Biggest LLM Questions Leaders Can’t Ignore
The buzz around generative AI is impossible to ignore. With McKinsey estimating it could add between $2.6 and $4.4 trillion in value to the global economy each year, it’s no wonder leaders are feeling the pressure to get their strategy right.

But where do you even begin?

We sat down with Dan, one of our in-house experts on LLM SEO, to cut through the noise and map out a practical path forward for any brand navigating this new landscape.

Q1. What’s your advice for a business leader who is just starting to think about what LLMs mean for their company?

Dan: The honest answer? Start with a dose of humility and a lot of measurement. There’s a ton of confident commentary out there, but the truth is, even the people building these models acknowledge they can’t always interpret exactly how an answer is produced. So, treat any strong claims with caution.

Instead of getting caught up in speculation, get a concrete baseline. Ask yourself: for the questions and topics that matter to our business, do the major LLMs mention us? Where do we show up, and how do we rank against our competitors? We call this a “visibility score.” It takes the conversation from abstract theory to a tangible map you can actually work with.

If you’re wondering why this is urgent, two external signals make it crystal clear. First, Gartner predicts that by 2026, traditional search engine volume could drop by 25% as people shift to AI-powered answer engines. That’s a fundamental shift in how customers will discover you.

Second, the investment and adoption curves are only getting steeper. Stanford’s latest AI Index shows that funding for generative AI is still surging, even as overall private investment in AI dipped. Together, these trends tell us that your brand’s visibility inside LLMs is going to matter more and more with each passing quarter.

Q2. Once you know your visibility baseline, what should you do to move the needle?

Dan: Think in two horizons:

The model horizon (slow).

Core LLMs are trained and fine-tuned over long cycles. Influence here is indirect: you need a strong, persistent digital footprint that becomes part of the training corpus. This is where classic disciplines: SEO, Digital PR, and authoritative content publishing still matter. High-quality, well-cited articles, consistent mentions in credible outlets, and technically sound pages are your insurance policy that when the next model is trained, your brand is part of its “memory.”

The retrieval horizon (fast).

This is where you can act immediately. Most assistants also rely on Retrieval-Augmented Generation (RAG) to pull in fresh sources at query time. The original RAG research showed how retrieval improves factuality and specificity compared to parametric-only answers. That means if you’re not in the sources LLMs retrieve from, you’re invisible; no matter how strong your legacy SEO is.

This is why reverse engineering how machines are answering today’s queries is a strategic real-world data point. By mapping which URLs, articles, and publishers are being cited in your category, you uncover the blueprint of what LLMs value: the content structures, schemas, and PR signals they consistently lean on.

From there, your levers become clear:

Digital PR – Ensure your brand is mentioned in trusted publications and industry sources that models are already surfacing.
SEO – Maintain technically flawless pages with schema, structured data, and crawlability, making your content easy for retrieval pipelines.
Content strategy – Match the formats models prefer (lists, tables, FAQs, authoritative explainers), and systematically fill topical gaps.
Analytics – Track citations, rank shifts, and model updates to iterate quickly.

Q3. Let’s say you’ve mapped your visibility, identified the gaps, and set your priorities. What do you do on Monday morning?

Dan: This is where you turn your analysis into action with briefs and experiments.

First, audit what the models are already rewarding. Look at the URLs they cite as sources for answers on your key topics. For each one, study its:

Structure: Does it have clear headings, tables, lists, and direct answers to common questions?
Technical setup: How is its metadata, schema, and internal linking structured? Is it easy to crawl?
Depth and coverage: How thoroughly does it cover the topic? Does it include definitions, practical steps, and well-supported claims?

Doing this at scale can be tedious, which is why we use tools like Spotlight to analyse hundreds of URLs at once and find the common patterns.

Next, create a “best-of” content brief. Let’s say for a key topic, ChatGPT and other AIs consistently cite five different listicles. Compare them side-by-side and merge their best attributes into a single master blueprint for your content team. This spec should include required sections, key questions to answer, table layouts, reference styles, and any recurring themes or entities that appear in the high-ranking sources. You’re essentially reverse-engineering success.

Then, fill the gaps the models reveal. If you notice that AI retrieval consistently struggles to find good material on a certain subtopic; maybe the data is thin, outdated, or just not there; create focused content that fills that void. RAG systems tend to favour sources that are trustworthy, specific, and easy to break into digestible chunks. The research backs this up: precise, well-structured information dramatically improves the quality of the AI’s final answer.

Finally, instrument everything and track your progress. Treat this like a product development cycle:

Track how your new and updated content performs over time in model answers and citations.
Tag your content by topic, format, and schema so you can see which features are most likely to get you included in an AI’s answer.
Keep an eye out for confounding variables, like major model updates or changes to your own site, and make a note of them.

This is critical because the landscape is shifting fast. That Gartner forecast suggests your organic traffic mix is going to change significantly. By reporting on your LLM visibility alongside classic SEO metrics, you can keep your stakeholders informed and aligned. You should get into a rhythm of constant experimentation. The AI Index and McKinsey reports both point to rapid, compounding change. Run small, fast tests: tweak your content structure, add answer boxes and tables, tighten up your citations, and see what moves the needle. Think of 2025 as the year you build your playbook, so that by 2026 you’re operating from a position of strength, not starting from scratch.

Closing Thoughts

Winning visibility in LLMs is about adapting to a fundamental shift in how people access knowledge and how machines assemble information. The path forward starts with three simple questions: Where do you stand today? Which levers can you pull right now? And how do you turn those levers into measurable experiments?

The data is clear: the value on the table is enormous, your competitors are already moving, and the centre of gravity for discovery is shifting toward answer engines. The brands that build evidence-based content systems and learn to iterate in this new environment will gain a durable advantage as the market resets.

Evidence & Sources
- $2.6–$4.4T in Annual Value: McKinsey estimates generative AI could add this much value per year across 63 different use cases. Source: McKinsey & Company, “The economic potential of generative AI: The next productivity frontier,” June 2023. https://www.mckinsey.com/capabilities/mckinsey-digital/our-insights/the-economic-potential-of-generative-ai-the-next-productivity-frontier
- Search is Shifting to Answers: Gartner forecasts that traditional search engine volume will drop by about 25% by 2026 as users move to AI chatbots and agents. Source: Gartner, “Gartner Predicts Search Engine Volume Will Drop 25 Percent by 2026,” April 2024. https://www.gartner.com/en/newsroom/press-releases/2024-04-17-gartner-predicts-search-engine-volume-will-drop-25-percent-by-2026-due-to-ai-chatbots-and-other-virtual-agents
- Enterprise Adoption is Real: IBM’s Global AI Adoption Index reports that 42% of large companies have already deployed AI, with another 40% in the exploration or experimentation phase. Source: IBM, “Global AI Adoption Index 2023,” January 2024. https://newsroom.ibm.com/2024-01-17-IBM-s-Global-AI-Adoption-Index-2023-Finds-AI-Adoption-is-Steady,-But-Barriers-to-Entry-Remain-for-the-40-of-Organizations-Still-on-the-Sidelines
- GenAI Investment Keeps Surging: Stanford HAI’s 2024 AI Index Report found private investment in generative AI soared in 2023, reaching $25.2 billion—nearly 8 times the investment level of 2022. Source: Stanford University, “Artificial Intelligence Index Report 2024,” April 2024. https://aiindex.stanford.edu/report/
- Why RAG Matters: The original Retrieval-Augmented Generation research showed that models produce more specific and factual answers when they can pull in fresh, retrieved sources—a foundational concept for any near-term brand visibility strategy. Source: Lewis, et al., “Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks,” May 2020. https://arxiv.org/abs/2005.11401
October 3, 2025

What Content Types Do LLMs Prefer? A Data-Driven Analysis

Key Question: Can we tell what type of content LLMs prefer? For example, are LLMs likely to prefer content that has a combination of video, images, reviews, etc.? We analyzed over 1.2 million citations from 8 different LLMs to find out.

Methodology

This analysis is based on data from Spotlight’s database, which tracks how different LLMs cite content in their responses. We analyzed:

1,684 source analyses from Gemini 2.0 Flash, examining detailed content characteristics
1.2+ million response links from 8 different LLMs (ChatGPT, Gemini, Perplexity, Claude, Copilot, Grok, AIO, and AIMode)
Content preferences across visual elements, structure, depth, and source types

The Universal Content Preferences

Our analysis reveals that LLMs have remarkably consistent preferences when it comes to content types. Here’s what we found across all models:

95.13% of analyzed content contains images

90.62% of content uses bullet points or lists

78.80% of content includes visual data (images/videos)

74.76% of content shows author credentials

LLM-Specific Content Preferences

ChatGPT: The Wikipedia Champion

Total Citations: 290,493

Top Preference: Wikipedia dominates with 20,309 citations (7% of all ChatGPT citations)

Key Insight:

ChatGPT shows the highest preference for .org domains (10.29%) and academic sources, suggesting a preference for authoritative, well-sourced content.

Content Type Breakdown:

Guide/Tutorial content: 12.45%
Blog content: 11.23%
Listicle format: 12.19%

Perplexity: The Social Media Enthusiast

Total Citations: 445,176 (highest among all LLMs)

Top Preference: Reddit dominates with 13,614 citations

Key Insight:

Perplexity shows the strongest preference for user-generated content and social platforms, with Reddit, YouTube, and Google Play Store being top sources.

Content Type Breakdown:

Blog content: 17.95%
Guide/Tutorial content: 14.66%
Listicle format: 9.10%

Gemini: The Google Ecosystem Expert

Total Citations: 328,134

Top Preference: Google Play Store with 3,745 citations

Key Insight:

Gemini heavily favors Google’s own properties and services, with Google Play, YouTube, and Google’s AI search being top sources.

Content Type Breakdown:

Guide/Tutorial content: 14.89%
Blog content: 16.87%
Listicle format: 9.31%

Claude: The UK-Focused Specialist

Total Citations: 460 (smallest dataset)

Top Preference: Wise.com with 26 citations

Key Insight:

Claude shows a strong preference for UK-based financial services and consumer advice sites, with 37.61% of citations from .co.uk domains.

Content Type Breakdown:

Guide/Tutorial content: 23.70%
Blog content: 22.17%
Listicle format: 15.22%

Copilot: The E-commerce Expert

Total Citations: 10,450

Top Preference: Amazon with 568 citations

Key Insight:

Copilot shows the strongest preference for e-commerce platforms, with Amazon, Walmart, and Target being top sources.

Content Type Breakdown:

Listicle format: 14.99%
Blog content: 13.07%
Guide/Tutorial content: 11.03%

Grok: The X (Twitter) Native

Total Citations: 2,566

Top Preference: X.com (formerly Twitter) with 732 citations

Key Insight:

Grok shows the highest preference for .com domains (81.49%) and heavily favors its parent company’s platform, X.com.

Content Type Breakdown:

Blog content: 12.98%
Guide/Tutorial content: 10.68%
Listicle format: 5.07%

Content Characteristics That Matter Most

Based on our analysis of 1,684 source analyses from Gemini 2.0 Flash, here are the content characteristics that appear most frequently in LLM-cited content:

Characteristic	Percentage	What This Means
Images Present	95.13%	Visual content is nearly universal in cited content
Uses Bullet Points	90.62%	Structured, scannable content is preferred
Visual Data (Images/Videos)	78.80%	Multimedia content is highly valued
Author Credentials	74.76%	Credibility and expertise matter
Uses Opinions	64.85%	Subjective insights are valued alongside facts
Corporate Website	61.28%	Official brand sources are heavily cited
Signs of Agenda	60.27%	Content with clear purpose/intent is preferred
Fresh Content	57.78%	Recent information is valued
Highlighted Keywords	48.34%	SEO-optimized content performs well
FAQ Sections	35.39%	Question-and-answer format is effective

The Content Depth Sweet Spot

Our analysis reveals that LLMs prefer content that’s neither too shallow nor too deep:

71.08%

of cited content is “moderate” depth

Only 4.28% of cited content is classified as “in-depth,” while 5.29% is “surface-level.” This suggests that LLMs prefer content that provides substantial information without being overwhelming.

Visual Content: The Universal Language

Visual content appears to be the most consistent preference across all LLMs:

95.13% of cited content contains images
10.45% contains videos
78.80% has some form of visual data

The average cited content contains 9.3 sections and 83 paragraphs, with an average length of 2,820 characters.

Domain Preferences by LLM

Each LLM shows distinct domain preferences that reflect their training and purpose:

LLM	Top Domain Preference	% of Citations	Characteristic
ChatGPT	en.wikipedia.org	7.0%	Academic, authoritative
Perplexity	reddit.com	3.1%	User-generated, social
Gemini	play.google.com	1.1%	Google ecosystem
Claude	wise.com	5.7%	UK financial services
Copilot	amazon.com	5.4%	E-commerce focused
Grok	x.com	28.5%	Social media native

Key Takeaways

Visual content is essential: 95% of cited content contains images, making visual elements nearly universal in LLM-preferred content.
Structure matters: 90% of cited content uses bullet points or lists, indicating a strong preference for scannable, organized information.
Moderate depth wins: 71% of cited content is “moderate” depth – not too shallow, not too deep.
Credibility counts: 75% of cited content shows author credentials, emphasizing the importance of expertise.
LLMs have distinct personalities: Each LLM shows unique preferences reflecting their training and purpose (ChatGPT loves Wikipedia, Perplexity favors Reddit, etc.).
Corporate content dominates: 61% of cited content comes from corporate websites, suggesting official brand sources are highly valued.

Practical Implications for Content Creators

Based on this analysis, here’s what content creators should focus on to improve their chances of being cited by LLMs:

1. Visual Content Strategy

Include images in 95%+ of your content
Consider adding videos to 10%+ of content
Ensure visual elements support and enhance the text

2. Content Structure

Use bullet points and lists extensively (90%+ of content)
Organize content into clear sections (average 9.3 sections)
Keep paragraphs manageable (average 83 paragraphs per piece)

3. Authority and Credibility

Showcase author credentials and expertise
Include empirical evidence when possible
Cite sources and provide evidence

4. Content Depth

Aim for “moderate” depth – comprehensive but not overwhelming
Target 2,000-3,000 characters per piece
Balance thoroughness with accessibility

5. Platform-Specific Optimization

For ChatGPT: Focus on authoritative, well-sourced content similar to Wikipedia
For Perplexity: Create engaging, social-friendly content that sparks discussion
For Gemini: Optimize for Google’s ecosystem and services
For Claude: Consider UK-focused content and financial services
For Copilot: Focus on e-commerce and product-related content

Final Thoughts

While LLMs show distinct preferences based on their training and purpose, there are universal content characteristics that improve citation likelihood across all models. Visual content, structured presentation, moderate depth, and clear authority signals appear to be the most important factors for LLM citation success.

As AI continues to evolve and new models emerge, understanding these preferences becomes crucial for content creators looking to optimize for AI visibility. The data shows that the future of content optimization isn’t just about search engines—it’s about understanding how AI models consume and cite information.

This analysis is based on data from Spotlight’s database, which tracks LLM citations across multiple AI models. The data represents real-world citation patterns from over 1.2 million analyzed links.

September 19, 2025

Which Domains Do AI Models Trust Most? A 60-Day Analysis of Citation Patterns
In the rapidly evolving world of AI-powered search and content generation, understanding which sources AI models trust most is crucial for brands looking to optimize their visibility. Our latest analysis of over 850,000 citations across major AI models reveals fascinating patterns in domain preferences that could reshape your content strategy.

Key Finding

Each AI model has distinct domain preferences, with Wikipedia dominating ChatGPT citations (20,122), Reddit leading Perplexity (12,774), and YouTube topping Gemini trusted sources (1,821).

The Methodology

We analyzed citation data from our Spotlight platform, examining over 850,000 URL citations across seven major AI models over the past 60 days. The data reveals not just which domains get cited most frequently, but also the unique preferences of each AI model.

ChatGPT: The Wikipedia Champion

ChatGPT shows a clear preference for authoritative, encyclopedia-style content. Wikipedia dominates its citations with an astonishing 20,122 references in just 60 days.

Domain Citations Domain Type
en.wikipedia.org 20,122 Encyclopedia
reddit.com 11,251 Community
techradar.com 3,424 Tech News
investopedia.com 1,530 Financial Education
tomsguide.com 1,330 Tech Reviews

Insight: ChatGPT heavily favors established, authoritative sources. Wikipedia dominance suggests that comprehensive, well-sourced content performs exceptionally well with this model.

Perplexity: The Community-Driven Model

Perplexity shows a different pattern, with Reddit leading its citations at 12,774 references. This suggests Perplexity values real-world user experiences and community discussions.

Domain Citations Domain Type
reddit.com 12,774 Community
youtube.com 6,345 Video Content
translate.google.com 2,970 Translation Tool
play.google.com 1,871 App Store
bestbrokers.com 1,800 Financial Services

Insight: Perplexity preference for Reddit and YouTube suggests it values authentic user experiences and visual content. Brands should consider creating community-focused content and video materials.

Gemini: The Google Ecosystem Player

Google Gemini shows interesting patterns, with YouTube leading at 1,821 citations, followed by Google’s own Vertex AI Search at 1,631 citations.

Domain Citations Domain Type
youtube.com 1,821 Video Content
play.google.com 1,261 App Store
investopedia.com 1,072 Financial Education
pcmag.com 1,059 Tech Reviews

Insight: Gemini heavy reliance on Google’s own tools and YouTube suggests strong integration within the Google ecosystem. Video content and Google-optimized materials may perform better with this model.

Cross-Model Patterns: Universal Winners
- Reddit: Top performer in Perplexity (12,774), strong in ChatGPT (11,251)
- YouTube: Leading in Gemini (1,821), strong in Perplexity (6,345)
- Investopedia: Consistently cited across ChatGPT (1,530), Gemini (1,072)
- TechRadar: Strong performance across ChatGPT (3,424), Perplexity (1,208), Gemini (770)
What This Means for Your Brand

1. Model-Specific Strategies
- For ChatGPT: Focus on comprehensive, encyclopedia-style content that could be referenced in Wikipedia
- For Perplexity: Engage with community platforms like Reddit and create video content for YouTube
- For Gemini: Optimize for Google ecosystem and create video content
2. Universal Strategies
- Create comprehensive, authoritative content
- Engage with community platforms
- Develop video content
- Focus on expert reviews and technical analysis
Key Takeaways
1. Model Preferences Vary Significantly: Each AI model has distinct domain preferences that require tailored strategies.
2. Authority Matters: Established, authoritative sources consistently perform well across models.
3. Community Engagement Works: Platforms like Reddit show strong citation patterns, indicating value in community-focused content.
4. Video Content is Powerful: YouTube strong performance across models suggests video content is highly valued.
5. Industry-Specific Patterns: Financial services and technology sectors show particularly strong citation patterns.
This analysis is based on data from Spotlight AI visibility monitoring platform, analyzing over 850,000 citations across seven major AI models over the past 60 days.
September 17, 2025

Domain	Citations	Domain Type
en.wikipedia.org	20,122	Encyclopedia
reddit.com	11,251	Community
techradar.com	3,424	Tech News
investopedia.com	1,530	Financial Education
tomsguide.com	1,330	Tech Reviews

Domain	Citations	Domain Type
reddit.com	12,774	Community
youtube.com	6,345	Video Content
translate.google.com	2,970	Translation Tool
play.google.com	1,871	App Store
bestbrokers.com	1,800	Financial Services

Domain	Citations	Domain Type
youtube.com	1,821	Video Content
play.google.com	1,261	App Store
investopedia.com	1,072	Financial Education
pcmag.com	1,059	Tech Reviews

Blog

From SEO to Survival: The Three Biggest LLM Questions Leaders Can’t Ignore

Q1. What’s your advice for a business leader who is just starting to think about what LLMs mean for their company?

Q2. Once you know your visibility baseline, what should you do to move the needle?

Q3. Let’s say you’ve mapped your visibility, identified the gaps, and set your priorities. What do you do on Monday morning?

Closing Thoughts

Evidence & Sources

What Content Types Do LLMs Prefer? A Data-Driven Analysis

The Universal Content Preferences

LLM-Specific Content Preferences

Content Characteristics That Matter Most

The Content Depth Sweet Spot

Visual Content: The Universal Language

Domain Preferences by LLM

Practical Implications for Content Creators

1. Visual Content Strategy

2. Content Structure

3. Authority and Credibility

4. Content Depth

5. Platform-Specific Optimization

Which Domains Do AI Models Trust Most? A 60-Day Analysis of Citation Patterns

Key Finding

The Methodology

ChatGPT: The Wikipedia Champion

Perplexity: The Community-Driven Model

Gemini: The Google Ecosystem Player

Cross-Model Patterns: Universal Winners

What This Means for Your Brand

1. Model-Specific Strategies

2. Universal Strategies

Key Takeaways