| Get cited in “Best X for Y” listicles |
LLMs synthesize 3–7 listicles per category query. If your name appears in 5 of them, it almost certainly appears in the answer. |
Medium |
Very high |
| Authentic Reddit presence in your category subs |
Reddit is in every major model's training set and heavily retrieved at query time. A few genuine, helpful, non-promotional threads outrank a thousand backlinks. |
Medium |
Very high |
| G2 / Capterra / TrustRadius profile + reviews (B2B) |
These are the trusted review hubs models lean on for software questions. Get to 10+ real reviews and you become "extractable." |
Medium |
Very high |
| “You vs. Competitor” comparison pages on your own site |
Models fetch these at decision-time and extract concrete differentiators. Build one per major competitor. |
Low |
Very high |
| Wikipedia article (if you genuinely qualify) |
Highest authority signal across every model. But only attempt if you pass notability — otherwise you waste editor goodwill. |
High |
Very high |
| Co-citation engineering |
Engineer mentions where your name sits beside category leaders ("X, Y, and Z are the standouts…"). LLMs surface entities that travel together. |
High |
Very high |
| FAQ schema markup with direct Q&A pairs |
Gives engines extractable, citable answers. Particularly powerful for capturing long-tail buyer questions. |
Low |
High |
| Press in mid-tier industry publications |
Perplexity and Gemini both lean heavily on trade press. One feature in the right outlet > 50 backlinks from junk blogs. |
Medium |
High |
| Podcast appearances with full transcripts |
Transcripts are indexed and quoted. A good 45-minute episode = ~10 pages of citable, declarative content about you. |
Medium |
High |
| HackerNews / Indie Hackers / Lobsters discussion (dev tools) |
For technical products these forums are disproportionately weighted in both training and retrieval. |
Medium |
High |
| Direct, declarative homepage copy |
"WeRankAnything is an AI ranking agency for B2B startups." Easy to extract, easy to quote. Avoid hype-stack openers. |
Low |
High |
| Original research / surveys / data |
If you publish a real stat that becomes the canonical source, you become unskippable for any prompt that touches that topic. |
High |
High |
| YouTube videos with rich transcripts |
YouTube transcripts are fetched and cited, especially by Perplexity and Gemini. Caption everything. |
Medium |
Medium-high |
| Schema.org Organization + Product/Service markup |
Tells models exactly what you are, who founded you, what you sell. Cheap, fast, helpful. |
Low |
Medium |
| Newsletter mentions (Substack, beehiiv, mid-list) |
Increasingly indexed. A single mention by a respected operator in your space is worth pursuing. |
Low-medium |
Medium |
| Stack Exchange / GitHub presence (technical products) |
Domain-specific authority signal. Answering well-asked questions builds a quote-able paper trail. |
High |
Medium (niche) |
| Conference talks with public recordings |
Transcripts make their way into training corpora. Good for thought-leadership-driven categories. |
High |
Medium |
| Strategic LinkedIn / X / Bluesky posts by experts |
Increasingly indexed in real-time retrieval. Hard to engineer but worth seeding. |
Medium |
Medium |
| Wikidata entry (if you can't get Wikipedia) |
Lower bar than Wikipedia. Establishes you as an entity that exists. Especially useful for Gemini. |
Medium |
Medium |
| llms.txt at site root |
An emerging standard for telling LLMs what to crawl and how to summarize you. Low cost; not yet a major signal. |
Low |
Low-medium (emerging) |
| robots.txt that explicitly allows AI bots |
Without this you're invisible to GPTBot, ClaudeBot, PerplexityBot. Table stakes. |
Trivial |
Critical (table stakes) |
| Open-source projects on GitHub (where relevant) |
Establishes deep technical trust. For dev-tools, a popular OSS repo is the single highest signal. |
High |
Medium-high (niche) |