LinkStorm uses a powerful web crawler to analyze your siteโs internal linking structure, gathering essential data to help you optimize your links for SEO. Understanding how the crawler works will help you interpret reports and make better internal linking decisions.
How LinkStorm Crawls Your Site
Discovering URLs
LinkStorm finds pages on your site through:
Your Sitemap โ If XML sitemap processing is enabled, LinkStorm will use it as the primary source for URL discovery.
Internal Links โ The crawler follows links on your pages to discover additional content.
๐กย Best practice:ย For the most accurate analysis, ensure your sitemap is up to date and contains all indexable pages.
Crawling Behavior
When crawling, LinkStorm:
Renders JavaScript โ This allows the crawler to fully load dynamic content, making it compatible with any website or CMS, including JavaScript-heavy frameworks.
Honors meta directives โ Pages with
noindex
wonโt be considered indexable.Detects Canonical Pages โ If a page has a canonical tag pointing elsewhere, LinkStorm treats it as non-canonical.
Limits Crawling to the Specified Scope โ If a subdomain or path is entered, LinkStorm will only crawl pages within that specific subdomain or path (Learn how to organize your websites and projects).
How Often LinkStorm Crawls Your Site
LinkStorm crawls your site:
When a new project is created โ A full crawl is performed to gather initial data.
When you trigger a manual crawl โ You can start a new crawl from the dashboard (this consumes credits).
Weekly for Sitemap Processing โ If enabled, LinkStorm will check the XML sitemap once per week for new or updated content.
๐ก To manually trigger a crawl, go to the Dashboard and click "Reset and recrawl now"
What Data LinkStorm Collects
During a crawl, LinkStorm gathers:
Page URLs โ List of all discovered pages.
Internal Links โ Connections between pages.
External Links โ Outbound links pointing to other sites.
Anchor Texts โ The words used in internal and external links.
Indexability & Canonical Status โ Determines whether a page is indexed, non-indexable, or non-canonical.
Issues โ Broken links, redirects, and
nofollow
tags.
๐ก Internal link opportunities are computed once the crawl is completed.
Managing Your Crawls
Controlling What Gets Crawled
To refine your crawls, you can:
Enable or disable sitemap processing in your dashboard.
Define a subdomain or path when adding a website to restrict crawling to specific sections.
When to Run a Manual Crawl
Consider running a new crawl if:
Youโve added or removed major site sections.
Youโve changed the internal linking structure.
You want the latest data before optimizing links.
Next Steps
Now that you understand how LinkStorm crawls your website, you can:
Review the Pages Report to analyze your indexed and non-indexed pages.
Check the Issues Report to fix broken links or redirects.
Use the Opportunities Report to implement AI-recommended internal links.