Screaming Frog Guide to Doing Almost Anything: 60+ Uses, Including AI-Assisted Workflows

This blog was originally published in May 2015, updated in February 2020, and updated again in June 2026.

When we looked at our own blog traffic, we realized this was one of the most historically popular blog posts on the Seer domain. After a brief moment of enthusiasm for the ever-present greatness of the Screaming Frog SEO Spider, we realized we were doing a disservice to both our readers and to how far the tool itself has come by not updating our content.

This guide was originally published in 2015 and last updated in 2020. In the years since, Screaming Frog has evolved dramatically, now offering database crawls that can handle enterprise-scale sites. It's also entered the AI ages with native integrations with OpenAI, Claude, and Gemini as well as a Model Context Protocol (MCP) server that allows AI clients to query crawl data.

And the SEO landscape has evolved alongside Screaming Frog: JavaScript frameworks are the norm, Google's crawl behavior has shifted, and new questions around LLM visibility have entered the conversation.

Below, you’ll find an updated guide to how SEOs, PPC professionals, and digital marketing experts can use Screaming Frog to streamline their workflow. We cover everything from what to prioritize and how to handle modern JavaScript architecture to how AI fits into the picture.

First, How Should You Prioritize Insights from Screaming Frog?

Screaming Frog is going to flag a lot. The real skill is knowing how to filter what comes back and distinguish what genuinely needs fixing from what's technically imperfect but strategically irrelevant.

A practical starting point for most audits: begin with crawl and indexation. Cross-reference Screaming Frog with Google Search Console to see whether pages are being fully crawled and indexed, or falling into "discovered, not indexed" or "crawled, not indexed" status. If content isn't getting picked up, everything downstream (including your keyword strategy and content calendar) is operating at reduced capacity.

From there, move to the issues wasting crawl budget: 404s, chains of 301 redirects, and unnecessary parameters. These are also the easiest to communicate to clients, because you can quantify the cost of inaction. Despite all of those hours the client has spent creating content, some of it is getting picked up later than it should be or isn't get picked up at all.

One dimension worth tracking that's become a bigger part of client conversations: speed of indexation. Delays between publishing and indexation can have real traffic implications, and Screaming Frog is one of the best tools for diagnosing what's causing them.

Now to get started, simply select what it is that you are looking to do:

Basic Crawling

I want to crawl my entire site
I want to crawl a single subdirectory
I want to crawl a specific set of subdomains or subdirectories
I want a list of all of the pages on my site
I want a list of all of the pages in a specific subdirectory
I want to find all of the subdomains on a site and verify internal links
I want to crawl an ecommerce site or other large site
I want to crawl a complex ecommerce site that has query parameters
I want to crawl a site hosted on an older server
I want to crawl a site that requires cookies
I want to crawl using a different user agent
I want to crawl pages that require authentication

Internal Links

I want information about all of the internal and external links on my site (anchor text, directives, links per page etc.)
I want to find broken internal links on a page or site
I want to find broken outbound links on a page or site (or all outbound links in general)
I want to find links that are being redirected
I am looking for internal linking opportunities

Site Content

I want to identify pages with thin content
I want a list of the image links on a particular page
I want to find images that are missing alt text or images that have lengthy alt text
I want to find every CSS file on my site
I want to identify all of the JavaScript files and plugins used on the site and what pages they appear on
I want to find where flash is embedded on-site
I want to find any internal PDFs that are linked on-site
I want to understand content segmentation within a site or group of pages
I want to find pages that have social sharing buttons
I want to find pages that are using iframes
I want to find pages that contain embedded video or audio content

Meta Data and Directives

I want to identify pages with lengthy page titles, meta descriptions, or URLs
I want to find duplicate page titles, meta descriptions, or URLs
I want to find duplicate content and/or URLs that need to be rewritten/redirected/canonicalized
I want to identify all of the pages that include meta directives e.g.: nofollow/noindex/noodp/canonical etc.
I want to verify that my robots.txt file is functioning as desired
I want to find or verify Schema markup or other microdata on my site

Sitemap

I want to create an XML Sitemap
How to create an XML Sitemap by Uploading URLs
I want to check my existing XML Sitemap

General Troubleshooting

I want to find issues that don't appear in Google Search Console
I want to identify why certain sections of my site aren't being indexed or aren’t ranking
I want to check if my site migration/redesign was successful
I want to find slow loading pages on my site
I want to find malware or spam on my site

PPC & Analytics

I want to verify that my Google Analytics code is on every page, or on a specific set of pages on my site
I want to validate a list of PPC URLs in bulk

Scraping

I want to scrape the meta data for a list of pages
I want to scrape a site for all of the pages that contain a specific footprint

URL Rewriting

I want to find and remove session id or other parameters from my crawled URLs
I want to rewrite the crawled URLs (e.g: replace .com with .co.uk, or write all URLs in lowercase)

Keyword Research

I want to know which pages my competitors value most
I want to know what anchor text my competitors are using for internal linking

Link Building

I want to analyze a list of prospective link locations
I want to find broken links for outreach opportunities
I want to verify my backlinks and view the anchor text
I want to make sure that I'm not part of a link network
I am in the process of cleaning up my backlinks and need to verify that links are being removed as requested

Artificial Intelligence (AI)

I want to query my crawl data using an AI tool
I want to understand what Screaming Frog's AI integrations can and can't do

Basic Crawling

How to crawl an entire site

When starting a crawl, it’s a good idea to take a moment and evaluate what kind of information you’re looking to get, how big the site is, and how much of the site you’ll need to crawl in order to access it all. Sometimes, with larger sites, it’s best to restrict the crawler to a sub-section of URLs to get a good representative sample of data. This keeps file sizes and data exports a bit more manageable. We go over this in further detail below. For crawling your entire site, including all subdomains, you’ll need to make some slight adjustments to the spider configuration to get started.

By default, Screaming Frog only crawls the subdomain that you enter. Any additional subdomains that the spider encounters will be viewed as external links. In order to crawl additional subdomains, you must change the settings in the Spider Configuration menu. By checking ‘Crawl All Subdomains’, you will ensure that the spider crawls any links that it encounters to other subdomains on your site.

Step 1:

pasted image 0 72

Step 2:

In addition, if you’re starting your crawl from a specific subfolder or subdirectory and still want Screaming Frog to crawl the whole site, check the box marked “Crawl Outside of Start Folder.”

By default, the SEO Spider is only set to crawl the subfolder or subdirectory you crawl from forwards. If you want to crawl the whole site and start from a specific subdirectory, be sure that the configuration is set to crawl outside the start folder.

Pro Tip:

To save time and disk space, be mindful of resources that you may not need in your crawl. Websites link to so much more than just pages. Uncheck Images, CSS, and JavaScript resources in order to reduce the size of the crawl.

Screaming Frog Guide to Doing Almost Anything: 60+ Uses, Including AI-Assisted Workflows

First, How Should You Prioritize Insights from Screaming Frog?

Basic Crawling

How to crawl an entire site

How to crawl a single subdirectory

How to crawl a specific set of subdomains or subdirectories

I want a list of all of the pages on my site

I want a list of all of the pages in a specific subdirectory

How to find all of the subdomains on a site and verify internal links.

How to crawl an ecommerce site or other large site

How to get a clean crawl on a complex ecommerce site

How to crawl a site hosted on an older server -- or how to crawl a site without crashing it

How to crawl a site that requires cookies

How to crawl using a different user-agent

How to crawl pages that require authentication

Internal Links

I want information about all of the internal and external links on my site (anchor text, directives, links per page etc.)

How to find broken internal links on a page or site

How to find broken outbound links on a page or site (or all outbound links in general)

How to find links that are being redirected

I am looking for internal linking opportunities

Site Content

How to identify pages with thin content

I want a list of the image links on a particular page

How to find images that are missing alt text or images that have lengthy alt text

How to find every CSS file on my site

How to identify JavaScript files and plugins used on the site and what pages they appear on

How to find where flash is embedded on-site

How to find any internal PDFs that are linked on-site

How to understand content segmentation within a site or group of pages

How to find pages that have social sharing buttons

How to find pages that are using iframes

How to find pages that contain embedded video or audio content

Meta Data and Directives

How to identify pages with lengthy page titles, meta descriptions, or URLs

How to find duplicate page titles, meta descriptions, or URLs

How to find duplicate content and/or URLs that need to be rewritten/redirected/canonicalized

How to identify all of the pages that include meta directives e.g.: nofollow/noindex/noodp/canonical etc.

How to verify that my robots.txt file is functioning as desired

How to find or verify Schema markup or other microdata on my site

Sitemap

How to create an XML Sitemap

Creating an XML Sitemap By Uploading URLs

How to check my existing XML Sitemap

General Troubleshooting

How to find issues that Google Search Console doesn't surface

How to identify why certain sections of my site aren't being indexed or aren’t ranking

How to check if my site migration/redesign was successful

How to find slow-loading pages on my site

How to find malware or spam on my site

PPC & Analytics

How to verify that my GA4 measurement ID is on every page, or on a specific set of pages on my site

How to validate a list of PPC URLs in bulk

Scraping

How to scrape the metadata for a list of pages

How to scrape a site for all of the pages that contain a specific footprint

URL Rewriting

How to find and remove session id or other parameters from my crawled URLs

How to rewrite the crawled URLs (e.g: replace .com with .co.uk, or write all URLs in lowercase)

Keyword Research

How to know which pages my competitors value most

How to know what anchor text my competitors are using for internal linking

How to know which meta keywords (if any) my competitors have added to their pages

Link Building

How to analyze a list of prospective link locations

How to find broken links for outreach opportunities

How to verify my backlinks and view the anchor text

How to make sure that I'm not part of a link network

I am in the process of cleaning up my backlinks and need to verify that links are being removed as requested

Bonus Round

How to Edit Meta Data

How to Crawl JavaScript-Rendered Sites

View Original HTML and Rendered HTML

Artificial Intelligence (AI)

I want to query my crawl data using an AI tool

I want to understand what Screaming Frog's AI integrations can and can't do

Final Remarks

Still nerding out on technical SEO?

For more SEO tutorials and the latest digital marketing updates, subscribe to the Seer newsletter below.

We love helping marketers like you.