A Website URL Extractor & Crawler is a free online tool that crawls any website from a starting URL and collects every internal link it finds. Paste a URL, set your crawl depth, and get a complete list with page titles and HTTP status codes. Use the free Word Spinner Website URL Extractor to audit your site structure. No signup required.
Every website has pages you do not know about. Old blog posts, orphaned landing pages, staging directories accidentally left indexed.
You cannot fix what you cannot find. Most site owners discover these pages by accident when a 404 shows up in Google Search Console or a client asks about a page that should have been deleted months ago.
A website URL extractor crawler solves this. You paste a starting URL, the tool follows every internal link using breadth-first search, and it returns a complete inventory of every page it found with titles and status codes. The whole process takes seconds and works entirely in your browser.
What a Website URL Extractor Does
The Word Spinner Website URL Extractor & Crawler starts from a single URL and follows every internal link it finds. For each page, it records the URL, the page title (if discoverable), and the HTTP status code. You control how deep the crawl goes from a quick one-level scan to a full site crawl.
The tool uses breadth-first search, meaning it checks every link on the starting page before moving deeper. This gives you a complete map of your internal link structure without skipping sections buried under deep navigation menus.
Here is what a standard crawl report looks like:
| URL | Page Title | Status | Depth |
|---|---|---|---|
| yoursite.com/about | About Us | Your Site | 200 | 1 |
| yoursite.com/blog | Blog | Your Site | 200 | 1 |
| yoursite.com/old-page | Old Product Page | 301 | 2 |
| yoursite.com/404-test | N/A | 404 | 2 |
Each row shows a discovered URL, the page title, the HTTP status code, and how many clicks from the start page the crawler found it. This one table can surface dozens of issues in seconds.

When to Use a Website URL Extractor
SEO Audits
Run a crawl before any SEO audit to discover which pages search engines can actually reach. Pages that are not linked internally may as well not exist. Use the Sitemap Finder & Checker alongside your crawl to compare what you think is indexed versus what your site structure actually supports.
Content Migration
Moving to a new domain or CMS? Crawl your entire site before the migration to get a complete URL inventory. After the move, crawl again and compare results using the Sitemap URLs Comparison Tool to catch missing redirects and broken paths.
Competitor Analysis
Enter a competitor domain with a shallow depth of 1 or 2 to see their publicly linked pages. You can discover content topics, service pages, and resource sections they prioritize from a single crawl.
Internal Link Audit
Pages with zero internal links pointing to them are nearly invisible to search engines. A URL extractor highlights these orphan pages so you can add internal links from relevant content and pass link equity where it matters.
How to Crawl a Website in 5 Steps
Step 1: Go to the Website URL Extractor & Crawler and paste the full URL of the page you want to start from. For a full site crawl, use your homepage. For a section crawl, use the category or subdirectory you want to audit.
Step 2: Set your crawl depth. Depth 1 only crawls links on the starting page. Depth 2 follows links from those pages. Depth 3 and beyond go further.
Start with depth 2 or 3 for most sites. It catches the majority of internal pages without overwhelming the results.
Step 3: Click start. The crawler works entirely in your browser. Results start appearing as each page is checked so you can begin reviewing before the crawl finishes.
Step 4: Review the results. Scan the URL list for 404s, redirect chains, and orphan pages. Note any page titles that look wrong or missing. Those are opportunities to improve on-page SEO.
Step 5: Export your data. Download the full crawl as CSV or TXT. Import it into a spreadsheet for deeper analysis or share it with your team as the starting point for a content cleanup.

What to Look For in the Crawl Results
- 404 pages. Every 404 in your crawl is a broken internal link. Find the source page and update or redirect the URL.
- Redirect chains. Two or more consecutive redirects waste crawl budget. Each hop slows search engine crawling and may reduce link equity.
- Orphan pages. High-value content with zero internal links. These pages rank poorly simply because search engines cannot find them through your site structure.
- Missing or weak titles. Pages with generic or missing title tags. Every page needs a unique descriptive title that matches the content.
- Deep pages. Content buried 5 or more clicks from the homepage. Add direct internal links from higher-level pages to surface important content.
Related Tools You Might Need
- Sitemap Finder & Checker Discover all sitemap files on any domain to cross-reference against your crawl results.
- XML Sitemap Generator Generate a fresh sitemap from your crawl data to submit to Google Search Console.
- Internal Link Checker Deep-dive into internal link health, anchor text distribution, and link equity flow.
- XML Sitemap Validator & SEO Analyzer Validate your sitemap after a crawl to catch errors before submitting to search engines.
Frequently Asked Questions
No. A URL extractor collects page addresses, titles, and status codes. A web scraper downloads the full content of each page. Use an extractor for site structure audits and a scraper when you need the actual text or data from each page.
For most sites, depth 2 or 3 catches the majority of useful pages. Depth 1 only gets the starting page direct links. Beyond depth 4, you may hit thousands of pages depending on site size. Start shallow and increase if needed.
The Word Spinner Website URL Extractor reads server-rendered HTML links. Pages that only load links via JavaScript after user interaction may not be fully captured. For JS-heavy sites, complement your crawl with a dedicated JavaScript crawler or review your rendered sitemap.
No. The tool works in your browser and sends standard HTTP requests at a reasonable rate. It will not overload your server or trigger rate limiting unless you run multiple deep crawls simultaneously.
Crawl depth controls how many link hops the crawler follows from the starting page. URL limit caps the total number of URLs collected. If you set depth to 5 and the site has 10,000 linked pages, the URL limit stops the crawl early so results stay manageable.
Crawl any website for free
Paste a URL, set crawl depth, and get every internal link with titles and status codes. Free, no signup, runs in your browser.
Try the Website URL Extractor & CrawlerNeed AI-powered content for your site?
Use Word Spinner to rewrite, humanize, and optimize content at scale. Generate, rephrase, and polish your writing with the AI Humanizer platform.
Start Free Trial