Why does fetching my sitemap fail in the browser?

Most sites do not send CORS headers that allow other websites to read their sitemap.xml. That is normal. Use Fetch when it works (e.g. same site or open CORS), or open the sitemap in a new tab, copy the XML, paste here, or upload the file — that always works.

What is a sitemap index?

A sitemap index is an XML file that lists other sitemap files instead of listing every page URL. If you see mostly .xml links after parsing, fetch or paste each child sitemap’s XML to collect the real page URLs.

Can I parse robots.txt?

Yes. Paste your robots.txt content. The tool finds Sitemap: lines and lists those URLs. You can try Fetch on each sitemap URL when CORS allows it.

Is my sitemap data uploaded anywhere?

No. Parsing and export run entirely in your browser. Nothing is sent to JSONTech servers.

How do I turn URLs into llms.txt?

Use the llms.txt Builder tool: copy your extracted URL list, paste it there, add a title and description, then download llms.txt.

Извлечение URL из sitemap

Из sitemap.xml или robots.txt

Нужен черновик llms.txt из списка URL? Используйте конструктор llms.txt.

Попробовать загрузить URL (только same-origin или хосты с открытым CORS)

Вставьте XML sitemap, robots.txt или по одному URL на строку

Фильтр URL (regex, необязательно)

Результаты (0)

Пока нет URL — вставьте XML и нажмите «Разобрать», или загрузите пример.

Pull URLs out of sitemap.xml or robots.txt fast

Sitemaps are the source of truth for what a site wants indexed. Extracting their URLs gives you a clean list for SEO audits, archive snapshots, llms.txt builders, AI training datasets, or migration checks — without writing a parser yourself or paying for a desktop tool to do it.

Use the extractor when you need to

Audit a competitor's site structure

Pull every URL from their sitemap to map out content categories and depth in minutes.

Build a list for an llms.txt file

Extract URLs and feed them into the llms.txt builder to publish an LLM-friendly content map.

Migrate or archive a site

Pull all URLs before a redesign so you can set up redirects or capture an archive of the old structure.

How to extract sitemap URLs

1
Paste sitemap.xml or robots.txt content, or fetch a public URL when CORS allows.
2
Click Extract to list every URL with its lastmod, priority, and changefreq if present.
3
Filter or sort the list, then export as JSON, CSV, or plain text.

Keep going

Turn URLs into llms.txt

Feed the extracted URL list into a generator that builds a clean llms.txt for AI crawlers.

Encode URL components

Encode special characters before using URLs in queries or scripts.

Test URL endpoints

Send requests to extracted URLs to verify status, redirects, or content type.

Format the JSON export

Beautify the exported JSON for inclusion in docs or downstream pipelines.

Common extraction workflows

Run a quick SEO audit

Pull every indexed URL and look for thin pages, duplicates, or missing content categories.

Bootstrap a llms.txt rollout

Extract sitemap URLs once and use them as the foundation of your llms.txt content list.

Plan a content migration

Capture the full URL inventory before changing CMS or restructuring sections.

Часто задаваемые вопросы

У большинства сайтов нет CORS-заголовков, разрешающих чужим сайтам читать sitemap.xml — это нормально. Используйте «Загрузить», когда это работает (тот же сайт или открытый CORS), либо откройте sitemap в новой вкладке, скопируйте XML, вставьте сюда или загрузите файл — так всегда сработает.

Индекс sitemap — это XML-файл со списком других sitemap-файлов, а не всех URL страниц. Если после разбора вы видите в основном ссылки на .xml, загрузите или вставьте XML каждого дочернего sitemap, чтобы собрать реальные URL страниц.

Да. Вставьте содержимое robots.txt. Инструмент находит строки Sitemap: и перечисляет эти URL. При разрешённом CORS можно попробовать «Загрузить» для каждого URL sitemap.

Нет. Разбор и экспорт выполняются полностью в браузере. На серверы JSONTech ничего не отправляется.

Используйте инструмент «Конструктор llms.txt»: скопируйте извлечённый список URL, вставьте туда, добавьте заголовок и описание, затем скачайте llms.txt.