Generate valid XML sitemaps with hreflang tags for multilingual sites, or organised standard sitemaps for single-language sites. Paste URLs or import from a sitemap, review the structure, and download ready-to-submit XML.
URL parsing, validation, and XML generation run in your browser. Optional AI features help match translated slugs and resolve ambiguous locale codes — these are clearly marked and only used when needed.
Paste URLs or fetch from a sitemap
Accepts a sitemap index or a single sitemap. Large sites often list multiple sitemaps in robots.txt— fetch each one separately to combine them.
Add your URLs
Paste URLs one per line, or import them from an existing sitemap URL. The tool auto-detects whether your site is multilingual (subfolder, subdomain, or ccTLD patterns) or single-language, and adjusts accordingly.
Review and adjust
For multilingual sites, locales and page groups are detected automatically. AI helps match translated slugs and resolve ambiguous codes. For single-language sites, pages are listed by content section. You review and accept before generation.
Choose format and download
Pick an output format: single file, per-locale, or per-category. The validator checks for hreflang errors before generating. Download as XML or a ZIP with a sitemap index for multi-file output.
Google uses hreflang annotations to serve the right language or regional version of a page to each user. Without them, search engines may index the wrong version, show duplicate content warnings, or split ranking signals across locales. You need hreflang tags when your site has:
Detects all three multilingual URL patterns: subfolder (/en-gb/), subdomain (en.example.com), and ccTLD (example.de). When no locale patterns are found, the tool switches to standard sitemap mode automatically.
Choose how to organise your sitemaps: a single file, one file per locale (e.g., sitemap-en.xml, sitemap-fr.xml), or one file per content category (e.g., sitemap-products.xml, sitemap-blog.xml). Multi-file output includes a sitemap index and downloads as a ZIP.
Google requires hreflang annotations to be confirmed from every page in the group. The validator checks that every page cluster has reciprocal links and flags orphan pages with only one locale variant.
Hreflang values use ISO 639-1 language codes and optional ISO 3166-1 Alpha-2 region codes. For ambiguous segments (e.g., /hk/ could be English or Chinese), AI suggests the most likely locale. Chinese locales are flagged when they may need zh-hans/zh-hant script subtags.
When localised URLs use translated slugs (e.g., /shoes/running and /schuhe/laufen), the tool uses AI text embeddings to find semantically equivalent paths across locales. Results include confidence scores so you can review each match before accepting.
Generated sitemaps follow the sitemap protocol specification: 50,000 URLs and 50 MB per file, automatic splitting with a sitemap index, and <lastmod> dates preserved from source sitemaps.
The x-default annotation tells Google which page to show when none of the specific language/region variants match the user. Select any URL as the x-default for every page cluster.
Import from a sitemap index and the tool preserves which child sitemap each URL came from. This enables per-category output that mirrors your existing sitemap organisation — useful for large sites with separate product, blog, and page sitemaps.
xhtml:link elements pointing to all its translated equivalents, helping Google and other search engines serve the right version to users in each locale.https://example.com/sitemap.xml) and the tool will fetch and extract all URLs. It handles both flat sitemaps and sitemap index files with multiple child sitemaps. When importing from a sitemap index, the tool preserves which child sitemap each URL came from, so you can split your output by category./en-gb/page), subdomain (en.example.com/page), and country-code top-level domain (example.de/page) patterns. Mixed structures across the same set of URLs are flagged for review./shoes/running and /schuhe/laufen), exact path matching won't group them. The tool uses text embeddings to compute semantic similarity between path segments and clusters pages that are above a confidence threshold, letting you match translated URLs without manual mapping.<lastmod> values, those dates are carried through to the generated output. This helps search engines prioritise which pages to re-crawl. If you paste URLs manually, lastmod is omitted since there is no source date to reference.en) targets all speakers of that language globally. Adding a region code (e.g., en-gb) targets speakers in a specific country. Google recommends using region codes when you have country-specific content like different currencies, legal terms, or spelling conventions. You can combine both: en as a fallback with en-gb and en-us for regional variants. See Google's language code documentation./en/, /fr/) are easiest to manage and consolidate domain authority. Subdomains (en.example.com) offer more server flexibility. ccTLDs (example.de) send the strongest geotargeting signal but split domain authority across separate domains. Google can handle all three with hreflang tags. See Google's multi-regional sites guide.zh-hans for Simplified Chinese and zh-hant for Traditional Chinese. You can combine with region codes: zh-hans-cn for Simplified Chinese (China), zh-hant-tw for Traditional Chinese (Taiwan). This tool detects bare zh locale segments and suggests adding the appropriate script subtag.robots.txt. See Google's submission guide.This tool implements the hreflang specification as documented by Google. For the full details on each topic, see the official guides: