Generate valid multilingual XML sitemaps with hreflang tags. Paste URLs or import from a sitemap, review detected locales, and download ready-to-submit XML.
URL parsing, validation, and XML generation run in your browser. Optional AI features help match translated slugs and resolve ambiguous locale codes — these are clearly marked and only used when needed.
15 URLs detected
Add your URLs
Paste multilingual URLs one per line, or import them from an existing sitemap URL. The tool parses subfolder, subdomain, and ccTLD patterns automatically.
Review and adjust
Most locales and page groups are detected deterministically using pattern matching. When ambiguous codes or translated slugs are found, AI suggests matches with confidence scores. You review and accept everything before generation.
Download the XML
The validator checks for common hreflang errors before generating the sitemap. Download as a single XML file or a ZIP with automatic splitting for large sites.
Google uses hreflang annotations to serve the right language or regional version of a page to each user. Without them, search engines may index the wrong version, show duplicate content warnings, or split ranking signals across locales. You need hreflang tags when your site has:
Detects and handles all three approaches to multilingual URLs: subfolder (/en-gb/), subdomain (en.example.com), and ccTLD (example.de). Mixed patterns across the same site are flagged for review. Google's multi-regional sites guide covers the trade-offs of each approach.
Google requires hreflang annotations to be confirmed from every page in the group. The validator checks that every page cluster has reciprocal links and flags orphan pages with only one locale variant.
Hreflang values use ISO 639-1 language codes and optional ISO 3166-1 Alpha-2 region codes. Most codes are resolved via static lookup. For ambiguous segments (e.g., /hk/ could be English or Chinese), AI suggests the most likely locale based on URL context. Chinese locales are flagged when they may need zh-hans/zh-hant script subtags.
The x-default annotation tells Google which page to show when none of the specific language/region variants match the user. This tool lets you select any URL as the x-default for every page cluster.
When localised URLs use translated slugs (e.g., /shoes/running and /schuhe/laufen), path matching alone can't group them. The tool uses AI text embeddings to find semantically equivalent paths across locales. Results include confidence scores so you can review each match before accepting.
Generated sitemaps follow the sitemap protocol specification with the xmlns:xhtml namespace for hreflang links, self-referencing alternates, and automatic file splitting when output exceeds the protocol's 50,000-URL-per-file limit.
xhtml:link elements pointing to all its translated equivalents, helping Google and other search engines serve the right version to users in each locale./en-gb/page), subdomain (en.example.com/page), and country-code top-level domain (example.de/page) patterns. Mixed structures across the same set of URLs are flagged for review./shoes/running and /schuhe/laufen), exact path matching won't group them. The tool uses text embeddings to compute semantic similarity between path segments and clusters pages that are above a confidence threshold, letting you match translated URLs without manual mapping.https://example.com/sitemap.xml) and the tool will fetch and extract all URLs. It handles both flat sitemaps and sitemap index files with multiple child sitemaps.en) targets all speakers of that language globally. Adding a region code (e.g., en-gb) targets speakers in a specific country. Google recommends using region codes when you have country-specific content like different currencies, legal terms, or spelling conventions. You can combine both: en as a fallback with en-gb and en-us for regional variants. See Google's language code documentation./en/, /fr/) are easiest to manage and consolidate domain authority. Subdomains (en.example.com) offer more server flexibility. ccTLDs (example.de) send the strongest geotargeting signal but split domain authority across separate domains. Google can handle all three with hreflang tags. See Google's multi-regional sites guide.zh-hans for Simplified Chinese and zh-hant for Traditional Chinese. You can combine with region codes: zh-hans-cn for Simplified Chinese (China), zh-hant-tw for Traditional Chinese (Taiwan). This tool detects bare zh locale segments and suggests adding the appropriate script subtag.robots.txt. See Google's submission guide.This tool implements the hreflang specification as documented by Google. For the full details on each topic, see the official guides: