HTTPS surface reachable (robots ✓, sitemap ✓, title ✓)
Why it matters: Public files — robots.txt, sitemap.xml, head meta — are what attackers see first during reconnaissance. Misadvertised paths, stale sitemaps, and verbose generators leak more than intended (ISO 27001 A.8.9).
robots.txt
present
User-agent: Googlebot-News
Disallow: /angebote/
User-agent: *
Disallow: /zeit/
Disallow: /templates/
Disallow: /hp_channels/
Disallow: /send/
Disallow: /rezepte/suche/
Disallow: */comment-thread?
Disallow: */liveblog-backend*
Disallow: /framebuilder/
Disallow: /campus/framebuilder/
Disallow: /navigation-teasers*
Disallow: *iqadcontroller.js
Allow: /llms.txt
User-agent: Baiduspider
Disallow: /
User-agent: GrapeshotCrawler
crawl-delay: 3
User-agent: GPTBot
Disallow: /
User-agent: Google-Extended
Disallow: /
Allow: /*-gxe$
User-agent: Applebot-Extended
Disallow: /
User-agent: CCBot
Disallow: /
User-agent: Bytespider
Disallow: /
User-agent: anthropic-ai
Disallow: /
User-agent: ClaudeBot
Disallow: /
User-agent: Timpibot
Disallow: /
User-agent: Meta-ExternalAgent
Disallow: /
User-agent: FacebookBot
Disallow: /
User-agent: Diffbot
Disallow: /
User-agent: PerplexityBot
Disallow: /
User-agent: Perplexity-User
Disallow: /
User-agent: TerraCotta
Disallow: /
Sitemap: https://www.zeit.de/gsitemaps/index.xml
# Legal notice: zeit.de expressly reserves the right to use its content for commercial text and data mining (§ 44 b UrhG).
# The use of robots or other automated means to access zeit.de or collect or mine data without
# the express permission of zeit.de is strictly prohibited.
# zeit.de may, in its discretion, permit certain automated access to certain zeit.de pages,
# If you would like to apply for permission to crawl zeit.de, collect or use data, please email online-syndication@zeit.de
sitemap.xml
present — 512 url(s)
head
- title
- DIE ZEIT | Nachrichten, News, Hintergründe und Debatten
- description
- Aktuelle Nachrichten, Kommentare, Analysen und Hintergrundberichte aus Politik, Wirtschaft, Gesellschaft, Wissen, Feuilleton und Sport lesen Sie bei der ZEIT.
social
- og:image
- https://img.zeit.de/administratives/sharing/fallback-image-die-zeit/wide__1300x731
- og:image:width
- 1300
- og:image:height
- 731
- og:site_name
- DIE ZEIT
- og:type
- website
- og:title
- DIE ZEIT Startseite
- og:description
- Aktuelle Nachrichten, Kommentare, Analysen und Hintergrundberichte aus Politik, Wirtschaft, Gesellschaft, Wissen, Feuilleton und Sport lesen Sie bei der ZEIT.
- og:url
- https://www.zeit.de/index
- twitter:card
- summary
- twitter:site
- @zeitonline
- twitter:creator
- @zeitonline
- twitter:title
- DIE ZEIT Startseite
- twitter:description
- Aktuelle Nachrichten, Kommentare, Analysen und Hintergrundberichte aus Politik, Wirtschaft, Gesellschaft, Wissen, Feuilleton und Sport lesen Sie bei der ZEIT.
- twitter:image
- https://img.zeit.de/administratives/sharing/fallback-image-die-zeit/wide__1300x731