Replace web scraping content with AI automation brand

- Remove all web scraping services, blog articles, locations, tools pages
- Remove fake author profiles and old categories
- Add 6 new AI automation blog articles targeting legal/consultancy firms
- Rewrite blog index with new AI automation content
- Update robots.txt with correct ukaiautomation.co.uk domain
- Update sitemap.xml with current pages only
This commit is contained in:
Peter Foster
2026-03-21 10:04:47 +00:00
parent 1d705572ad
commit 37a6b01598
113 changed files with 611 additions and 47503 deletions

View File

@@ -1,5 +1,5 @@
# UK Data Services - robots.txt
# https://ukdataservices.co.uk
# UK AI Automation - robots.txt
# https://ukaiautomation.co.uk
User-agent: *
Allow: /
@@ -13,10 +13,6 @@ Disallow: /vendor/
Disallow: /config/
Disallow: /database/
Disallow: /docker/
Disallow: /redis/
Disallow: /google-oauth-callback
Disallow: /google-oauth-callback.php
Disallow: /oauth-callback.php
# Block configuration and handler files
Disallow: /*-handler.php
@@ -41,11 +37,7 @@ Allow: /assets/images/*.jpg
Allow: /assets/images/*.svg
# Sitemaps
Sitemap: https://ukdataservices.co.uk/sitemap.xml
Sitemap: https://ukdataservices.co.uk/sitemap-index.xml
Sitemap: https://ukdataservices.co.uk/sitemap-blog.xml
Sitemap: https://ukdataservices.co.uk/sitemap-services.xml
Sitemap: https://ukdataservices.co.uk/sitemap-tools.xml
Sitemap: https://ukaiautomation.co.uk/sitemap.xml
# Crawl-delay for respectful crawling
Crawl-delay: 1
@@ -59,10 +51,6 @@ User-agent: Bingbot
Allow: /
Crawl-delay: 1
User-agent: Slurp
Allow: /
Crawl-delay: 2
# AI crawlers - explicitly allowed for citation
User-agent: GPTBot
Allow: /
@@ -81,18 +69,3 @@ Allow: /
User-agent: Google-Extended
Allow: /
User-agent: Applebot-Extended
Allow: /
User-agent: Bytespider
Allow: /
User-agent: CCBot
Allow: /
User-agent: FacebookBot
Allow: /
User-agent: Amazonbot
Allow: /