View Issue Details
ID | Project | Category | View Status | Date Submitted | Last Update |
---|---|---|---|---|---|
0001389 | Composr | core | public | 2013-09-08 13:00 | 2015-07-14 23:45 |
Reporter | Chris Graham | Assigned To | Chris Graham | ||
Severity | Feature-request | ||||
Status | resolved | Resolution | fixed | ||
Product Version | |||||
Fixed in Version | |||||
Summary | 0001389: XML Sitemap index | ||||
Description | Currently Composr builds a single sitemap nightly. For a large website, this is resource-intensive and risks going over the 50,000 sitemap URL/10MB limit. Ideally we would support the sitemap API putting out everything since the last "archival date", and then iteratively updating the sitemap index file to include the refreshed reference to the latest sitemap. It would have to calculate the archival date in a smart way, noting when it approaches a certain limit (e.g. 5000 URLs per sitemap). | ||||
Additional Information | Old sitemaps would contain URLs duplicated in newer sitemaps, if the article had been refreshed. This is not a serious problem though. | ||||
Tags | No tags attached. | ||||
Time estimation (hours) | 10 | ||||
Sponsorship open | |||||
related to | 0000142 | resolved | Chris Graham | Merged sitemap API |
|
Resolved in https://github.com/ocproducts/composr/commit/29c9de41415bd5c07bd75377da7dc2ffa0d2eebc Implementation differs to description. Actually works much better, this is very efficient (can scale to millions of URLs), and can handle deletions of URLs from the sitemap. |