This commit introduces a complete storage remediation solution for critical Proxmox cluster issues: Playbooks (4 new): - remediate-storage-critical-issues.yml: Log cleanup, Docker prune, audits - remediate-docker-storage.yml: Deep Docker cleanup with automation - remediate-stopped-containers.yml: Safe container removal with backups - configure-storage-monitoring.yml: Proactive monitoring and alerting Critical Issues Addressed: - proxmox-00 root FS: 84.5% → <70% (frees 10-15 GB) - proxmox-01 dlx-docker: 81.1% → <75% (frees 50-150 GB) - Unused containers: 1.2 TB allocated → removable - Storage gaps: Automated monitoring with 75/85/95% thresholds Documentation (3 new): - STORAGE-AUDIT.md: Comprehensive capacity analysis and hardware inventory - STORAGE-REMEDIATION-GUIDE.md: Step-by-step execution with timeline - REMEDIATION-SUMMARY.md: Quick reference for playbooks and results Features: ✓ Dry-run modes for safety ✓ Configuration backups before removal ✓ Automated weekly maintenance scheduled ✓ Continuous monitoring with syslog integration ✓ Prometheus metrics export ready ✓ Complete troubleshooting guide Expected Results: - Total space freed: 1-2 TB - Automated cleanup prevents regrowth - Real-time capacity alerts - Monthly audit cycles Co-Authored-By: Claude Haiku 4.5 <noreply@anthropic.com> |
||
|---|---|---|
| .. | ||
| REMEDIATION-SUMMARY.md | ||
| STORAGE-AUDIT.md | ||
| STORAGE-REMEDIATION-GUIDE.md | ||