Oracle DR: Complete Windows VM implementation and cleanup

Major changes:
- Implemented Windows VM 109 as DR target (replaces Linux LXC)
- Tested RMAN restore successfully (12-15 min RTO, 24h RPO)
- Added comprehensive DR documentation:
  * DR_WINDOWS_VM_STATUS_2025-10-09.md - Current implementation status
  * DR_UPGRADE_TO_CUMULATIVE_PLAN.md - Plan for cumulative incremental backups
  * DR_VM_MIGRATION_GUIDE.md - Guide for VM migration between Proxmox nodes
- Updated DR_WINDOWS_VM_IMPLEMENTATION_PLAN.md with completed phases

New scripts:
- add_system_key_dr.ps1 - SSH key setup for automated transfers
- configure_listener_dr.ps1 - Oracle Listener configuration
- fix_ssh_via_service.ps1 - SSH authentication fix
- rman_restore_final.cmd - Working RMAN restore script (tested)
- transfer_to_dr.ps1 - FULL backup transfer (renamed from 02_*)
- transfer_incremental.ps1 - Incremental backup transfer (renamed from 02b_*)

Cleanup:
- Removed 19 obsolete scripts for Linux LXC DR
- Removed 8 outdated documentation files
- Organized project structure

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
This commit is contained in:
Marius
2025-10-09 18:54:08 +03:00
parent 6a6ffe84af
commit ac2340c967
26 changed files with 2309 additions and 6208 deletions

View File

@@ -7,13 +7,20 @@
## 📋 PRE-IMPLEMENTATION CHECKLIST
### Current Infrastructure
### Current Infrastructure (IMPLEMENTED ✅)
- ✅ PRIMARY: Windows Server, Oracle 19c SE2, IP: 10.0.20.36, SSH port 22122
- ✅ Database: ROA, DBID: 1363569330
- ✅ RMAN backups: FULL daily (02:30 AM), INCREMENTAL midday (14:00)
-Transfer scripts: PowerShell scripts working to LXC 10.0.20.37
-Backup size: ~7GB compressed (from 23GB), retention 2 days
-Current DR target: Linux LXC 109 (10.0.20.37) - TO BE REPLACED
- ✅ RMAN backups: FULL daily (02:30 AM)
-DIFFERENTIAL INCREMENTAL (14:00) - NOT USED (causes UNDO corruption on restore)
-Transfer scripts: PowerShell scripts transferring to VM 109 (Windows)
-Backup size: ~6-7GB compressed (from 23GB), retention 2 days
- ✅ DR target: Windows VM 109 (10.0.20.37) on pveelite - **OPERATIONAL**
### Planned Upgrade (see DR_UPGRADE_TO_CUMULATIVE_PLAN.md)
- 🔄 Convert DIFFERENTIAL → **CUMULATIVE** incremental backups
- 🔄 Add second daily incremental (13:00 + 18:00 vs current 14:00 only)
- 🔄 Store backups on Proxmox host (pveelite), mounted in VM when needed
- 🔄 Target RPO: **3-4 hours** (vs current 24 hours)
### What We'll Build
- 🎯 Windows VM in Proxmox (replaces LXC 109)
@@ -799,31 +806,54 @@ ssh Administrator@10.0.20.37 "Get-ChildItem D:\oracle\backups\primary -Filter *.
│ │
└─────────────────────────────────────────────────────────────┘
METRICS:
- RPO: 24 hours (daily backup) + 6 hours (incremental)
METRICS (Current Implementation):
- RPO: 24 hours (only FULL backup used; incremental causes UNDO corruption)
- RTO: 15 minutes
- Storage: 150 GB total (100GB VM + 50GB backups)
- Storage: 500 GB VM + backups on host
- Daily resources: ZERO (VM powered off)
- DR test: Monthly
- DR test: Weekly (planned)
METRICS (After Upgrade to CUMULATIVE):
- RPO: 3-4 hours (FULL + latest CUMULATIVE)
- RTO: 15 minutes (unchanged)
- Storage: 500 GB VM + ~15 GB on Proxmox host
- Daily resources: ZERO (VM powered off)
- DR test: Weekly (automated)
```
---
## ✅ POST-IMPLEMENTATION CHECKLIST
After completing all phases, verify:
### Phase 1-8 (Initial Setup) - ✅ COMPLETED 2025-10-09
- [ ] Windows VM created in Proxmox (VM ID 109, IP 10.0.20.37)
- [ ] Oracle 19c SE2 installed and working
- [ ] OpenSSH Server configured with passwordless authentication
- [ ] Transfer scripts updated and tested (FULL + INCREMENTAL)
- [ ] RMAN restore script created on DR VM
- [ ] DR restore tested successfully (database opens and is usable)
- [ ] Scheduled tasks on PRIMARY updated
- [ ] DR runbook documented and accessible
- [ ] Team trained on DR activation procedure
- [ ] Monthly DR test scheduled in calendar
- [ ] VM shutdown after initial setup (to conserve resources)
- [x] Windows VM created in Proxmox (VM ID 109, IP 10.0.20.37)
- [x] Oracle 19c SE2 installed and working
- [x] OpenSSH Server configured with passwordless authentication
- [x] Transfer scripts updated and tested (FULL backup)
- [x] RMAN restore script created on DR VM
- [x] DR restore tested successfully (database opens and is usable)
- [x] Scheduled tasks on PRIMARY verified
- [x] DR procedures documented
- [x] VM shutdown after testing (to conserve resources)
### Phase 9 (Upgrade to CUMULATIVE) - 📋 PLANNED
**See:** `DR_UPGRADE_TO_CUMULATIVE_PLAN.md` for detailed implementation steps
- [ ] Proxmox host storage configured (`/mnt/pve/oracle-backups`)
- [ ] VM 109 mount point configured (E:\ from host)
- [ ] RMAN script updated to CUMULATIVE incremental
- [ ] Transfer scripts updated to send to Proxmox host
- [ ] SSH key for Proxmox host access configured
- [ ] Scheduled task created for 13:00 CUMULATIVE backup
- [ ] Scheduled task created for 18:00 CUMULATIVE backup
- [ ] Existing 14:00 task removed
- [ ] 02:30 FULL task updated to use new transfer script
- [ ] DR restore script updated for cumulative backups
- [ ] End-to-end restore test with CUMULATIVE successful
- [ ] Weekly test script created and scheduled
- [ ] Team trained on new backup strategy
---
@@ -919,7 +949,15 @@ RMAN> DELETE NOPROMPT ARCHIVELOG ALL COMPLETED BEFORE 'SYSDATE-2';
---
**Generated:** 2025-10-08
**Version:** 1.0
**Status:** Ready for Implementation
**Next Session:** Start with Phase 1 - Create Windows VM
**Last Updated:** 2025-10-09
**Version:** 2.0
**Status:** ✅ Phase 1-8 COMPLETED | 📋 Phase 9 (CUMULATIVE upgrade) PLANNED
**Implementation Status:**
- Initial setup (Phases 1-8): ✅ COMPLETED 2025-10-09
- RMAN restore tested: ✅ SUCCESSFUL (12-15 minutes RTO)
- Current RPO: 24 hours (FULL backup only)
- Next: Upgrade to CUMULATIVE incremental for 3-4 hour RPO
**Next Session:** Implement CUMULATIVE backup strategy
**See:** `DR_UPGRADE_TO_CUMULATIVE_PLAN.md` for upgrade plan