Job Description
Disaster Recovery (3-11Years)
Role Overview
The Disaster Recovery (DR) L3 Engineer is a senior technical role responsible for designing, implementing, maintaining, and optimizing enterprisegrade disaster recovery and business continuity solutions. This role provides deep technical expertise during DR architecture planning, develops runbooks, executes DR drills, resolves complex escalations, and ensures organizational resilience against system failures, cyberattacks, and largescale outages.
Key Responsibilities
1. Disaster Recovery Architecture & Design
- Design and implement DR solutions across onprem, cloud, and hybrid infrastructures.
- Define RPO/RTO requirements in alignment with business objectives.
- Evaluate and recommend DR technologies (backup solutions, replication, failover, orchestration tools).
- Architect DR strategies for missioncritical applications, databases, and infrastructure.
2. Operations & Maintenance
- Maintain DR sites, replication processes, DR automation scripts, and orchestration frameworks.
- Conduct periodic health checks of DR systems, backup integrity, replication lag, and failover readiness.
- Ensure compliance with DR policies, security standards, and audit requirements.
3. DR Testing & Drills
- Plan and execute DR drillsannual, semiannual, or customerdriven.
- Document drill outcomes, perform gap analysis, and implement corrective actions.
- Create and maintain DR runbooks, recovery workflows, and stepbystep SOPs.
4. L3 Escalations & Troubleshooting
- Act as the highest point of technical escalation for DR/BCP issues.
- Troubleshoot complex failures involving storage, networking, compute, databases, backup tools, and cloud services.
- Perform root cause analysis (RCA) for DRrelated incidents or failover failures.
5. Automation & Optimization
- Develop automation for backup validation, replication monitoring, DR failover/failback workflows.
- Improve DR performance, reliability, and cost efficiency.
- Work with DevOps/SRE teams to integrate DR processes into CI/CD.
6. CrossFunctional Collaboration
- Work with application teams, infra teams, cloud CoE, InfoSec, and business stakeholders.
- Support audits, compliance checks (ISO 22301, SOC2, RBI, PCI-DSS depending on org).
- Lead technical communication during outages or DR events.
Required Skills & Experience
Technical Skills
- Expertise in DR/Backup tools:
Veeam, Commvault, NetBackup, Rubrik, Zerto, Azure Site Recovery, AWS DR/Backup, VMware SRM, etc. - Deep understanding of:
- Storage (SAN/NAS), replication technologies
- Virtualization (VMware/HyperV/KVM)
- Windows/Linux servers
- Network failover concepts (DNS, routing, load balancing)
- Strong experience with cloud DR (Azure/AWS/GCP).
- Experience in scripting/automation (PowerShell, Python, Bash).
Behavioral Skills
- Strong analytical and problemsolving skills.
- Ability to work under pressure during outages.
- Excellent documentation and communication skills.
Experience
- 6+ years of experience in Infrastructure/DR/Backup roles.
- At least 3 years in an L3/Senior engineering capacity.
Preferred Certifications
- DR/BCP: CBCI, ISO 22301 Lead Implementer
- Cloud: AWS/Azure Architect certifications
- Virtualization/Storage: VMware VCP, Dell/NetApp certifications
- Backup/DR tools: Vendorspecific certifications (Rubrik, Commvault, Veeam, Zerto, etc.)
Sample Job Title Variants
- Senior Disaster Recovery Engineer
- L3 Disaster Recovery Specialist
- DR & BCP Senior Engineer
- Resilience & Continuity Engineer
- Business Continuity & DR Architect
Job Classification
Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA
Role Category: Quality Assurance and Testing
Role: Automation Test Engineer
Employement Type: Full time
Contact Details:
Company: Wissen Infotech
Location(s): Bengaluru
Keyskills:
Cross Functional Coordination
L3 Escalations & Troubleshooting
Disaster Recovery Architecture & Design
. DR Testing & Drills
Automation & Optimization