Responsibilities
- Create and execute comprehensive business continuity and disaster recovery (BCP/DR) programs based on industry standards (ISO 22301, NIST SP 800-34, ISO 27001).
- Perform Business Impact Analyses (BIA) to determine essential business functions, dependencies, and recovery priorities.
- Establish and maintain Recovery Time Objectives (RTO) and Recovery Point Objectives (RPO) for all critical systems and services.
- Develop and maintain disaster recovery playbooks and runbooks for various incident scenarios.
- Create and manage crisis communication frameworks for security incidents and business disruptions.
- Lead tabletop exercises and disaster recovery drills to validate recovery procedures.
- Design and implement backup and recovery solutions for AWS cloud infrastructure (primary focus).
- Build automated backup workflows for databases, storage systems, applications, and configurations.
- Implement immutable backup strategies and offsite replication for ransomware resilience.
- Monitor backup operations, validate recovery procedures, and maintain backup integrity.
- Optimize backup windows, retention policies, and storage costs across cloud environments.
- Implement Infrastructure as Code (IaC) for DR environment provisioning and configuration management.
- Develop automated failover and failback procedures for critical services.
- Design and maintain hot/warm/cold standby environments based on business requirements.
- Conduct regular disaster recovery testing and document test results with improvement recommendations.
- Build monitoring and alerting systems for backup health, replication lag, and recovery readiness.
- Maintain detailed recovery documentation including network diagrams, dependency maps, and configuration details.
- Collaborate with application teams to ensure application-consistent backups and recovery procedures.
- Ensure BCP/DR programs meet regulatory requirements and customer commitments.
- Maintain comprehensive documentation of recovery procedures, test results, and capability assessments.
- Track and report on key resilience metrics including RTO/RPO achievement, test success rates, and recovery drills.
- Coordinate with internal audit and compliance teams during assessments.
- Participate in vendor risk assessments for third-party backup and recovery solutions.
Compensation
Competitive
Work Arrangement
On-site
Team
Security
Responsibilities
- Design and implement comprehensive BCP/DR programs aligned with industry frameworks (ISO 22301, NIST SP 800-34, ISO 27001).
- Conduct Business Impact Analyses (BIA) to identify critical business functions, dependencies, and recovery priorities.
- Define and maintain Recovery Time Objectives (RTO) and Recovery Point Objectives (RPO) for all critical systems and services.
- Develop and maintain disaster recovery playbooks and runbooks for various incident scenarios.
- Create and manage crisis communication frameworks for security incidents and business disruptions.
- Lead tabletop exercises and disaster recovery drills to validate recovery procedures.
- Design and implement backup and recovery solutions for AWS cloud infrastructure (primary focus).
- Build automated backup workflows for databases, storage systems, applications, and configurations.
- Implement immutable backup strategies and offsite replication for ransomware resilience.
- Monitor backup operations, validate recovery procedures, and maintain backup integrity.
- Optimize backup windows, retention policies, and storage costs across cloud environments.
- Implement Infrastructure as Code (IaC) for DR environment provisioning and configuration management.
- Develop automated failover and failback procedures for critical services.
- Design and maintain hot/warm/cold standby environments based on business requirements.
- Conduct regular disaster recovery testing and document test results with improvement recommendations.
- Build monitoring and alerting systems for backup health, replication lag, and recovery readiness.
- Maintain detailed recovery documentation including network diagrams, dependency maps, and configuration details.
- Coordinate with application teams to ensure application-consistent backups and recovery procedures.
- Ensure BCP/DR programs meet regulatory requirements and customer commitments.
- Maintain comprehensive documentation of recovery procedures, test results, and capability assessments.
- Track and report on key resilience metrics including RTO/RPO achievement, test success rates, and recovery drills.
- Coordinate with internal audit and compliance teams during assessments.
- Participate in vendor risk assessments for third-party backup and recovery solutions.
Not provided