Global Disaster Recovery Engineer
hace 4 semanas
Global Disaster Recovery Engineer Finning, CATERPILLAR’s strategic partner and a leader in equipment distribution and services, is looking for top talent to take on the role of Global Disaster Recovery & Resilience Engineer responsible for building and operating the enterprise-wide Disaster Recovery (DR) and IT Service Continuity capability across Public Cloud (Azure/AWS), Hybrid Cloud (VMware in UK & South America), and Private Managed Services (SAP on HANA and Lawson M3 at a private hosting provider). This role will design the DR strategy, runbooks/playbooks, and execution model; coordinate recovery with Network, Identity & Access Management (IAM), Storage, Application, and Security (CSIRP) teams to ensure company readiness against technology disruptions. The DR Engineer will work closely with internal stakeholders across all Finning regions, vendors, and managed service providers to ensure the enterprise meets availability, RTO/RPO, compliance, and cost objectives, with a focus on operational excellence, automation, and auditability. Major Job Functions: Service Delivery (60%) Lead the execution of enterprise-wide DR processes across Public Cloud (Azure/AWS), Hybrid Cloud (VMware), Private Managed Services (SAP and Lawson M3). Coordinate cross-domain recovery activities with Network, IAM, Storage, Application, and Security teams during DR drills and real incidents. Optimize DR tooling and Managed Services DR process and documentation. Ensure recoverability of Entra ID and Microsoft 365, including break-glass accounts, Conditional Access rollback, and email continuity. Act as DR Incident Commander during outages, aligning with the Cybersecurity Incident Response Plan (CSIRP) for communication and escalation. Validate vendor DR capabilities and coordinate joint DR tests with Private Cloud providers. Service Improvements (20%) Identify and Suggest process enhancement through automation of DR workflows using tools like PowerShell, Python, or Terraform to reduce recovery time and human error. Identify and drive remediation of single points of failure and resilience gaps across infrastructure and SaaS platforms. Implement continuous improvement initiatives based on lessons learned from DR tests and incidents. Work closely with the cloud financial management team (FinOps) to optimize costs associated with DR services (replication, storage, testing), ensuring financial efficiency without compromising availability. Documentation (10%) Develop and maintain DR runbooks/playbooks for all critical systems, including cloud, hybrid, identity, and SaaS services. Keep dependency maps and CMDB entries accurate for recovery sequencing. Ensure all DR documentation is audit-ready, version-controlled, and reviewed quarterly Reporting (10%) Produce monthly DR readiness reports covering RTO/RPO compliance, backup/restores success rates and DR test results. Maintain audit evidence for DR exercises and backup verification. Provide executive dashboards summarizing DR posture, risks and improvement actions. Requirements: Education: Bachelor’s degree in computer science, Information Systems, Engineering, or equivalent experience. 5+ years in infrastructure/cloud operations with 3+ years focused on Disaster Recovery/IT Service Continuity. Disaster Recovery Execution: Ability to plan, coordinate and execute DR drills and real failovers across multi-cloud, hybrid and managed services environments. Proven record leading multi platform DR tests (cloud, VMware and SaaS/Identity) and recovering complex applications in production or drills. Hands on with Azure Site Recovery or equivalent, AWS Elastic Disaster Recovery, VMware SRM/Zerto or equivalent, Enterprise‑grade backup and data protection solutions (e.g., Cohesity or similar). Certifications (Preferred): Azure Solutions Architect, Azure Administrator, VMware Site Recovery Manager (SRM). ITIL Foundation. BC/DR: CBCP, ISO 22301 BCMS. Demonstrated experience leading DR drills and real failovers across multi‑cloud (Azure, AWS) and VMware environments. Deep understanding of replication, failover and failback strategies. Identity and SaaS recovery (Entra ID, Microsoft 365). Networking for DR (DNS failover, ExpressRoute/Direct Connect, VPN). Advanced English proficiency (spoken and written). Proficiency in tools like PowerShell, Python or any other to automate DR workflows and validation steps. Skilled in orchestrating recovery steps with Network, IAM and Application teams. Ability to act as Incident Commander during outages ensuring structured communication and escalation. Santiago, Santiago Metropolitan Region, Chile $1,800.00-$2,800.00 1 day ago #J-18808-Ljbffr
-
Sr. DevOps Engineer
hace 2 días
Santiago de Chile Soft Dev Team A tiempo completoDevOps engineers play a pivotal role in streamlining the software development process, ensuring swift time-to-market, and adapting to market dynamics and competition. They are responsible for maintaining system stability and reliability while continually improving the mean time to recovery. Expertise in AWS (Amazon Web Services) is essential for this role....
-
Senior DevOps Engineer
hace 3 semanas
, Región Metropolitana de Santiago, Chile IDT Corporation A tiempo completoSenior DevOps Engineer - Latam IDT is looking for a Senior DevOps Engineer to join us remotely from Latam, designing, implementing, and maintaining our infrastructure and deployment pipelines. Your expertise in AWS, Kubernetes, HashiCorp Vault, and Consul will be essential to ensuring the reliability, scalability, and security of our applications....
-
Site Reliability Engineer
hace 4 semanas
, Región Metropolitana de Santiago, Chile GUX A tiempo completoSite Reliability Engineer Role at GUX Base pay range Terraform. Funciones principales Definir, medir y operar SLO, SLI y SLA para servicios Cloud críticos. Garantizar alta disponibilidad, tolerancia a fallas y disaster recovery en AWS. Participar en gestión de incidentes, on-call y postmortems. Apoyar en diseño, implementación y mantención de...
-
DevOps Engineer
hace 2 semanas
Santiago, Chile Manager Software A tiempo completoOverview Manager Software, a Chilean ERP company, seeks a DevOps Engineer with a strong background in cloud platforms, automation, and CI/CD pipelines. The role is based in Santiago and offers flexible remote work options. Responsibilities Automate repetitive operational tasks and streamline deployment processes. Monitor, troubleshoot, and resolve incidents...
-
Product Support – Engineer
hace 2 semanas
, Región Metropolitana de Santiago, Chile Robert Walters A tiempo completoEn Robert Walters, estamos en la búsqueda de un Product Support Engineer para una importante compañía tecnológica global especializada en soluciones de Collections & Recovery para su equipo en Chile. El rol tiene como objetivo asegurar el correcto funcionamiento de los productos, brindando soporte técnico avanzado a clientes, resolviendo incidentes...
-
Senior Mainframe Engineer
hace 2 semanas
, Región Metropolitana de Santiago, Chile Tata Consultancy Services A tiempo completoUna empresa líder en tecnología está buscando un Senior Developer Engineer que trabaje en proyectos globales para un importante banco. El candidato ideal debe tener experiencia en COBOL y entornos Mainframe Legacy, además de un sólido inglés intermedio/avanzado. Se ofrece un entorno colaborativo, oportunidades de carrera, y un completo seguro de salud....
-
Azure Infrastructure Director: Cloud Transformation
hace 4 semanas
, , Chile EPAM Systems A tiempo completoA global IT services company seeks a Senior Azure Infrastructure Architect to design and oversee advanced cloud solutions. This role requires leading cloud transformations, establishing secure infrastructure, and optimizing performance across enterprise applications. The successful candidate will possess over 12 years of experience in Solution Architecture,...
-
Sr. DevOps Engineer
hace 5 días
Santiago de Chile Flow RMS A tiempo completoSenior DevOps Engineer (Remote - Latin America) **How to Apply** To be considered for this position, you must record a short video explaining why you are excited and qualified for this role. You can use a free tool like Loom to create your video. **Please send us a message on Indeed with a link to your video.** **Flow RMS Overview** Flow RMS is...
-
Remote Consulting Power Engineer for Global Grid Projects
hace 4 semanas
, Región Metropolitana de Santiago, Chile Acelerex A tiempo completoA global consulting firm is seeking an Entry-Level Consulting Power Engineer to join their team remotely in Santiago, Chile. The successful candidate will be involved in delivering consulting and software implementation services, focusing on power engineering projects. Responsibilities include producing deliverables, preparing financial models, and analyzing...
-
Technical Support Engineer
hace 2 semanas
, Región Metropolitana de Santiago, Chile The Global Talent Co. A tiempo completoTechnical Support Engineer Join to apply for the Technical Support Engineer role at The Global Talent Co. Full-time | Remote | LATAM | 4:00 pm - Midnight PST About Us At The Global Talent Co., we provide opportunities to work with leading innovative technology companies worldwide, offering stable employment, competitive compensation, career growth, and...