Senior Site Reliability Engineer
hace 19 horas
Launchpad, a people-first technology company, is a leader in North America´s rapidly growing tech sector. Through two solutions, Launchpad supports its clients with digital transformation:
- PaasportTM, our iPaaS solution, streamlines software integration and automates workflows.
- Nearshore Staff Augmentation, our managed IT staffing service, connects top IT talent across various geographical regions, bringing industry expertise to leading clients.
Based in Vancouver, Canada, our operational footprint spans across North and South America, with a second headquarters in Santiago, Chile.
At Launchpad, we genuinely care about our people as individuals. If you are looking for a team that values growth, drive, and passion for your craft, if you’re seeking a place to achieve your goals and dreams with fairness and integrity, then we’d love to hear from you.
**About the Role**:
We are seeking a **Senior Site Reliability Engineer (SRE)** to play a pivotal role in ensuring the reliability, scalability, and performance of our infrastructure. This is a mission-critical role, requiring someone who can address both external product reliability and internal platform demands while contributing strategically to organizational objectives.
You will balance hands-on technical work with leadership in reliability initiatives, driving improvements across our platform and collaborating with stakeholders at all levels. This position is crucial to maintaining operational excellence as we navigate complex compliance standards and evolving business needs.
**Responsibilities**:
**Strategic and Leadership Responsibilities**:
- Drive the development and enhancement of reliability frameworks and processes.
- Collaborate with VPs, managers, and cross-functional teams, presenting monthly reliability reports and real-time data analysis.
- Lead initiatives to address key operational challenges and identify areas for process innovation.
**Technical Responsibilities**:
- Design, build, and maintain reliable and scalable infrastructure using **Azure (90%)** and **AWS (10%)**.
- Automate infrastructure and operational tasks with **Kubernetes**, **Terraform**, **Jenkins**, and **GitHub Actions**.
- Develop and refine monitoring solutions using tools like **Grafana**, **Prometheus**, **ELK Stack**, and **Azure Monitoring**.
- Manage incident response and conduct post-mortem analyses to improve system resiliency.
- Provide Level 3 operational support, including on-call availability (preferred).
- Address gaps in automation and optimize existing processes.
**Compliance and Reporting**:
- Ensure systems comply with **ISO 27001** and **SOC2** standards.
- Develop and improve reliability metrics and their communication to stakeholders.
**Qualifications**:
**Technical Skills**:
- Expertise in cloud infrastructure, particularly **Azure** and **AWS**.
- Strong experience with containerization and orchestration (**Kubernetes**) and infrastructure-as-code tools (**Terraform**).
- Proficiency in CI/CD pipelines (**Jenkins**, **GitHub Actions**) and monitoring tools (**Grafana**, **Prometheus**).
- Experience with secrets management tools such as **HashiCorp Vault** and incident management platforms like **OpsGenie**.
**Experience**:
- 7+ years in Site Reliability Engineering, DevOps, or similar roles.
- Proven track record of managing complex cloud environments and driving operational improvements.
- Familiarity with compliance frameworks such as **ISO 27001** and **SOC2**.
- Experience presenting technical data and initiatives to executive-level stakeholders.
**Soft Skills**:
- Exceptional communication skills to collaborate across diverse teams and organizational levels.
- Strong analytical mindset to address reliability challenges and identify innovative solutions.
- Ability to work in a fast-paced environment and meet urgent deadlines.
**Why work for Launchpad?**:
- 100% remote
- People first culture
- Excellent compensation in US Dollars
- Hardware setup for working from home
- Work with global teams and prominent brands based in North America, Europe, and Asia
- Training allowances
- Personal time off (PTO) for vacations, study leave, personal time, etc.
- ...and more
At Launchpad, we genuinely care about our people as individuals. If you are looking for a team that values growth, drive, and passion for your craft, if you’re seeking a place to achieve your goals and dreams with fairness and integrity, then you are the future of Launchpad. Launchpad is committed to fostering a diverse and representative workforce and an inclusive work environment where all employees are respected and treated equally.
Are you ready to elevate your career at Launchpad? We want to hear your story Contact us today.
-
Site Reliability Engineer
hace 1 semana
Santiago de Chile Launchpad Technologies Inc. A tiempo completoRecognized as one of Canada’s fastest-growing companies, Launchpad provides next-generation integration platform capabilities for connecting and managing enterprise automation and data integration. Headquartered in Vancouver, Canada, our operations span both North and South Americas, with a second headquarter located in Santiago, Chile. Our vision is to...
-
Site Reliability Engineer
hace 17 horas
, Región Metropolitana de Santiago, Chile Infosys A tiempo completoSite Reliability Engineer 1 week ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. Infosys is a global leader in next-generation digital services and consulting. We enable clients in more than 50 countries to navigate their digital transformation. With over four decades of experience in managing the systems...
-
Site Reliability Engineer
hace 1 semana
Santiago de Chile BairesDev A tiempo completoAt BairesDev®, we've been leading the way in technology projects for over 15 years. We deliver cutting-edge solutions to giants like Google and the most innovative startups in Silicon Valley. Our diverse 4,000+ team, composed of the world's Top 1% of tech talent, works remotely on roles that drive significant impact worldwide. Site Reliability Engineer at...
-
Site Reliability Engineer
hace 3 semanas
, , Chile UST España & Latam A tiempo completo1 day ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. We’re still looking for talent… and we’d love for you to join our team! For more than 25 years, UST has partnered with the world’s leading companies to create real impact through business transformation. Driven by technology, inspired by people,...
-
Work From Home Site Reliability Engineer
hace 2 semanas
Santiago de Chile BairesDev SA A tiempo completoWho we are BairesDev is proud to be the fastest-growing company in America. With people on five continents and world-class clients, we are only as strong as the multicultural teams at the heart of our business. To consistently deliver the highest quality solutions to our clients, we only hire the Top 1% of the best talents and nurture their professional...
-
Especialista en Site Reliability Engineer
hace 2 semanas
Santiago de Chile Proyectum A tiempo completo**Descripción empresa**: Proyectum Chile, brinda como propósito el promover la aplicación de prácticas profesionales de Dirección de Proyectos, ofreciendo servicios de capacitación y consultoría. Así como también la incorporación de ofertas de servicios recurrentes, tales como Outsourcing de Profesionales. Junto a 11 países latinoamericanos...
-
Work From Home Site Reliability Engineer
hace 7 días
Santiago, Chile BairesDev SA A tiempo completoBairesDev® is a leading tech company empowering exceptional minds to redefine possibilities. We deliver cutting-edge solutions to giants like Google and top startups, working on projects that benefit millions. Site Reliability Engineer at BairesDev We are looking for a Site Reliability Engineer to build and maintain highly reliable, scalable, and secure...
-
Lead Site Reliability Engineer – Cloud, CI/CD, Kubernetes
hace 4 semanas
, , Chile EPAM Systems A tiempo completoA global tech consulting firm in Chile is seeking a Lead Site Reliability Engineer to oversee enterprise application infrastructure and drive reliability through advanced DevOps practices. The ideal candidate will have strong Python skills, extensive experience with cloud platforms like AWS and Azure, and expertise in managing Kubernetes clusters. This...
-
Lead Site Reliability Engineer
hace 4 semanas
, , Chile EPAM Systems A tiempo completo1 week ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. We are seeking a Lead Site Reliability Engineer to oversee enterprise application infrastructure and drive reliability through advanced DevOps practices and tools. You will lead efforts to optimize cloud environments, implement robust CI/CD pipelines,...
-
Site Reliability Engineer
hace 4 semanas
, Región Metropolitana de Santiago, Chile Grupo Falabella A tiempo completoDescripción de la Empresa Somos más de 90 mil personas que, día a día, dedicamos nuestra pasión y energía a cumplir nuestro Propósito de “Simplificar y Disfrutar Más la Vida”. El propósito se vive a través de nuestro ecosistema físico y digital en todas nuestras empresas (Falabella Retail, Sodimac, IKEA, Tottus, Mallplaza, Falabella...