Senior Site Reliability

hace 2 semanas

Región Metropolitana de Santiago Chile Canonical A tiempo completo

Senior Site Reliability / GitOps Engineer Join to apply for the Senior Site Reliability / GitOps Engineer role at Canonical. Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public cloud and silicon providers, and industry leaders in many sectors. The company is a pioneer of global distributed collaboration, with 1200+ colleagues in 75+ countries and very few office‑based roles. Teams meet two to four times yearly in person, in interesting locations around the world, to align on strategy and execution. The company is founder‑led, profitable, and growing. Job Summary The Information Systems (IS) team at Canonical supports and maintains all of Canonical's IT production services, in charge of running services used by over 60 million Ubuntu users. As a Senior SRE & GitOps engineer you will drive operations automation to the next level, both in our private clouds and in public clouds, using open‑source infrastructure as code, CI/CD pipelines, and Canonical’s leading products for software operation automation. You will also improve Canonical products by providing feedback, submitting bugs, and collaborating on design and implementation with other teams. Responsibilities Drive the development of automation and GitOps in your team as an embedded tech lead. Closely collaborate with the IS architect to align solutions with the IS architecture vision. Design and architect services that IS can offer to the organization as products. Apply your experience of IaC to develop infrastructure as code practice within IS by constantly increasing automation and improving IaC processes. Automate software operations for re‑usability and consistency across private and public clouds, taking into consideration the complexities of distributed systems. Maintain operational responsibility for all of Canonical's core services, networks, and infrastructure. Develop skills in troubleshooting, capacity planning, and performance investigation; set up, maintain and use observability tools such as Prometheus, Grafana, and Elasticsearch; design, implement and maintain monitoring and alerting for various systems and services. Provide assistance and work with globally distributed engineering, operations, and support peers. Be given uninterrupted development time to focus on larger projects and automation of manual tasks. Share your experience, know‑how and best practices with other team members in design sessions, mentorship and ‘doing work together’. Carry final responsibility for time‑critical escalations. Qualifications A modern view on hosting architecture, driven by infrastructure as code across both private and public clouds. A product mindset thriving to develop products rather than solutions. Python software development experience with large projects. Experience working with Kubernetes or other container orchestration systems. Proven exposure to manage and deploy cloud infrastructure with code. Practical knowledge of Linux networking, routing, and firewalls. Affinity with various forms of Linux storage, from Ceph to databases. Hands‑on experience administering enterprise Linux servers. Extensive knowledge of cloud computing concepts and technologies. Bachelor's degree or greater, preferably in computer science or related engineering field. Able to communicate clearly and effectively in English over email, chat, video or voice calls and in‑person. Motivated and able to troubleshoot from kernel to web, and willing to ask others when appropriate. A willingness to be flexible and to learn new things quickly. Be inspired by the needs of fast‑changing environments. Happy to work within distributed teams. Be passionate and familiarised with open‑source, especially Ubuntu or Debian. Benefits Distributed work environment with twice‑yearly team sprints in person. Personal learning and development budget of USD 2,000 per year. Annual compensation review. Recognition rewards. Annual holiday leave. Maternity and paternity leave. Team Member Assistance Program & Wellness Platform. Opportunity to travel to new locations to meet colleagues. Priority Pass and travel upgrades for long‑haul company events. About Canonical Canonical is a pioneering tech firm at the forefront of the global move to open source. As the company that publishes Ubuntu, one of the most important open‑source projects and the platform for AI, IoT, and the cloud, we are changing the world of software. We recruit on a global basis and set a very high standard for people joining the company. We expect excellence; to succeed, we need to be the best at what we do. Most colleagues at Canonical have worked from home since our inception in 2004. Working here is a step into the future and will challenge you to think differently, work smarter, learn new skills, and raise your game. We are proud to foster a workplace free from discrimination. Diversity of experience, perspectives, and background create a better work environment and better products. Whatever your identity, we will give your application fair consideration. Seniority level Mid‑Senior level Employment type Full‑time Job function Engineering and Information Technology Industries Software Development #J-18808-Ljbffr

Riu1102 - Site Reliability Engineering

hace 1 hora

Santiago de Chile IT-TALENT Headhunter SENIOR IT A tiempo completo

**IT-Talent Headhunter SENIOR TI** busca para uno de sus clientes **Site Reliability Engineering** Requisitos/Conocimientos/Experiência: - Ingeniero Informático. Sistemas. Tecnología, Computación o afín - Kubernetes - Docker - Celery - Redis - Linux / Debian - Redes - Cloud - BBDD PostgreSQL Renta líquida: Indicar pretensión de renta Modalidad:...
Remote Senior Site Reliability Engineering Manager

hace 3 días

, , Chile Next League A tiempo completo

A leading sports technology consultant is seeking a Senior Engineering Manager for Site Reliability. The successful candidate will lead a team of site reliability engineers, ensuring high availability and performance of systems for clients like NASCAR. This role is remote and requires a minimum of 5 years in SRE and 2 years in management. Offering between...
Senior Engineering Manager, Site Reliability

hace 3 días

, , Chile Next League A tiempo completo

Senior Engineering Manager, Site Reliability Join to apply for the Senior Engineering Manager, Site Reliability role at Next League . As the Senior Manager of Site Reliability Engineering, you will be responsible for ensuring the reliability, scalability, and efficiency for a wide range of client systems, including organizations such as NASCAR, USOPC, and...
Site Reliability Engineer

hace 2 semanas

, Región Metropolitana de Santiago, Chile Infosys A tiempo completo

Site Reliability Engineer 1 week ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. Infosys is a global leader in next-generation digital services and consulting. We enable clients in more than 50 countries to navigate their digital transformation. With over four decades of experience in managing the systems...
Site Reliability Engineer

hace 2 semanas

Santiago de Chile Careers at SunDevs A tiempo completo

**Descripción del puesto**: Como Site Reliability Engineer en SunDevs, colaborarás con otros ingenieros de software senior y Platform Engineers para diseñar y desarrollar sistemas y plataformas en la nube altamente disponibles, escalables, seguras y mantenibles para resolver grandes desafíos. Brindarás asesoramiento y guía a nuestros ingenieros de...
Senior Associate, Site Reliability Engineer

hace 2 días

Santiago de Chile Kyndryl Chile SpA A tiempo completo

**Why Kyndryl** Kyndryl is a market leader that thinks and acts like a start-up. We design, build, manage, and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward - always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our...
Site Reliability Engineer

hace 1 hora

Santiago de Chile Launchpad Technologies Inc. A tiempo completo

Launchpad, a people-first technology company, is a leader in North America´s rapidly growing tech sector. Through two solutions, Launchpad supports its clients with digital transformation: - PaasportTM, our iPaaS solution, streamlines software integration and automates workflows. - Nearshore Staff Augmentation, our managed IT staffing service, connects top...
Site Reliability Engineer

hace 5 días

Santiago de Chile Kyndryl Chile SpA A tiempo completo

**Why Kyndryl** Kyndryl is a market leader that thinks and acts like a start-up. We design, build, manage, and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward - always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our...
Site Reliability Engineer

hace 2 semanas

Santiago de Chile Kabeli Selección A tiempo completo

En Kabeli buscamos ser socios estratégicos de nuestros clientes apoyándoles en la implementación de proyectos ágiles y en la transformación digital de sus procesos, con foco en un delivery continuo. Nuestro objetivo es agregar valor a sus distintos productos y servicios, a través de soluciones tecnológicas innovadoras y fáciles de...
Especialista en Site Reliability Engineer

hace 2 días

Santiago de Chile Proyectum A tiempo completo

**Descripción empresa**: Proyectum Chile, brinda como propósito el promover la aplicación de prácticas profesionales de Dirección de Proyectos, ofreciendo servicios de capacitación y consultoría. Así como también la incorporación de ofertas de servicios recurrentes, tales como Outsourcing de Profesionales. Junto a 11 países latinoamericanos...

América

Europa

Asia / Oceanía

África

Senior Site Reliability