Site Reliability

hace 4 días


Santiago de Chile Canonical - Jobs A tiempo completo

This role is an opportunity for a hands-on, but literally hands-off, technologist with a passion for Linux to build a career with Canonical and drive the success with those leveraging Ubuntu and open source products. If you have an affinity for operations automation and a passion for technology, then you will enjoy working with some of the best people in the industry at Canonical.

**Job Summary**:
The IS team at Canonical supports and maintains all of Canonical's IT production services. The team is in charge of running services used by over 60 million Ubuntu users.

As an SRE & Gitops engineer you'll be in a unique position to drive operations automation to the next level, both in our own private clouds as well as in the public clouds. We do this by utilizing the best of open source infrastructure as code software, software development practices such as CI/CD pipelines, and Canonical's leading products for software operation automation.

In addition to defining the infrastructure as code, you will improve Canonical products and the open-source technologies they're based on by providing critical feedback to developers on how their products operate at scale. This is done by submitting bugs (and sometimes writing pull requests) and collaborating on design and implementations with other teams within the company.

You'll be part of a global team of SREs that work together and support each other to provide the best possible services to our company, Canonical's customers and the Ubuntu Community.

**As a Site Reliability / Gitops Engineer engineer you will**:

- Automate software operations for reusability and consistency across private and public clouds, taking into consideration the complexities of distributed systems
- Develop infrastructure as code practice within IS by constantly increase automation and improve IaC processes
- Develop new features and improve the resilience and scalability of the existing cloud and container portfolio at Canonical
- Maintain operational responsibility for all of Canonical's core services, networks, and infrastructure
- Develop skills in troubleshooting, capacity planning, and performance investigation,
- Setting up, maintaining and using observability tools such as Prometheus, Grafana, and Elasticsearch; design, implement and maintain Implementing monitoring and alerting for various systems and services
- Collaborate with development teams to design service architecture, documentation, playbooks, policies and operational procedures
- Provide assistance and work with globally distributed engineering, operations, and support peers
- Be given uninterrupted software development time to focus on larger projects and automation of manual tasks
- Carry final responsibility for time-critical escalations
- Strong modern engineering background (peer-review, unit testing, SCM, CI/CD, Agile)
- Python software development experience, with large projects
- Practical knowledge of Linux networking, routing, and firewalls
- Affinity with various forms of Linux storage, from Ceph to Databases
- Hands-on experience administering enterprise Linux servers
- Extensive knowledge of cloud computing concepts and technologies
- Bachelor's degree or greater, preferably in computer science or related engineering field
- Motivated and able to troubleshoot from kernel to web, and willing to ask others when appropriate
- A willingness to be flexible and able to learn new things quickly
- Be inspired by the needs of fast-changing environments
- Happy to work within distributed teams
- Be passionate and familiarized about open-source, especially Ubuntu or Debian
- A residence in North-, Middle
- or South America

**What we offer you**:
Your base pay will depend on various factors including your geographical location, level of experience, knowledge and skills. Our compensation philosophy is to ensure equity right across our global workforce.
- Fully remote working environment - we've been working remotely since 2004
- Personal learning and development budget of 2,000USD per annum
- Annual compensation review
- Recognition rewards
- Annual holiday leave
- Parental Leave
- Employee Assistance Programme
- Opportunity to travel to new locations to meet colleagues at 'sprints'
- Priority Pass for travel and travel upgrades for long haul company events

**About Canonical**:
Canonical is a pioneering tech firm that is at the forefront of the global move to open source. As the company that publishes Ubuntu, one of the most important open source projects and the platform for AI, IoT and the cloud, we are changing the world on a daily basis. We recruit on a global basis and set a very high standard for people joining the company. We expect excellence - in order to succeed, we need to be the best at what we do.

Canonical has been a remote-first company since its inception in 2004. Work at Canonical is a step into the future, and will challenge you to think differently, work smarter, learn new skills, and raise your ga


  • Site Reliability Engineer

    hace 24 minutos


    , , Chile Capchase A tiempo completo

    Site Reliability Engineer – Capchase Join to apply for the Site Reliability Engineer role at Capchase Capchase is the #1 platform for vendor financing in tech. We help software and hardware vendors offer flexible installment payments as part of the sales process, improving conversion rates and cashflow. We provide an awesome buyer experience. Capchase was...


  • , , Chile Next League A tiempo completo

    A leading sports technology consultant is seeking a Senior Engineering Manager for Site Reliability. The successful candidate will lead a team of site reliability engineers, ensuring high availability and performance of systems for clients like NASCAR. This role is remote and requires a minimum of 5 years in SRE and 2 years in management. Offering between...


  • Santiago de Chile Launchpad Technologies Inc. A tiempo completo

    Launchpad, a people-first technology company, is a leader in North America´s rapidly growing tech sector. Through two solutions, Launchpad supports its clients with digital transformation: - PaasportTM, our iPaaS solution, streamlines software integration and automates workflows. - Nearshore Staff Augmentation, our managed IT staffing service, connects top...


  • Santiago de Chile Careers at SunDevs A tiempo completo

    **Descripción del puesto**: Como Site Reliability Engineer en SunDevs, colaborarás con otros ingenieros de software senior y Platform Engineers para diseñar y desarrollar sistemas y plataformas en la nube altamente disponibles, escalables, seguras y mantenibles para resolver grandes desafíos. Brindarás asesoramiento y guía a nuestros ingenieros de...


  • , , Chile Next League A tiempo completo

    Senior Engineering Manager, Site Reliability Join to apply for the Senior Engineering Manager, Site Reliability role at Next League . As the Senior Manager of Site Reliability Engineering, you will be responsible for ensuring the reliability, scalability, and efficiency for a wide range of client systems, including organizations such as NASCAR, USOPC, and...

  • Site Reliability Engineer

    hace 24 minutos


    , , Chile UST España & Latam A tiempo completo

    1 day ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. We’re still looking for talent… and we’d love for you to join our team! For more than 25 years, UST has partnered with the world’s leading companies to create real impact through business transformation. Driven by technology, inspired by people,...


  • , , Chile Capchase A tiempo completo

    A leading vendor financing platform is seeking a Site Reliability Engineer to ensure the availability and performance of our systems as we scale. This foundational role involves designing resilient architectures, managing incident responses, and collaborating with talented engineers. Candidates should possess a Bachelor's in Computer Science, experience in...

  • Site Reliability Engineer

    hace 24 minutos


    , Región Metropolitana de Santiago, Chile Infosys A tiempo completo

    Site Reliability Engineer 1 week ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. Infosys is a global leader in next-generation digital services and consulting. We enable clients in more than 50 countries to navigate their digital transformation. With over four decades of experience in managing the systems...


  • Santiago, Chile BairesDev SA A tiempo completo

    Who we are BairesDev is proud to be the fastest-growing company in America. With people on five continents and world-class clients, we are only as strong as the multicultural teams at the heart of our business. To consistently deliver the highest quality solutions to our clients, we only hire the Top 1% of the best talents and nurture their professional...

  • Site Reliability

    hace 24 minutos


    , Región Metropolitana de Santiago, Chile Canonical A tiempo completo

    Site Reliability / Gitops Engineer Join to apply for the Site Reliability / Gitops Engineer role at Canonical Job Summary The IS team at Canonical supports and maintains all of Canonical's IT production services. The team is in charge of running services used by over 60 million Ubuntu users. Responsibilities Apply your experience of IaC to develop...