Staff Site Reliability Engineer N5737

hace 2 semanas


Santiago, Metropolitana, Chile Nisum A tiempo completo
The Staff Site Reliability Engineer is a position of technical expertise, influence, and leadership in the technology realm.

The position requires the individual to apply their expert knowledge to ensure best practices and well-engineered architecture across multiple teams.

They will also be a key stakeholder and initiator of major changes to processes, engineering practices, and system administration. This position will be required to work in a space of solving critical issues and initiatives across multiple teams. It will require an extensive and deep understanding of cutting-edge practices and innovative approaches to problems.

Staff Site Reliability Engineers are also tasked with establishing and maintaining a positive and productive culture based on the client Leadership Principles.

Essential Functions and Responsibilities:

  • Identify and reduce redundant, unnecessary processes by asking the question, "Is this process improving our ability to reliably deliver software to our customers?"
  • Help resolve bugs causing consistent reliability issues by working to identify the root cause and proposing shortterm and longterm solutions to triage and resolve them.
  • Advocate for reliable design patterns in our distributed systems (graceful failure in the event of absent dependencies, sharenothing architecture, loose coupling of services, etc.) and work to get reliability features prioritized in team backlogs.
  • Identify reliability concerns before they become an outage via engagements with teams regarding major code revisions.
  • Collaborate with developers, designers, testing, and product management to address issues with service reliability or performance.
  • Work with Product to create and update KPIs for their products to allow business process level observability of throughput.
  • Work with product teams to set SLO, SLI, and error budgets for their services.
  • Participate in an oncall rotation and respond to major incidents.
  • Performs other related duties as assigned.
Knowledge, Skill and Abilities:

  • Demonstrated ability to work with monitoring and alerting platforms; New Relic and Opsgenie.
  • Practical knowledge of CI/CD technologies; Github, CodePipeline.
  • Practical knowledge of programming languages; Python and Java.
  • Practical knowledge of databases; Oracle, Postgres, DynamoDB.
  • Demonstrated ability with chaos engineering technologies.
  • Demonstrated ability to perform technical analysis in a discovery fashion, resulting in architecture artifacts such as a logical system deployment diagram, sequence diagram, state diagram, ERD, etc. This must be performed in UML.
  • Practical knowledge of design patterns, the conditions that indicate their usage and an ability to identify antipatterns when presented a diagram that contains the relevant information.
  • Practical knowledge of various application architectures; must be well familiar with Martin Fowler's Enterprise Application Design Patterns.
  • Can execute through soft skills as this position is one of leadership but has no direct reports.
Competencies:

Organizational or Student Impact:

  • Recommends and implements changes in technical/business processes; identifies areas for improvement.
  • Helps lead/coordinate extremely complex technical projects and programs and leads development and implementation of innovative solutions for specialized technical issues.
  • Works proactively; identifies and helps prevent/ solve problems that may cross disciplines.
  • Fully understands and quantifies project risks with impact. Identifies, generates, and implements innovative solutions.

Problem Solving & Decision Making:

  • This individual accomplishes goals and objectives independently.
  • Builds and leads teams, influencing decisions and results.
  • Uses discretion to fully scope, design, and implement solutions to complex technical problems.
  • The individual provides regular technical advice and direction to technical teams and management.
  • Models and helps set high standards for effective interactions with internal and external individuals.

Communication & Influence:

  • Communicates with parties within and outside of their job function and typically has responsibilities for communicating with parties external to the organization.
  • Works to influence others to accept and understand new concepts, practices, and approaches. Requires ability to communicate with executive leadership regarding matters of significant importance to the organization.
  • This individual may conduct briefings with senior leaders within the technical function.

Leadership:

  • Frequently responsible for providing guidance, coaching, and training to other employees across the Company within the area of expertise.
  • Responsible for managing large, complex project initiatives or strategically important solutions to the organization, involving large crossfunctional teams.
  • May have direct reports but generally fewer than three.
Minimum Qualifications:

  • The individual is acknowledged within the group as a subject matter expert.
  • Typically requires a University Degree or equivalent experience. 9 years of prior relevant experience.


Department Specific Minimum Qualifications:


  • Bachelor's degree in computer science, information technology, or a related field.
  • Appropriate prior experience and depth may be substituted for this qualification. 9 years experience in site reliability engineering, systems administration, or software development; automating approaches and technologies in engineering
  • Experience in webbased applications and integrations to enable those application using Java, Python, REST, JSON/YAML, XML, SQL and other technologies, including experience integrating thirdparty products
  • Experience in monitoring and alerting platforms, especially New Relic and Opsgenie.
Preferred Qualifications:

  • Technical Experience in any of the following:Amazon Web Services (AWS), Jira, Agile/Scrum, Python, OAuth 2, IDM/OSSO, Ruby/Rails, PHP, Hibernate/Seam, J2EE, Tomcat, jQuery, JavaScript, NOSQL, Angular, New Relic, Opsgenie.
Technical Certifications - Strong experience with distance education and distance learning students is preferred.

¿What can we offer you?
- Belong to an international and multicultural company that supports diversity.- Be part of international projects with a presence in North America, Pakistan, India and Latam.- Work environment with extensive experience in remote and distributed work, using agile methodologies.- Culture of constant learning and development in current technologies.- Pleasant and collaborative environment, with a focus on teamwork.- Access to learning platforms, Google Cloud certifications, Databricks, Tech Talks, etc.- Being part of various initiatives and continuous participation in internal and external activities of innovation, hackathon, technology, agility, talks, webinars, well-being and culture with the possibility not only to participate but also to be an exhibitor.- Since you live in Chile you will also have access to several benefits related to our center :)
  • Site Reliability Engineer

    hace 2 semanas


    Santiago, Metropolitana, Chile Launchpad Technologies Inc. A tiempo completo

    Launchpad, a people-first technology company, is a leader in North America ́s rapidly growing tech sector.Through two solutions, Launchpad supports its clients with digital transformation: PaasportTM, our iPaaS solution, streamlines software integration and automates workflows. Nearshore Staff Augmentation, our managed IT staffing service, connects top IT...


  • Santiago, Metropolitana, Chile Nisum A tiempo completo

    Nisum is a leading global digital commerce firm headquartered in California, with services spanning digital strategy and transformation, insights and analytics, blockchain, business agility, and custom software development. Founded in 2000 with the customer-centric motto "_Building Success Together_," Nisum has grown to over 1,800 professionals across the...

  • Site Reliability Engineer

    hace 2 semanas


    Santiago, Metropolitana, Chile Kyndryl Chile SpA A tiempo completo

    Why KyndrylKyndryl is a market leader that thinks and acts like a start-up. We design, build, manage, and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl?We are always moving forward - always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our...


  • Santiago, Metropolitana, Chile Amaris Consulting A tiempo completo

    Job description Buscamos consultores dinámicos para hacer crecer nuestro equipo de Sistemas de Información y Digital en Santiago de Chile.Tu experiencia, conocimiento y compromiso nos ayudarán a enfrentar los desafíos de nuestros clientes.Estarás apoyando diferentes proyectos a través de tu experiencia como SRE (Site Reliability Engineer).Sus...

  • Site Reliability Engineer

    hace 2 semanas


    Santiago, Metropolitana, Chile Abenis A tiempo completo

    En Abenis nos encontramos en búsqueda de un (a) Site Reliability Engineer o SRE para uno de nuestros clientes, importante multinacional ubicada en la comuna de Providencia.Funciones: Participará y mejorará el ciclo de vida del desarrollo de software, desde el inicio y el diseño, hasta el desarrollo, la implementación, la operación y el refinamiento....


  • Santiago, Metropolitana, Chile amaris A tiempo completo

    Job descriptionBuscamos consultores dinámicos para hacer crecer nuestro equipo de Sistemas de Información y Digital en Santiago de Chile. Tu experiência, conocimiento y compromiso nos ayudarán a enfrentar los desafíos de nuestros clientes.Estarás apoyando diferentes proyectos a través de tu experiência como SRE (Site Reliability Engineer).Sus...


  • Santiago, Metropolitana, Chile Amaris Consulting A tiempo completo

    Who are we? :Amaris Consulting es una firma independiente de asesoría tecnológica que ofrece servicios de orientación y soluciones para las empresas.Reúne a más de 7 600 personas distribuidas en 5 continentes y más de 60 países. Con más de 1 000 clientes en todo el mundo, hemos implementado soluciones en proyectos importantes durante más de una...


  • Santiago, Metropolitana, Chile Kyndryl Chile SpA A tiempo completo

    Why KyndrylKyndryl is a market leader that thinks and acts like a start-up. We design, build, manage, and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl?We are always moving forward - always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our...


  • Santiago, Metropolitana, Chile Launchpad Technologies Inc. A tiempo completo

    Recognized as one of Canada's fastest-growing companies, Launchpad provides next-generation integration platform capabilities for connecting and managing enterprise automation and data integration. Headquartered in Vancouver, Canada, our operations span both North and South Americas, with a second headquarter located in Santiago, Chile.Our vision is to bring...


  • Santiago, Metropolitana, Chile IT-TALENT Headhunter SENIOR IT A tiempo completo

    IT-Talent Headhunter SENIOR TI busca para uno de sus clientes Site Reliability EngineeringRequisitos/Conocimientos/Experiência: Ingeniero Informático. Sistemas. Tecnología, Computación o afín Kubernetes Docker Celery Redis Linux / Debian Redes Cloud BBDD PostgreSQLRenta líquida: Indicar pretensión de rentaModalidad: Mixta. 3x2Contrato directo con...

  • Reliability Engineer

    hace 2 semanas


    Santiago, Metropolitana, Chile Goodyear A tiempo completo

    Lugar: CL - Maipu PlantRepresentante de Adquisición de Talento de Goodyear: SONIA NIETOLI-DNIReliability EngineerPrimary Purpose: Ensure equipment complies with all safety, environmental and other regulatory compliances Provide the necessary engineering and maintenance support to the production organization in order to promote the production of the highest...


  • Santiago, Metropolitana, Chile Neoris A tiempo completo

    Site Reliability Engineering Senior:Date:Nov 27, 2023Location: SANTIAGO, CLCompany:NEORISNEORIS es un Acelerador Digital que ayuda a las compañías a entrar en el futuro. Combinamos un profundo conocimiento de industria con el más alto expertise tecnológico en el mercado para crear soluciones personalizadas que permiten superar los desafíos de negocio y...

  • SSP Project Engineer

    hace 2 semanas


    Santiago, Metropolitana, Chile MPH Global A tiempo completo

    We are looking for a SSP Project Engineer for one of our clients with the following details: Start date: 16th June 2024 Location: Chile Duration: 3 years Profile: Master's degree in environmental engineering, Soil Science, or a related discipline Proven experience as a project engineer - Soil Site Pollution Knowledge of contaminated soils, geology...


  • Santiago, Metropolitana, Chile Logicalis A tiempo completo

    Job Description Experiencia previa como Site Reliability Engineer o en roles similares. Conocimientos sólidos en sistemas operativos, redes y arquitecturas de infraestructura distribuida. Experiencia en el diseño e implementación de soluciones de monitoreo utilizando cualquiera de las siguientes herramientas de monitoreo como: o Prometheus o Grafana...

  • Site Reliability Engineer

    hace 2 semanas


    Santiago, Metropolitana, Chile Agilesoft A tiempo completo

    Agilesoft es una start-up formada por gente joven, enfocada en el desarrollo de software. Somos amantes de la tecnología, con un ambiente laboral entretenido y grato. Más que cumplir horarios, trabajamos bajo metas, que nos impulsan a resultados innovadores y de calidad.Descripción:Trabajamos principalmente desarrollando plataformas web y aplicaciones...

  • Site Reliability Engineer

    hace 2 semanas


    Santiago, Metropolitana, Chile Muruna A tiempo completo

    Administrador de incidentes autónomo y altamente motivado con habilidades de gestión de incidente y DevOps. Esta posición es responsable de identificar y proponer mejoras al proceso de gestión de incidentes, de liderar el análisis de la causa raíz y la solución en todos los equipos de productos, brindando una comprensión clara del impacto técnico y...

  • Site Reliability Engineer

    hace 2 semanas


    Santiago, Metropolitana, Chile ZerviZ A tiempo completo

    En ZerviZ, prestamos servicios de consultoría y desarrollo de productos digitales en todo Latinoamérica. Nos especializamos en soluciones de Customer Experience (CX). Contamos con amplia experiência en Plataformas WEB, CRM, procesos back-office, desarrollos back-end, e integración de nuevos desarrollos sobre sistemas core de nuestros clientes. Una de...

  • Site Reliability Engineer

    hace 2 semanas


    Santiago, Metropolitana, Chile Agilesoft SpA A tiempo completo

    Agilesoft es una start-up formada por gente joven, enfocada en el desarrollo de software. Somos amantes de la tecnología, con un ambiente laboral entretenido y grato. Más que cumplir horarios, trabajamos bajo metas, que nos impulsan a resultados innovadores y de calidad.Descripción:Trabajamos principalmente desarrollando plataformas web y aplicaciones...

  • Site Reliability Engineer

    hace 2 semanas


    Santiago, Metropolitana, Chile Toku A tiempo completo

    En Toku, buscamos SREs apasionados por el desarrollo de software y motivados para unirse a un equipo de alto rendimiento. Buscamos a una persona que siempre esté buscando maneras de mejorar nuestros productos y la experiência de nuestros usuarios. Valoramos a personas curiosas, proactivas, que se adapten rápidamente a los cambios, que puedan pensar de...

  • Site Reliability Engineer

    hace 2 semanas


    Santiago, Metropolitana, Chile AMARIS CONSULTING SPA A tiempo completo

    Buscamos consultores dinámicos para hacer crecer nuestro equipo de Sistemas de Información y Digital en Chile. Tu experiência, conocimiento y compromiso nos ayudarán a afrontar los retos de nuestros clientes.Estarás apoyando diferentes proyectos a través de tu experiência como SRE.Sus principales responsabilidades:Asesorar sobre la elección y la...