Empregos
>
São Paulo

    SRE and Incident Manager - São Paulo, Brasil - Pluxee

    Pluxee
    Pluxee São Paulo, Brasil

    há 1 hora

    Default job background
    Descrição

    SRE and Incident Manager - Brazil / Chile (Latin America)

    Pluxee is the leading global employee benefits and engagement partner that opens up a world of opportunities to help people enjoy more of what really matters.

    We believe that living life to the full means making the most of every moment and sharing experiences with the people we care about. To make these experiences meaningful, fulfilling, and personalized, we combine our 45+ years of experience with the agility and energy of a new digital brand.

    The SRE and Incident Manager - LATAM will be responsible for overseeing the entire SRE (Site Reliability Engineering) and incident management process, ensuring that infrastructure and application under his / her scope are design, developed and implemented with reliability and resiliency, increasing efficiency through automation, which at the end improving customer satisfaction and retention. He/she should also proactively drive a culture of continuous improvement

    During the operational phase of the system under his / her scope, he is responsible for the timely resolution of incidents and minimizing the impact on our systems and services offered to clients, consumers and merchants in several Latin American countries.

    He / She will be a part of a distributed international Run & Operations team that collaborates with other cross-functional teams to drive continuous improvement of our incident response capabilities at all levels (local, regional and global).

    Our systems and services are built on a cloud-native, distributed architecture and integrated to country systems. The current functional scope is composed of an identity management system, a store locator, several web portals, a couple of mobile applications, as well as several payment capabilities.

    Technical expertise: solid experience with Azure services, like App Services, Functions, Application Gateways, APIM, and understanding of common server-side frameworks, like , , PHP Synfony, ".Net" as well as at least one observability platform, preferably Datadog.

    Process-centric approach: strive to understand and optimize the end-to-end flow of work, rather than solely focusing on individual tasks or functional silos.

    Assertive communication: clearly express his / her thoughts, needs, and boundaries in a direct, and respectful manner while considering the need to engage multiple parties (internal and external) towards a common goal.

    Your next challenge:

    The SRE and Incident Manager – LATAM will have the following responsibilities to ensure reliability and resiliency of the system under his / her scope:

    • Ensuring reliability - getting systems back to steady-state as quickly as possible
    • Eliminating toil - automating wherever possible
    • Blameless postmortems - driving better cross-team collaboration
    • Observing what matters - gaining full visibility into system health
    • Being pro-active - living and breathing SLOs to identify and remediate issues before SLAs are violated
    • Architecting for resiliency - Informing architectural design decisions to build more reliable systems

    The SRE and Incident Manager – LATAM will have the following responsibilities during the lifecycle of an incident or crisis:

    • Review and maintain the incident and crisis management processes, escalation paths, and communication plans.
    • Review and maintain an incident and crisis response plan, outlining steps, procedures, and roles during incidents.
    • Identify and log incidents promptly, using monitoring tools, alerts, and reports.
    • Ensure incidents are prioritized according to predefined criteria, allocating appropriate resources based on the incident's impact.
    • Coordinate and communicate effectively with technical teams, support staff, and stakeholders throughout the incident lifecycle.
    • Initiate escalation processes when incidents cannot be resolved within predefined timeframes or by initial response teams.
    • Lead or participate in incident investigations and root cause analysis, identifying underlying causes and recommending preventive measures.
    • Maintain accurate records of incidents, including timelines, actions taken, and resolutions.
    • Lead or participate in the preparation of incident reports and root cause analysis reports.
    • Share relevant reports with stakeholders, highlighting trends, recurring issues, and improvement opportunities.
    • Drive continuous improvement of the incident management process by analysing incident data, identifying patterns, and suggesting enhancements.

    You're a match :

    • The Incident Manager - LATAM will be responsible for the following technical activities:
    • Review and track patch management activities to ensure that applications and services are up to date with the latest security patches, bug fixes, and upgrades.
    • Coordinate and schedule regular maintenance windows for applying patches or implementing upgrades, minimizing disruption to users.
    • Track support lifecycle of all configuration items supporting the systems and services under his / her responsibility.
    • Collaborate with other stakeholders to ensure timely upgrades of software and services ensuring that they remain enjoying the proper support and service levels.
    • Track capacity and utilization of resources and collaborate with other stakeholders to ensure services remain available, even under high loads.
    • Regularly review monitoring data with Tech Leads and DevOps to identify performance issues, capacity bottlenecks, or potential risks.
    • Proactively propose new monitors and adjustments in the monitoring thresholds aiming at detecting events and incidents the earliest.
    • Review and maintain process documentation, including support procedures, runbooks, knowledge articles, and FAQs (Frequently Asked Questions), for all systems and services under his responsibility.
    • Review contracts from vendors and service providers to ensure they fully support the SLAs (Service Level Agreements) agreed with internal clients.
    • Propose architectural changes to increase systems resiliency, based on insights and experience gained from incidents and maintenance activities.
    • Conduct training sessions, workshops, or awareness programs to educate staff and countries on the support model, incident response procedures and best practices.
    • Be the main point of contact from Run & Operations in matters related to the Release Management process that may affect the services availability.

    SKILLS REQUIRED:

    Soft Skills :

    • Fluency in English and Spanish (Mandatory) / Portuguese is a Plus
    • Assertive communication, Leadership, Problem-Solving, Stress Management, Collaboration and Teamwork, Conflict Resolution, Continuous Learning and Adaptability.

    Knowledge / Experience:

    Technical Knowledge, Incident Management Tools, Troubleshooting and Root Cause Analysis, Application Stacks Knowledge, Incident Response and Recovery, IT Infrastructure and Networks, Data Analysis and Reporting, IT Security and Compliance.

    To get this challenge:

    • Video call Discussion with TA Expert.
    • Video call Discussion with Hiring Manager
    • Video call Discussion with Operation Manager
    • Video call Discussion with HRBP.

    Your team

    • IT TEAM

    Your Location:

    Preferred: Brazil / Chile (Latin America)


  • Fiserv, Inc.

    Incident Management

    há 6 dias


    Fiserv, Inc. Sao Paulo, Brasil

    **Calling all innovators - find your future at Fiserv.** · We're Fiserv, a global leader in Fintech and payments, and we move money and information in a way that moves the world. We connect financial institutions, corporations, merchants, and consumers to one another millions of ...


  • Trustly Serra, Brasil

    **WHO WE ARE** · - The Trustly Americas team combines PayWithMyBank, a 2012 Silicon Valley startup, with Trustly AB of Sweden, following their 2019 merger. Our team represents 30 nationalities, serving 8,300 merchants, connecting to 650 million consumers, and 12,000 banks across ...

  • Fiserv, Inc.

    Incident Management

    há 4 dias


    Fiserv, Inc. São Paulo, Brasil

    **Calling all innovators - find your future at Fiserv.**We're Fiserv, a global leader in Fintech and payments, and we move money and information in a way that moves the world. We connect financial institutions, corporations, merchants, and consumers to one another millions of tim ...


  • Pluxee São Paulo, Brasil

    SRE and Incident Manager Brazil / Chile (Latin America)Pluxee is the leading global employee benefits and engagement partner that opens up a world of ...


  • Acronis São Paulo, Brasil

    Acronis is a world leader in cyber protection—empowering people by providing them with cutting-edge technology that enables them to monitor, control, and protect the data that their businesses and lives depend on. We are looking for an Incident and Escalation Manager for our Tech ...


  • Acronis São Paulo, Brasil

    Acronis is a world leader in cyber protection—empowering people by providing them with cutting-edge technology that enables them to monitor, control, and protect the data that their businesses and lives depend on. We are looking for an Incident and Escalation Manager for our Tech ...

  • Curriculum

    Gerente

    há 6 dias


    Curriculum Sao Paulo, Brasil

    Incident Manager /Commander Pleno. Propriedade total dos incidentes P1 e P2, garantindo os recursos necessários para resolver o problema o mais rápido possível. Gerenciar interrupções conduzindo, coordenando e escalando internamente entre os times da empresa, para atuação dos Inc ...

  • Curriculum

    Gerente

    há 6 dias


    Curriculum Sao Paulo, Brasil

    Incident Manager / Commander Pleno - Segundo Turno. Propriedade total dos incidentes P1 e P2, garantindo os recursos necessários para resolver o problema o mais rápido possível. Gerenciar interrupções conduzindo, coordenando e escalando internamente entre os times da empresa, par ...

  • Portal Buscar Vagas BR

    Consultor ABAP Pi

    há 4 dias


    Portal Buscar Vagas BR Sao Paulo, Brasil

    Atuação em desenvolvimento ABAP tanto em novas demandas quanto incidentes SAP. · (ALVs, Module Pool ,batch input, SmartForms, grupos de funções, screen e field exits, · Enhancement Point, BADIs, BAPIs, SM30,Tabelas, Aplicação de Notas SAP) Salário a Combinar Inglês avançado. · Co ...

  • Divulga Vagas

    Consultor ABAP Pi

    há 5 dias


    Divulga Vagas Sao Paulo, Brasil

    Atuação em desenvolvimento ABAP tanto em novas demandas quanto incidentes SAP. · (ALVs, Module Pool ,batch input, SmartForms, grupos de funções, screen e field exits, · Enhancement Point, BADIs, BAPIs, SM30,Tabelas, Aplicação de Notas SAP) Salário a Combinar Local: Híbrido 50% Pr ...


  • SONDA IT Sao Paulo, Brasil

    **Viva a experiência SONDA** · **Aqui você transforma sua carreira** · **Por que fazer parte da SONDA?** · Porque assim como você, amamos o que fazemos, temos paixão por inovação e estamos convencidos de que através do conhecimento e da tecnologia vamos transformar não só os negó ...

  • Santander

    Bra IT Senior Analyst

    há 1 semana


    Santander Sao Paulo, Brasil

    BRA IT Senior Analyst · Country: Brazil · **WHAT YOU WILL BE DOING** · **Gerenciamento, Suporte e Administração de underlay, overlay Cloud Privada VMware utilizando ferramentas, linguagens para automação e monitoração. · - Transformações ambientes para Cloud Privada · - Pesquisa ...

  • ACTUALIZE, Grupo Fractalia

    Ticket Manager

    há 6 dias


    ACTUALIZE, Grupo Fractalia Sao Paulo, Brasil

    Responsabilidades e atribuições: Avaliação e direcionamento de tickets, incidentes e requisições utilizando a ferramenta ITSM Service Now. Garantir o SLO e SLA, monitorando e atuando na ferramenta, visando métricas, pontuando Analistas e Coordenadores. Analisar e efetuar cobrança ...

  • Santander

    Bra IT Sr Leader

    há 1 dia


    Santander Sao Paulo, Brasil

    BRA IT Sr Leader · Country: Brazil · **WHAT YOU WILL BE DOING** · Estamos à procura de um Manager de Network para atuar na área de SRE Networking (Site Reliability Engineering), sendo responsável pela na definição e implementação, atuação em suporte avançado e interlocução com fa ...


  • SONDA IT Sao Paulo, Brasil

    **Viva a experiência SONDA** · **Aqui você transforma sua carreira** · **Por que fazer parte da SONDA?** · Porque assim como você, amamos o que fazemos, temos paixão por inovação e estamos convencidos de que através do conhecimento e da tecnologia vamos transformar não só os negó ...

  • Santander

    IT Leader

    há 1 semana


    Santander Sao Paulo, Brasil

    IT Leader · Country: Brazil · **# QUEM SOMOS** · A F1RST é o futuro e sua carreira está aqui Nossa cultura é baseada em "Pessoas, Inovação e Resultados". · Fazemos parte do maior banco Internacional do Brasil, um dos maiores bancos do mundo. Promovendo serviços e experiências pa ...

  • To Brasil

    Ticket Manager

    há 1 semana


    To Brasil Sao Paulo, Brasil

    Estamos em busca de um(a) Ticket Manager para atuar em uma Instituição Financeira fundada em 1857 de forma **Híbrida (3x Escritório - SP Av. Amador Bueno e 2x Home Office) **e CLT + Benefícios. · Requisitos Indispensáveis: · - Realizar a gestão da fila de Tickets, controlando, di ...


  • Adecco Sao Paulo, Brasil

    -Adecco · São Paulo, SP - Brasil · **Salário compatível com o mercado** · - CLT- Remoto- Pleno**Descrição**: · - A vaga será uma contratação Adecco, para uma empresa multinacional (global) atuante no segmento de Tecnologia da Informação.**Descrição das Atividades**: · - Analista ...


  • Adecco Sao Paulo, Brasil

    Adecco Outsourcing- São Paulo - Brasil- Vaga exclusiva para PWD- CLT- Remote- PlenoDescription · - A vaga será uma contratação Adecco, para uma empresa multinacional (global) atuante no segmento de Tecnologia da Informação. · **Descrição das Atividades**: · - Analista de Serviços ...


  • EY Sao Paulo, Brasil

    Na EY, você terá a chance de construir uma carreira tão única quanto você, com alcance global, cultura inclusiva e tecnologia para se tornar a melhor versão de você. E nós contamos com sua voz e perspectivas únicas para ajudar a EY a se tornar ainda melhor. Junte-se a nós e const ...