• Notera att ansökningsdagen för den här annonsen kan ha passerat. Läs annonsen noggrant innan du går vidare med din ansökan.

Site Reliability Engineers (SREs) are people who use engineering-based approaches to solve operations problems. SRE owns and develops the infrastructure needed for the Electrolux Connectivity Platform and supporting services. SRE is also responsible for making sure the services – both internal and external systems - have the characteristics and qualities needed for the intended use.

You will work to understand the operational requirements and develop an infrastructure architecture and tools that meet these requirements. You will monitor the performance of the system and refine the management of the infrastructure from both a performance and cost perspective so that it is optimal and balanced at all times.

You will also work closely with our DevOps teams to deliver efficiently by empowering them with excellent tools that you develop. These might be for example monitoring tools, infrastructure pipeline components etc.

Responsibilities

  • Engage in and improve the whole lifecycle of services — from inception and design, through deployment, operation and refinement
  • Support services prior to production through activities like system design consulting, developing software platforms and frameworks, capacity planning and launch reviews
  • Contribute improvements to the availability, scalability, latency, and efficiency of the services once they are live
  • Scale systems sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and velocity
  • Practice sustainable incident response and blameless postmortems
  • Contribute to our deployment and automation tools
  • Promote Site Reliability Engineering best practices
  • Be part of our on-call rotation with other engineers around the world

Minimum Qualifications

· BS or MS in Computer Science or a related technical field

· 3+ years experience working with infrastructure engineering in large-scale production service environment

· 3+ years experience in analyzing and troubleshooting distributed systems using logging, distributed tracing, stack traces and metrics

· Automation skills and a desire to automate everything

· Comfortable with at least one of the following languages: Java, Python, Go. Can learn a new language quickly

· Systematic problem-solving approach with strong sense of ownership

· Good communication skills

Preferred Qualifications

· You are a Software Engineer

· A good understanding in large-scale distributed systems

· Experience working with Public Cloud (AWS, Azure or GCP)

· Experience working with container orchestration e.g. Kubernetes

· Experience in monitoring and metrics systems, e.g. Prometheus, Grafana

· A good knowledge of Site Reliability Engineering principles

· Experience with on-call rotation, incident response and blameless postmortem

· CI/CD automation experience

· A great team player

· Fluency in English

Öppen för alla
Vi fokuserar på din kompetens, inte dina övriga förutsättningar. Vi är öppna för att anpassa rollen eller arbetsplatsen efter dina behov.

Detta är en jobbannons med titeln "Site Reliability Engineer" hos företaget AB Electrolux och publicerades på webbjobb.io den 7 april 2021 klockan 17:53.

Hur du söker jobbet

Ansökan sker via e-post till [email protected].

webbjobb-logo-white webbjobb-logo-grey webbjobb-logo-black