• Notera att ansökningsdagen för den här annonsen kan ha passerat. Läs annonsen noggrant innan du går vidare med din ansökan.

We are looking for a Site Reliability Engineer to join the Infrastructure Engineering team.

Your role within our Kingdom

Our job is to build effective, stable and reliable large scale infrastructure tools and services for our games and product teams, to allow them to focus on creating great games. We strive to empower developer teams to be autonomous and flexible and continuously work to create a self service model for our tech, by being in close collaboration with development teams in the full product life cycle.

We engineer and provide the shared infrastructure serving all of our games, as well as the developer environments and supporting tech like observability, log management, and event transport. This includes everything from working in the Data Centers and writing orchestration and automation for our production stack to troubleshooting distributed systems and resolving production incidents.

We are currently at the beginning of multiple projects redefining our infrastructure. Among other things we have major efforts to modernize our platform as well as all supporting software and orchestration.

The Application team is responsible for designing, building, and maintaining the production applications’ infrastructure. These applications serve billions of data objects representing game states, messaging, and much more that constantly serve hundreds of thousands of requests per second with low millisecond response times.

Being part of the team, you will write software to support and automate our infrastructure as well as manage and plan our environment, working in close collaboration with the rest of the Infrastructure Engineering organization and backend-developer teams.

You will among other things:

Develop and maintain utilities and libraries supporting our distributed infrastructure, working with technologies like MySQL, Cassandra, Kafka, OpenTSDB, and so on.
Build automation and improve the resilience of the systems serving our games
Evaluate hardware and software, run benchmarks, and perform capacity planning, for existing and future deployments
Do performance analysis, optimization, and workload characteristics to minimize the resource utilization and cost of our backend
Work closely with other teams on incident resolution and proactive strengthening King’s site reliability
Create and maintain our deployment pipelines
Provide subject matter expertise for our technologies and systems to stakeholders
Troubleshooting, incident management, and On Call

Skills to create thrills

Comfortable working in a Linux computing environment
Strong development skills in Python, and some knowledge of Java, Perl, SQL, or similar
Experience automating and orchestrating distributed systems as well as creating internal tools such as backup management or metrics collection
Interest or experience in
Database technologies like MySQL, Cassandra, HDFS/Hadoop, etc
Monitoring systems like OpenTSDB, InfluxDB, Graphite, etc
Log management systems like Graylog, the ELK stack, etc
Orchestration frameworks like Ansible, Salt, etc
Familiarity with Linux performance tools
Good communication skills

Our Kingdom

King is a leading interactive entertainment company for the mobile world, with people all around the world playing one or more of our games. We have developed more than 200 fun titles, and our games are played and enjoyed all around the world. King has game studios in Barcelona, Berlin, London, Malmö, Seattle, and Stockholm along with offices in Bucharest, Malta, San Francisco, New York and Tokyo. King is an independent unit of Activision Blizzard Inc. (Nasdaq: ATVI), which acquired King in February 2016.

Detta är en jobbannons med titeln "Site Reliability Engineer" hos företaget Midasplayer ab och publicerades på webbjobb.io den 26 juli 2017 klockan 00:00.

Hur du söker jobbet

webbjobb-logo-white webbjobb-logo-grey webbjobb-logo-black