Lead Site Reliability Engineer (Inside IR35)

Sign up to see company details
  • Contract 60 days
  • £750 - £800 (GBP) / day
  • Remote Only
  • 28/06/2021

An experienced SRE with a strong understanding of Kubernetes to help build 3 Container Platforms in 3 different regions servicing 200 workloads or a 1000+ pods for a payment solutions provider.  Must be analytically thinking and comfortable troubleshooting technical problems in high-pressure situations.  You should understand the importance of peer review, test automation, continuous integration & delivery as well as GitOps.

Description

An experienced SRE with a strong understanding of Kubernetes to help build 3 Container Platforms in 3 different regions servicing 200 workloads or a 1000+ pods for a payment solutions provider.  Must be analytically thinking and comfortable troubleshooting technical problems in high-pressure situations.  You should understand the importance of peer review, test automation, continuous integration & delivery as well as GitOps.

 

What you will be doing:

Help support the build out of the Terraform scripts using CI/CD best practices “GitOps” to iteratively build out the container platform and integrate into supporting services like monitoring using Prometheus and alerting Icinga.  You will work remotely within a team of engineers.

Skills: -

  • Ability to communicate written and verbally in English
  • 2+ years' experience working with AWS
  • Understanding of Networks and network configuration
  • 1+ years working with Terraform
  • 2+ years working with Kubernetes
  • 2+ years docker
  • 1+ Istio
  • 3+ Git
  • Can programme in one of the following languages:  Golang, Python, Java, Kotlin, JavaScript
  • Have used one of the following: Jenkins, CircleCi, Gitlab CI, TravisCI, Concourse CI, AWS Code pipeline
  • Bash/Sh/Fish
  • Linux (Centos /Redhat/Alpine)
  • aws-cli

 

Have used at least 10 of these Kubernetes resources/Services:

  • Pods
  • Secrets
  • Configmaps
  • Deployments
  • Replicasets
  • bindings
  • daemonsets
  • Ingress
  • Service
  • Persistent Volumes
  • Storage classes
  • Flux
  • Cert manager
  • Helm charts
  • Kustomize

 

Have used at least 10 of these AWS services:

  • IAM
  • Route53
  • VPC
  • Subnets
  • Security Groups
  • Auto Scaling Groups
  • EC2
  • Application Load balancer
  • Network Load balancer
  • API Gateway
  • Fargate
  • ECR
  • EBS
  • EFS
  • S3
  • KMS
  • Secrets Manager
  • SQS
  • Lambda
  • WAF

 

Desirable: -

  • Understanding of Zero Trust Network principles
  • Linting
  • Visual Studio Code                
  • Jira/Confluence
  • ngrep/grep/strace/awk/

Skills

IT Infrastructure Products
Alpine
Amazon AWS
CentOS
Docker
Kubernetes
Linux
RedHat
Programming Languages & Frameworks
Java
Python
Software Development Tools
Bash
Jenkins
Terraform

Industry Experience

IT
Consultancy and Professional Services