Job Details
Location

Poland (Remote)

Employment Type

B2B

Experience

5 years +

Position

Senior/Lead

Relout is a place created by ambitious people with passion for technology. We work for international projects and clients from various industries. We’re helping startups, software houses and enterprises to transform and scale their businesses. From infrastructure management to observability and automation – we’re building the foundation to scale for success.

We’re hiring talented engineers that are passionate about using software-based approaches to solve complex challenges and problems. You’ll be part of the engineering organization with strong focus on using industry  standards and cutting-edge technology to manage and scale cloud native and kubernetes applications and infrastructure. Working at Relout, you’ll lead and contribute to initiatives aimed at scaling our systems, processes and automation.

Read more about us and other career opportunities on our Careers page.

 

As a Site Reliability Engineer you will:

  • Take responsibility for the scalability, stability, and availability of low-latency and high-performance systems.
  • Run and maintain cloud and Kubernetes services and infrastructures
  • Optimize monitoring for proactive faults detection, reduce failure detection time, and improve failure repair times.
  • Implement latency, traffic, errors, and saturation observability & monitoring.
  • Troubleshoot performance and stability of systems and applications
  • Introduce SRE principles like SLOs, embracing and managing risk, eliminating toil
  • Optimize cloud and Kubernetes setup for security and cost-effectiveness.
  • Help prevent and investigate production issues and problems both proactively and actively.
  • Create a bridge between development and operations by applying a DevOps mindset to system administration topics.
  • Balance feature development speed and reliability with well-defined service level objectives.
 

Recommended skills and experience:

  • Cloud & Kubernetes management & development (AWS, Azure or GCP)
  • Maintaining and developing Helm charts and Kubernetes manifests
  • Implementing monitoring solutions (Prometheus, Datadog etc.)
  • Familiarity with infrastructure automation & orchestration tools – Terraform, Packer, Ansible, Saltstack, etc.
  • Sysadmin background with a deep understanding of Linux system internals and architecture
  • Attention to detail in building reusable and scalable software and scripts
  • Ability to debug and troubleshoot stability and performance issues
  • Fluent verbal and written command of Polish and English

What we offer:

    Apply now! Join Relout Team today!