Hero background

Senior Platform & DevOps Engineer (Unified Role)

Remote or onsite

Job Type: Full-time

Key Responsibilities:

  • Architect, build and maintain Linux-based server environments with a strong focus on reliability, security and performance
  • Design and operate network infrastructure (switches, routers, firewalls, VPN, routing, VLANs, HA topologies)
  • Define and enforce infrastructure standards, system hardening, automation and access policies
  • Lead troubleshooting and incident response across compute, network and storage layers
  • Implement automation for provisioning, patching and configuration management
  • Establish high-availability, backup and DR strategies across environments
  • Drive documentation quality, operational runbooks and infrastructure governance
  • Architect and maintain GitLab CI/CD pipelines, build systems and automated release workflows
  • Lead container strategy using Docker, Docker Compose and best practices for secure images
  • Deploy, manage and optimize Kubernetes clusters (bare metal or cloud), including networking, storage and scaling models
  • Use Talos to create standardized, reproducible cluster infrastructure and immutable operating environments
  • Build, optimize and extend observability stacks with Prometheus, Grafana and exporters
  • Collaborate with software teams to improve build, test, packaging and deployment cycles
  • Introduce and enforce DevOps best practices, including IaC, automation, reproducible builds and environment consistency

Requirements:

  • 7+ years in Linux systems engineering, DevOps or platform engineering
  • Strong expertise with L2/L3 networking, firewalls, routing and troubleshooting
  • Expert-level experience with GitLab CI, runners, pipelines and artifacts
  • Deep containerization knowledge and real-world Docker/K8S experience in production
  • Experience with immutable Linux (preferably Talos; experience w. Nix/Flatcar will help)
  • Strong understanding of Prometheus, Grafana and operational metrics
  • Solid scripting abilities in Bash and at least one other language (Python, Go preferred)
  • Ability to design reliable, secure and scalable architectures with minimal oversight
  • Strong ownership mindset and ability to lead initiatives independently

Alternative Focused Roles:

Senior Platform Engineer - Focused

This role requires a deep expert in Linux systems and network infrastructure. You will architect and operate production-grade environments, enforce standards and handle high-complexity issues across servers, networks and security.

Key Responsibilities:

  • Architect, build and manage secure Linux server fleets across multiple sites and environments
  • Operate and optimize network infrastructure: switching, routing, firewalls, VPN, segmentation
  • Design HA topologies, monitoring, alerting and capacity planning
  • Lead root-cause analysis and solve complex performance or connectivity issues
  • Automate infrastructure lifecycle management using scripts and modern tools
  • Implement platform-wide security controls, access policies and compliance standards
  • Maintain detailed documentation, infrastructure diagrams and operational procedures
  • Mentor developers and junior engineers on system, network and operational topics

Skill Requirements:

  • 7+ years in Linux operations or platform engineering
  • Deep knowledge of networking protocols, topology design and troubleshooting
  • Experience with HA clustering, storage stacks, virtualization and distributed systems
  • Strong automation experience
  • Proven ability to handle production incidents, migrations and architecture decisions

Senior DevOps Engineer - Focused

We are seeking a senior DevOps expert who can design, implement and optimize a fully automated CI/CD and Kubernetes-based delivery ecosystem. You will empower developers, improve reliability and define DevOps practices company-wide.

Key Responsibilities:

  • Architect robust GitLab CI/CD workflows for multi-stage builds, tests and deployments
  • Define containerization standards and manage Docker image lifecycle for multiple services
  • Lead design and operations of Kubernetes clusters, including networking, ingress, storage and scaling
  • Build standardized and immutable OS environments using Talos
  • Maintain and extend Prometheus and Grafana based observability systems
  • Drive automation across the entire SDLC, eliminating manual processes
  • Improve developer productivity with tooling, templates, pipelines and automation
  • Contribute to reliability engineering efforts, SLOs, performance monitoring and failure analysis

Skill Requirements:

  • 7+ years in DevOps or infrastructure automation
  • Deep expertise with GitLab CI, Docker and Kubernetes (production environments)
  • Experience with immutable infrastructure tools such as Talos
  • Strong understanding of observability, alerting and infrastructure metrics
  • IaC experience (Terraform, Ansible, Helm, Kustomize etc.) is a strong plus
  • Ability to design scalable, secure and automated delivery systems from scratch
  • Strong communication skills and ability to mentor developers on DevOps practices

🚀 Join us to drive the next generation of telecom and observability solutions!

Why Join Us?

  • • Work on innovative telecom and observability solutions
  • • Collaborate with skilled engineers, architects, and business leaders
  • • Opportunity to make data-driven decisions that shape product direction
  • • Competitive salary, benefits, and career growth opportunities