Your Role
:
We are seeking a Staff Site Reliability Engineer (Infrastructure & Site Reliability Engineering) with extensive experience in AWS, AZURE, Kubernetes, and GitOps to lead our Site Reliability Engineering (SRE) team.
The successful candidate will deeply understand SRE practices and have a track record of implementing high-quality site reliability engineering practices (SLAs, SLOs, Proactive Alert Management, Incident Response/Review, Postmortems, etc.).
In this role, you will work with our SRE and cross-functional engineering teams to develop and operate our development and production infrastructure and operations
Responsibilities:
Work collaboratively with software engineering on infrastructure and deployment requirements;Contribute actively and assist in our automation and observability initiativesBuild and maintain operational tools for deployment, monitoring, and analysis of cloud (AWS & AZURE) infrastructure and systemsCollaborate with senior team members in responding to production incidents, actively contribute to postmortems, and engage in continuous improvement efforts as part of on-call rotations for exposure to critical issue resolutionEstablish and drive operations performance through SLOsProvide project management, sprint planning, and road-mapping support to the SRE teamExpert-level technical skills and able to provide mentoring to team membersOur team uses practices to maximize our development velocity, including but not limited to: continuous integration/deployment, code review via GitHub pull requests Ideal Attributes
Strong customer orientationExcellent interpersonal and organizational skillsAttention to detail and focus on qualityStrong communication skills to effectively liaise with both technical and non-technical staffAbility to act decisively and works well under pressureMust be a collaborative problem solverStrong bias for ownership and action Qualifications:
At least 10 + years of experience designing, building, and maintaining SAAS environments6+ years of experience designing, building, and maintaining AWS/AZURE infrastructure with Terraform3+ years of experience building and running Kubernetes, Clickhouse, MySQL, and Kafka clustersExperience with observability (monitoring, logging, tracing, metrics)Experience with GitOps CI/CD processesExperience with scripting with Python, Go (Golang), bash, or PowerShell, and AWS CLI tools Our benefits:
10 study days per year2 volunteering days per year30-day holidays after 5-year tenure, Sabbatical Leave4 weeks of paternity leaveUp to 8700 PLN personal education budget per year300 PLN corrective glasses reimbursement every two yearsMedical care with Luxmed – individual, partner, or family package fully paid by the companyThe company fully pays for group life insurancePension scheme (employee capital plans) with 1.5% employer contributionUnlimited access to LinkedIn LearningEnglish/Polish classesMyBenefit platform with a monthly subsidy of 103 PLN (with various vouchers and Multisport cards available)500 PLN per year of race fee reimbursementSolarian Referral ProgramSolarWinds Appreciation ProgramEmployee Assistance ProgramFree lunches at the office on Wednesdays