Site Reliability Engineer, SRE
Job Description:
- Scale and secure our rapidly growing infrastructure
- Automate critical processes
- Ensure a seamless experience for new users
- Make sure the infrastructure keeps up with the growth
- Ensure system scalability and high traffic handling
- Define and deploy monitoring, alerting, and logging systems
- Respond to and resolve production incidents
- Conduct thorough post-mortems
- Monitor server logs for abnormalities
- Design, manage and maintain automation tools for operational processes
Requirements:
- 5+ years of relevant work experience
- Working experience with AWS
- Docker
- Git
- CI/CD tools like Gitlab CI, Jenkins, etc.
- Experience with IaC tools like Terraform, CloudFormation, Ansible, Puppet, Packer
- Proficiency with Linux and other Unix-based systems
- Experience setting up build automation
- Excellent understanding of security and safety best practices
- Bachelor’s degree in Computer Science or equivalent work experience
- Excellent written and verbal English communication skills
- Ability to work with mixed US and EU based teams
Benefits:
- No overtime
- No work on weekends
- No late working hours
- In-house learning programs
- Tech lectures
- Knowledge sharing
- Remote work with provided MacBook
Apply tot his job Apply To this Job