
Website GlobalFinanceHQ Global Financial Solutions
Empowering Financial Success
Job Description
About the Role
We’re seeking a Site Reliability Engineer to design, maintain, and optimize our infrastructure to ensure our services remain highly available and reliable.
Responsibilities
- Design, parity, and maintain scalable, loosely coupled services
- Proactive monitoring, alerting, and remediation of potential issues
- Participate in on-call rotations to ensure 24/7 system reliability
- Collaborate with software engineers to implement infrastructure as code (IaC)
- Conduct chaos engineering and disaster recovery drills
- Stay up-to-date with emerging infrastructure trends and best practices
Requirements
- Proven experience (5+ years) in Site Reliability Engineering or a similar role
- Strong proficiency in infrastructure as code tools (e.g., Terraform, CloudFormation)
- Experience with containerization (e.g., Docker, Kubernetes) and orchestration
- Excellent problem-solving and scripting skills
- Strong knowledge of Linux/Unix administration and debugging
- Experience with cloud platforms (e.g., AWS, GCP, Azure)
Benefits
- Competitive salary and equity compensation plan
- Comprehensive health, dental, and vision insurance
- Generous PTO package and flexible work hours
- 401(k) plan with company matching
- Professional development opportunities and tuition assistance
- Dynamic and inclusive engineering environment
About the Company
Our mission is to provide highly available, scalable, and secure infrastructure for our suite of products and services, ensuring our users have the best possible experience.
Job ID: site-reliability-engineer-BULXm