
Website techinnovators Tech Innovators Inc.
Building the future of software
Job Description
About the Role
Ensure high availability and scalability of cloud infrastructure. Develop reliable systems to monitor and improve production services. Collaborate with development teams to resolve production incidents.
Responsibilities
- Implement monitoring solutions (Prometheus/Grafana)
- Design fault-tolerant architecture
- Develop automation scripts using Python/Terraform
- Collaborate with DevOps and engineering teams
- Participate in on-call rotations for incident response
- Optimize system performance + capacity planning
Requirements
- Bachelor’s in Computer Science/IT equivalent
- 4+ years SRE or SysAdmin experience
- Experience with Kubernetes/Docker
- Strong scripting skills (Python/Bash)
- Familiarity with CI/CD pipelines
- Strong problem-solving + analytical skills
Benefits
- Comprehensive healthcare benefits
- 401(k) matching + retirement plans
- Flexible working hours + remote opportunities
- Professional development + certifications
- Collaborative team environment + cultural events
- Annual performance bonus + stock options
About the Company
At ReliableCloud, we specialize in delivering robust cloud infrastructure solutions. We prioritize innovation, dependability, and a supportive work culture.
Job ID: site-reliability-engineer-(sre)-dv7xn