Site Reliability Engineer at Andela January, 2026

Oops! It seems this job from Andela has expired
View current and similar jobs below

- Site Reliability Engineer at Andela
- View Jobs in ICT / Telecommunication / View Jobs at Andela
Posted: Jan 20, 2026

Deadline: Not specified
- Save
- Email
- @gmail.com
- @yahoo.com
- @outlook.com
Never pay for any notarisation, certificate or assessment as part of any recruitment process. When in doubt, contact us

Andela provides companies with access to the top 1% of global tech talent. We identify high-potential developers on the African continent, shape them into world-class technical leaders, and pair them with companies as full-time, distributed team members. Accelerate your product roadmap while minimizing time spent interviewing, on-boarding, and training ne...
Read more about this company

Site Reliability Engineer
- Job Type Remote
- Qualification BA/BSc/HND
- Experience 5 years
- Location Nairobi
- Job Field ICT / Computer
The Senior Site Reliability Engineer is a technical leadership role responsible for designing, implementing, and maintaining highly available, scalable, and secure infrastructure for banking applications, including Mobile Banking and Internet Banking platforms on on-premise infrastructure. This role leads SRE initiatives, mentors junior engineers, drives continuous improvement in production support, and leads observability strategy using OpenShift, Kubernetes, Prometheus, Grafana, and ELK Stack on on-premise data center infrastructure.

Key Responsibilities
- Design and architect a highly available and scalable OpenShift/Kubernetes infrastructure for banking applications on on-premise servers
- Lead and implement a comprehensive monitoring and observability strategy using Prometheus and Grafana
- Design and oversee centralized logging infrastructure using ELK Stack (Elasticsearch, Logstash, Kibana)
- Lead SRE best practices implementation and adoption of production support standards across teams
- Mentor and coach junior SRE and DevOps engineers on OpenShift, Kubernetes, monitoring, and production support
- Define and implement Service Level Indicators (SLIs), Objectives (SLOs), and Agreements (SLAs) with measurable metrics
- Lead incident response strategy, post-incident reviews, and drive continuous improvement in production stability
- Architect and implement advanced alerting, monitoring dashboards, and visualization strategies using Prometheus and Grafana
- Design automation frameworks and tools to reduce operational toil and improve production efficiency
- Lead OpenShift/Kubernetes cluster upgrades, security patches, and infrastructure modernization on-premise
- Establish production support procedures, on-call rotation policies, and escalation frameworks
- Optimize system performance, cost, and resource utilization across containerized on-premise infrastructure
- Conduct capacity planning, performance optimization, and infrastructure scaling initiatives
- Lead technical architecture reviews and infrastructure design decisions for banking applications
- Manage on-premise data center resources and infrastructure planning
- Participate in 24/7 on-call rotation and escalation for critical production incidents
- Ensure compliance, security hardening, and disaster recovery procedures for financial systems
Qualifications
- BSc in Computer Science, Information Technology, Software Engineering, or related field
- 5+ years of hands-on SRE, DevOps, or Production Engineering experience
- 3+ years of experience leading SRE teams or managing production support operations
- 3+ years of hands-on experience managing OpenShift and Kubernetes infrastructure on on-premise infrastructure
- Expert-level experience with Prometheus for monitoring and alerting in production
- Expert-level experience with Grafana for creating comprehensive monitoring dashboards
- Advanced experience with ELK Stack (Elasticsearch, Logstash, Kibana) for logging and log analysis
- Proven experience designing and scaling production systems for high-traffic banking applications
- Deep expertise in Linux/Unix system administration and container networking
- Advanced knowledge of CI/CD automation and deployment strategies
- Hands-on experience with database management, tuning, and optimization on-premises
- Strong experience with infrastructure automation and Infrastructure as Code
- Proven 24/7 production support experience in mission-critical environments
- Experience managing on-premise data center infrastructure
- Proven leadership skills and ability to mentor junior engineers
- Excellent communication skills and ability to present to executive stakeholders
- Experience in financial services or banking sector is highly preferred
Check how your CV aligns with this job

Method of Application

Interested and qualified? Go to Andela on www.linkedin.com to apply

Build your CV for free. Download in different templates.
Share
- Save
- Email
- Report
Send your application

Your Name Your Email Your Phone Number Your Current Location Subject of your Application Your cover letter
Attach your CV/Doc

View All Vacancies at Andela Back To Home

Related Companies Hiring Now

Career Advice

Intelligence-Led Recruitment in Kenya: A Smarter Way for Companies to Hire and Retain Talent MyJobMag Kenya launches its intelligence-led recruitment service to help companies hire smarter using data, insights, and proven success patterns, improving retention and overall hiring outcomes.
How to Network Professionally at Career Events (Plus Templates) Networking at career events can open doors to new opportunities. Discover everything you need to network professionally and make meaningful connections.
60 Behavioural Interview Questions That Expose a Candidate If you’re trying to figure out whether someone is a good fit for your team, emotionally aware, or a strong leader, these questions can help you see who they really are before you hire them.
25 Signs Your Job Interview Went Really Well In this article, we discuss 25 clear signs that your interview probably went really well. These are simple hints that recruiters and employers often show.

View All Career Advice

Send this job to a friend

Did you notice an error or suspect this job is scam? Tell us.

Site Reliability Engineer at Andela

Site Reliability Engineer

Method of Application

Send your application

Related Companies Hiring Now

Career Advice

Send this job to a friend

Did you notice an error or suspect this job is scam? Tell us.

Site Reliability Engineer at Andela

Site Reliability Engineer

Method of Application

Send your application

Related Companies Hiring Now

Career Advice

Subscribe to Job Alert