Mabopane
FULL_TIME
Skilled work
Global MNC Tech is seeking a highly motivated and experienced Site Reliability Engineer (SRE) to join our global remote engineering team. As an SRE, you will play a critical role in ensuring the reliability, scalability, performance, and security of our cloud-based platforms and mission-critical systems. You will work at the intersection of software engineering and systems engineering, applying automation, observability, and reliability best practices to build resilient and highly available services used by millions of users worldwide.
This role is ideal for professionals who are passionate about building stable systems, solving complex infrastructure problems, and driving a culture of reliability across distributed teams.
Design, implement, and maintain highly reliable, scalable, and fault-tolerant systems across cloud environments.
Monitor system performance, availability, and capacity using advanced observability tools.
Develop and maintain automation for deployments, scaling, monitoring, and incident response.
Lead incident management processes, including root cause analysis and post-incident reviews.
Collaborate with software engineers to improve system reliability and performance.
Implement CI/CD pipelines and infrastructure-as-code practices.
Define and track SLOs, SLIs, and SLAs to ensure service reliability.
Optimize system performance, cost efficiency, and resource utilization.
Ensure security, compliance, and disaster recovery readiness.
Continuously improve operational processes and reliability standards.
Strong experience with cloud platforms such as AWS, Azure, or Google Cloud Platform.
Proficiency in Linux/Unix system administration.
Solid programming skills in one or more languages (Python, Go, Java, Bash, or similar).
Experience with containerization and orchestration tools (Docker, Kubernetes).
Hands-on experience with monitoring and observability tools (Prometheus, Grafana, Datadog, New Relic, etc.).
Strong understanding of networking, distributed systems, and system architecture.
Experience with infrastructure-as-code tools (Terraform, Ansible, CloudFormation).
Knowledge of CI/CD tools (Jenkins, GitHub Actions, GitLab CI, CircleCI).
Excellent problem-solving and troubleshooting skills.
Strong communication skills and ability to work in a remote global environment.
3+ years of experience in Site Reliability Engineering, DevOps, Cloud Engineering, or similar roles.
Proven track record of managing production systems at scale.
Experience supporting high-availability systems and 24/7 services.
Prior experience in a global or distributed team is a plus.
100% Remote – Work from anywhere globally.
Flexible working hours with some overlap required for global team collaboration.
On-call rotation may be required for critical systems support.
Deep understanding of reliability engineering principles and best practices.
Ability to design resilient architectures and automate repetitive tasks.
Strong analytical mindset with attention to detail.
Ability to work independently and manage priorities in a fast-paced environment.
Passion for continuous learning and adopting new technologies.
Strong sense of ownership and accountability for system reliability.
Competitive global compensation package.
Fully remote work with flexible schedules.
Health insurance and wellness programs.
Paid time off, holidays, and sick leave.
Learning and development budget.
Access to cutting-edge technologies and global projects.
Career growth opportunities in a multinational environment.
Supportive and inclusive company culture.
At Global MNC Tech, we believe reliability is the foundation of innovation. You will join a world-class engineering organization that values autonomy, technical excellence, and continuous improvement. You will have the opportunity to work on global-scale systems, collaborate with top talent across continents, and make a real impact on the stability and performance of products used worldwide.
This is more than a job — it is an opportunity to shape the future of reliable digital infrastructure.
Interested candidates are invited to submit their updated resume along with a brief cover letter highlighting their experience in SRE or cloud engineering roles.