Salem
FULL_TIME
Skilled work
Global MNC Tech is seeking a highly motivated Junior Site Reliability Engineer (SRE) to join our Cloud Operations team. The ideal candidate will play a crucial role in monitoring, maintaining, and improving the reliability of our cloud infrastructure. You will be responsible for implementing alerting systems, responding to incidents, and ensuring smooth operation of our services. This role is perfect for a proactive, detail-oriented individual who is passionate about cloud technologies and system reliability.
As a Junior SRE, you will collaborate closely with engineering and operations teams, helping to design and maintain robust monitoring solutions, analyze system performance, and support continuous improvement of our cloud services.
Monitor cloud-based applications, services, and infrastructure to ensure high availability and performance.
Configure, maintain, and enhance monitoring and alerting tools (e.g., Prometheus, Grafana, CloudWatch, or equivalent).
Respond promptly to incidents and outages, performing initial triage and escalation as necessary.
Collaborate with engineering teams to identify root causes of issues and implement long-term fixes.
Assist in maintaining documentation, runbooks, and operational procedures.
Participate in on-call rotation and support incident management processes.
Proactively suggest improvements to monitoring, alerting, and overall system reliability.
Bachelors degree in Computer Science, Information Technology, or a related field.
Basic understanding of cloud platforms (AWS, Azure, or GCP) and their core services.
Familiarity with monitoring and alerting tools (e.g., Prometheus, Grafana, CloudWatch, Nagios).
Knowledge of scripting languages such as Python, Bash, or PowerShell.
Understanding of Linux/Unix operating systems and command-line operations.
Strong analytical and problem-solving skills.
Effective communication skills and ability to work in a collaborative team environment.
0–2 years of experience in a technical support, systems administration, or cloud operations role.
Internship or academic projects demonstrating cloud infrastructure monitoring experience are considered valuable.
Full-time role (40 hours per week).
Flexible working hours with participation in an on-call rotation to support 24/7 cloud operations.
Remote-friendly options may be available depending on location and team requirements.
Ability to quickly learn new cloud technologies and monitoring tools.
Strong attention to detail and ability to work under pressure during incidents.
Capacity to prioritize tasks effectively in a dynamic environment.
Analytical mindset to identify trends, potential issues, and recommend improvements.
Team-oriented attitude with strong collaboration skills across engineering and operations teams.
Competitive salary and performance-based incentives.
Comprehensive health, dental, and vision insurance plans.
Paid time off, holidays, and parental leave policies.
Access to professional development and training programs.
Flexible remote work opportunities and supportive work-life balance.
Employee wellness programs and team-building activities.
Be part of a global technology leader driving innovation in cloud solutions.
Work in a collaborative, inclusive, and growth-focused environment.
Gain hands-on experience with cutting-edge cloud technologies and tools.
Opportunity to grow your career in Site Reliability Engineering with mentorship from experienced SREs.
Contribute to impactful projects that serve millions of users worldwide.
Interested candidates are invited to submit their resume and a cover letter detailing relevant experience to us. Please include Junior SRE - Cloud Monitoring & Alerting in the subject line.
Applications will be reviewed on a rolling basis, and shortlisted candidates will be contacted for interviews.