Senior SRE - Scalable Backend Infrastructure

Location

Ramgundam

Job Type

FULL_TIME

Experience

Skilled work

Job Description

Job Summary

Global MNC Tech is seeking an experienced and highly skilled Senior Site Reliability Engineer (SRE) to join our dynamic engineering team. In this role, you will be responsible for designing, implementing, and maintaining scalable backend infrastructure that powers critical services for millions of users worldwide. You will work at the intersection of software engineering and systems operations, ensuring the reliability, scalability, and performance of our cloud-based systems while driving automation and operational excellence.

This is a high-impact position ideal for professionals who thrive in fast-paced environments, enjoy solving complex technical challenges, and are passionate about building robust infrastructure for global-scale applications.


Key Responsibilities

  • Architect, build, and maintain highly available, fault-tolerant, and scalable backend systems.

  • Implement monitoring, alerting, and incident response processes to ensure service reliability and uptime.

  • Collaborate with software engineering teams to optimize deployment pipelines, improve observability, and integrate reliability best practices into development cycles.

  • Automate operational tasks, including infrastructure provisioning, deployment, and remediation workflows.

  • Conduct root cause analyses for incidents, identify trends, and implement preventive measures to reduce recurrence.

  • Define and track SLOs, SLAs, and SLIs, ensuring alignment with business and customer expectations.

  • Mentor junior engineers, provide guidance on reliability engineering principles, and champion a culture of resilience across the organization.


Required Skills and Qualifications

  • Bachelors or Masters degree in Computer Science, Engineering, or a related field.

  • Strong expertise in Linux/Unix systems administration and backend system architecture.

  • Proficient in at least one programming/scripting language such as Python, Go, Java, or Ruby.

  • Hands-on experience with cloud platforms (AWS, GCP, or Azure) and container orchestration technologies (Kubernetes, Docker).

  • Deep understanding of distributed systems, networking, databases (SQL/NoSQL), and caching mechanisms.

  • Experience with CI/CD pipelines, infrastructure-as-code (Terraform, CloudFormation), and configuration management tools (Ansible, Puppet, Chef).

  • Proven track record of incident management and troubleshooting in production environments.

  • Strong analytical, problem-solving, and communication skills.


Experience

  • 5+ years of experience in Site Reliability Engineering, DevOps, or Systems Engineering.

  • Demonstrated experience in designing, scaling, and maintaining high-traffic backend infrastructure.

  • Experience working in cross-functional, globally distributed teams is a plus.


Working Hours

  • Standard work hours: 40 hours/week (flexible and remote-friendly).

  • Occasional on-call rotations for incident response may be required, compensated with appropriate time off.


Knowledge, Skills, and Abilities

  • Strong understanding of high availability, fault tolerance, and disaster recovery strategies.

  • Ability to analyze large-scale system performance and implement solutions for optimization.

  • Excellent collaboration skills to work effectively with software developers, product managers, and other stakeholders.

  • Ability to mentor and train junior engineers in best practices for reliability and operations.

  • Proactive mindset with the ability to drive improvements and anticipate potential issues.


Benefits

  • Competitive salary and performance-based bonuses.

  • Fully remote and flexible working options.

  • Comprehensive health, dental, and vision insurance.

  • Generous paid time off and parental leave policies.

  • Professional development budget and access to training programs.

  • Collaborative and inclusive work culture with global exposure.


Why Join Global MNC Tech?

  • Work on cutting-edge backend technologies that impact millions of users globally.

  • Be part of a fast-growing, innovative organization that values reliability, performance, and operational excellence.

  • Collaborate with highly skilled engineers in a culture of learning and continuous improvement.

  • Make a tangible impact by shaping the infrastructure that powers the next generation of digital services.


How to Apply

Interested candidates are encouraged to submit:

  • A current resume highlighting relevant SRE experience.

  • A cover letter detailing your achievements in scalable backend infrastructure and reliability engineering.

Submit applications via us or email to us with the subject line: Senior SRE – Scalable Backend Infrastructure Application.

Additional Details

Similar Jobs

Apply Now