Kota
FULL_TIME
Skilled work
Global MNC Tech is seeking a highly skilled and experienced Site Reliability Engineering (SRE) Lead to oversee and enhance the reliability, performance, and scalability of our global network infrastructure. This role requires a visionary leader with deep expertise in cloud architecture, distributed systems, and network operations. The SRE Lead will drive initiatives to improve system availability, implement proactive monitoring, and foster a culture of resilience and automation across global teams.
Lead a global SRE team to ensure the highest levels of network reliability, availability, and performance.
Define and implement SRE best practices, including SLIs, SLOs, and error budgets for critical systems.
Develop strategies for proactive incident detection, root cause analysis, and rapid remediation of network issues.
Collaborate with engineering, operations, and product teams to ensure system reliability is integrated into design and development.
Drive automation of operational tasks, monitoring, and alerting to reduce manual intervention and improve efficiency.
Oversee capacity planning, performance tuning, and risk assessments to support global network scaling.
Mentor and guide team members, fostering continuous learning, knowledge sharing, and professional growth.
Partner with cross-functional teams to design disaster recovery and business continuity strategies.
Strong expertise in SRE principles, distributed systems, and global network operations.
Proficiency with cloud platforms such as AWS, Azure, or GCP and hybrid multi-cloud architectures.
Deep knowledge of network protocols (TCP/IP, BGP, DNS, HTTP/S) and network troubleshooting tools.
Experience with infrastructure as code tools (Terraform, Ansible, CloudFormation).
Familiarity with monitoring, logging, and observability tools (Prometheus, Grafana, ELK stack).
Strong programming/scripting skills in Python, Go, Bash, or similar languages.
Excellent leadership, communication, and stakeholder management skills.
Minimum 8–10 years in site reliability, network engineering, or related roles.
At least 3–5 years in a leadership or technical management capacity.
Proven experience managing global-scale, high-availability networks and critical infrastructure.
Full-time role with flexible global coordination.
May require occasional off-hours support for incident response or global network maintenance.
Strong analytical and problem-solving capabilities.
Ability to make data-driven decisions and prioritize initiatives effectively.
Excellent project management skills and ability to lead cross-functional teams.
Ability to foster a culture of resilience, automation, and operational excellence.
Exceptional communication skills for both technical and non-technical stakeholders.
Competitive salary with performance-based incentives.
Comprehensive health, dental, and vision coverage.
Generous paid time off, holidays, and parental leave.
Professional development programs and training opportunities.
Flexible work arrangements and remote work support.
Opportunities to work on cutting-edge technologies at a global scale.
At Global MNC Tech, we are committed to creating a culture of innovation, collaboration, and excellence. Joining our team means working on world-class technology, shaping the future of global network reliability, and advancing your career in an inclusive and empowering environment.
Interested candidates should submit their resume and a cover letter outlining relevant experience and leadership achievements to us with the subject line: SRE Lead – Global Network Reliability Application.