Lucknow
FULL_TIME
Skilled work
Global MNC Tech is seeking an experienced and highly motivated Senior Site Reliability Engineer (SRE) – Observability & Logging to join our global technology team. This role is critical in ensuring the reliability, scalability, performance, and observability of our large-scale, cloud-native systems.
As a Senior SRE, you will lead the design and implementation of advanced observability platforms with a strong focus on logging, monitoring, and analytics using the ELK Stack (Elasticsearch, Logstash, Kibana) and modern telemetry tools. You will work closely with engineering, DevOps, platform, and security teams to create proactive monitoring solutions, reduce system downtime, and drive data-driven operational excellence across distributed systems.
Design, build, and maintain enterprise-grade observability and logging platforms using ELK Stack.
Architect scalable log ingestion pipelines for high-volume distributed systems.
Define and implement monitoring strategies for microservices, cloud infrastructure, and applications.
Create dashboards, alerts, and visualizations to improve system visibility and operational insights.
Lead root cause analysis (RCA) for production incidents using logs, metrics, and traces.
Automate reliability processes including alerting, remediation, and capacity planning.
Collaborate with development teams to integrate observability into CI/CD pipelines.
Establish SLOs, SLIs, and error budgets to drive reliability engineering practices.
Mentor junior SREs and engineers on observability best practices.
Continuously optimize performance, cost, and data retention strategies for logging systems.
Strong hands-on experience with ELK Stack (Elasticsearch, Logstash, Kibana).
Proficiency in Linux/Unix systems administration.
Experience with cloud platforms such as AWS, Azure, or GCP.
Strong scripting skills in Python, Bash, or similar languages.
Experience with containerization and orchestration (Docker, Kubernetes).
Knowledge of observability tools such as Prometheus, Grafana, OpenTelemetry, Splunk, or Datadog.
Solid understanding of networking, distributed systems, and system architecture.
Experience working with CI/CD tools (Jenkins, GitLab, GitHub Actions, ArgoCD).
Excellent problem-solving and analytical skills.
6+ years of experience in SRE, DevOps, or Systems Engineering roles.
At least 3+ years of hands-on experience building and managing observability platforms.
Experience supporting high-availability, mission-critical production systems.
Proven experience working in large-scale, cloud-native environments.
Full-time position (40 hours per week).
Flexible working hours aligned with global team operations.
On-call rotation may be required for critical incident response.
Remote or hybrid work options depending on location and team needs.
Deep understanding of reliability engineering principles and SRE practices.
Ability to analyze complex system behavior and troubleshoot performance issues.
Strong communication skills to collaborate with cross-functional teams.
Leadership mindset with the ability to influence technical decisions.
Capability to handle pressure in high-impact production environments.
Passion for automation, system resilience, and continuous improvement.
Competitive salary and performance-based bonuses.
Comprehensive health insurance and wellness programs.
Paid time off, holidays, and flexible leave policies.
Learning and development budget for certifications and training.
Access to global projects and cutting-edge technologies.
Retirement benefits and long-term career growth opportunities.
Remote work flexibility and work-life balance initiatives.
At Global MNC Tech, we build technology that powers businesses worldwide. You will be part of a global engineering culture that values innovation, ownership, and technical excellence. As a Senior SRE, you will have a direct impact on system reliability and user experience while working on complex, large-scale systems used by millions of users.
We offer a collaborative environment, exposure to global enterprise projects, and a strong focus on continuous learning and career progression.
Interested candidates are invited to submit their updated resume along with a brief cover letter highlighting their experience in SRE and observability platforms.