Job Description
Job Title
Cloud Infrastructure Engineering (Tech Lead - SRE) - Azure, AWS, GCP Infra Operations - Bangalore
Job Description
Experience: 8+ Years
Shift: 11:00 AM to 8:00 PM/ 12:00 PM to 9:00 PM IST
Position Overview
The Cloud Operations Tech Lead provides technical leadership, operational continuity, and escalation support for Fiservs enterprise multi cloud operations across AWS, Azure, and GCP. This role serves as the technical authority and shift to shift continuity point for Cloud Operations Engineers (L1) and Site Reliability Engineers (SREs L2), ensuring consistent execution, rapid incident resolution, and continuous operational improvement.
The Tech Lead is responsible for mentoring engineers, guiding technical decision making during incidents, and ensuring that operational practices align with SRE principles, automation first execution, security, and compliance standards. This role bridges hands on operations and leadership, providing stability across a global, follow the sun operations model.
Key Responsibilities
Technical Leadership Operational Continuity
- Serve as the technical lead and escalation point for Cloud Operations Engineers (L1) and SREs
- Mange resource scheduling and rotational coverage to ensure Shift continuity and operational objectives are met
- Provide shift to shift continuity, ensuring clear handoffs, consistent decision making, and risk awareness
- Act as the final technical authority during complex or high severity incidents
- Ensure operational consistency across regions, shifts, and teams
Incident Management Escalation
- Lead and coordinate response to major incidents, service outages, and degraded service conditions
- Drive Realtime troubleshooting, impact assessment, and recovery efforts
- Ensure incidents are escalated appropriately and resolved efficiently
- Facilitate and review post incident reviews, root cause analysis, and corrective actions
- Track recurring issues and ensure they are addressed through automation or engineering fixes
Mentorship Team Development
Mentor Cloud Operations Engineers and SREs on:
- Cloud infrastructure fundamentals
- Incident response and escalation best practices
- SRE concepts (reliability, toil reduction, automation)
- Support skill development and readiness for progression from Cloud Operations EngineerCloud Site Reliability Engineer SRE
- Promote operational excellence, accountability, and a blameless culture
Operational Excellence Process Governance
- Enforce adherence to runbooks, SOPs, change management, and security policies
- Review and approve operational procedures and runbook updates
- Identify gaps in operational coverage, tooling, or documentation
- Drive standardization and continuous improvement across cloud operations
Automation Reliability Enablement
Partner with SRE and Platform Engineering teams to:
- Reduce manual toil
- Expand automation and auto remediation
- Improve monitoring, alerting, and observability
- Identify recurring operational patterns suitable for automation
- Ensure automation is safe, documented, and consistently executed
Cross Functional Collaboration
Collaborate closely with:
- Application and product engineering teams
- Platform engineering
- Security and compliance teams
- Vendors and cloud service providers
- Act as the operations representative in technical discussions impacting reliability and supportability
- Communicate operational risks, trends, and improvement opportunities to leadership
Required Qualifications
- 8+ years of experience in Cloud Operations, SRE, Infrastructure Engineering, or Production Support
- Handson experience operating enterprise environments in AWS, Azure, and/or GCP
Strong background in:
- Incident management and escalation
- Cloud infrastructure (compute, networking, storage, IAM)
- Monitoring and observability
- Proven ability to lead technical teams during incidents
- Experience mentoring engineers in operational and reliability practices
- Strong written and verbal communication skills
- Ability to operate effectively in a 24*7 global operations model
Preferred Qualifications
- Prior experience as a Tech Lead, Lead SRE, or Senior Cloud Operations Engineer
- Strong understanding of SRE principles including reliability, error budgets, and toil reduction
- Experience driving automation using scripting or Infrastructure as Code
- Familiarity with regulated or enterprise environments (financial services preferred)
- Cloud certifications (AWS, Azure, or GCP)
Success in This Role Looks Like
- Incidents are resolved faster with clear technical leadership
- Operational handoffs are clean, consistent, and low risk
- L1 and SRE engineers show measurable skill growth and confidence
- Recurring issues are reduced through automation and process improvements
- Cloud operations demonstrate improved reliability, stability, and predictability
- Leadership has clear visibility into operational health and risk
Job Classification
Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA
Role Category: Software Development
Role: Technical Lead
Employement Type: Full time
Contact Details:
Company: Fiserv
Location(s): Pune
Keyskills:
aws
technical lead
technical leadership
sre
change management
incident response
cloud
operations
automation
iam
gcp
compliance
cloud infrastructure
infrastructure as code
azure