Your browser does not support javascript! Please enable it, otherwise web will not work for you.

GCP Platform Engineer @ Fint Solutions

Home > Production & Manufacturing - Other

 Posted 46 days ago — confirm the vacancy is still active.

 GCP Platform Engineer

Job Description

GCP Platform Engineer

Role: GCP Platform Engineer (L3 Production Platforms) (Hands-on)

Location: Chennai ( On-site)

Experience: 8+ years overall (minimum 3+ years hands-on in Google Cloud Platform)

Position Type: Full-time

Mode: Onsite (5 days working from office)

Role & responsibilities

Role Summary

We are hiring a GCP Platform Engineer (L3) with real-world experience engineering and supporting production-grade data platforms on Google Cloud Platform.

This role requires a platform engineering mindset with deep expertise in GCP service configuration, runtime behaviour, monitoring, performance tuning, automation, and security compliance. The engineer will design, configure, operate, and troubleshoot GCP data platform services and act as the L3 escalation point for complex platform issues raised by development teams.

The ideal candidate understands how configuration parameters influence service behaviour, performance, cost, and reliability, and is comfortable debugging issues across distributed systems in live production environments.

Key Responsibilities

Platform Engineering & Production Ownership

  • Engineer, configure, and operate production-grade GCP data platform services, including:
  • Dataproc
  • Dataflow
  • Cloud Composer (Airflow)
  • Pub/Sub
  • Google Cloud Storage (GCS)
  • Own platform configurations to ensure high availability, performance, scalability, and security compliance.
  • Design and maintain Terraform-based Infrastructure-as-Code (IaC) modules for standardized and repeatable platform provisioning.
  • Implement and enforce security and governance controls, including:
  • IAM least-privilege models
  • CMEK
  • VPC Service Controls
  • Organization policies
  • Workload Identity

L3 Troubleshooting & Deep Technical Support

  • Act as the L3 escalation point for platform-related issues from data and application engineering teams.
  • Troubleshoot complex production issues, including:
  • Dataproc / Spark / YARN job failures and performance bottlenecks
  • Dataflow pipeline backlogs, worker tuning, and throughput issues
  • Composer / Airflow scheduler failures, DAG dependency issues
  • Pub/Sub throughput, retention, and delivery behaviour issues
  • Analyse and clearly explain how GCP service configurations impact runtime behaviour, monitoring metrics, performance, and cost.
  • Perform root-cause analysis (RCA) and implement long-term engineering fixes rather than short-term workarounds.

Monitoring, Performance & Reliability Engineering

  • Design and maintain monitoring, alerting, and observability using Cloud Monitoring and logging.
  • Interpret service-level metrics and logs to diagnose:
  • Performance degradation
  • Scaling and capacity bottlenecks
  • Reliability and availability risks
  • Tune platform configurations for optimal performance, reliability, and cost efficiency.
  • Ensure platforms meet production uptime, SLA, and compliance requirements.

Automation, CI/CD & Engineering Tooling

  • Build automation using Terraform, Python, and scripting to minimize manual intervention.
  • Integrate CI/CD pipelines for:
  • Platform configuration changes
  • Cloud Composer DAG deployments
  • Dataflow template promotions
  • Develop reusable frameworks for:
  • Dependency packaging
  • Deployment workflows
  • Environment provisioning
  • Operational consistency

Platform Standards, Documentation & Enablement

  • Define and enforce platform standards and best practices.
  • Create and maintain:
  • Platform runbooks
  • Troubleshooting guides
  • Onboarding documentation
  • Enable development teams through self-service tooling, templates, and clear usage guidelines.

What This Role Is NOT

To avoid confusion, this role is explicitly not the following:

  • Not an L1 / L2 Operations or Support role no ticket triage, alert acknowledgment, or routine support tasks.
  • Not a shift-based or NOC role no 247 rotations or follow-the-sun support.
  • Not a DevOps-only role CI/CD is an enabler, not the primary responsibility.
  • Not a pure Data Engineer role focus is on platform engineering, not business data transformations.
  • Not a Cloud Administrator role requires deep understanding of service behaviour, not just provisioning.
  • Not a break-fix role expectation is to engineer permanent solutions, automation, and standards.

Required Skills

GCP Platform & Data Services

  • Strong hands-on production experience with:
  • Dataproc (autoscaling, HA, init actions, Spark/YARN debugging)
  • Dataflow (Flex Templates, worker sizing, backlog handling)
  • Cloud Composer / Airflow (scheduler behaviour, scaling, DAG troubleshooting)
  • Pub/Sub (throughput, retention, delivery semantics)
  • GCS (IAM, lifecycle policies, CMEK, cross-project access)
  • Strong understanding of how configuration parameters impact performance, reliability, monitoring, and cost.

Engineering & Automation

  • Proven experience with Terraform for IaC.
  • Strong Python scripting skills (additional scripting languages are a plus).
  • Experience integrating CI/CD pipelines for cloud and data platforms.
  • Solid understanding of distributed systems, Spark, and Hadoop.

Monitoring, Performance & Databases

  • Experience analyzing production metrics and logs.
  • Ability to explain performance and reliability impacts of configuration changes.
  • Working knowledge of databases (SQL / NoSQL) is a strong plus.

Leadership & Ownership

  • Ability to lead platform initiatives while remaining hands-on.
  • Strong communication skills to explain complex platform behavior clearly.
  • High ownership mindset with strong documentation discipline.

Preferred Certifications

  • Google Professional Data Engineer
  • Google Professional Cloud Architect
  • Google Associate Cloud Engineer

Note: Looking for Immediate or 15 days notice period.

If Interested you can share me your updated resume on sr***********u@fi****c.com.

Job Classification

Industry: Software Product
Functional Area / Department: Production, Manufacturing & Engineering
Role Category: Production & Manufacturing - Other
Role: Production & Manufacturing - Other
Employement Type: Full time

Contact Details:

Company: Fint Solutions
Location(s): Chennai

+ View Contactajax loader


Keyskills:   Pubsub Infrastructure As Code Dataproc Security Cloud Storage L3 L3 Escalations Iac Ci/Cd Platform Standards Composing IAM Terraform GCP GCS Data Flow Google Cloud Storage Google Cloud Platforms governance Python

 Fraud Alert to job seekers!

₹ Not Disclosed

Fint Solutions

Axis Max Life Insurance is one of the trusted name amongst the several most admired companies in financial domain.

Job Listings