Job Title:  Consultant | Data Analytics | Delhi | Operations, Industry & Domain Solutions

Job requisition ID ::  103530
Date:  Apr 27, 2026
Location:  Delhi
Designation:  Consultant
Entity:  Deloitte Touche Tohmatsu India LLP

Job Title: Senior DevOps Engineer (Cloud, On-Prem & Open Source Infrastructure)

Experience Required

5+ years of hands-on experience in DevOps, SRE, or Platform Engineering roles

Role Overview

We are seeking a Senior DevOps Engineer who can design, build, and operate highly available, scalable, and secure infrastructure across cloud and on-prem environments. The role requires deep expertise in automation, infrastructure as code, CI/CD, observability, and open-source ecosystems.

You will be responsible for building end-to-end delivery pipelines, ensuring system reliability, and enabling engineering teams with robust platform capabilities.

Key Responsibilities

1. Infrastructure Design & Management

  • Design and implement highly available (HA), fault-tolerant, and scalable architectures
  • Manage infrastructure across AWS / GCP / Azure (at least one must be strong hands-on)
  • Build and maintain hybrid infrastructure (cloud + on-prem)
  • Implement disaster recovery (DR), backup, and failover strategies

2. Infrastructure as Code (IaC) & Automation

  • Develop and manage infrastructure using:
  • Terraform (mandatory)
  • Ansible / Puppet / Chef (any relevant automation tools)
  • Create reusable modules and standardized infrastructure patterns
  • Automate provisioning, configuration, and environment management

3. CI/CD & Release Engineering

  • Build and maintain robust CI/CD pipelines using:
  • Jenkins / GitHub Actions / GitLab CI / ArgoCD / Tekton
  • Implement:
  • Blue-Green deployments
  • Canary releases
  • Rolling deployments
  • Automate build, test, security scanning, and deployment processes

4. Containerization & Orchestration

  • Hands-on experience with:
  • Docker
  • Kubernetes (mandatory)
  • Manage production-grade Kubernetes clusters (EKS / GKE / AKS or on-prem clusters like OpenShift / Rancher)
  • Helm charts / Kustomize for deployment management
  • Service mesh (Istio / Linkerd – preferred)

5. Observability & Monitoring (Full Stack)

Design and implement end-to-end observability stack:

Monitoring & Metrics

  • Prometheus, Grafana, Datadog, New Relic

Logging

  • ELK Stack (Elasticsearch, Logstash, Kibana)
  • EFK (Fluentd/Fluentbit)
  • Loki (Grafana stack)

Tracing

  • Jaeger / OpenTelemetry / Zipkin

Alerting

  • Alertmanager, PagerDuty, Opsgenie

6. Cloud & Platform Engineering

Strong hands-on experience in at least one:

  • AWS (EC2, VPC, EKS, S3, IAM, RDS, CloudWatch)
  • OR GCP (GKE, IAM, Cloud Build, Stackdriver)
  • OR Azure (AKS, Azure DevOps, Monitor, VNets)

7. Networking, Security & Reliability

  • Deep understanding of:
  • Load balancing (Nginx, HAProxy, ALB/NLB)
  • DNS, CDN, routing, VPC design
  • Security best practices:
  • IAM, RBAC, secrets management (Vault, AWS Secrets Manager)
  • TLS/SSL, certificate management
  • Implement zero-downtime deployments and high availability architectures

8. Open Source Ecosystem

Strong exposure to open-source tools used in production:

  • Kubernetes ecosystem tools
  • Terraform, Ansible
  • Prometheus, Grafana, ELK
  • Linux system internals and tooling
  • GitOps tools (ArgoCD / FluxCD)

9. Scripting & Programming

  • Strong scripting skills in:
  • Python / Bash / Go (preferred)
  • Ability to build automation tools and internal DevOps utilities

Required Technical Expertise Summary

  • One cloud platform (AWS/GCP/Azure) – strong hands-on
  • On-prem infrastructure management experience
  • Kubernetes at production scale
  • Terraform (must-have)
  • CI/CD pipelines (end-to-end ownership)
  • Observability stack (metrics, logs, traces)
  • HA & scalable system design
  • Linux system administration
  • Open-source-first engineering mindset

Preferred Qualifications

  • Experience with microservices architecture
  • GitOps implementation experience
  • Service mesh (Istio/Linkerd)
  • FinOps / cost optimization in cloud
  • SRE practices (SLI/SLO/SLA)
  • Performance tuning and capacity planning

Soft Skills

  • Strong ownership and accountability
  • Ability to troubleshoot complex distributed systems
  • Good collaboration with development and security teams
  • Proactive mindset for automation and reliability improvements
  • Comfortable working in fast-paced engineering environments

Educational Qualification : BE/ B.tech, MCA