Platform & Site Reliability Engineer- SSE

Company: CGI
Apply for the Platform & Site Reliability Engineer- SSE
Location: Bengaluru
Job Description:

Position Description:

Company Profile:Founded in , CGI is among the largest independent IT and business consulting services firms in the world. With 94, consultants and professionals across the globe, CGI delivers an end-to-end portfolio of capabilities, from strategic IT and business consulting to systems integration, managed IT and business process services and intellectual property solutions. CGI works with clients through a local relationship model complemented by a global delivery network that helps clients digitally transform their organizations and accelerate results. CGI Fiscal reported revenue is CA$14.68 billion and CGI shares are listed on the TSX (GIB.A) and the NYSE (GIB). Learn more at .

Platform & Site Reliability Engineering- SSE

Job Description:

Looking for a highly skilled Senior resource to join our Platform & Reliability Engineering team. In this role, you will be responsible for designing, building, and maintaining scalable and reliable platform solutions that empower software and AIOps delivery. Accelerate AIOps delivery and SRE operations, high-elite performance as measured by DORA. Key Responsibilities:

. Platform Development: Design, implement, and optimize core platform services, APIs, and automation frameworks to support software development, AIOps, and SRE operations. . Infrastructure as Code (IaC): Develop and maintain infrastructure using tools such as Terraform. . Cloud Engineering: Architect and optimize cloud-native solutions in GCP and on-prem OpenShift, ensuring reliability, scalability, and cost-efficiency. . Automation & CI/CD: Implement and enhance CI/CD pipelines using tools such as GitHub Actions, GitLab CI, Jenkins, or ArgoCD to improve software delivery. Ensure standardized DORA observability across prioritized development programs using Gathr as the platform. . Observability & Performance: Establish monitoring, logging, and alerting strategies using Prometheus, Grafana, OpenTelemetry, NewRelic, or similar technologies. Enable % SLO observability for onboarded services in SRE. . Security & Compliance: Embed security best practices in platform solutions, including identity management, secrets management, and policy enforcement. . AIOps & SRE Enablement: Support AIOps 24/7 in production through SRE and enhance automation capabilities for proactive incident resolution. . Decommissioning & Optimization: Contribute to decommissioning NSO-ONAP tenant software and optimizing platform services . Technical Leadership: Provide mentorship and guidance to junior developers and advocate for engineering excellence and DevOps culture.

Required Skills & Experience:

. 7+ years of professional software development experience, with at least 3 years focused on platform engineering, DevOps, or SRE. . Proficiency in at least one programming language such as Python, Go, Java, or Rust. . Hands-on experience with cloud platforms (GCP and on-prem OpenShift) and cloud-native technologies such as Kubernetes. . Strong knowledge of Infrastructure as Code (Terraform). . Experience with containerization technologies (Docker, Kubernetes, Helm). . Expertise in CI/CD tooling and best practices for software deployment and release automation. . Familiarity with monitoring, logging, and observability tools (Prometheus, Grafana, OpenTelemetry, NewRelic, ELK stack). . Strong problem-solving skills and a track record of delivering scalable and maintainable solutions. . Excellent communication and collaboration skills, with experience working in agile environments.

Nice to Have:

. Experience with service meshes and API gateways. . Knowledge of SRE principles and reliability engineering. . Experience with FinOps and cost optimization in cloud environments. . Exposure to policy-as-code frameworks.

Skills:

  • DevOps
  • Google Cloud Platform
  • Kubernetes
  • Python
  • Terraform
  • Posted: February 25th, 2026