IT Ops Specialist role (Google Cloud Platform Support)

Full time @TD BANK in Information Technology (IT) Email Job

Job Detail

  • Job ID 11085
  • Experience  2 Years
  • Qualifications  Degree Bachelor
Bottom Promo

Job Description

“In this role, you’ll join a team to provide 24/7 support of our Google Cloud Platform (GCP) environment, focusing on operational support and optimizing its health and performance. Your expertise in Kubernetes Engine (GKE) will be crucial, as you’ll oversee the management and security of our containerized applications. This includes ensuring efficient resource allocation and adherence to best practices for container deployments. Additionally, you’ll be responsible for monitoring the performance and availability of GCP at a platform level, proactively identifying and resolving any potential issues.

24/7 On-call support for GCP Public Cloud Operations.

Responsible for DEV to PROD GCP Cloud PaaS/IaaS support and processes. This is to ensure quality, performance, and availability of Public Cloud services (GCP).

The successful candidate must have demonstrated ability to learn new technologies and processes, resolve incidents, and solving problems by collaborating with others.

The candidate will be responsible for providing operational support for platforms and infrastructure hosted on TD’s GCP Public Cloud. The role requires familiarity with ITIL processes (change, incident, and problem management) and availability for off-hours escalated support.

Provide planning, communication, and reporting of day-to-day ticket metrics and longer-term tactical objectives.

Level 2 support of TD business line GCP Cloud infrastructure including PaaS/IaaS/Containers across all production and test environments.

Manage non-standard/complex P1, P2 (major incidents), and P3 and P4 incidents and service requests.

Ensure customer service satisfaction and enable continuous improvements.

Oversee higher complexity operational and preventive maintenance tasks.

Manage complex remedial and unscheduled urgent changes.

Able to be accessible via a mobile device to support on-call escalations.

Drive root cause analysis on repeatable incidents to help prevent issues in the future.

Creation of support documentation and scripts.

Oversee vendor’s service delivery and escalation.

Provide operational consultancy for future-state technologies.

Prioritize activities to align with compliance, regulatory requirements, and business objectives.

Keep informed of technology solutions initiatives and IT direction to provide strong support to the businesses

Current or prior experience supporting GKE (Google Kubernetes Engine) – MUST HAVE

Current or prior experience supporting GCP (Google Cloud Platform) – MUST HAVE

Strong to expert knowledge of supporting GCP including GKE workloads
Familiar with supporting GCP services such as BigQuery, Cloud SQL (SQL/PostgreSQL), REDIS, Cassandra, BigTable, Cloud Filestore, Persistent Storage, Apigee, Kafka, Dataflow, GCS.
Experience and knowledge supporting an Azure Public Cloud environment (while not necessary) would be valuable.
Thorough problem determination skills to troubleshoot and r esolve business application issues .
Knowledge with OS technologies (Windows, RedHat Linux).
Familiar with CI/CD tools such as Github, Jenkins, etc.
DevOps and Agile understanding.
Working knowledge of Local Area Networks (LAN) and Wide Area Networks (WAN).
Comfortable with working in a rapidly changing, technically complex environment.
Knowledge of scripting languages and tools such as Python, JavaScript, Powershell, Bash.
Comfortable with the Agile methodology.
The successful applicant must have a solid understanding of incident, change, and problem management methodologies as well as solid experience in a large, high-performance production environment.”

Bottom Promo

Required skills

Other jobs you may like