GCP Data Architect
Need Candidate LinkedIn
Key Responsibilities:
- Install, configure, and upgrade Google AgentSpace and related agents (Ops Agent, Logging Agent, Monitoring Agent) across Linux/Windows systems and container environments.
- Design and implement automation for agent and connector deployment using scripts, orchestration tools (e.g., Ansible, Terraform), or GCP-native solutions.
- Integrate Google Cloud Monitoring & Logging with third-party tools such as:
- ServiceNow (for alerts/incidents)
- Splunk or Elastic (for log ingestion)
- Prometheus/Grafana or Dynatrace/AppDynamics (for metric visualization)
- Install and configure connectors/adapters for third-party observability and incident management platforms.
- Develop and optimize custom queries (e.g., Monitoring Query Language - MQL, Logging queries) to extract meaningful insights from telemetry data.
- Create and maintain dashboards and reports for monitoring, SLA tracking, system health, and compliance.
- Collaborate with SRE, CloudOps, and application teams to enable alerting, anomaly detection, and correlation across systems.
- Troubleshoot agent or connector-related issues, including telemetry gaps, broken integrations, or data latency.
- Maintain configuration baselines, deployment templates, and operational documentation.
- Coordinate with Google Cloud support and third-party vendors for issue resolution, RCA, and product enhancements.
Required Skills & Qualifications:
- 5–7+ years of hands-on experience in monitoring agent and connector deployment in enterprise/cloud environments.
- In-depth knowledge of Google Cloud Operations Suite (Monitoring, Logging, Trace, Profiler).
- Experience in:
- Installing and managing Ops Agent, Logging Agents
- Deploying connectors to third-party tools (ServiceNow, Splunk, Dynatrace, etc.)
- Writing Monitoring Query Language (MQL) and Logging queries
- Hands-on with query building and custom dashboards for visualization and analytics.
- Strong scripting skills (Bash, Python, PowerShell).
- Familiarity with Infrastructure as Code (IaC) tools (Terraform, Ansible).
- Exposure to Kubernetes (GKE), Compute Engine, and hybrid environments.
- Good understanding of telemetry pipelines, log forwarding, and metric ingestion.
- Experience with reporting tools like Looker, Grafana, or Google Data Studio is a plus.
- Google Cloud Certification (Associate/Professional DevOps or Cloud Engineer) is preferred.