Lead Network Load Balancing Engineer
Design, build, and maintain data ingestion and transformation pipelines that aggregate observability data from multiple systems (infrastructure, applications, APIs, and cloud services).
Integrate diverse telemetry sources into enterprise observability platforms (e.g., Splunk, Datadog, Prometheus, Grafana, Elastic Stack, New Relic, or Dynatrace).
Develop custom collectors, exporters, or plugins using OpenTelemetry, Fluentd/Fluent Bit, or other open-source agents.
Implement and manage observability instrumentation across applications and services using APM agents, SDKs, and tracing frameworks.
Automate deployment and configuration of monitoring tools using Terraform, Ansible, Helm, or CI/CD pipelines.
Develop reusable scripts and modules for telemetry onboarding and dashboard creation.
Maintain observability-as-code practices for reproducibility and governance.
Define and enforce data models, naming conventions, and tagging standards for metrics, logs, and traces.
Collaborate with SRE and DevOps teams to ensure consistent telemetry across all environments.
Validate and optimize data ingestion for performance, cost, and relevance.
Research and evaluate new observability tools, APIs, and technologies to enhance integration capabilities.
Contribute to improving telemetry quality, automation, and visualization.