Observability Engineer (Grafana, Elasticsearch and ELK Stack), Amsterdam

Omschrijving:

Voor onze eindklant KPN is Harvey nash op zoek naar een Observability Engineer

Start: 01-09-2025

Eind: 31-12-2025 (verlenging mogelijk)

Standplaats: Amsterdam

Uren: 32

We are seeking a skilled Observability Engineer to design, implement, and manage observability solutions using Grafana, Elasticsearch, and the ELK Stack (Elasticsearch, Logstash, Kibana). The ideal candidate will enhance the visibility into our systems, applications, and infrastructure, enabling proactive monitoring, rapid incident resolution, and data-driven decision-making. You will collaborate with cross-functional teams to ensure system reliability, performance, and scalability in a dynamic, cloud-native environment.

Key Responsibilities:

Observability Implementation:

  • Design and deploy Grafana dashboards to visualize metrics, logs, and traces, providing actionable insights into system performance.
  • Configure and optimize Elasticsearch and the ELK Stack for log aggregation, search, and analysis to support observability goals.
  • Integrate Grafana with Elasticsearch and other data sources (e.g., Prometheus, Open Telemetry, Cloudwatch) to create unified observability pipelines.
  • Set up alerting rules in Grafana and Kibana to notify teams of anomalies or critical issues via integrations (e.g., Slack, OpsGenie, email).

Log and Data Management:

  • Implement and manage Logstash pipelines to collect, process, and transform log data for ingestion into Elasticsearch.
  • Optimize Elasticsearch indices, mappings, and queries to ensure efficient storage, retrieval, and analysis of large-scale log and metric data.
  • Correlate logs, metrics, and traces across systems to identify root causes of issues and improve system reliability.

System Monitoring and Troubleshooting:

  • Use Grafana and Kibana to monitor infrastructure (e.g., servers, Kubernetes clusters) and application performance (e.g., latency, error rates).
  • Analyze telemetry data to detect trends, anomalies, and potential failures in distributed systems.
  • Support incident response by leveraging ELK Stack and Grafana for real-time debugging and root cause analysis.

Automation and Optimization:

  • Automate the provisioning and configuration of observability tools (Grafana, ELK Stack) using infrastructure-as-code (IaC) tools like Terraform or Ansible.
  • Optimize ELK Stack performance for high-volume log data, including index lifecycle management and resource efficiency.
  • Script automation tasks (e.g., dashboard provisioning, alert setup) using Python, Bash, or Grafana/Elastic APIs.

Collaboration and Best Practices:

  • Work with DevOps, development, and SRE teams to embed observability into development and deployment processes.
  • Promote a culture of observability by training teams on using Grafana and ELK Stack for data exploration and troubleshooting.
  • Document configurations, processes, and observability best practices for team reference.
  • Compliance and Security:
  • Ensure observability solutions meet security and compliance requirements (e.g., data retention, access controls).
  • Implement role-based access control (RBAC) in Grafana and Elasticsearch to secure sensitive data.

Experience:

  • 5-7 years of experience in observability, monitoring, or DevOps roles.
  • Hands-on expertise with Grafana for dashboard creation, alerting, and data visualization.
  • Proven experience with Elasticsearch and the ELK Stack (Logstash, Kibana) for log management and analysis.

Technical Skills:

  • Proficiency in configuring and optimizing Elasticsearch indices, mappings, and search queries.
  • Experience with Logstash for log parsing, filtering, and transformation.
  • Strong knowledge of Kibana for log visualization, dashboard creation, and data exploration.
  • Familiarity with integrating Grafana with Elasticsearch and other data sources (e.g., Prometheus, Open Telemetry).
  • Experience with cloud platforms (AWS) and containerized environments (Docker, Kubernetes).

Soft Skills:

  • Analytical mindset with strong problem-solving skills for root cause analysis.
  • Excellent communication and collaboration skills to work with cross-functional teams.
  • Ability to document processes and train others on observability tools.
  • Ability to work according to KPN guidelines


Trefwoorden:



OPDRACHT​GEVER:

bedrijfsnaam:
Harvey Nash B.V.
contactpersoon:
Amy Marriott
type:
ZZP, freelance, interim vacature
locatie:
Amsterdam
provincie:
Noord-Holland
uurtarief:
Tarief in overleg
start project:
01-09-2025
referentie:
ITC-BBBH115590
duur opdracht:
5 maanden
uren per week:
32 uur
publicatiedatum:
31-07-2025 11:37:23
terug naar zoekresultaten  |  vorige  |  volgende  |  alle vacatures