MulticoreWare

Cloud Engineer – Observability

Job Description:

Responsible for architecting, designing, and implementing our comprehensive observability stack, including tracing, telemetry, logging, health monitoring, visualization, and dashboards. You will play a key role in ensuring the reliability, performance, and operational efficiency of our services.

Key Responsibilities:
  • Design and implement a robust observability framework using technologies like Prometheus, Grafana, OpenTelemetry, ELK Stack, Zabbix, and Jaeger.
  • Develop and maintain health monitoring and alerting systems for our OpenStack and Kubernetes-based platforms, with a focus on GPU-supported environments.
  • Create and manage visualization dashboards to monitor system performance, resource utilization, and operational health.
  • Implement scalable, distributed logging and tracing solutions to diagnose, troubleshoot, and resolve system issues effectively.
  • Collaborate with development and operations teams to integrate observability practices into the development lifecycle.
  • Conduct performance analysis and optimization to ensure system reliability and efficiency.
  • Stay updated with the latest trends and technologies in observability and performance monitoring.
Qualifications:
  • Bachelor’s or Master’s degree in Computer Science, Information Technology, or a related field.
  • Proven experience in observability, monitoring, and system performance analysis, particularly in a cloud or data center environment.
  • Expertise in implementing and managing observability tools such as Prometheus, Grafana, OpenTelemetry, ELK Stack, Zabbix, and Jaeger.
  • Strong understanding of container orchestration using Kubernetes, and familiarity with OpenStack and GPU computing.
  • Proficiency in scripting and automation using languages such as Python, Shell, or Go.
  • Excellent problem-solving skills and the ability to work independently or as part of a team.
  • Strong communication skills and the ability to work in a fast-paced, dynamic environment.