logoSahasrara
Job ID: 1

DevOps Engineer – On-Premise Infrastructure (Linux, SLURM, Kubernetes, Ansible, CI/CD)

Location: Onsite

Experience: 5+ years

Experience/skills:

  • Strong expertise in Linux system administration (8+ years).
  • In-depth knowledge of SLURM workload manager and scheduling policies.
  • Experience deploying and managing Kubernetes clusters (on-premise or cloud-native).
  • Proficiency in Ansible for configuration management and automation.
  • Solid understanding of CI/CD concepts and tools (e.g., Jenkins, GitLab CI, ArgoCD).
  • Familiarity with networking, firewalls, DNS, and system hardening techniques.
  • Scripting skills (e.g., Bash, Python) for automation and monitoring.

Responsibilities:

  • Manage and maintain on-premise Linux server environments (CentOS, RHEL, Ubuntu).
  • Administer and optimize SLURM clusters for high-performance computing workloads.
  • Design, deploy, and manage containerized applications using Kubernetes.
  • Automate infrastructure provisioning and configuration using Ansible.
  • Develop and maintain robust CI/CD pipelines (e.g., GitLab CI, Jenkins) for application delivery.
  • Monitor system health, performance, and security using appropriate tools and best practices.
  • Troubleshoot system issues, identify root causes, and implement long-term solutions.
  • Collaborate with software development, QA, and IT operations teams to align on deployment and support strategies.
  • Document processes, configurations, and best practices.

Preferred Qualifications:

  • Bachelor's degree in Computer Science, Engineering, or related field.
  • Experience with monitoring tools (Prometheus, Grafana, ELK stack).
  • Knowledge of infrastructure-as-code (e.g., Terraform).
  • Familiarity with containerization technologies (Docker, Podman).

    Date Posted: 2025-04-23

    Kindly share your CV with us at [email protected].

    © Sahasrara Metatech Private Limited. All rights reserved