Job ID: 2
SRE
Location: In Office
Experience: 1-3 years
Job Description:
We are looking for a talented Site Reliability Engineer (SRE) to join our team full-time. As an SRE, you will play a critical role in ensuring the reliability, availability, and performance of our systems and applications. You will work closely with our development, operations, and infrastructure teams to implement best practices and improve our overall reliability and scalability.Please send your CVs via email at [email protected] after submitting application.
Responsibilities:
- Design, build, and maintain scalable, reliable, and efficient systems and infrastructure.
- Implement and manage monitoring, alerting, and logging solutions to proactively identify and address issues.
- Automate deployment, configuration, and orchestration processes using tools such as Ansible, Terraform, or Kubernetes.
- Conduct performance analysis and capacity planning to ensure optimal system performance and resource utilization.
- Troubleshoot and resolve complex technical issues related to infrastructure, networking, and application services.
- Implement and enforce security measures to protect against vulnerabilities and ensure compliance with industry standards.
- Collaborate with development teams to optimize applications for performance, reliability, and scalability.
- Participate in on-call rotation and respond to incidents in a timely manner to minimize downtime and service disruptions.
- Continuously improve processes and workflows to increase efficiency and reduce manual intervention.
- Stay updated on the latest trends and advancements in site reliability engineering and cloud technologies.
Requirements:
- Bachelor's degree in Computer Science, Information Technology, or a related field.
- Proven experience as a Site Reliability Engineer, DevOps Engineer, or Systems Engineer.
- Strong understanding of Linux operating systems and shell scripting.
- Experience with cloud platforms such as AWS, Azure, or Google Cloud Platform.
- Proficiency in infrastructure as code (IaC) tools such as Terraform or CloudFormation.
- Knowledge of containerization and orchestration technologies such as Docker and Kubernetes.
- Familiarity with monitoring and logging tools such as Prometheus, Grafana, ELK Stack, or Splunk.
- Strong problem-solving skills and attention to detail.
- Excellent communication and interpersonal skills.
- Ability to work independently as well as collaboratively in a team environment.
- Certification in relevant technologies (e.g., AWS Certified DevOps Engineer, Certified Kubernetes Administrator) is a plus.
Benefits:
- Competitive salary package commensurate with experience and skills.
- Professional development opportunities and reimbursement for certifications.
- Dynamic and collaborative work environment with opportunities for growth and advancement.
- Employee discounts and wellness programs.
Date Posted: 2024-03-04
AWSGCPIaCNetworkingLinuxGit