Facebook Pixel

Job Description

The Senior System Reliability Engineer is responsible for ensuring the stability and performance of complex systems and applications. This role involves designing, implementing, and maintaining robust systems that meet service-level requirements. The engineer will work closely with cross-functional teams to identify potential issues, conduct root cause analysis, and implement solutions to improve system reliability.


Responsibilities

  • Design and implement strategies to improve the reliability and availability of systems and applications.
  • Set up monitoring and alerting systems to detect potential performance issues.
  • Conduct performance testing and analysis to identify bottlenecks and areas for improvement.
  • Develop and maintain documentation for system architecture, processes, and procedures.
  • Troubleshoot and resolve complex issues related to system reliability and performance.
  • Collaborate with development teams to define and implement best practices for software deployment and orchestration.
  • Perform root cause analysis of system failures and implement preventive measures.
  • Continuously monitor system performance and initiate corrective actions as necessary.
  • Participate in on-call rotation for system emergencies and provide technical support when needed.
  • Stay up-to-date with emerging technologies and industry trends.
  • Bachelor's degree in computer science, engineering, or a related field.
  • Minimum of 5 years of experience in system reliability engineering or related roles.
  • Strong knowledge of Linux operating systems and command-line tools.
  • Proficiency in scripting languages, such as Python or Bash.
  • Experience with cloud infrastructure, such as AWS or Azure.
  • Familiarity with containerization technologies, such as Docker or Kubernetes.
  • Understanding of networking protocols and troubleshooting techniques.
  • Excellent problem-solving and analytical skills.

  • Bachelor's degree in computer science, engineering, or a related field.
  • Minimum of 5 years of experience in system reliability engineering or related roles.
  • Strong knowledge of Linux operating systems and command-line tools.
  • Proficiency in scripting languages, such as Python or Bash.
  • Experience with cloud infrastructure, such as AWS or Azure.
  • Familiarity with containerization technologies, such as Docker or Kubernetes.
  • Understanding of networking protocols and troubleshooting techniques.
  • Excellent problem-solving and analytical skills.

Job Details

Role Function: N/A Work Type: Full-Time
Role Level: Mid-Level Country: United Arab Emirates
City: Dubai Number of Vacancies: 1
Job Category: Engineering Company Website: https://www.talentmate.com/
Skills & Expertise
Good Communication Skill Attention to detail

What We Offer

  • Health Insurance
  • Visa
  • Paid Annual Leaves
  • Maternity and Paternity Leaves

About the Company

Searching, interviewing and hiring are all part of the professional life. The TALENTMATE Portal idea is to fill and help professionals doing one of them by bringing together the requisites under One Roof. Whether you're hunting for your Next Job Opportunity or Looking for Potential Employers, we're here to lend you a Helping Hand.

Report

Similar Jobs

Disclaimer: talentmate.com is only a platform to bring jobseekers & employers together. Applicants are advised to research the bonafides of the prospective employer independently. We do NOT endorse any requests for money payments and strictly advice against sharing personal or bank related information. We also recommend you visit Security Advice for more information. If you suspect any fraud or malpractice, email us at abuse@talentmate.com.