HPC Engineer

ManpowerGroup Middle East

Saudi Arabia

Posted on: 22 Feb 2024

{{ flashMessage.message }}

JOB DESCRIPTION / ROLE

Employment: Full Time

Job Title: HPC Engineer
Location: Riyadh, Saudi Arab
Role Type : Permanent

The HPC Engineer works with the research scientists, engineers, and collaborates with technical leadership in the design, development, installation, and maintenance of software for the High Performance Computing (HPC) systems. The HPC Engineer is responsible for supporting the planning, implementation, availability, performance, security, maintenance, and repair of high_performance computing infrastructure.

Responsibilities
- Support day-to-day operations for the ML/AI team by monitoring computing resource performance, managing configurations, and addressing security administration.
- Apply revisions to system firmware and software.
- Engage and collaborate with vendors to assist with support activities as required.
- Develop new HPC software deployment plans, custom scripts, and testing procedures to ensure operational reliability for the AI researchers.
- Design, install, configure, and perform document management for cluster infrastructure, including operating systems, job schedulers, resource managers, provisioning managers, configuration managers, network devices, and other components for the HPC environment.
- Explore emerging technologies and technical developments to address expanding ML/AI requirements.
- Identify new services and develop implementation plans.
- Stay current with best practices in the HPC field.

REQUIREMENTS

Qualifications
- +3 years of experience designing & architecting Linux environments (specifically Linux, HPC).
- Bachelor's degree in computer science, software engineering, or a related field.
- Experience in managing/administering Linux and Windows server environments for scientific computing.
- Understanding of GPU and accelerator technologies.
- Experience of managing high volumes of servers.
- Experience with HPC cluster job schedulers such as SLURM, LSF.
- Working knowledge of cluster configuration managements tools such as Ansible, Puppet, Salt.
- Knowledge of some of the following: Kubernetes, GitLab, CI/CD, Docker, Grafana, Prometheus, etc

ABOUT THE COMPANY

We lead in the creation and delivery of innovative workforce solutions and services that enable our clients to win in the changing world of work.

ManpowerGroup powers the success of many of the world's most dynamic organizations. We deliver innovative workforce solutions that enhance competitiveness, increase efficiency and spur productivity. Combining global reach with local expertise - 3600 offices in over 80 countries - we know the changing world of work and bring a deep understanding of the companies we work for and the industries we service.

ManpowerGroup entered the Middle East in December 2007 after acquiring local company Clarendon Parker, thus bringing 15 years in-depth local knowledge combined with a global footprint and industry shaping expertise and thought leadership. Manpower Middle East supports clients in the Middle East and North Africa regions. Our business is aligned to key skill specializations to ensure our clients requirements are met by expert and knowledgeable consultants that understand your industry and role requirement.

Our consultants are experts in finding the right talent across all industries in a broad-range of occupations including:

  • IT & Telecommunications
  • Engineering & Construction, Oil & Gas
  • Banking, Finance & Legal
  • Sales & Business Development
  • Marketing, Public Relations & Communications
  • Human Resources & Training
  • Customer & Support Services (Secretarial and Administrative)
  • Operational, Supply Chain & Logistics
  • Executive Recruitment
  • Emiratization Solutions
  • Recruitment Program Outsourcing Solutions
  • Managed Service Provider Solutions
  • Talent Based Outsourcing Solutions
  • Outsourced Staffing Solutions

Advertise Here
INSTALL APP
×