HPC Installer with Linux and Network Expertise

Job Description:

We are seeking a highly skilled and knowledgeable HPC Installer with expertise in Linux systems and network infrastructure to join our team. As an HPC Installer, you will be responsible for the installation, configuration, and maintenance of high-performance computing systems, ensuring their optimal performance and reliability. Your proficiency in Linux operating systems and network architecture will be crucial in delivering successful HPC installations.

Responsibilities:

  • Install, configure, and optimize high-performance computing systems, including hardware components, operating systems, cluster management software, and job schedulers.
  • Collaborate with system administrators, researchers, and IT professionals to understand their requirements and tailor HPC installations to meet their needs.
  • Configure and manage network infrastructure, including switches, routers, and high-speed interconnects, ensuring efficient and reliable data transfer within the HPC environment.
  • Deploy and optimize storage systems, including parallel file systems, network-attached storage, and object storage, to meet the demands of HPC workloads.
  • Monitor system performance using appropriate tools, diagnose issues, and apply performance tuning techniques to maximize the efficiency and throughput of HPC applications.
  • Ensure security and access control mechanisms are in place, implementing best practices to protect user authentication, network security, and data integrity within the HPC environment.
  • Document installation processes, system configurations, and modifications made, maintaining detailed records and creating clear reports for reference and future troubleshooting.
  • Stay up-to-date with advancements in HPC technology, Linux systems, and network architecture, continuously expanding your knowledge and skills to deliver cutting-edge solutions.

Requirements:

  • Bachelor’s degree in Computer Science, Information Technology, or a related field (or equivalent experience).
  • Proven experience in installing and configuring high-performance computing systems, preferably in a Linux environment.
  • Strong knowledge of Linux operating systems (e.g., CentOS, Ubuntu, Red Hat Enterprise Linux) and their administration.
  • Proficiency in network infrastructure, including Ethernet, InfiniBand, and high-speed interconnects, with hands-on experience in configuring network switches and routers.
  • Experience in writing scripts (Bash, Python)
  • Knowledge of performance tuning techniques and tools to analyze and optimize HPC system performance.
  • Familiarity with security best practices in HPC environments, including user authentication, access control, and data protection.
  • Proficient in system monitoring and troubleshooting using relevant tools.
  • Excellent documentation and reporting skills.
  • Strong problem-solving and communication skills, with the ability to work effectively in a team environment.

If you possess the required skills and are passionate about high-performance computing, Linux systems, and network architecture, we would love to hear from you. Join our team and contribute to the successful installation and operation of cutting-edge HPC systems.

Apply here.