QUALIFICATIONS AND JOB DESCRIPTION
We are looking for a passionate Linux System Administration Specialist to be part of a large IT team supporting University main campus, and its extensions across Turkey with thousands of users with the following qualifications.This position is remote.
- Bachelor’s degree in computer science, engineering or equivalent combination of education and relevant experience
- Minimum 2-3 years of experience in Linux system administration (Redhat/CentOS)
- Understanding of server system hardware and Linux system internals
- Experience installing, troubleshooting, updating/patching, monitoring and maintenance of linux systems (RedHat/CentOS, Ubuntu)
- Ability to write scripts (Bash, Python) to automate administrative tasks
- Experience with SAN&NAS storages (Netapp, Dell etc)
- Experience with LDAP and DNS
- Experience in systems administration and automation, TCP/IP networking and virtualization
- Availability to travel for a limited number of on-site cluster system installations, maintenance or trade shows
- Ability to interact with internal engineering, tech support and sales teams
- Preferably, have experience in parallel file systems like BeeGFS and/or Lustre
- Preferably, have experience in the following technologies/software applications: Materials Studio, Matlab, Gaussian, Namd, OpenFoam, Ansy, Fluent, GNU ve Intel compilers, IntelMPI, MPICH, OpenMPI, MVAPICH and other MPI implementations
- Preferably, have experience and knowledge of InfiniBand network technology
- Preferably, have experience in architecting and supporting HPC System – SLURM
- Excellent written and verbal communications skills in English
JOB DESCRIPTION
- Manages and administers production systems used by researchers on a HPC cluster environment
- Maintain the HPC systems availability, but also create and document site procedures, system diagrams, and other configuration or support documents
- Installs, maintains, and troubleshoots Linux servers, both physical and virtual
- Provides issue support, installation support and solution support
- Provides user support: account maintenance, software installations, (OS) updates and upgrades, technical support for users.
- Manages setup and provisioning of storage servers (Beegfs)
- Initiates preventive maintenance and hardening on the systems as well as manage repair of system/environment problems
- Technical troubleshooting and performance monitoring.Analyzes system faults and troubleshooting and run diagnostic tests on operating systems and hardware to detect problems
- Provides training, support, installation and configuration assistance to researchers