Operations Engineer - Linux / HPC We’re partnering with a global, research-driven technology organisation operating at significant scale across high-performance compute environments. They’re looking for a hands-on Linux Operations Engineer who genuinely enjoys production responsibility and problem solving. This role is operations-first - focused on keeping complex HPC systems running reliably in a demanding, 24/7 environment. If you enjoy deep systems work, incident ownership, and supporting real users with real workloads, this role will suit you well.
What You’ll Be Doing - Provide front-line operational support for 24/7 Linux HPC compute, storage, and networking environments
- Own incidents end-to-end: triage, troubleshooting, root cause analysis, and resolution
- Respond to alerts and system issues in a timely and structured manner
- Participate in planned maintenance windows (evenings or weekends on a rotating basis)
- Support global infrastructure projects across compute, storage, and interconnects
- Write tooling and automation to diagnose issues and reduce operational overhead
- Work across multiple codebases and languages to support and extend operational tooling
- Implement and maintain performance, fault, and health monitoring systems
- Develop and improve internal documentation and operational runbooks
- Work closely with internal users, infrastructure teams, and external vendors
- Participate in an on-call rotation and scheduled maintenance coverage
- Adhere to all cybersecurity, hardware, and software usage policies
Skills & Experience - 2+ years’ experience operating Linux systems in production environments
- A clear interest in operations as a primary job function
- Strong troubleshooting and root cause analysis skills
- Proficiency in at least one scripting or programming language (e.g. Python, Go, C)
- Ability to learn new tools and technologies quickly
- Strong written and verbal communication skills
- Comfortable working in a fast-paced, high-expectation environment
- Reliable and predictable availability for on-call and maintenance rotations
If you’re an Operations Engineer with strong Linux and HPC experience, this is a role worth exploring.
Visa sponsorship and relocation support available.