StartupsEventsJobsNieuwsTV
dutchstartup.ai
EventsJobsNieuwsTV

Vacature

HPC System Engineer

Fulltime · Geplaatst op 14 jun 2026

Solliciteer direct
De rolHet bedrijfMeer vacaturesVergelijkbaar
01

Wat je gaat doen

Over deze rol

# Systems Engineer (Cloudmeter)

Nebius is seeking a highly skilled Systems Engineer (Cloudmeter) to join the team to support benchmarking of GPU platforms for machine learning and AI workloads. You will play a critical role in evaluating the performance of GPU-based hardware for various deep learning and AI frameworks, enabling data-driven decisions for platform optimization and next-generation hardware development.

## Responsibilities:

  • Work closely with hardware and development teams to profile and analyze GPU performance at the system and kernel level
  • Evaluate and compare GPU performance across different platforms, architectures, and software stacks (e.g., CUDA, ROCm)
  • Perform acceptance testing for new GPU clusters, ensuring hardware and software meet performance, stability, and compatibility requirements for AI workloads
  • Perform experiments across diverse GPU system configurations to assess the impact of varying interconnect strategies and system-level optimizations on performance and scalability

## Requirements:

  • Proficient in Unix/Linux, plus Python and Bash for automation
  • Good understanding of the GPU stack: CUDA, NCCL, drivers, and relevant libraries
  • Proven ability to troubleshoot complex system issues including hardware, software, and networking problems
  • Familiarity with containerized environments (e.g., Docker, Kubernetes)

## Desirable:

  • Experience with modern deep learning frameworks (PyTorch, JAX, vLLM, TensorRT-LLM)
  • Experience with job schedulers and resource managers (Slurm, Volcano, etc.)

Skills & ervaring

LinuxUnixPythonBashCUDANCCLDockerKubernetesPyTorchJAXvLLMTensorRT-LLMSlurmVolcanoROCm
02

Waar je terechtkomt

Over Nebius Group

Nebius Group, gevestigd in Amsterdam, is een technologiebedrijf dat zich richt op het leveren van full-stack AI cloud-infrastructuur. Het bedrijf biedt GPU-clusters, cloudplatformen en ontwikkelaarstools voor het beheer van de volledige machine learning-levenscyclus, van dataverwerking tot fine-tuning en inferencing.

03

Meer bij dit bedrijf

Meer vacatures bij Nebius Group

Senior Software Engineer (Token Factory)FulltimeBekijk →Technical Product Manager - SoperatorFulltimeBekijk →AI/ML Specialist Solutions ArchitectFulltimeBekijk →Staff / Principal Applied AI Researcher (Agentic Search)FulltimeBekijk →HPC System EngineerFulltimeBekijk →ML Infrastructure EngineerFulltimeBekijk →
04

Verder kijken

Vergelijkbare vacatures

Software Engineer, Data Infrastructure & AcquisitionVeldhoven · FulltimeBekijk →AI Business AnalystVeldhoven · FulltimeBekijk →Lead Data EngineerFulltimeBekijk →AI Solutions EngineerNijmegen · FulltimeBekijk →Senior Data Engineer PricingFulltimeBekijk →Staff Officer (Data Scientist) - NATO 2030FulltimeBekijk →
dutchstartup.ai

Het platform voor de Nederlandse AI-scene.

Over ons·Contact·Privacy·Voorwaarden