Senior AI Infrastructure Engineer (f/m/x)
IT IST INFRASTRUKTUR VON MENSCH ZU MENSCH.
TEILE DEINE LEIDENSCHAFT.
Eine innovative Unternehmenskultur in einem so vielschichtigen Konzern wie der BMW Group lebt von komplexen Systemen und Netzwerken. Mit guten Ideen, Begeisterung und Teamgeist entwickeln unsere IT-Exper:innen unverwechselbar smarte und moderne Systeme. Dabei profitieren sie von ausreichend Budgets, aber auch von standardisierten Prozessen, um Lösungen effizienter umzusetzen. So kann eine IT realisiert werden, die neue Möglichkeiten schafft und damit die Basis unserer Unternehmenskultur und unseres Erfolges sichert.
We shape the future of domain-specific AI systems at the BMW Group by designing, training, and operating new foundation models. Our team sets standards for safe and scalable AI in engineering and production.
What awaits you?
- You will design, build, and operate GPU‑centric AI infrastructure (especially NVIDIA) across on‑prem and cloud environments, with a strong focus on performance, scalability, and efficiency.
- As part of your role, you take ownership of the architecture and operation of high‑performance compute environments for distributed training and optimised model execution.
- By optimizing compute, storage, and high‑performance networking (e.g., InfiniBand, NCCL), you enable large‑scale AI workloads in industrial contexts.
- You are responsible for developing and operating core infrastructure components such as scheduling and resource management systems (e.g., SLURM, Ray, Run:ai), ensuring efficient utilization of shared GPU resources.
- Using modern tooling, you build and maintain automated, reproducible infrastructure (e.g., Docker, Kubernetes, Terraform, Ansible, CI/CD).
- You contribute to BMW-specific AI use cases by providing reliable and scalable infrastructure.
- Your role is rounded by taking technical ownership of the AI infrastructure stack, defining best practices, and guide less experienced engineers.
What should you bring along?
- University degree in Computer Science, Computer/Electrical Engineering or related subjects.
- Several years of professional experience (8–10 years) in industry, building and operating AI and HPC infrastructure.
- Strong hands-on experience with GPU systems (especially NVIDIA), including drivers, CUDA, and performance optimisation.
- Experience with distributed systems and high‑performance networking (e.g. InfiniBand, NCCL), combined with experience in cloud environments (AWS, Azure) alongside on‑prem infrastructure.
- Practical experience with resource scheduling and workload orchestration (e.g., SLURM, Ray, NVIDIA Run:ai).
- Strong experience in infrastructure automation (e.g., Docker, Kubernetes, Terraform, Ansible, CI/CD) and proficiency in Python for infrastructure and system-level tooling.
- Experience with training, fine-tuning, or serving ML models in production as well as exposure to large-scale industrial AI use cases (e.g. simulation, robotics, engineering) are nice to have.
Are you excited to innovate in the mobility of tomorrow? Apply now!
Note: Please apply exclusively online via our career portal. Applications through other channels (especially email) cannot be considered.
What do we offer?
- Challenging projects with which we shape the mobility of tomorrow together.
- Wide range of personal and professional development opportunities.
- Attractive, fair and performance-related remuneration.
- High level of job security.
- Annual special payments such as vacation pay, Christmas bonus, and profit sharing.
- Flexible working hours including six weeks annual leave and overtime compensation.
- Discounted BMW & MINI conditions.
- Many other benefits at bmw.jobs/benefits
Earliest starting date: from now on
Type of employment: unlimited
Working hours: full-time
If you apply, the next steps in the selection process include an online test and subsequent interviews with the hiring manager (either virtually or in person).
You can find helpful tips on your application and the application process here.
At the BMW Group, we place great importance on equal treatment and equal opportunities. Our recruiting decisions are based on the personality, experience, and skills of the applicants. Learn more here.