Flag job

Report

IT Infrastructure Expert (f/m/d) for Machine Learning / MR

Location

Erlangen, Bayern, Germany

JobType

Full-time

About the job

Info This job is sourced from a job board

About the role

Do you want to help create the future of healthcare? Our name, Siemens Healthineers, was selected to honor our people who dedicate their energy and passion to this cause. It reflects their pioneering spirit combined with our long history of engineering in the ever-evolving healthcare industry. We offer you a flexible and dynamic environment with opportunities to go beyond your comfort zone in order to grow personally and professionally. Sounds interesting? Then come and join our global team as IT Infrastructure Expert (f/m/d) for Machine Learning in Magnetic Resonance Imaging (MRI). Choose the best place for your work – Within the scope of this position, it is possible, in consultation with your manager, to work mobile (within Germany) up to an average volume of 60% of the respective working hours. Your tasks and responsibilities: You administrate and maintain a machine learning GPU cluster consisting of NVIDIA DGX systems. You manage Kubernetes clusters for balancing workloads across multiple nodes. You develop and maintain a cloud-based GPU training infrastructure. You manage storage servers (local and cloud) and ensure seamless connectivity to the GPU cluster. You develop tooling to improve our ML Ops pipeline. You monitor system performance, troubleshoot issues as they arise and support users to optimize training performance and efficiency. You ensure that the used infrastructure, both local and cloud, complies with the organization's security policies and the product development process. You collaborate with data scientists and engineers to optimize the cluster for machine learning workloads. You perform regular system updates and maintenance tasks. You oversee data storage, backup, versioning, and recovery processes. Your qualifications and experience: You hold a university degree in the field of Computer Science, Mathematics, Engineering, or equivalent You have several years of professional experience in IT Infrastructure Administration, preferably within medical imaging or healthcare Experience in managing and maintaining GPU clusters, ideally NVIDIA DGX systems is beneficial You have strong knowledge and hands-on experience with Kubernetes for container orchestration. You have proficiency in Linux system administration, including shell scripting and automation. You have experience using Cloud infrastructure and basic knowledge of Windows system administration. You understand network configurations, protocols, and troubleshooting. You have experience with managing storage servers and ensuring connectivity to compute clusters. Your attributes and skills: Since the development teams of the Application Release Train are spread internationally across three locations, communication in English is not a problem for you. We win together - that's why you are proactively committed to delivering the best possible solutions to our customers. You present your ideas and results confidently and convincingly in cross-functional development teams. Personally, you are characterized by your strong teamwork and cooperation skills as well as your persuasiveness. The highest quality standards in product development are a matter of course for you.

About the company

Siemens Healthineers is a leading global medical technology company. 73,000 dedicated colleagues in over 70 countries are driven to shape the future of healthcare. An estimated 5 million patients across the globe benefit every day from our innovative technologies and services in the areas of diagnostic and therapeutic imaging, laboratory diagnostics and molecular medicine, as well as digital health and enterprise services.

Skills

Machine Learning
Kubernetes
Linux
Cloud
Windows
Network
Storage