Senior Storage Engineer required by one of the most exciting, well-resourced AI infrastructure startups in the world. The company works in close partnership with NVIDIA and other key organisations shaping the future of data centres and AI infrastructure.
As a Senior Storage Engineer, you will operate, optimize, and scale distributed storage systems powering some of the world’s most advanced AI infrastructure. This is a highly hands-on role working with InfiniBand fabrics and AI-optimized storage platforms, delivering the performance and reliability required for large-scale GPU workloads.
What we offer
- Salary up to $275,000 + 20% Bonus
- Huge equity upside
- Remote US
Responsibilities
- Operate and support production storage platforms powering large-scale AI workloads, including ETL.
- Maintain performance, stability, and reliability across customer environments
- Monitor and tune storage systems to ensure predictable throughput and low latency.
- Troubleshoot end-to-end I/O issues across GPU clients, RDMA networks (InfiniBand or RoCE), and storage infrastructure.
- Plan and execute upgrades, expansions, and maintenance with minimal disruption.
- Support customer onboarding, including storage configuration, namespaces, and access controls.
- Run performance validation and benchmarking
- Own incidents, lead root cause analysis, and improve reliability through automation and documentation.
Requirements
- Strong Linux systems experience operating storage infrastructure in production environments.
- Hands-on experience with high-performance or distributed storage systems supporting large-scale AI or HPC clusters.
- Deep understanding of storage architectures including parallel file systems, file, object, and block storage (e.g. Lustre, VAST, DDN).
- Experience troubleshooting end-to-end I/O performance across clients, RDMA networks (InfiniBand or RoCE), and storage systems.
- Experience analyzing and optimizing storage performance, including benchmarking, reliability, and data protection concepts.
- ETL and integrations supporting AI/ML workloads.
Benefits
- Medical, dental, and vision insurance for the employee and family
- Equity Scheme
- Bonus
- 401(k) with a generous employer match
- Company-paid Life Insurance
- Flexible Spending Account
- Mental Wellness Benefits
- Flexible PTO
Why Join?
They’re at the forefront of the AI industry, building bleeding-edge infrastructure that enables fully secure, scalable AI-ready solutions – propelling the entire AI landscape. Joining at this early stage (just two years in), you’ll have the opportunity to work hands-on with world-class GPU infrastructure, learn from senior experts, and grow as the platform scales globally.
Interested in learning more? Apply today.
Are you looking for remote jobs near your area? At Yulys, thousands of employers are looking for exceptional talent like yours. Find a perfect job now.