F
Foundry is hiring a
Site Reliability Engineer, Supply
About Foundry
Foundry is building the future of AI infrastructure with our Cloud Platform, providing self-serve access to high-performance GPU compute for training, fine-tuning, and serving AI models. We’re simplifying infrastructure for dynamic AI workflows, enabling AI practitioners to focus on innovation, not infrastructure. We’re well-funded ($80M, Series A), growing quickly, and looking for talented people to join our team.
Job Description
Manage GPU provisioning, spot bidding, and node pool health across clouds and on-prem. Work on the systems behind our global GPU fleet.Location
Onsite
Salary
Not Specified
Benefits
Not Specified
Similar Jobs
GPUProvisioningSpot BiddingNode Pool Health
📍 Onsite in Palo Alto or San Francisco, CA
99% match
GPUSchedulingFault-tolerantExecutionJob DAGs
📍 Onsite in Palo Alto or San Francisco, CA
92% match
InfrastructureMLCustomer-facing
📍 Onsite in Palo Alto or San Francisco, CA
91% match
Product ManagementML Infrastructure
📍 Onsite in Palo Alto or San Francisco, CA
89% match