See all roles

Member of Technical Staff, Inference (Bay Area, Remote)

Work from home Full-time role Hiring

What You’ll Do Build low-latency inference pipelines for on-device deployment, enabling real-time next-token and diffusion-based control loops in robotics Design and optimize distributed inference systems on GPU clusters, pushing throughput with large-batch serving and efficient resource utilization Implement efficient low-level code (CUDA, Triton, custom kernels) and integrate it seamlessly into high-level frameworks Optimize workloads for both throughput (batching, scheduling, quantization) and latency (caching, memory management, graph compilation) Develop monitoring and debugging tools to guarantee reliability, determinism, and rapid diagnosis of regressions across both stacks What You’ll Bring Deep experience in distributed systems, ML infrastructure, or high-performance serving (8+ years) Production-grade expertise in Python, with strong background in systems languages (C++/Rust/Go) Low-level performance mastery: CUDA, Triton, kernel optimization, quantization, memory and compute scheduling Proven track record scaling inference workloads in both throughput-oriented cluster environments and latency-critical on-device deployments System-level mindset with a history of tuning hardware–software interactions for maximum efficiency, throughput, and responsiveness Apply To This Job

You might like

Member of Technical Staff, Training (Bay Area, Remote)

Work from home Full-time role

Marketing Analyst (Attribution Focus) (Promova)

Work from home Full-time role

Student and Family Experience Manager (Immediate Opening)

Work from home Full-time role

Customer Sales Representative (remote work)

Work from home Full-time role

Account Manager Industrial Markets Region: France - Africa

Work from home Full-time role

VP of Engineering

Work from home Full-time role

Member of Technical Staff, Foundation Models (Bay Area)

Work from home Full-time role

Member of Technical Staff, Data Agent (Bay Area, Remote)

Work from home Full-time role

Member of Technical Staff, Platform (Bay Area, Remote)

Work from home Full-time role

Account Manager Industrial Markets Region: Europe - Middle Eas

Work from home Full-time role

Sports Performance Coach Part time

Work from home Full-time role

Experienced Remote Chat Support Specialist – Flexible Work Schedule & Competitive Hourly Rate

Work from home Full-time role

Experienced Data Entry Specialist – Remote Opportunity with arenaflex

Work from home Full-time role

Experienced Customer Service Representative – Work from Home Opportunity at arenaflex

Work from home Full-time role

Experienced Live Chat Support Representative – Entry-Level Opportunity at arenaflex

Work from home Full-time role

Patient Access Scheduler 1, BHMG Scheduling, FT 8A-4:30P

Work from home Full-time role

Experienced Live Chat Customer Service Representative – Remote Work Opportunity at arenaflex

Work from home Full-time role

Experienced Customer Service Representative – Entry-Level Opportunity for Remote Work at arenaflex

Work from home Full-time role

Experienced Senior Customer Support Lead – Driving Customer Satisfaction and Operational Efficiency at arenaflex

Work from home Full-time role

Experienced Customer Service Associate – Retail Store Operations and Customer Experience

Work from home Full-time role