Figure AI

Staff AI Inference and Acceleration Engineer

Figure AI · San Jose, CA
San Jose, CA $180K–$275K Posted 2026-06-26
Salary
$180K–$275K
Type
Full-time
Experience
8+ yr

Figure is an AI robotics company developing autonomous general-purpose humanoid robots. The goal of the company is to ship humanoid robots with human level intelligence. Its robots are engineered to perform a variety of tasks in the home and commercial markets. Figure is headquartered in San Jose, CA.

We are looking for a Staff AI Inference & Acceleration Engineer to join the Platform Software team and own the on-board inference architecture for Figure’s humanoid robots. You will be the technical authority on how AI workloads are mapped, optimized, and executed across the robot’s compute hardware — driving down power consumption and cost while meeting the strict latency and reliability demands of a real-time autonomous system.

Responsibilities:

  • Own the on-board inference architecture — mapping models to available accelerators (NPU, GPU, DSP, CPU) based on latency, power, and memory budgets.
  • Partition inference workloads across heterogeneous compute resources, balancing real-time performance with power and thermal constraints.
  • Define and maintain a system-level compute budget across all inference tasks running on the robot.
  • Evaluate next-generation acceleration hardware and contribute to the definition of future compute platform requirements.
  • Optimize inference toolchains end-to-end — from model export through runtime execution — for target hardware.
  • Apply quantization (INT8, INT4, mixed-precision), pruning, operator fusion, and other compression techniques to reduce compute, memory, and power footprint.
  • Profile inference pipelines to identify and eliminate bottlenecks in latency, memory bandwidth, and power consumption.
  • Optimize kernel scheduling, memory layout, and data movement across the compute hierarchy.
  • Partner closely with the AI/ML team to define model architecture constraints that are hardware-friendly from the outset.
  • Work with the Platform Software team on runtime integration, scheduling, and power management.
  • Engage with silicon vendors and research teams to track the accelerator landscape and influence hardware roadmaps.

Requirements:

  • M.S. or Ph.D. in Computer Engineering, Electrical Engineering, Computer Science, or a related field — or equivalent industry experience.
  • At least 8 years of industry experience in hardware acceleration, ML systems, or compute architecture.
  • Deep understanding of AI/ML inference — model formats (ONNX, TFLite, etc.), inference runtimes, and deployment pipelines.
  • Hands-on experience optimizing models for edge or embedded hardware using quantization, pruning, and operator-level tuning.
  • Strong understanding of computer architecture — memory hierarchies, data movement, and heterogeneous compute.
  • Experience profiling and benchmarking inference workloads across CPU, GPU, NPU, DSP.
  • Familiarity with low-level toolchains and compilation frameworks (e.g. TVM, MLIR, TensorRT, Torch, SNPE/QNN, JAX, CUDA, ROCm).
  • Solid software engineering skills in C++ and Python.
  • Strong cross-functional communication skills — able to work effectively across hardware, software, and AI/ML teams.

Bonus Qualifications:

  • Knowledge of real-time operating constraints and their impact on inference scheduling.
  • Track record of co-designing model architectures with ML teams to meet hardware constraints.

The US base salary range for this full-time position is between $180,000 - $275,000 annually.

The pay offered for this position may vary based on several individual factors, including job-related knowledge, skills, and experience. The total compensation package may also include additional components/benefits depending on the specific role. This information will be shared if an employment offer is extended.

PythonC++
$110K — 10th pctl $265K — 90th pctl
This role’s midpoint $227K vs. market median $185K for Engineering roles
+20%
above median
Based on 14,000+ Engineering roles with disclosed salary ranges tracked on NewJob.
Figure AI

Figure AI

Robotics · Series C · San Jose, USA

Stage & Valuation
Series C · $39B
Key Investors
Microsoft, Brookfield Asset Management, ARK Invest
Open roles on NewJob
Most hiring in
Engineering (25) · Data & ML (12) · Operations (11)
Figure AI develops autonomous general-purpose humanoid robots designed to address labor shortages and assist in unsafe working environments. The company focuses on integrating artificial intelligence with advanced robotics for commercial and industrial applications.
Robotics Artificial Intelligence Hardware
D
Helix AI Engineer, Android
San Jose, CA
Data & ML
$150K–$400K
D
Helix AI Engineer, Backend Infrastructure
San Jose, CA
Data & ML
$150K–$400K
E
AI Training Infrastructure Engineer – Humanoid Whole Body Control
San Jose, CA
Engineering
$200K–$350K
See all 35+ roles at Figure AI →
A
Staff AI Research Engineer
Agility Robotics Fremont, CA Hybrid
Engineering
$216K–$338K
A
Senior Staff Vehicle Hardware Verification and Validation Engineer Lead
Aurora Innovation Mountain View, CA
Engineering
$189K–$274K
A
Staff AI Embedded Software Engineer - Connected Devices
Axon Enterprise Seattle, WA
Engineering
$168K–$270K
C
Inference System & Performance Engineer - Member of Technical Staff
Callosum Technologies London, UK
Engineering
$101K–$192K
See all Engineering roles →

Interested in this role?

Apply directly on the company site — no recruiter middleman, no account required.

Apply now →
Apply on company site