Mirendil

Member of Technical Staff, Inference

Mirendil · San Francisco, CA
San Francisco, CA $350K–$500K Posted 2026-06-24
Salary
$350K–$500K
Type
Full-time
Experience
8+ yr

Mirendil

Mirendil is a tech-first company focused on solving core bottlenecks that unlock step-change acceleration across science and technology. Our first goal is to democratize frontier AI R&D across scientific disciplines. We believe accelerating scientific discovery is one of the most powerful ways to improve the future of humanity, and that AI will play a central role in making that possible.

We are building a frontier AI research company and training our own models end-to-end. Our work spans areas such as model training, reinforcement learning, reasoning systems, and infrastructure for large-scale experiments. Our team includes researchers and engineers from Anthropic, Google DeepMind, xAI, OpenAI, Microsoft, Apple, and MIT.

The Role

We are looking for an engineer to own the inference systems that power our models in production and research. You'll work across the full inference stack, from serving infrastructure down to hardware-level optimization. Some example areas you might work on (not limited to):

  • Design and build high-throughput, low-latency inference serving systems for frontier models, optimizing for both research iteration and production deployment
  • Optimize inference performance across GPU and accelerator hardware - maximizing FLOPs utilization, memory bandwidth, and compute efficiency for large-scale models
  • Enable and extend distributed inference frameworks (e.g. vLLM, SGLang, TensorRT-LLM) to support novel architectures, long-context workloads, and agentic inference patterns
  • Implement and validate inference-time optimizations: speculative decoding, quantization, KV cache management, and batching strategies
  • Build observability and reliability infrastructure so the team can measure latency, throughput, and cost across every serving configuration
  • Partner directly with teams to bring new model architectures and post-training techniques into production quickly

If you're excited about pushing the performance limits of frontier model inference, we'd love to hear from you.

We offer a base salary of $350,000–$500,000 USD and a meaningful equity grant, depending on experience and background, along with competitive benefits.

LLM
$65K — 10th pctl $230K — 90th pctl
This role’s midpoint $425K vs. market median $130K for Operations roles
+225%
above median
Based on 10,000+ Operations roles with disclosed salary ranges tracked on NewJob.
O
Member of Technical Staff, AI for AI Systems
San Francisco, CA
Operations
$350K–$500K
O
Member of Technical Staff, Agent Harness
San Francisco, CA
Operations
$350K–$500K
E
Member of Technical Staff, Design Engineer
San Francisco, CA
Engineering
$350K–$500K
See all 10+ roles at Mirendil →
C
Member of Technical Staff, Multilingual
Cohere London, UK Remote
Operations
$110K–$370K
N
Member of Technical Staff, Formal Verification
Netpreme Santa Clara, CA
Operations
P
Member of Technical Staff, Infrastructure
Physical Superintelligence Boston, MA Hybrid
Operations
S
Member of Technical Staff — Audio and Voice AI
Stuut Technologies San Francisco, CA
Operations
$220K–$320K
See all Operations roles →

Interested in this role?

Apply directly on the company site — no recruiter middleman, no account required.

Apply now →
Apply on company site