G
Gimlet Labs

Member of Technical Staff - AI Research

San Francisco, CA Posted 2025-03-31
Type
Full-time
Experience
8+ yr
Source
Ashby
About Us

Gimlet is building the next generation of AI infrastructure: large-scale AI datacenters and the orchestration platform that coordinates them.

The future of AI will require vastly more compute than exists today. But as AI workloads become more complex and new hardware architectures emerge, simply deploying more GPUs isn't enough. The challenge is making increasingly diverse compute work together.

Gimlet's platform intelligently partitions and routes workloads across heterogeneous hardware, enabling step-function improvements in performance and efficiency. Customers deploy through production-grade APIs without needing to think about hardware selection, placement, or optimization.

We work with foundation labs, hyperscalers, and AI-native companies to power production workloads at massive scale and help define the infrastructure layer for the future of AI.

ABOUT THE ROLE

Gimlet Labs is seeking an Member of Staff focused on AI Research (Intern). As an AI Researcher (Intern), you will be evaluating and implementing techniques to drive performance and quality optimizations across the latest AI models. The research team is responsible for exploring new model architectures and experimenting with novel inference efficiency techniques such as KV caching and FlashAttention. The team will design and prototype frameworks leveraging fine-tuning and knowledge distillation to push the boundaries of model performance.

WHAT YOU WILL WORK ON

- Monitoring and evaluating cutting-edge AI research

- Researching ways to improve model accuracy, performance and efficiency

- Prototyping frameworks with the latest fine-tuning and distillation techniques 

YOU MAY BE A GOOD FIT IF

- Currently pursuing degree in computer science, engineering, or comparable area of study

- Experience with AI/ML or distributed systems.

STRONG CANDIDATES MAY ALSO HAVE

- Experience with PyTorch, TensorFlow, ONNX and other AI frameworks

- Familiarity with distributed systems and orchestration frameworks (e.g., Kubernetes)

- Software development experience with Python and C++

- Understanding of the latest AI research and techniques

WHAT MAKES GIMLET DIFFERENT

At Gimlet, you will work on infrastructure problems that span the full stack of modern AI systems. Our team operates across datacenters, networking, distributed systems, compilers, runtimes, orchestration, and performance engineering to build the foundation for the next generation of AI infrastructure.

As an early member of the team, you will have significant ownership, work alongside highly technical engineers, and help shape both the systems we build and how we scale the company.

We value people who are excited to work across domains, take ownership of meaningful problems, and build technology that enables the next generation of AI.
PyTorchTensorFlowKubernetesPythonC++
Gimlet Labs is hiring for the member of technical staff - ai research role. NewJob aggregates active openings directly from Gimlet Labs's applicant tracking system, so this listing is current. More jobs at Gimlet Labs →
Apply on company site