Mellis

Research Scientist, Foundation Model

Mellis · Palo Alto, CA
Palo Alto, CA $185K–$400K Posted 2026-05-16
Salary
$185K–$400K
Type
Full-time
Experience
5+ yr

About the Role

At Pika, we are pioneering the next generation of creative infrastructure built around real-time, multimodal generation and intelligent agentic platforms. We are seeking accomplished Research Scientists in Foundation Models with expertise in pre-training and mid-training large-scale multimodal foundation models to advance our mission of making agentic, real-time generative technology accessible and transformative for millions of creators. This is a staff and lead-level opportunity.

As a key member of our research team, you will design and implement core technologies, develop new methodologies for large-scale multimodal pre-training/mid-training (text, image, audio, and video), and drive innovative approaches for foundational model architecture. You will collaborate closely with engineering and product teams, shaping the future of real-time creative and agentic platforms at scale.

What You’ll Do

  • Lead research and development on pre-training and mid-training of multimodal foundation models at scale.
  • Design and prototype novel algorithms and architectures for high-fidelity, real-time multimodal synthesis and interaction across modalities.
  • Focus on scalable data pipeline curation and model training strategies for broad, diverse, and sensory-rich datasets.
  • Advance state-of-the-art techniques in diffusion, autoregressive, and other generative models for large-scale pre-training and fine-tuning.
  • Identify, create, and leverage large, high-quality cross-modal datasets.
  • Bring research advancements into production-ready systems in collaboration with engineering and product teams.
  • Publish work in top-tier conferences and journals, and clearly communicate research both internally and externally.
  • Stay at the forefront of foundational model and real-time multimodal AI research.

What We’re Looking For

  • 5+ years of research experience in large-scale pre-training/mid-training of multimodal foundation models (LLMs, VLMs, Audio LMs, or similar), ideally at the staff or lead scientist level.
  • Track record as a first author on major publications in top conferences or journals (e.g., NeurIPS, ICML, ICLR).
  • Extensive hands-on experience with large-scale multimodal model design, training, and deployment.
  • Deep understanding and implementation experience with generative architectures (diffusion, autoregressive, cross-modal, etc.).
  • Expertise in high-throughput, scalable dataset curation and model pipeline optimization for multimodal applications.
  • Strong programming and prototyping skills (Python, PyTorch, TensorFlow, etc.) and experience deploying research into production systems.
  • Excellent communication and collaboration skills, and a passion for building creative enabling technology.

What We Offer

  • Competitive salary and substantial equity in a high-growth startup
  • Full health benefits + 401k matching and more
  • Collaborative, mission-driven team environment with major growth opportunities
  • Flexible on-site/remote hybrid (HQ in Palo Alto, CA)

About Pika

Pika empowers creators by building state-of-the-art agentic and multimedia platforms. Our vision is to break down technical barriers to creativity, making real-time generative and intelligent orchestration accessible to all. Join us and help shape the next evolution of creative technology!

If you are a leading researcher excited to build and scale real-time multimodal foundation models, we want to hear from you.

PythonPyTorchTensorFlow
$105K — 10th pctl $265K — 90th pctl
This role’s midpoint $292K vs. market median $180K for Data & ML roles
+60%
above median
Based on 2,000+ Data & ML roles with disclosed salary ranges tracked on NewJob.
E
Multimodal LLM Researcher
Palo Alto, CA
Engineering
$185K–$400K
E
Real-time Video Researcher
Palo Alto, CA
Engineering
$185K–$400K
D
Research Scientist, Data
Palo Alto, CA
Data & ML
$185K–$400K
See all 10+ roles at Mellis →
W
Senior Research Scientist, Foundation Model for Simulation
Waymo Mountain View, CA Hybrid
Data & ML
$204K–$259K
L
Applied Research Scientist, Agents
Labelbox San Francisco, CA
Data & ML
$250K–$300K
P
Applied AI Scientist, Small Language Model and AI Training
Postman San Francisco, CA
Data & ML
$218K–$288K
S
Senior Research Scientist, Battery Materials Simulation
SandboxAQ United States Remote
Data & ML
$168K–$252K
See all Data & ML roles →

Interested in this role?

Apply directly on the company site — no recruiter middleman, no account required.

Apply now →
Apply on company site