A
ABBYY Solutions

Senior Machine Learning Engineer - AI-Assisted Data Annotation

Bangalore, India Posted 2026-05-21
Type
Full-time
Experience
5+ yr
Source
Greenhouse
Join ABBYY and be part of a team that celebrates your unique work style. With flexible work options, a supportive team, and rewards that reflect your value, you can focus on what matters most – driving your growth, while fueling ours.
Our commitment to respect, transparency, and simplicity means you can trust us to always choose to do the right thing.
As a trusted partner for purpose-built AI and intelligent automation, we solve highly complex problems for our enterprise customers and put their information to work to transform the way they do business.  Over 10,000 customers trust ABBYY, including many Fortune 500 ones. You will work on further developing a portfolio already containing client names such as DHL, Johnson & Johnson, FDA, DMV, PwC, KeyBank, Spotify, and H&R BLOCK. About the Role  
We are seeking a  Senior Machine Learning Engineer – AI-Assisted Data Annotation  to own the automated annotation track within ABBYY’s  Document AI Data team .  
This role sits at the intersection of  large model capabilities and production data engineering , leveraging LLMs and vision-language models to generate high-quality training data at scale. You will design and build  AI-assisted annotation pipelines , ensuring outputs are accurate, measurable, and reliable for downstream model training.  
This is an ideal role for engineers who combine  deep model expertise with strong system-building instincts  and thrive in fast-moving, experimental environments.  
Key Responsibilities  
Technical Development & Innovation  


• Design and implement  AI-powered annotation pipelines  using large models to generate ground truth labels at scale  



• Develop and refine  prompting strategies, few-shot examples, and fine-tuning approaches  to improve accuracy and consistency  



• Build systems for  label verification, confidence scoring, and quality validation  



• Evaluate which tasks are suitable for  automated annotation vs. human review , and define decision criteria  



• Create  evaluation frameworks  to benchmark automated annotations against human-labeled data  



• Continuously improve annotation quality using feedback from human review workflows  

Project Ownership & Leadership  


• Own the automated annotation track  end-to-end , from architecture through production monitoring  



• Drive technical decisions across  model selection, pipeline design, and validation strategies  



• Define integration points with  platform infrastructure and model serving systems  



• Collaborate with Data Operations to design  human-in-the-loop workflows  for efficient review  



• Contribute to roadmap planning with Principal-level technical leadership  

Infrastructure & Scale  


• Build and optimize  large-scale inference pipelines  for processing millions of documents  



• Implement monitoring and alerting for  quality degradation and system failures  



• Design batching, caching, and fallback mechanisms to balance  cost, throughput, and accuracy  



• Collaborate with Platform teams on  model serving, APIs, and infrastructure scaling  



• Maintain clear documentation of  annotation strategies, metrics, and known limitations  

Qualifications  
Education & Experience  


• MS or PhD in Computer Science, Engineering, Mathematics, or related field  



• 5+ years of experience in  Machine Learning / AI , with focus on:   



• Large Language Models (LLMs)  



• Vision-Language Models (VLMs)  



• Data annotation or labeling systems  



• Demonstrated success using  large AI models to automate annotation at production scale  



• Strong background in  evaluation design and quality measurement  

Technical Expertise  


• Deep expertise in  LLMs and VLMs , including prompting, instruction tuning, and output evaluation  



• Strong understanding of  document understanding tasks  (classification, extraction, layout analysis, semantic parsing)  



• Experience designing  label quality metrics , confidence scoring, and agreement analysis  



• Strong programming skills in  Python  and proficiency with  PyTorch or similar frameworks  



• Experience with  large-scale inference pipelines and model serving systems  



• Familiarity with  human-in-the-loop annotation systems  and automation trade-offs  

Leadership & Communication  


• Proven ability to independently own complex technical workstreams  



• Strong collaboration with  data operations, platform, and modeling teams  



• Ability to clearly communicate  quality trade-offs and system behavior  to diverse stakeholders  



• Rigorous, data-driven problem-solving approach  

Here are some of our local benefits:  


• Comprehensive medical, accidental, and life insurance  



• Weekly wellness sessions to support your physical and mental well-being  



• A generous paid time off policy  

 
 



Join ABBYY, and you will:
Love how you work










• We provide remote and hybrid working options to fit all lifestyles.

• We use flexible hours across most of our teams to allow you to find your own definition of balance.

• Encouraging a culture of giving, we provide two paid volunteering days off every year so you can take time to contribute to the causes you care about.

• To ensure your family is cared for, we offer paid parental leave in all our locations.









Love whom you work with










• We are a global team of 600+ colleagues, spread across 15 countries on four continents.

• With colleagues representing 30+ nationalities, our workforce reflects the world.

• Innovation and excellence run through our veins. Our teams gather the expertise which has garnered ABBYY more than 140 technology patents.

• We are guided by the values of respect, transparency, and simplicity.

• "Team Environment" is in the top three highest-scoring drivers of engagement across all of our departments.









Love what you work on


• We are a company with more than 35 years of experience in the technology market;

• Over 10,000 customers trust ABBYY, including many Fortune 500 ones, with names such as DHL, Johnson & Johnson, FDA, DMV, PwC, KeyBank, Spotify, and H&R BLOCK;

• We have modernized the capture market by creating the first low-code/no-code IDP platform.

• Our Machine Learning, Natural Language Processing, Computer Vision Technologies, and a marketplace built with AI, can transform any document in any process;

• Top Analyst firms recognize ABBYY's market leadership, including Gartner, Everest PEAK Matrix ® Assessment, ISG Intelligent Automation Lens, and NelsonHall, amongst others.

ABBYY is an Equal Employment Opportunity employer that values the strength that diversity brings to the workplace. To learn more about our commitment to Diversity and Inclusion, check out the careers section on our website.
PythonPyTorch
ABBYY Solutions is hiring for the senior machine learning engineer - ai-assisted data annotation role. NewJob aggregates active openings directly from ABBYY Solutions's applicant tracking system, so this listing is current. More jobs at ABBYY Solutions →
Apply on company site