Looks like you have JavaScript disabled. For the full Twine experience, you will need to re-enable it.

Find Freelance Jobs>AI Engineer Jobs>AI Engineer Jobs in Bengaluru>Job Details

Evnek Technologies Pvt Ltd - Senior AI Engineer

AI Engineer

💰 Negotiable

📍 Bengaluru, India

Twine Jobs

Based in Manchester, United Kingdom

Last online 6 months ago

AI Engineer is needed in Bengaluru, India.

Client: Evnek Technologies Pvt Ltd

Location: Bengaluru, KA, in

Contract: Freelance

Job Description

This is a remote position.

Job Title: Senior AI Engineer

Experience: 7–8 Years

Location: Remote

Timings: 5:30 PM – 2:30 AM IST

Notice Period: Immediate Joiners Only

We are hiring a highly skilled Senior AI Engineer to design, build, and scale enterprise-grade document intelligence systems powered by self-hosted LLM infrastructure. This is a hands-on individual contributor role focused on architecting and operating production-ready OCR-driven pipelines that process long-form, document-heavy workloads (200+ pages) with deterministic, auditable outputs.

You will partner closely with AI leadership while independently owning architecture decisions, system optimization, and production reliability across document processing and LLM inference systems.

Key Responsibilities

1. Enterprise Document Intelligence Architecture

Architect and implement end-to-end pipelines for OCR-heavy, long-form document processing.
Lead PDF ingestion, layout-aware parsing, and multi-page document reconstruction.
Design chunking, segmentation, and metadata tracking strategies for large-scale workflows.
Implement deterministic retry mechanisms, exception handling, and failure recovery systems.
Optimize processing frameworks to handle 200+ page documents at scale with high reliability.
Ensure system scalability, performance consistency, and operational stability under heavy document loads.

2. OCR & Structured Data Extraction Engineering

Integrate and optimize OCR engines such as Tesseract, PaddleOCR, and layout-aware or vision-language models.
Build layout-sensitive extraction systems leveraging bounding boxes and structural metadata.
Implement deterministic schema validation, cross-field consistency checks, and rule-based validation layers.
Minimize manual QA dependency through automated validation and control frameworks.
Maintain traceability from every extracted field to its original document span for auditability.

3. Self-Hosted LLM Infrastructure (Production Ownership)

Deploy and manage open-source LLMs using vLLM and Hugging Face TGI.
Operate GPU-backed serving infrastructure for large-scale inference workloads.
Optimize inference performance via KV cache strategies, intelligent batching, context window tuning, and throughput-latency balancing.
Monitor GPU utilization and improve cost efficiency per inference request.
Own uptime, observability, and reliability of LLM production environments.

4. Deterministic Validation & Governance Systems

Design validation layers external to LLM outputs.
Build schema enforcement engines, invariant checks, rule systems, and rejection logic.
Implement automated exception routing without default human review.
Ensure reproducibility, audit trails, and measurable correctness guarantees.
Establish reliability standards for high-stakes and compliance-driven use cases.

5. Production Engineering & Scalability

Architect systems capable of managing large document volumes and high concurrency.
Handle complex failure states with robust fallback and recovery mechanisms.
Build observability frameworks including structured logging, distributed tracing, and metrics monitoring.
Improve monitoring, alerting, and operational tooling across AI document pipelines.
Collaborate cross-functionally to deliver production-grade AI solutions.

Requirements

6+ years of strong Python-based backend engineering experience.
Proven expertise in building OCR-driven document intelligence systems in production.
Experience handling long-form PDFs (100+ pages).
Hands-on experience with:
vLLM or Hugging Face TGI
GPU-backed LLM serving infrastructure
Open-source LLMs such as LLaMA, Qwen, Mistral, etc.
Experience designing deterministic validation systems (schema + rule enforcement).
Strong debugging, systems-thinking, and architectural decision-making skills.
Ability to clearly communicate technical trade-offs and business impact.

Strongly Preferred

Experience with layout-aware models such as LayoutLM, DocFormer, or vision-language architectures.
Demonstrated experience optimizing GPU utilization and inference cost efficiency.
Background in regulated industries (finance, healthcare, compliance).
Familiarity with document-intensive workflows like underwriting, loan processing, or insurance claims.

Posted a month ago

No longer accepting applications

Get instant notifications for new AI Engineer jobs. Enter your email:

How It Works
🔍
Get quality leads
Review job leads for free, filter by local or global clients, and get real time notifications for new opportunities.
🎉
Apply with ease
Pick the best leads, unlock contact details, and apply effortlessly with Twine's AI application tools.
📈
Grow your career
Showcase your work, pitch to the best leads, land new clients and use Twine’s tools to find more opportunities.
Sign up for free

- Hire an expert AI Engineer
- Hire a AI Engineer in Bengaluru