Open to Remote Roles

Senior AI Engineer

Building AI Systems
That Ship to Production

I design and deliver production multi-agent systems — from LangGraph orchestration and MCP server development, to system prompt engineering and agent evaluation pipelines. I care about agents that work in the real world, not just in demos.

Multi-Agent Systems MCP Servers Prompt Engineering Agent Evaluation

10+

AI Systems Built

Years in AI/ML

MCP Integrations

View Work Let's Connect

🐍 Python

🔗 LangGraph

⚡ FastAPI

🤖 MCP

Core Expertise

Where I Deliver Value

Deep specialization across the full lifecycle of production AI systems

Agent Architecture

Designing and shipping production multi-agent pipelines using LangChain and LangGraph — with robust state machines, tool routing, and failure handling.

Multi-agent orchestration
Autonomous agent design
LangGraph state machines
Tool use & routing logic

MCP Server Development

Building custom MCP (Model Context Protocol) servers and integrating enterprise tools — giving LLMs structured, reliable access to real systems.

Custom MCP server build
Tool schema design
Jira / Confluence / GitHub
Playwright-based scrapers

Prompt Engineering

Crafting production-grade system prompts — with precise output formatting, structured reasoning chains, and versioned prompt management.

System prompt design
Chain-of-thought prompting
Structured output control
Prompt versioning

Agent Evaluation

Designing eval frameworks that catch regressions before production — LLM-as-judge pipelines, tracing, scoring, and behavioral reliability testing.

Eval framework design
LLM-as-judge pipelines
Regression testing
Tracing & observability

RAG & LLM Infrastructure

End-to-end RAG pipelines — from embedding strategy and chunking to retrieval optimization and vector store management at scale.

RAG pipeline design
Vector DB management
Embedding strategies
Retrieval optimization

Cloud & API Backend

Production-grade FastAPI backends and cloud deployments on AWS — containerized, CI/CD-ready, and built to scale with AI workloads.

FastAPI microservices
AWS / GCP deployment
Docker & CI/CD
Async API design

Featured Projects

Things I've Shipped

Production AI systems delivering measurable impact

Featured

AI-Powered Invoice Processing & Chat System

Automated invoice processing using multimodal GenAI with a conversational interface — cutting processing time from hours to minutes per batch.

LangChain Gemini Multimodal Flask PDF Parsing

75% Cost Reduction · Batch Processing

Featured

WebAI: Intelligent Website Assistant

24/7 AI-powered customer assistant for businesses — trained on real website content, with real-time knowledge base updates and natural conversation flow.

Next.js LangChain FAISS Vector Search

24/7 Support · 80% Efficiency Gain

ML-Powered Supply Chain Forecasting

Enterprise ML pipeline for supply chain optimization — 92% accuracy, containerized deployment with full CI/CD.

Python Scikit-learn Flask Docker

92% Accuracy · 25% Cost Reduction

Enterprise Knowledge Assistant

RAG chatbot integrated with Slack — instant access to org knowledge with 95% retrieval accuracy and 1000+ queries/week.

RAG Slack API OpenAI FAISS

1000+ Queries/Week · 95% Accuracy

Enterprise Sales AI Agent

AWS Bedrock conversational AI for sales automation — full serverless architecture built and deployed in 48 hours.

AWS Bedrock Lambda S3 API Gateway

48hr Build · Serverless Architecture

Technical Skills

Tech Stack

Tools and technologies I use to build production AI systems

Languages & Frameworks

Python

FastAPI

PostgreSQL

Pandas / NumPy

Agent & LLM Stack

LangChain / LangGraph

MCP Servers

Prompt Engineering

Agent Evaluation

RAG Systems

Cloud & DevOps

AWS (EC2, S3, Lambda)

Docker

CI/CD · GitHub Actions

Grafana / Loki

LLM Models & Tools

OpenAI / Anthropic Claude

Gemini

Ollama / vLLM

Vector DBs (FAISS, Chroma)

About Me

I build AI systems that work in production

Senior AI Engineer with 3+ years building and deploying intelligent systems. My core focus is production multi-agent architecture — specifically agent orchestration, MCP server development, precise prompt engineering, and the eval frameworks that make it all trustworthy enough to ship.

I've shipped everything from invoice-parsing chatbots and enterprise RAG systems to Slack-integrated knowledge assistants and cloud-native AI agents on AWS Bedrock. I care deeply about the gap between "it works in a notebook" and "it works in prod" — and I've spent years closing that gap.

Production-First Mindset

Every system I design accounts for failure modes, deterministic routing, and observability from day one.

MCP & Tooling Expert

Built custom MCP servers integrating Jira, Confluence, GitHub, Grafana, and LMS platforms for real enterprise workflows.

Evaluation-Driven Development

I don't ship agents without evals. LLM-as-judge, regression suites, and tracing are part of every delivery.

Let's Work Together Resume

Get In Touch

Let's Build Something Real

Open to remote Senior AI Engineer roles and interesting agent / LLM infrastructure projects

Email

akshaykumarbedre.bm@gmail.com

Phone

(+91) 9164623536

akshay-kumar-bm

GitHub

akshay-kumar-bm

Building AI Systems That Ship to Production

Where I Deliver Value

Agent Architecture

MCP Server Development

Prompt Engineering

Agent Evaluation

RAG & LLM Infrastructure

Cloud & API Backend

Things I've Shipped

AI-Powered Invoice Processing & Chat System

WebAI: Intelligent Website Assistant

ML-Powered Supply Chain Forecasting

Enterprise Knowledge Assistant

Enterprise Sales AI Agent

Tech Stack

Languages & Frameworks

Agent & LLM Stack

Cloud & DevOps

LLM Models & Tools

I build AI systems that work in production

Production-First Mindset

MCP & Tooling Expert

Evaluation-Driven Development

Let's Build Something Real

Email

Phone

LinkedIn

GitHub

Technologies Used:

Key Features:

Building AI Systems
That Ship to Production