Open to new projects — Next slot: May 2026

Himanshu Kumar Singh

Senior Software Consultant

I help engineering teams ship production-grade AI systems - from architecture design to cloud deployment. 10 years. 50+ projects. Zero hand-holding.

India (UTC+5:30) · Remote Worldwide

About Me

Companies come to me when they need to move fast — a working AI product, a backend that won't buckle under load, or a cloud setup that doesn't cost a fortune to run.

Over 10 years I've architected and shipped systems across e-commerce, road safety, supply chain, and AI/ML — from a 96× workflow speedup at a startup to enterprise RAG platforms used by global engineering teams. I work independently, which means you get a senior engineer on every call, not a rotating cast of juniors.

My stack is Python-first, cloud-native, and AI-ready. I don't hand you a proof of concept — I hand you production code with tests, docs, and a CI/CD pipeline.

Who I work with

  • Startups that need to move from MVP to scale without re-architecting everything
  • Product teams adding AI/LLM features to existing systems
  • Engineering managers who need an outside architecture review before committing to a direction
  • International companies that need a reliable, async-friendly senior engineer in IST timezone

10

Years Experience

50+

Projects Delivered

AI/ML

Specialist

Global

Remote Clients

Consulting Services

End-to-end software consulting for businesses worldwide. From architecture design to production deployment.

Core ServicesAlso Available
Core Service

AI & LLM Solutions

Production-ready RAG pipelines, LLM integrations, and intelligent agents — not demos. I handle retrieval architecture, prompt engineering, evaluation, and deployment so your AI feature actually works at scale.

RAG
LangChain
OpenAI
Chatbots
Core Service

Backend Architecture

FastAPI and Django APIs built to handle real traffic. I design for observability, testability, and the day your load doubles — with documentation your team can actually use.

FastAPI
Django
Microservices
APIs
Core Service

Cloud & DevOps

AWS and GCP infrastructure provisioned with Terraform, containerised with Docker, and shipped through automated CI/CD pipelines. Zero-downtime deployments, cost-aware design.

AWS
GCP
Terraform
Docker
Also Available

Technical Consulting

Architecture reviews, stack selection, performance audits, and code quality assessments. I give you an honest external opinion — not one shaped by what's easiest for me to build.

Architecture
Code Review
Mentoring
Strategy
Also Available

Full-Stack Development

End-to-end web applications — Python backend, React/Next.js frontend, cloud deployment. Useful when you need one person who can own the full vertical.

React.js
Next.js
Full-Stack
SaaS
Also Available

Data Engineering

ETL pipelines, database optimisation, and real-time processing built on PostgreSQL, Redis, and RabbitMQ. Data that arrives clean, on time, and queryable.

PostgreSQL
Redis
ETL
Pipelines

Engagements start at a free 30-minute discovery call. I work on project-based and monthly retainer models — scoped after we've talked through your goals.

Skills & Technologies

Languages & Frameworks
Python
Django
Flask
FastAPI
Bash
PHP
JavaScript
React.js
Next.js
AI & Machine Learning
LLM
OpenAI
LangChain
RAG
MCP
PyTorch
HuggingFace Transformers
Cloud & DevOps
AWS (EC2, RDS, ECS, S3, Bedrock)
GCP
Docker
Docker Compose
Terraform
Ansible
CircleCI
Databases & Messaging
PostgreSQL
MySQL
MongoDB
Neo4j
Redis
RabbitMQ
Pinecone
Data Science & Media
NumPy
Pandas
Matplotlib
OpenCV
FFmpeg
Architecture
Multithreading
Multiprocessing
Distributed Systems
GraphQL
REST API

Professional Experience

Independent Consultant
Senior Software Consultant
February 2024 – Present

Enterprise Knowledge Intelligence Platform (RAG)

Technologies: FastAPI, LangChain, OpenAI API, Neo4j, Pinecone, BM25, MMR, MongoDB, Next.js

Helped an enterprise engineering team stop losing hours searching GitHub and Confluence by natural language — built the AI search layer from scratch.

  • Architected a hybrid RAG platform that queries GitHub and Confluence knowledge using natural language
  • Implemented code summarization and Neo4j knowledge graph relationships to improve retrieval precision
  • Built LangChain pipelines with BM25 + MMR retrieval, query rewriting, and context compression
  • Delivered a Next.js developer portal with CI/CD on GitHub Actions

Hotel Intelligence MCP Server

Technologies: FastAPI, MCP, LangChain, HuggingFace Transformers, Playwright, BeautifulSoup, PostgreSQL

Built an AI-callable hotel search and pricing tool that travel agents can query in plain English rather than navigating multiple booking interfaces.

  • Designed an MCP server exposing hotel search, pricing comparison, and availability as AI-callable tools
  • Engineered a resilient scraping backend with retry logic, anti-bot handling, and rate limiting
  • Integrated transformer-based intent parsing to resolve ambiguous user travel queries
  • Persisted pricing history and availability windows for comparative and trend analysis

Unified Distributed Operations Portal

Technologies: FastAPI, React.js (Microfrontend), GCP Firestore, Pub/Sub, Cloud Functions, AWS, Terraform, Auth0, AVP

Designed a multi-team operations platform that lets independently owned services plug in without requiring a shared codebase or central deployment team.

  • Architected a microservices and microfrontend platform for independently onboarded team services
  • Built schema-governed configuration pipelines with Apache Avro and event-driven propagation
  • Implemented cross-cloud identity with Auth0 and policy-based authorization using Amazon Verified Permissions
  • Provisioned and shipped distributed modules with Terraform and CircleCI pipelines

Credentialing & Verification Platform

Technologies: FastAPI, Next.js, MongoDB, AWS IAM Identity Center, Terraform, CircleCI

Built the backend and client portal for a professional credentialing workflow — replacing a manual verification process with automated orchestration.

  • Built credential lifecycle APIs and verification orchestration workflows on FastAPI
  • Integrated enterprise SSO with AWS IAM Identity Center for secure client onboarding
  • Developed a responsive Next.js portal for submission, tracking, and compliance views
  • Automated infrastructure and release workflows across environments
UST Global, Pune
Senior Software Engineer II
February 2023 – February 2024

Admin and Tooling Platform

Technologies: FastAPI, GCP, AWS, Terraform, CircleCI, React.js, SQLAlchemy

Internal admin platform and tooling infrastructure for a large technology services firm, spanning GCP and AWS environments.

  • Led architecture design sessions and code reviews for a team of 6 engineers
  • Achieved >80% test coverage across core application modules
  • Provisioned multi-environment GCP infrastructure with Terraform, reducing manual setup time by ~70%
  • Reduced deployment cycle from manual releases to fully automated CircleCI pipelines
R System International, Noida
Senior Software Engineer
October 2022 – February 2023

SCREIM (Supply Chain Resilience Evaluation, Integration & Monitoring)

Technologies: Django, AWS, Terraform

Supply chain resilience monitoring system — tracking disruption signals across supplier networks and surfacing risk indicators to operations teams.

  • Led architectural design discussions for a supply chain risk monitoring system used across multiple supplier tiers
  • Developed core Django modules for disruption signal ingestion and risk scoring
  • Owned sprint planning and server deployment workflows across staging and production environments
Idemia Syscom India Pvt Ltd, Noida
Senior Software Engineer
June 2021 – October 2022

MestaCompact (Road Safety Device)

Technologies: Flask, Python, Matplotlib, OpenCV, FFmpeg, NumPy, React.js

Road safety device platform — real-time video and image processing pipeline for detecting road hazards and driver behaviour on European highways.

  • Architected real-time image and video processing pipelines handling continuous road footage using OpenCV and FFmpeg
  • Led code reviews and drove adoption of processing optimisations that reduced pipeline latency
  • Packaged and deployed the application to embedded road safety hardware units
Mall91 / Ongraph Technologies, Noida
Software Engineer → Team Lead
April 2019 – June 2021

3Automation.com (RPA Solution)

Technologies: Flask, Django, AWS, PostgreSQL, Tornado, Ansible, Docker, Redis, RabbitMQ

RPA (Robotic Process Automation) platform — a bot execution engine that replaced manual business workflows with automated task queues.

  • Led a team of 9 engineers as Team Lead
  • Built the bot engine from scratch, managing frontend, backend, and execution engine
  • Cut workflow execution time from 48 hours to 30 minutes (96× improvement) by redesigning the execution engine with Python multithreading, Pandas pipelines, and TinyDB — eliminating the single-threaded bottleneck in the bot runner
  • Architected distributed task queue system with Redis and RabbitMQ
Webkul Software Pvt Ltd, Noida
Software Engineer
July 2016 – April 2019

Multi-channel Connector & Prestashop Odoo Bridge

Technologies: Python, Odoo, PHP, JavaScript, jQuery

E-commerce integration modules connecting online storefronts to ERP systems for international retail clients.

  • Built and shipped Python/PHP integration modules connecting Prestashop storefronts to Odoo ERP for 10+ international clients
  • Managed deployment and customisation cycles across client environments with varying Odoo versions
  • Delivered e-commerce connector solutions serving clients across Europe and the Middle East
Selected outcomes

Proof from recent engagements

These are representative delivery outcomes, used here instead of unverifiable testimonials.

3Automation.com

96× faster workflow execution

Cut a 48-hour processing job down to 30 minutes by redesigning the execution engine with Python multithreading, Pandas pipelines, and TinyDB.

Independent consultant

Enterprise RAG search layer

Built the retrieval and knowledge graph layer that let engineering teams search GitHub and Confluence in natural language.

Independent consultant

Cross-cloud operations portal

Delivered a platform spanning GCP and AWS with Auth0 and policy-based authorization for independently owned services.

Education & Achievements

ABES Engineering College
Ghaziabad, India

Bachelor of Technology

Computer Science Engineering

August 2012 – August 2016

How We Work

From Idea to Deployment

A battle-tested 6-step process built for clarity, speed, and production-grade results — every time.

STEP 01

Discovery Call

Free 30-min call to map your goals, constraints, timeline, and budget. No jargon — just clarity.

Project scope document
STEP 02

Proposal & Architecture

Full technical architecture — stack selection, system design, API contracts, cloud infra plan, and phased roadmap.

Architecture doc + roadmap
STEP 03

Agile Development

1–2 week sprints with daily updates. Clean, documented, test-covered code following SOLID principles.

Working software each sprint
STEP 04

Testing & QA

Unit, integration, load tests and security audits. Nothing ships without a passing test suite.

Test reports + coverage badge
STEP 05

CI/CD & Deployment

Automated pipelines, Docker containers, zero-downtime rollouts to AWS or GCP.

Live production environment
STEP 06

Monitoring & Support

Logging, alerts, and performance dashboards post-launch. Ongoing optimisation as you grow.

Runbook + monitoring setup
Get In Touch

Start a Conversation

Have a project in mind? Let's discuss how I can help you achieve your goals.

Selected Outcomes
96× faster
Workflow execution time cut at 3Automation
Enterprise RAG
Search layer for GitHub and Confluence knowledge
Cross-cloud
Distributed ops platform with Auth0 and AVP

Why Work With Me?

  • 10 years building systems that go to production, not decks
  • Clients across international time zones
  • End-to-end ownership: I don't hand off — I ship
  • Async-first communication, weekly written updates included
  • Code with tests, docs, and a handover runbook — always
  • I stay available post-launch for the questions that come up at 2am
Book a Free 30-min Discovery Call
IST (UTC+5:30) · Flexible hours for global clients