Tulika Das

I build agentic AI systems that reason, act, and deliver — from multi-agent pipelines to RAG-powered solutions.

Open to AI Engineer roles

Tulika Das

About

Building AI that works

Based in India with a deep focus on large language models, retrieval-augmented generation, and agentic AI systems. I enjoy the full arc of building — from designing pipelines and evaluating models to shipping clean, working code.

My recent work spans scalable LLM backends, memory-based systems, hybrid RAG architectures, and AI safety tools. I care about building things that are not just technically interesting but genuinely useful.

Experience

Where I've worked

Oct 2025 – Feb 2026

AI Engineer Intern

Tulu Health, Delhi

  • Built and optimized LLM prompt pipelines and multilingual Voice AI systems for healthcare workflows, improving response relevance by 25–30%.
  • Deployed Voice AI proof-of-concept systems integrating telephony platforms with a FastAPI backend, cutting demo setup time by 40%.
  • Designed a multi-agent orchestration MVP using LangGraph for hospital triage with emergency-priority routing, structured state transitions, and audit logging.
Jan 2025 – May 2025

AI Intern

Neuro Web Solutions, Rajkot

  • Implemented LangChain and a custom prompt engineering framework with 15+ templates for Algoace, an AI-powered programming assessment platform built for 1,000+ students.
  • Built a pseudocode evaluation system with multi-metric scoring across correctness, efficiency, and complexity dimensions.
  • Launched an adaptive student support feature that delivers personalized learning resources based on student performance.

Projects

Things I've built

Multi-Agent Complaint System

Multi-Agent Complaint System

Problem

Urgent complaints get buried under low-priority tickets, causing missed escalations.

Solution

Multi-agent system with LangGraph + Llama-3 that auto-routes by urgency, escalates to human managers, and sends automated resolutions.

Result Benchmarked on 10-case golden dataset with 63% severity accuracy, 30% error rate; under-escalation identified as primary failure mode.
LangGraphLLaMA-3GeminiHITLPython
Research Agent

Autonomous Research Agent (ReAct)

Problem

Manual research is slow and fragmented — hours of reading to synthesize a single topic.

Solution

ReAct agent using LangGraph + Groq that autonomously searches multiple angles and compiles a structured 6-section report.

Result One prompt, structured multi-angle report in minutes.
LangGraphGroqReActPython
SheGuard AI

SheGuard AI

Problem

Existing safety tools for women are reactive, alerting after danger occurs rather than providing proactive, situational guidance.

Solution

AI safety advisor using Llama-3.3-70b + LangChain with Chain-of-Thought reasoning, age-specific safety tips, self-defense techniques, and relevant resources.

Result Rated 9–10/10 by real users across age groups from 7 to 30+, with feedback like "very helpful", "very informative", and "responses are presented in a structured way".
LLaMALangChainSafety AIPython
LLM Backend

Production-Ready LLM Backend

Problem

Production LLM deployment needs more than API calls, teams need rate limiting, cost control, auth, and fallback logic from day one.

Solution

FastAPI backend with per-user rate limiting, token cost accounting, auth, and provider fallback, the unglamorous layer that makes LLMs production-worthy.

Result Teams skip infra firefighting entirely and ship features instead.
FastAPILLMPythonSystem Design

Recognition

Achievements

🏆 Distinguished Social Impact Innovation

Agentic AI Innovation Challenge 2025 · Ready Tensor

Built SheGuard, an AI-powered personal safety advisor for women, using LLaMA 3.3-70b, LangChain, and deployed on Hugging Face Spaces. Ranked in the top 38 out of 671 entries in the Distinguished Social Impact Innovation category.

LLaMALangChainGradioReady Tensor
Read Publication

🥇 HackerRank Orchestrate Hackathon

24-Hour Hackathon · HackerRank

Built a hybrid RAG support triage agent — support docs chunked at paragraph level, dual-indexed with FAISS and TF-IDF, fused via Reciprocal Rank Fusion, and filtered by company metadata so the LLM only sees relevant context. Output is structured JSON with automatic escalation for high-risk tickets, and the system checkpoints after every ticket so nothing is lost mid-run. Ranked 529 out of 1,349 participants.

RAGFAISSTF-IDFRRFPython

Skills

What I work with

Core Stack

Python LangChain LangGraph FastAPI Prompt Engineering Agentic AI Multi-agent Systems

LLM & Retrieval

RAG RAGAS Evals ChromaDB FAISS HuggingFace Llama 3 Gemini Groq

Dev & Deployment

Gradio Streamlit HuggingFace Spaces GitHub Actions Git Docker

ML Foundations

Scikit-learn TensorFlow Pandas NumPy NLP

Blog

Blogs

RAG · Evaluation · RAGAS

My RAG Pipeline Looked Fine Until I Measured It

Built a RAG system, ran a few queries, called it done. Turns out that was just expensive vibe checking. Here's what RAGAS revealed about silent failures in retrieval and generation.

5 min read · Medium

Read Code

LangGraph · Multi-Agent · Production

LangGraph is the Easy Part. Here's the Hard 80%

Getting it to work locally was just the first step. This is what it actually takes to ship a multi-agent complaint routing system into the real world.

5 min read · Medium

Read

Education

Academic background

Sept 2023 – June 2025

MSc in Data Science

Vellore Institute of Technology, Amaravati

Aug 2019 – July 2022

BSc in Mathematics

Ispat Autonomous College, Rourkela

Contact

Let's Build Together

tulika.das105@gmail.com

Send a message