Tulika Das

About

Building AI that works

Based in India with a deep focus on large language models, retrieval-augmented generation, and agentic AI systems. I enjoy the full arc of building — from designing pipelines and evaluating models to shipping clean, working code.

My recent work spans scalable LLM backends, memory-based systems, hybrid RAG architectures, and AI safety tools. I care about building things that are not just technically interesting but genuinely useful.

Experience

Where I've worked

Oct 2025 – Feb 2026

AI Engineer Intern

Tulu Health, Delhi

Built and optimized LLM prompt pipelines and multilingual Voice AI systems for healthcare workflows, improving response relevance by 25–30%.
Deployed Voice AI proof-of-concept systems integrating telephony platforms with a FastAPI backend, cutting demo setup time by 40%.
Designed a multi-agent orchestration MVP using LangGraph for hospital triage with emergency-priority routing, structured state transitions, and audit logging.

Jan 2025 – May 2025

AI Intern

Neuro Web Solutions, Rajkot

Implemented LangChain and a custom prompt engineering framework with 15+ templates for Algoace, an AI-powered programming assessment platform built for 1,000+ students.
Built a pseudocode evaluation system with multi-metric scoring across correctness, efficiency, and complexity dimensions.
Launched an adaptive student support feature that delivers personalized learning resources based on student performance.

Projects

Things I've built

Multi-Agent Complaint System

Problem

Urgent complaints get buried under low-priority tickets, causing missed escalations.

Solution

Multi-agent system with LangGraph + Llama-3 that auto-routes by urgency, escalates to human managers, and sends automated resolutions.

Result Benchmarked on 10-case golden dataset with 63% severity accuracy, 30% error rate; under-escalation identified as primary failure mode.

LangGraphLLaMA-3GeminiHITLPython

Code ▶ See it in Action

Autonomous Research Agent (ReAct)

Problem

Manual research is slow and fragmented — hours of reading to synthesize a single topic.

Solution

ReAct agent using LangGraph + Groq that autonomously searches multiple angles and compiles a structured 6-section report.

Result One prompt, structured multi-angle report in minutes.

LangGraphGroqReActPython

Code ▶ See it in Action

SheGuard AI

Problem

Existing safety tools for women are reactive, alerting after danger occurs rather than providing proactive, situational guidance.

Solution

AI safety advisor using Llama-3.3-70b + LangChain with Chain-of-Thought reasoning, age-specific safety tips, self-defense techniques, and relevant resources.

Result Rated 9–10/10 by real users across age groups from 7 to 30+, with feedback like "very helpful", "very informative", and "responses are presented in a structured way".

LLaMALangChainSafety AIPython

Code ▶ Live Demo

Production-Ready LLM Backend

Problem

Production LLM deployment needs more than API calls, teams need rate limiting, cost control, auth, and fallback logic from day one.

Solution

FastAPI backend with per-user rate limiting, token cost accounting, auth, and provider fallback, the unglamorous layer that makes LLMs production-worthy.

Result Teams skip infra firefighting entirely and ship features instead.

FastAPILLMPythonSystem Design

Code ▶ See it in Action

Recognition

Achievements

🏆 Distinguished Social Impact Innovation

Agentic AI Innovation Challenge 2025 · Ready Tensor

Built SheGuard, an AI-powered personal safety advisor for women, using LLaMA 3.3-70b, LangChain, and deployed on Hugging Face Spaces. Ranked in the top 38 out of 671 entries in the Distinguished Social Impact Innovation category.

LLaMALangChainGradioReady Tensor

Read Publication

🥇 HackerRank Orchestrate Hackathon

24-Hour Hackathon · HackerRank

Built a hybrid RAG support triage agent — support docs chunked at paragraph level, dual-indexed with FAISS and TF-IDF, fused via Reciprocal Rank Fusion, and filtered by company metadata so the LLM only sees relevant context. Output is structured JSON with automatic escalation for high-risk tickets, and the system checkpoints after every ticket so nothing is lost mid-run. Ranked 529 out of 1,349 participants.

RAGFAISSTF-IDFRRFPython

Source Code Leaderboard

Skills

What I work with

Core Stack

Python LangChain LangGraph FastAPI Prompt Engineering Agentic AI Multi-agent Systems

LLM & Retrieval

RAG RAGAS Evals ChromaDB FAISS HuggingFace Llama 3 Gemini Groq

Dev & Deployment

Gradio Streamlit HuggingFace Spaces GitHub Actions Git Docker

ML Foundations

Scikit-learn TensorFlow Pandas NumPy NLP

Blog

Blogs

RAG · Evaluation · RAGAS

My RAG Pipeline Looked Fine Until I Measured It

Built a RAG system, ran a few queries, called it done. Turns out that was just expensive vibe checking. Here's what RAGAS revealed about silent failures in retrieval and generation.

5 min read · Medium

Read Code

LangGraph · Multi-Agent · Production

LangGraph is the Easy Part. Here's the Hard 80%

Getting it to work locally was just the first step. This is what it actually takes to ship a multi-agent complaint routing system into the real world.

5 min read · Medium

Read

Tulika Das

Building AI that works

Where I've worked

Things I've built

Achievements

What I work with

Blogs

Academic background

Let's Build Together