Who is Manvendra Kumar?

Manvendra Kumar is a Senior AI Product Manager based in Pittsburgh, PA, with 5+ years of experience shipping production-grade AI platforms. He specializes in LangChain, agentic workflows, LLM systems, and 0-to-1 product builds. He is the founder of CareBow, an AI-powered in-home healthcare coordination platform, and holds an MS in Management Information Systems from the University of Pittsburgh (GPA 3.8).

What AI tools and technologies does Manvendra Kumar use?

Manvendra Kumar works with OpenAI GPT-4, LangChain, Anthropic Claude, HuggingFace, Pinecone, n8n, Zapier, and Make for AI systems. He also uses React, TypeScript, PostgreSQL, Supabase, Python, SQL, Power BI, and Tableau for data and engineering work.

CareBow is an AI-powered in-home healthcare coordination platform founded by Manvendra Kumar. It uses a GPT-4 and LangChain-powered symptom triage engine with agentic care routing to connect families with the right in-home care. As of 2025, CareBow has 1,000+ pre-launch waitlist sign-ups and 40+ active partnership leads.

What were Manvendra Kumar's key achievements at REDO?

At REDO (2025–2026), Manvendra Kumar built an LLM-assisted claims classification system handling carrier-specific policies for USPS, UPS, FedEx, and Amazon. He automated 500+ daily claims, scaled to 1,000+ weekly automated claims, achieved a 40% reduction in operational costs, and improved first-touch resolution by 30% through A/B-tested prompt engineering strategies.

Is Manvendra Kumar available for work?

Yes. Manvendra Kumar is open to Senior Product Manager roles at AI-forward companies. He is based in Pittsburgh, PA, and is open to remote work and relocation. He is STEM OPT authorized through February 2028. You can reach him at mkworkingx7@gmail.com or book a call at cal.com/manvendrakumar/30min.

What product management methodologies does Manvendra Kumar use?

Manvendra Kumar uses RICE scoring for prioritization, PR/FAQ (Amazon methodology), Jobs-to-be-Done (JTBD) framework, Human-in-the-Loop AI design, Agentic AI Loop (Perceive → Plan → Act → Reflect), A/B testing and prompt optimization, OKRs and North Star Metric frameworks, and both Agile/Scrum and Waterfall depending on context.

Where did Manvendra Kumar study?

Manvendra Kumar earned his Master of Science in Management Information Systems from the University of Pittsburgh (2023–2024, GPA 3.8), and his Bachelor of Arts in Economics, Finance, and Entrepreneurship from Hansraj College, University of Delhi (2020–2023).

What companies has Manvendra Kumar founded?

Manvendra Kumar has founded five companies: CareBow (AI-powered in-home healthcare coordination, 2025), Mopshy AI (productized AI automation for SMBs, 2020), ProductJarvis (AI-native PM operating system, 2025), OnliGrow (B2B edtech for India, 2021), and CoWearth (sustainable fashion D2C, 2021). CareBow is targeting a $500K pre-seed raise with 1,000+ pre-launch waitlist sign-ups.

What is Manvendra Kumar's experience with LangChain?

Manvendra Kumar has used LangChain in production at CareBow for agentic symptom triage and multi-level care routing, and at Mopshy AI for multi-agent sales automation pipelines serving 20+ SMB clients. He is proficient in LangChain agent architectures, RAG (Retrieval Augmented Generation) systems, tool use, memory management, and LangGraph for multi-agent orchestration.

How did Manvendra Kumar automate claims processing at REDO?

At REDO (2025–2026), Manvendra Kumar built an LLM-assisted claims classification system that parsed carrier-specific policies for USPS, UPS, FedEx, and Amazon. He used structured prompt engineering with role-primed context, ran A/B tests on prompt variants, and implemented Human-in-the-Loop routing for ambiguous cases. The system automated 500+ daily claims (scaled to 1,000+ weekly), reduced operational costs by 40%, and improved first-touch resolution by 30%.

What is Manvendra Kumar's approach to Human-in-the-Loop AI design?

Manvendra Kumar treats Human-in-the-Loop (HITL) as a first-class architectural decision, not a safety net patched in after launch. In his AI systems, ambiguous or high-stakes outputs are routed to a human review queue with AI-generated context packets — giving reviewers the full reasoning chain, not just the output. This approach was used at REDO for claims review and at CareBow for care escalation decisions, resulting in higher resolution quality without increasing headcount.

What certifications does Manvendra Kumar hold?

Manvendra Kumar holds a Certified Scrum Product Owner (CSPO) certification, is a Buildspace S5 Graduate, a DEI Fellow at the David Berg Center at the University of Pittsburgh, and is a Lovable Brand Ambassador. He is also STEM OPT authorized through February 2028.

Agentic AI

Agentic AI Workflows: A Production Playbook

Most agentic AI demos break in production. This is the four-stage playbook I use to design agents that actually ship: Perceive, Plan, Act, Reflect — with real examples and the failure modes nobody warns you about.

April 20, 202614 min readUpdated April 25, 2026

Agentic AI Workflows: A Production Playbook

Demos lie. The autonomous agent that booked your flight in a YouTube video falls over the second a real user asks it to reschedule. After shipping multi-agent systems for healthcare triage at CareBow and claims automation at REDO, here is the playbook that actually survives production.

TL;DR

An agent is just an LLM with a loop and tools. Treat it that way.
Use the Perceive → Plan → Act → Reflect loop to architect every agent.
Default to single-agent. Add a second agent only when there are at least two distinct skill sets the model needs to switch between.
Build the human-in-the-loop checkpoint first, not last.
Cost and latency will kill you before quality does. Budget both before you write the first prompt.

Stage 1: Perceive

Define every input the agent receives:

The user prompt (raw, untrusted)
Retrieved context (RAG)
Tool outputs from the previous turn
Memory from prior sessions
System instructions

The failure mode here is context pollution — stuffing too much irrelevant text into the prompt and watching quality collapse. Your job is the opposite of "give it everything." Curate.

Stage 2: Plan

Decide how the agent reasons. Three architectures, in order of complexity:

Single-agent. One LLM, one prompt, a list of tools. 90% of production use cases.
Supervisor + workers. A planner agent that delegates to specialist agents. Use this when sub-tasks need different system prompts.
Open-ended multi-agent. Many agents talking to each other. Almost never the right answer outside of research demos.

Stage 3: Act

The agent calls tools. Two rules:

Structured outputs only. Define a JSON schema for every tool input and output. Free-text agent output is a debugging nightmare.
Tool count discipline. Past 7 tools, agents start picking the wrong one. Cluster tools into namespaces or split into sub-agents.

Stage 4: Reflect

The checkpoint that separates demo from product:

Confidence-based routing: high-confidence outputs auto-execute, low-confidence go to a human review queue with the agent's reasoning attached.
Failure logging: every refusal, hallucination, and tool error gets stored with the full trace.
Feedback loop: human corrections become eval examples, which become prompt improvements.

Real Production Examples

CareBow symptom triage. The agent classifies a patient query into one of four care levels (self-care, teleconsult, in-home visit, emergency). Confidence under 0.7? Routes to a clinician with a structured context packet, not a raw LLM trace.

REDO claims automation. A classification agent processes 500+ claims/day. The agent never auto-denies — it only auto-approves clear cases or escalates with a one-paragraph explanation. Result: 40% ops cost reduction without a single wrongful denial.

Failure Modes Nobody Warns You About

Latency death spirals. A four-tool agent with 2-second tool latencies takes 10+ seconds. Users abandon at 3.
Context window creep. Agent loops accumulate context. By turn 5 you are paying for a 30k-token prompt.
Tool hallucination. Agents will invent tool names. Hard-validate every tool call against your schema.
Eval rot. Your eval set goes stale fast. Refresh 10% of it monthly.

Agentic AI Workflows: A Production Playbook

Agentic AI Workflows: A Production Playbook

TL;DR

Stage 1: Perceive

Stage 2: Plan

Stage 3: Act

Stage 4: Reflect

Real Production Examples

Failure Modes Nobody Warns You About

Read Next