Company Internal DatabaseSensitive database containing information about clinical patients etc.

Open-Source Medical/Biology data Available(Benmort reports, MIMIC Clinical Database, Pubmed Reports)

Pre-processingWe want to pre-process our unstructured medical data to reduce tokens and clean unnecessary info as much as possible.

Passing the Multi-Source Multi-Hop Query

AI Models Engineassigning different models for different tasks (e.g., a fast model for drafting, a powerful model for strategic decisions)

Qwen | Mistral AI | LLaMA by Meta | Open Weight GPT-OSS Models

Standard Operating Procedure (SOP)Our planner agent will create a JSON-based SOP for our RAG agentic system, which defines how, when, and what actions to take.

Multi-Agentic RAGBased on the plan a team of agents goes to work, each querying its own dedicated knowledge store

Criteria Synthesizer AgentAll the findings from the specialist agents are collected, and consolidated all information and into a formal report

Multiple new set of rules to update the planning design and perform the evaluation again

Pareto 5D EvaluationThe generated document is run through a rigorous, multi-dimensional evaluation system that produces a 5-point report card.

Scientific Rigor, Compliance, Ethics | Recruitment Feasibility | Operational Simplicity

Performance Diagnostician agentThis high-level agent analyzes the scores and identifies the single biggest weakness (e.g., “The feasibility score is only 0.39, which is too low.”).

SOP Architect agentIts job is to intelligently re-write the rules (the GuidSOP) the Inner Loop follows. Based on the diagnosis, it proposes 2-3 new, “mutated” SOPs.

Scientific ToolsTools that Bridge Agentic Reasoning with Real-World Scientific Data

Medical Sub-agentsMedical Sub agents System for Autonomous Scientific Discovery

LangGraphMulti Agentic based Medical Thinking System

Knowledge Base Data PipelineData Pipeline to Transform Raw Agentic Traces into Specialized, Algorithm-Ready Datasets

Agentic Training Architecture

Evaluation of Finetuned EngineWe evaluate Finetuned based agentic architecture on three factors, qualitative, quantitative, and performance.

LLM judges by faithfulness, relevance, soundness, and depth.Quantitative eval measures retrieval precision and recall.Performance eval tracks latency (time) and cost (tokens) per query.

RL AlgorithmsRL Algorithms for training different sub-agents

Monitoring the Training ProcessMonitoring the training/Inferencing process of ai agents

Office