Vaidya.ai

Introduction

A single, OpenAI-compatible endpoint for healthcare AI - symptom checking, wellness management, emergency assist, patient journey support, and more.

What is Vaidya AI API?

Vaidya AI API is an OpenAI/vLLM-compatible Chat Completions endpoint built for healthcare. Send a standard model and messages body — the model implicitly understands your healthcare request and routes to the right capability.

Why use it?

  • One endpoint, many use cases - Symptom Checker, Wellness Management, Emergency Assist, Patient Journey Assist, Administrator Assist, and more behind a single POST.
  • Drop-in compatible - works with the OpenAI Python/JS SDK, requests, curl, or any HTTP client.
  • Predictable pricing - credit-based model so you know the cost per call before you ship.

Vaidya Model 2.0

Vaidya 2.0 is powered by our latest innovations in LLM post-training techniques, a deep-research augmented reasoning framework, and doctor-annotated synthetic healthcare data creation. Our rubric-based reinforcement learning trains Vaidya to think across the full spectrum of what makes a great clinical response - accuracy, completeness, empathy, and context. Parallel agentic thinking supercharges Vaidya 2.0 with the ability to deep-search the web, gathering and reasoning on top of the latest and most reliable healthcare information.

Benchmark performance

Vaidya 2.0 was rigorously evaluated on OpenAI's HealthBench Hard and outperformed every major model globally - including GPT-5 Thinking, Gemini-3-Pro, and Baichuan-M3-235B.

RankModelHealthBench Hard Score
1Vaidya 2.0 by Fractal50.1
2OpenAI GPT-5 Thinking46.2
3Baichuan-M3-235B44.6
4OpenAI GPT-5.2 Thinking42.1
5OpenAI GPT-5 Mini40.3
6Google Gemini 3 Pro25.5
7Qwen3 A3B-30B18.7
8Google MedGemma 27B15.5

Supported Use Cases

Vaidya AI API supports a range of healthcare use cases. The model implicitly understands your request and responds accordingly — no special parameter required:

Use CaseWhat it does
Symptom CheckerTriage symptoms and provide safe next steps and red flags
Wellness ManagementPersonalized health scoring, plans, and lifestyle recommendations
Emergency AssistRapid guidance for urgent health situations
Patient Journey AssistEnd-to-end support across care milestones and follow-ups
Administrator AssistSupport for clinical and administrative workflows

Requests are priced on a credit-based model. See Monitoring Usage for quota and billing details.

Quick overview

Base URLhttps://api.vaidya.ai
EndpointPOST vaidya/chat/completions
AuthBearer token - get your key from console.vaidya.ai
FormatStandard OpenAI chat request

Minimal request

The smallest valid call matches OpenAI Chat Completions:

curl --request POST \
  --url https://api.vaidya.ai/vaidya/chat/completions \
  --header "authorization: Bearer $VAIDYA_API_KEY" \
  --header "content-type: application/json" \
  --data '{
  "model": "Vaidya-v2",
  "messages": [
    {
      "role": "user",
      "content": "Explain the pathogenesis of rheumatoid arthritis."
    }
  ]
  "temperature": 0.2
}'

Same body as JSON:

{
  "model": "Vaidya-v2",
  "messages": [
    {
      "role": "user",
      "content": "Explain the pathogenesis of rheumatoid arthritis."
    }
  ]
  "temperature": 0.2
}

Error handling scenarios

Failures return JSON with an error object (message, type, code) and an HTTP status, similar to OpenAI-style APIs. Typical situations:

ScenarioStatusWhat to do
No Authorization header, wrong key format, or invalid key401Send Authorization: Bearer <key> with a key from console.vaidya.ai. See Authentication.
Key revoked or expired401Create a new key in the console and update your config.
Key lacks access to the resource or project403Check account or project permissions in the console.
messages missing roles, empty content, or invalid structure400Match the chat schema: alternating user/assistant turns with string content (or valid multimodal parts).
File upload missing when required400Attach the required PDF or image.
Request or file payload too large413Shorten text, compress images, or split work across calls.
Field types or enums fail validation422Align types with the API reference (e.g. temperature as number).
Too many requests in a short window429Retry with exponential backoff; honor Retry-After when present.
Client or upstream timeout408Increase timeout, retry idempotent calls, or shorten prompts.
Transient server or overload500 / 503Retry with backoff; log the response/request id if you need support.

Who is this for?

  • Health apps - add symptom triage, drug info, or lab analysis to your product.
  • Wellness platforms - generate health scores and personalized plans.
  • Clinical tools - build copilots and workflow assistants for care teams.
  • Enterprise - embed healthcare AI into internal tools and customer-facing products.

Next steps

  1. Get your API key at console.vaidya.ai.
  2. Write better prompts - read the Prompt Guide.
  3. Start building - jump to the Chat Completions API reference.
  4. Ship safely - follow Best Practices for prompts, retries, security, and disclaimers.
  5. Watch usage - use the Console Monitoring Usage page for quotas, charts, and alerts.

On this page