Knowledge Health Score: How to Know If Your AI Actually Learned Anything
Product

Knowledge Health Score: How to Know If Your AI Actually Learned Anything

April 23, 20268 min read

You uploaded 50 pages to train your AI chatbot. It says "Training Complete". But did it actually learn anything useful? The Knowledge Health Score is a 0-100 grade that answers exactly that question.

The Problem: "Training Complete" Tells You Nothing

When you train an AI chatbot with your website, product docs, or FAQs, the system splits everything into text chunks, creates vector embeddings, and stores them. Training complete. ✅

But "training complete" only means the data went in — not that the AI understands your business. It's like a student who read the textbook but can't name a single key concept from it.

What "trained" means
Your content was split into chunks and stored. The AI can search through it.
What it doesn't mean
The AI understands your products, services, team, or policies. It has no structured knowledge — just raw text fragments.
What the Health Score adds
A diagnostic grade showing volume, topic density, concept depth, and data consistency — like a report card for your AI.

What Gets Analyzed: Entities, Concepts, and More

Before we explain the score, you need to understand what the AI is looking for when it analyzes your content. Think of it like a restaurant's knowledge base:

Comparison of entities (specific named things) versus concepts (cross-cutting ideas)
Entities are specific things customers ask about. Concepts are themes that connect them.
TypeWhat It IsExample
EntityA specific thing a customer might ask about"Butter Chicken", "Chef Ramesh", "Weekend Brunch"
ConceptA cross-cutting idea that connects entities"Dietary Accommodations" (links Vegan + Gluten-Free + Nut-Free)
ContradictionTwo pages saying different thingsPage A: "Open till 11pm" vs Page B: "Open till 10pm"
GapSomething customers ask about but isn't covered5 visitors asked about parking, but no page mentions it
Entity ≠ mention. Your company name might appear on every page, but the AI creates only one entity card for it. More pages mentioning it means higher confidence in the summary, not more cards.

How the Score Is Calculated

The Health Score starts at 0 and earns points for actual coverage. It's not a percentage — it's a grade built from four dimensions, minus penalties.

Health score breakdown showing Volume (25), Density (2), Depth (0), and Clean Bonus (+10) totaling 37
A real example: 283 pages trained, but only 27 topics identified. Score: 37/100.

The Four Dimensions

ComponentMax PointsWhat It MeasuresAnalogy
Volume25How much content you've trained"Did you read the textbook?"
Entity Density25How many key topics the AI found per page"Can you name the important things?"
Concept Depth20How well ideas connect across topics"Do you understand how things relate?"
Clean Bonus+10No contradictions in your knowledge"Is your information consistent?"

Penalties That Lower Your Score

If the AI finds problems, points get subtracted:

ProblemPenaltyExample
Critical contradiction (pricing, contact info)-15 eachTwo different phone numbers on different pages
Warning contradiction (features, specs)-5 eachDifferent feature lists for the same product
Info contradiction (minor)-2 eachSlightly different descriptions of the same service
Knowledge gap-2 each (max -15)Customers asking about something you haven't documented

What Your Score Means

ScoreGradeWhat To Do
0–15🔴 PoorAdd more training content. Your AI doesn't have enough material to work with.
16–35🟠 WeakContent exists, but the AI hasn't identified enough key topics. Check if pages have enough detail.
36–55🟡 FairDecent foundation. Focus on connecting concepts across topics and filling gaps.
56–75🟢 GoodSolid coverage. Most visitor questions should get good answers. Look for contradictions.
76–100💎 ExcellentDeep knowledge with connected concepts and consistent information. Your AI is well-trained.

Real Example: 283 Pages, Score 37

One of our test customers trained their AI with 283 pages from their IT services website. The result? A health score of 37 out of 100. Here's why:

25/25
Volume
2/25
Density
0/20
Depth
+10
Clean Bonus

The score of 37 correctly tells this admin: "You have plenty of content, but the AI barely scratched the surface. It identified 27 things worth remembering across 283 pages — that's less than 1 topic per 10 pages."

A score of 37 isn't bad — it's honest. It tells you exactly where to focus: not more content, but deeper analysis of the content you already have.

How It Works Behind the Scenes

When you enable Auto-Synthesis in your Genius settings, the analysis runs automatically after every training. Here's what happens:

1
Entity Extraction
The AI reads all your pages in batches of 5, identifying products, services, people, locations, and policies. Each entity gets a one-line summary and confidence score. Duplicates are automatically merged.
2
Concept Mapping
Next, it looks for cross-cutting themes — ideas that span multiple entities. "Return Policy" might connect to "30-Day Guarantee" and "Refund Process." These connections help the AI give more complete answers.
3
Contradiction Detection
The AI compares pages against each other, looking for conflicting information. If your pricing page says "$99/month" but your FAQ says "$79/month", it flags this as a critical contradiction.
4
Gap Analysis
By looking at what visitors are actually asking in chat, the AI identifies topics that keep coming up but aren't well-covered in your content. This requires at least 5 conversations to activate.
5
Health Score
All the data from steps 1-4 feeds into the health score formula. The score is stored and shown on your Knowledge Health dashboard.

Does This Cost Extra?

Two important facts:

Synthesis (once per training)
Uses AI calls to analyze your content. For a 50-page site, this is about 33 AI calls costing roughly $0.03. Runs once, not per visitor message.
Chat (per visitor message)
Adding the synthesis feed to chat adds ZERO extra AI calls. The entity summaries are fetched via a simple database lookup, not an AI call. Your per-message cost doesn't change.
The synthesis pipeline uses the lightweight Gemini Flash model. For most businesses (under 100 pages), the entire analysis costs less than a nickel.

How to Enable It

Three toggles in Settings → AI & Behavior → Search Quality:

ToggleWhat It DoesCost Impact
Query ExpansionRephrases questions 2 ways for broader search+1 AI call per message
Knowledge Synthesis FeedAdds entity summaries to chat context$0.00 (database lookup)
Auto-Synthesis After TrainingRuns full analysis after every trainingOnce per training (~$0.03 for 50 pages)

Build a smarter AI chatbot

GetGenius trains on your website and docs to deliver accurate, consistent answers 24/7. No per-seat pricing. AI included in every plan.

Start free trial

Keep Reading