Filing cabinet pretending to be a KB

You don’t have a knowledge base. You have a filing cabinet.

I’ve had this conversation a lot this year. An organisation deploys a Copilot-style assistant or builds a RAG pipeline. They point it at their SharePoint, their Confluence, their shared drive. The first demo goes well. Then they roll it out and the thing starts confidently giving wrong answers, citing outdated policies, mashing together contradictory guidance from three different documents.

“Our AI is hallucinating,” they say.

Sometimes it is. But more often, the AI is doing exactly what it was asked to do. It’s retrieving content from the sources you pointed it at. The problem is those sources aren’t a Knowledge Management System (KMS). They’re a filing cabinet.

This distinction matters enormously, and most people building AI projects right now don’t fully understand it.

Knowledge isn’t data. It’s not even information.

Data is a raw observation. The fact that a customer called at 2:47pm on Tuesday.

Information is data given context and meaning. The fact that 73% of Tuesday afternoon calls in your contact centre are billing disputes.

Knowledge is the ability to act on that information correctly. What the agent should say, which process to follow, what exceptions apply, what the safe answer is in a regulated context.

What’s the difference?

Knowledge

The ability to act correctly

What the agent should say, which process to follow, what the exceptions are under the current policy.

This is what people, and AI, need to actually do work.

Information

Data given context and meaning

73% of Tuesday afternoon calls in your contact centre are billing disputes.

Useful, but not yet actionable for the agent handling the next call.

Data

Raw observation

A customer called at 2:47pm on Tuesday. A PDF was uploaded to SharePoint in March 2022. A ticket was logged.

Without context, none of this is useful.

A document repository manages data and information. A Knowledge Management System (KMS) manages knowledge: the actionable, role-specific, policy-governed guidance that people and systems need to do work correctly.

When you point AI at a repository, you’re asking it to synthesise knowledge from sources that were never organised as knowledge. You’re asking it to construct the right answer from documents that were never written to answer questions.

You haven’t given your AI a knowledge base. You’ve given it a library.

Libraries are useful. But “search the library for an answer to my question” and “consult the knowledge base” are not the same, and you’ll notice the difference real fast in production.

What a Document Management System actually is

Document Management Systems (DMS), and their bigger cousins Enterprise Content Management (ECM) platforms, do exactly what they say. They manage documents.

Version control. Access permissions. Check-in and check-out. Retention policies. Search across metadata and file names. Occasionally full-text search within documents.

These are genuinely useful. But the design centre for a DMS is file governance and retrieval. The question it answers is: “Can I find the document I’m looking for and trust it’s the right version?” That’s a valuable question. It’s not the same question as: “What should I do when a customer escalates a billing dispute under the latest policy that came into effect last month?”

A DMS is optimised for “store safely and retrieve accurately.”
A KMS is optimised for “give the right answer, in context, to the right person.”

What a Knowledge Management System actually does

A KMS is organised around answers, not files.

The content types are different. Instead of documents, PDFs, and presentation decks, a KMS contains structured articles: how-to guides, troubleshooting, policy interpretations, decision based procedures. Each article answers a specific question or handles a specific scenario. Each has a defined scope so you know what it covers and what it doesn’t.

The metadata is also different. A repository tells you who created a document and when. A KMS tells you which product the article applies to, which customer segment, which channel, which regulation, whether it’s been validated, and who’s responsible for it when things change.

The governance is different. Every article has an owner, a review date, quality metrics, and a feedback mechanism. Knowledge decays. Someone is accountable for keeping it accurate.

And critically, a KMS surfaces knowledge at the point of need: inside the CRM, inside the ticketing system, inside the AI assistant, formatted for the channel it’s appearing in. Not sitting in a portal you have to remember to navigate to.

A DMS may contain knowledge. But it is not organised as knowledge.

That sentence explains a large proportion of the AI disappointment I’m seeing in the market right now.

Document management
Knowledge management
Primary object
Files — PDFs, Word docs, presentations, spreadsheets.
Answers — procedures, decisions, troubleshooting guides, policy interpretations.
Main goal
Store safely. Retrieve accurately. Control versions.
Give the right answer, in context, to the right person.
Structure
Folder hierarchies and file-level metadata.
Taxonomies, templates, and relationships between concepts, tasks, and roles.
Governance
Document owners and retention policies.
Knowledge owners, quality metrics, review cycles, and feedback loops.
AI suitability
Retrieves documents. Chunks may be incomplete, inconsistent, or outdated.
Purpose-built for retrieval. Structured, scoped, validated, and current.

Why this matters so much for AI

Retrieval-Augmented Generation (RAG) works by fetching relevant passages from your enterprise sources and using them to ground the model’s answer. It’s a great technique for reducing hallucination and grounding AI output to your actual policies and procedures.

But it only works if what you’re retrieving is actually knowledge.

A KMS gives AI several things a document repository can’t:

Granularity. Knowledge articles are small, focused, and self-contained. They map cleanly to retrieval chunks. A 47-page policy manual doesn’t.

Context. Rich metadata including what the content applies to, when it’s valid, and what the exceptions are helps AI retrieve the right thing for the specific query, not just something approximately related.

Currency. Clear ownership and review cycles mean stale content gets flagged and updated. Without governance, your AI will confidently cite the policy from 2022.

Feedback loops. Usage analytics and quality signals let you see what’s being retrieved, whether it’s helping, and where the gaps are. A repository gives you none of this.

Industry analysis on RAG implementations put the reduction in unhelpful or inaccurate answers at roughly 50% when AI is grounded in curated, structured knowledge sources versus raw document repositories. I’d treat that number as directionally right rather than precise, but the direction is consistent with everything I’ve seen in practice.

Without a proper knowledge foundation, RAG doesn’t eliminate bad answers. It generates them faster, with confidence, at scale. Also known as “Confident nonsense at scale” by a Senior PM I know.

Six signs you have a repository, not a KMS

Most organisations don’t know which one they have until they try to connect AI to it. Here’s how to check before that happens.

Six signs you have a repository, not a KMS

If three or more of these apply, your AI project has a problem that even the best models won’t solve.

1

Your primary content type is documents

PDFs, Word files, and PowerPoint decks with no consistent templates or structure. The file format isn’t the issue. The absence of structure around questions and tasks is.

2

Taxonomy is just folder paths

“HR > Policies > 2024” tells you where the file lives. It doesn’t tell you who it applies to, in which context, or under which conditions, you’re organising documents by where they live, not by what they mean.

3

Nobody owns the content

Articles without a named owner, or review date is an orphan. Orphan articles accumulate quickly. Your AI will cite them anyway.

4

Search returns contradictions

Three versions of the same guidance from three different years, with no indication of which is current. AI will synthesise those contradictions into something that sounds authoritative.

5

Your AI project wired directly to SharePoint

If the plan was “connect Copilot to SharePoint and it’ll answer questions,” someone made a significant assumption about what’s actually in that SharePoint.

6

Success is measured in storage, not resolution

If your KPIs are number of documents and access counts rather than accuracy rates and task completion, you’re thinking like a repository. Your AI will act accordingly.

What to do about it

The hard truth is that knowledge management requires work. Not technology. Work. Content audits. Governance frameworks. Ownership assignments. Article templates. Taxonomy decisions. Review cycles.

None of that is AI’s job. All of it is a prerequisite for AI to do its job.

If you’re planning an AI knowledge project, start by asking honestly whether what you’re pointing AI at is actually organised as knowledge. If it isn’t, you’re not choosing between “fast and good” and “slow and thorough.” You’re choosing between “build it properly” and “build it twice.”

If you’re already deployed and wondering why your AI keeps getting it wrong, the answer is almost certainly in the quality of your sources, not the capability of the model.

Models can only work with what you give it.


Have you done an honest audit of what you’re actually pointing your AI at? I’ve run this diagnostic with a lot of organisations, usually ones who’ve already tried the shortcut. If you’re trying to figure out where your knowledge base actually stands before you start an implementation, reply and let me know. Happy to help you think it through.

Leave a Reply

Your email address will not be published. Required fields are marked *