The Sovereign AI Flywheel

What This Means

Private AI infrastructure for teams that cannot treat hosting as an afterthought

This is the tighter hosting and operating model behind caveauAI. It matters when the documents are sensitive, the network boundary matters, or the client needs clearer control over where data lives, how tenants are separated, and who touches the stack.

Day 1

Instant utility

Ongoing

Data builds passively

Month 4+

Your sovereign model

Abstract artwork showing enterprise documents entering an EU-hosted ingestion lattice

Phase 1

Instant Utility — Day One

Your team uploads their working documents — contracts, case law, internal manuals, regulatory filings — to our EU-hosted bare-metal servers. Within minutes, they have enterprise-grade search across everything they uploaded.

No training required. No AI expertise needed. They type a question in plain language, and caveauAI returns precise answers with clickable citations back to the source document. Powered by open-source models up to 72 billion parameters.

Upload PDF, DOCX, TXT, HTML, or Markdown — up to 50 MB per file
Vector, keyword, and hybrid search modes
Every answer cites its source document and paragraph
EU-hosted infrastructure — your data never crosses jurisdictional boundaries

Immediate ROI

Abstract artwork showing interaction signals forming a proprietary instruction dataset

Phase 2

The Invisible Asset Build

Here is where caveauAI diverges from every other enterprise search tool on the market.

While your team does their normal work — asking questions, clicking cited sources, accepting or refining answers — the platform is quietly building something valuable in the background. Every interaction is securely logged on your isolated database: the question asked, the context retrieved, the answer generated, and whether your expert accepted it.

By simply using the tool, your organisation is passively generating a high-quality, human-verified, domain-specific instruction dataset. No annotation team. No data labelling project. No extra budget line. Your daily workflows produce the training data.

Every query-answer pair is logged with full context
Expert acceptance signals act as implicit human verification
Data stays in your isolated database — never shared, never aggregated
Zero additional effort from your team

No extra effort required

Abstract artwork showing a sovereign AI model being distilled on dedicated GPU hardware

Phase 3

Your Sovereign Model — Month 4+

Once your organisation has generated enough high-quality interactions — typically four to six months of normal use — we take that proprietary dataset and fine-tune a compact, specialised model exclusively for your business.

This model runs on our dedicated NVIDIA RTX PRO 6000 infrastructure. It understands your corporate terminology, your document structures, your regulatory context, and your team's specific way of asking questions. It doesn't guess — it knows.

Fine-tuned on your verified interaction data — not generic internet text
Smaller model, drastically lower inference cost, faster responses
Runs on dedicated GPU hardware — not shared cloud instances
The model belongs to your organisation

Your proprietary AI asset

No Vendor Lock-in

You aren't renting an AI service. You are building a proprietary asset that understands your corporate language, your contract structures, your regulatory environment. If you leave, the model and the data go with you. Your investment compounds — it doesn't evaporate when a subscription ends.

Cost & Speed

A fine-tuned 7B parameter model that knows your domain outperforms a generic 70B model on your specific tasks — at a fraction of the compute cost. Smaller model, faster inference, lower monthly spend. The economics improve the longer you use the platform.

Absolute Data Sovereignty

The entire process — from initial document upload, through every query your team runs, to the final fine-tuning of your custom model — happens on our secure European bare-metal servers. Your data never touches OpenAI, Google, or any third-party AI provider. Full GDPR compliance by architecture, not by policy.

72B

Parameter Models

RTX PRO

6000 Ada GPUs

EU

Bare-Metal Hosted

<3s

Response Time

Isolated Tenant

MariaDB

Per-Client Schema

WireGuard VPN

10.8.0.x

Encrypted Tunnel

Dedicated Compute Layer

BNL · Azure · AWS · Your DC

Your Chosen Location

Model Weights

Client Ownership

Your Asset

Controlled data collection

We keep logging, telemetry, and storage decisions explicit. The goal is not to collect everything possible. It is to collect what is operationally useful, keep tenants separated, and make the deployment easier to govern.

The same controls — tenant isolation, encrypted network boundaries, and auditable logging — are the foundation for compliance with any regulatory framework. Whether your obligations are GDPR, CCPA, PDPA, POPIA, or internal data governance policy, the architecture adapts. We match the deployment location and operating model to the jurisdiction that governs your operations.

Start with the product. Open beta workspace →

	ChatGPT / CopilotRenting	caveauAI FlywheelOwning
Data location	US cloud (OpenAI / Microsoft)	EU bare-metal (your tenant)
Model ownership	Vendor-owned, shared	Your organisation owns it
Domain knowledge	Generic, internet-trained	Fine-tuned on your verified data
Training data used	Your prompts may train their model	Your data trains only your model
Cost trajectory	Fixed per-seat, no compounding	Decreasing — smaller model, lower cost
If you cancel	You keep nothing	You keep the model and data

Data Logging

We log what you explicitly permit. Nothing more.

When a user submits a query, the platform records four things: the raw query string, the retrieved document chunks (context window), the generated response, and the user's feedback signal (accept, reject, or refine). Each record is timestamped, tied to a session ID, and written to your isolated MariaDB instance — a dedicated database on your tenant, not a shared table.

There is no ambient telemetry. We do not log mouse movements, session duration, navigation patterns, or anything outside the explicit query-response-feedback loop. The logging schema is documented and available for audit. You can disable logging entirely at the tenant level, though this removes your ability to accumulate fine-tuning data.

Abstract artwork showing an isolated telemetry database and data pipeline

Abstract artwork showing a private WireGuard mesh tunnel between isolated infrastructure nodes

The Air-Gap

The fine-tuning process does not happen on the same servers that handle your daily queries. It runs on a dedicated GPU cluster code-named hippo — physically separate Hetzner bare-metal nodes provisioned exclusively for training workloads.

Data transfer between your caveauAI tenant and the training cluster happens over our private WireGuard mesh VPN on the 10.8.0.x subnet. No data traverses the public internet. The WireGuard tunnel is point-to-point with pre-shared keys rotated on a defined schedule.

At no point in this pipeline does your data touch a cloud provider's API, a shared compute cluster, or a network segment accessible from the public internet. The training data is purged from hippo after successful model delivery and client confirmation.

The Output

The deliverable is a set of model weights. Specifically: a quantised GGUF file (or safetensors if you prefer full-precision) containing your fine-tuned 8B parameter model, plus the LoRA adapter weights used during training.

This file belongs entirely to your organisation. It is not a license. It is not a service subscription. It is a binary artefact that encodes your corporate knowledge, tuned on your verified interaction data, that you can run on any compatible inference engine — llama.cpp, vLLM, Ollama, or our managed infrastructure.

We do not retain a copy of your weights. We do not use your model to serve other clients. We do not use your training data to improve a shared model. After delivery and your written confirmation, the weights and training artefacts are purged from our training infrastructure. You receive the only copy.

Abstract artwork showing dedicated GPU infrastructure for sovereign model training

Abstract artwork showing layered security architecture around sovereign AI workloads

Data at rest

Isolated MariaDB per tenant

Data in transit

WireGuard VPN (10.8.0.x)

Training infra

Dedicated bare-metal (hippo)

Output format

GGUF / safetensors (you own it)

Start Building Your AI Asset Today

The sooner your team starts using caveauAI, the sooner you accumulate the data that makes your sovereign model possible. Day one delivers instant value. Month four delivers a competitive moat.

Book a Consultation Try caveauAI Free

Your Team Uses the Tool.
The Tool Becomes Your AI.

Private AI infrastructure for teams that cannot treat hosting as an afterthought

Instant Utility — Day One

The Invisible Asset Build

Your Sovereign Model — Month 4+