Core Concepts

The mental model. Ten minutes of reading that will save you hours of orientation. If you only read one docs page, read this one.

One platform, many products

StellarBase is a modular platform with three complementary products. You can use any one of them standalone; they multiply when used together.

Product	What it is	When to use
StellarBase	Core platform — knowledge base, agents, workflows, chat, search, collaboration	You have data + want to build AI workflows on top of it
StellarCloud	Managed open-source inference — OCR, embeddings, NER, rerankers, LLMs	You need AI models but don’t want to host them
StellarGate	Privacy proxy — anonymizes any request before it reaches a third-party LLM	You want GPT-4 / Claude / Gemini but can’t send real data

Three building blocks inside StellarBase

1. Knowledge base

The knowledge base is your unified data layer. Every document, every email, every image, every row of a database you connect — all ingested, parsed, indexed, and linked. It’s not a storage layer; it’s a semantic layer that sits on top of your existing systems.

Key properties:

50+ connectors — Google Drive, SharePoint, iManage, Slack, Teams, Notion, databases, S3, plus custom connectors
Every format — PDF, DOCX, images, audio, video, code, logs, email — parsed automatically by StellarOCR
24+ languages — content in any European language, searchable in any other
Continuous sync — as sources change, the knowledge base follows

2. Agents

An agent is a scoped AI specialist you configure. You write its system prompt (this is where your domain knowledge lives), you choose its tools, you scope its knowledge. Every answer is cited to source.

Agents are the main way you encode your company’s playbook into the platform. A law firm builds a “MSA Reviewer” with the firm’s red-lines. A hospital builds a “Guideline Agent” that knows NCCN inside out. A manufacturer builds a “Rotating Equipment” agent that mirrors the senior foreman’s reasoning.

3. Workflows

A workflow is a deterministic pipeline. You arrange nodes (data sources, models, agents, outputs) in a graph. Same inputs produce the same outputs, every time. Workflows can be visual (drag-and-drop) or declarative (described to an agent in plain language). Workflows roll out through Q3 2026.

Workflows are how you automate. An insurance broker runs “portfolio review” weekly. A government ministry runs “beneficiary oversight” nightly. A manufacturer runs “predictive maintenance” every 15 minutes. The workflow is the same; it executes against fresh data.

Key mental shifts

It’s a platform, not a specialized tool

StellarBase does not come pre-configured for “legal review” or “medical literature” or “contract drafting”. It comes with the building blocks you use to assemble those. This is deliberate — your workflow is different from your neighbour’s, and a platform that bends to both is more valuable than two tools that each cover half.

The practical consequence: the first week you use StellarBase is about configuration, not training. You connect sources, write agent prompts, design workflows. After that, usage feels like any normal tool — but the tool you’re using is one you (partially) built.

Connect your own models

StellarBase is model-agnostic. You can:

Use the open-source models on StellarCloud (cheapest, EU-hosted)
Use commercial LLMs via StellarGate (GPT-4, Claude, Gemini — with PII anonymized before they see it)
Plug in your own models — HuggingFace endpoints, REST APIs, gRPC, Python scripts
Mix all three — different agents can use different models

Your organization’s custom ML — fraud detector, artifact classifier, failure predictor — becomes a callable tool that any agent or workflow can invoke.

Everything cited, nothing invented

The platform is built so that every output can be traced to an input. Agents cite the specific passage they’re quoting. Workflows produce structured outputs (CSV, JSON, database rows) with source metadata attached to every row. This matters for regulated work — audit trails, compliance reports, court submissions — and it matters for trust in general.

Deploy where the data lives

You’re not forced into anyone’s cloud. Managed EU cloud is the fastest path. On-premise deployment inside your own Kubernetes works identically. For classified or regulated workloads, air-gapped deployments are first-class — the same platform runs with zero internet egress.

Glossary

Term	Meaning
Base	An isolated container — its own data, its own agents, its own permissions. Think of it like a workspace in Notion or Slack. A ministry might have one per department; a law firm one per matter.
Connector	A configured link to an external source (Drive folder, SharePoint library, Postgres DB, custom API).
DSM	Dynamic Semantic Module — the proprietary engine that builds the knowledge graph linking documents, people, entities, concepts.
Agent	A configured AI specialist — system prompt + tool allowlist + knowledge scope + output schema.
Tool	Anything an agent can call — built-in (search, OCR), external (your API), or workflow-composed.
Workflow	A graph of nodes producing deterministic outputs. Can be triggered on schedule, on event, or on demand.
Citation	A back-reference from an AI output to its source — document + page + passage span.
StellarGate	The privacy-proxy product. Anonymizes prompts before they reach external LLMs, de-anonymizes the response.
StellarCloud	The inference-API product. EU-hosted open-source models available over HTTP.

Next steps

Quick Start — connect a source, spin up an agent, run a query
Platform Overview — deeper tour of the knowledge base + agents + workflows
Deployment — cloud, self-hosted, air-gapped, hybrid