StellarBase
StellarCloud Managed Inference

Open-source AI models,
production-ready.

We host and serve the best open-source AI models — language detection, OCR, NER, embeddings, rerankers, and LLMs. Simple API, transparent per-token pricing, EU infrastructure.

EU EU-hosted
·
Pay per million tokens
·
Zero data retention
Specialized AI Models

Purpose-built. State of the art. Hosted for you.

Small, fast, accurate models for NLP and vision tasks. Curated and tuned by us — you just call the API.

Text Processing

Language Detection

Identify 1,000+ languages with high accuracy — including rare and low-resource ones.

GlotLID v3

Lemmatization

Neural lemmatizer reducing words to base forms across 60+ languages.

Stanza

Named Entity Recognition

Zero-shot NER — detect any entity type (person, org, amount, custom) without fine-tuning.

GLiNER-relex large

Entity Linking

Link detected entities to knowledge bases and canonical identifiers (Wikidata, custom KB).

GLiNER Linker large
Document Processing

StellarOCR

One endpoint · structured output

Upload a PDF or image. StellarOCR detects structure, extracts body text, handles complex tables, and preserves math formulas — returning clean, structured output. No stitching, no orchestration.

€0.90 / 1K pages · one flat rate
What it handles
Structure detection

Headers, paragraphs, columns, figures, reading order — preserved.

Text extraction

Printed, handwritten, multi-language. Fast and layout-aware.

Complex tables

Dedicated high-fidelity extraction for tables and difficult scans.

Math formulas

Inline and display equations converted to clean LaTeX.

Embeddings & Retrieval

Text Embeddings — Small

Fast multilingual dense embeddings for semantic search and similarity. 1024-dim.

Qwen3-Embedding 0.6B

Text Embeddings — Large

Higher-quality multilingual embeddings for retrieval where recall matters most.

Qwen3-Embedding 8B

Text Embeddings — Long Context

8K context multilingual embeddings. Ideal for whole-document retrieval.

BGE-M3

Image Embeddings

Visual embeddings for image search and cross-modal retrieval. Self-supervised.

DINOv3 ViT-L

Reranking

Cross-encoder re-ranking of search results. Dramatic quality boost over bi-encoders.

Qwen3-Reranker 0.6B
Large Language Models

The best open-weights LLMs, served on our GPUs

We run them on our infrastructure so you don't have to. Full generative capability without sending your data to US model providers.

GPT-OSS 120B

Apache 2.0

Open-weights LLM by OpenAI. Instruction-tuned, broad capabilities.

Parameters
120B
Context
128K

Devstral 2

Apache 2.0

Mistral coding model. Strong on software tasks, tool use, agentic workflows.

Parameters
24B
Context
128K

Qwen 3.5 397B A17B

Apache 2.0

Alibaba MoE flagship. Multilingual reasoning, long context, frontier-tier quality.

Parameters
397B · 17B active
Context
256K
More models coming. We curate open-weights LLMs based on benchmarks and EU-alignment. Tell us what you'd like to see.
How it works

Ship in under 5 minutes

No complex setup. Sign up, generate keys, integrate. We handle model serving, scaling, and uptime.

01

Sign up

Create your account, choose a plan.

02

Create an API key

Scoped tokens with per-model limits.

03

Call any model

OpenAI-compatible endpoints.

04

Pay per token

Transparent usage, invoiced monthly.

Call any model OpenAI-compatible
curl
curl https://api.stellarcloud.ai/v1/chat/completions \
  -H "Authorization: Bearer $STELLARCLOUD_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-oss-120b",
    "messages": [{ "role": "user", "content": "Hello" }]
  }'
EU 100% EU Infrastructure

Inference in Europe. Governed by Europe.

All models run on GPUs in EU data centers. Your prompts and responses never leave the region. No US Cloud Act exposure. GDPR compliant by design.

EU data centers only (Frankfurt, Amsterdam)
Zero data retention — no prompt logging
EU legal jurisdiction, Czech HQ
GDPR, EU Data Act, DORA aligned
Why EU matters
Data sovereignty

Your data stays in EU. No foreign government can subpoena it via extraterritorial laws.

Regulatory alignment

GDPR, EU Data Act, DORA — built in. No SCCs needed for EU data transfers.

Low latency for EU users

Sub-100ms to your EU customers. No transatlantic round-trip.

Pricing

Pay per million tokens. No minimums.

Transparent usage-based pricing. Only pay for what you use. Pre-paid credits or monthly invoicing for teams.

Per-token billing

Simple rates per million input/output tokens. Different rates per model.

Free tier included

Every account gets free monthly credits to test all models before committing.

Real-time usage dashboard

Track spend per model, per key, per team member. Set hard limits and alerts.

Transparent rates for every model
Per-million-token pricing across specialized models and LLMs.
See full pricing
Start building

Open-source models, zero hassle.

Create an account, generate an API key, and ship. Free credits on signup.

View pricing