StellarCloud Managed Inference

Open-source AI models,
production-ready.

We host and serve the best open-source AI models — language detection, OCR, NER, embeddings, rerankers, and LLMs. Simple API, transparent per-token pricing, EU infrastructure.

EU-hosted

Pay per million tokens

Zero data retention

Get API key View pricing

Specialized AI Models

Purpose-built. State of the art. Hosted for you.

Small, fast, accurate models for NLP and vision tasks. Curated and tuned by us — you just call the API.

Text Processing

Language Detection

Identify 1,000+ languages with high accuracy — including rare and low-resource ones.

Language ID model

Lemmatization

Neural lemmatizer reducing words to base forms across 60+ languages.

Neural lemmatizer

Named Entity Recognition

Zero-shot NER — detect any entity type (person, org, amount, custom) without fine-tuning.

Zero-shot NER

Entity Linking

Link detected entities to knowledge bases and canonical identifiers (Wikidata, custom KB).

Entity linker

Document Processing

StellarOCR

One endpoint · structured output

Upload a PDF or image. StellarOCR detects structure, extracts body text, handles complex tables, and preserves math formulas — returning clean, structured output. No stitching, no orchestration.

€0.90 / 1K pages · one flat rate

What it handles

Structure detection

Headers, paragraphs, columns, figures, reading order — preserved.

Text extraction

Printed, handwritten, multi-language. Fast and layout-aware.

Complex tables

Dedicated high-fidelity extraction for tables and difficult scans.

Math formulas

Inline and display equations converted to clean LaTeX.

Embeddings & Retrieval

Text Embeddings — Small

Fast multilingual dense embeddings for semantic search and similarity. 1024-dim.

Embedding (small)

Text Embeddings — Large

Higher-quality multilingual embeddings for retrieval where recall matters most.

Embedding (large)

Text Embeddings — Long Context

8K context multilingual embeddings. Ideal for whole-document retrieval.

Multilingual embedding

Image Embeddings

Visual embeddings for image search and cross-modal retrieval. Self-supervised.

Image embedding

Reranking

Cross-encoder re-ranking of search results. Dramatic quality boost over bi-encoders.

Cross-encoder reranker

Large Language Models

The best open-weights LLMs, served on our GPUs

We run them on our infrastructure so you don't have to. Full generative capability without sending your data to US model providers.

GPT-OSS 120B

Apache 2.0

Open-weights LLM by OpenAI. Instruction-tuned, broad capabilities.

Parameters

120B

Context

128K

Devstral 2

Apache 2.0

Mistral coding model. Strong on software tasks, tool use, agentic workflows.

Parameters

24B

Context

128K

Qwen 3.5 397B A17B

Apache 2.0

Alibaba MoE flagship. Multilingual reasoning, long context, frontier-tier quality.

Parameters

397B · 17B active

Context

256K

More models coming. We curate open-weights LLMs based on benchmarks and EU-alignment. Tell us what you'd like to see.

How it works

Ship in under 5 minutes

No complex setup. Sign up, generate keys, integrate. We handle model serving, scaling, and uptime.

Sign up

Create your account, choose a plan.

Create an API key

Scoped tokens with per-model limits.

Call any model

OpenAI-compatible endpoints.

Pay per token

Transparent usage, invoiced monthly.

Call any model OpenAI-compatible

curl

curl https://api.stellarcloud.ai/v1/chat/completions \
  -H "Authorization: Bearer $STELLARCLOUD_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-oss-120b",
    "messages": [{ "role": "user", "content": "Hello" }]
  }'

100% EU Infrastructure

Inference in Europe. Governed by Europe.

All models run on GPUs in EU data centers. Your prompts and responses never leave the region. No US Cloud Act exposure. GDPR compliant by design.

EU data centers only (Frankfurt, Amsterdam)

Zero data retention — no prompt logging

EU legal jurisdiction, Czech HQ

GDPR, EU Data Act, DORA aligned

Why EU matters

Data sovereignty

Your data stays in EU. No foreign government can subpoena it via extraterritorial laws.

Regulatory alignment

GDPR, EU Data Act, DORA — built in. No SCCs needed for EU data transfers.

Low latency for EU users

Sub-100ms to your EU customers. No transatlantic round-trip.

Pricing

Pay per million tokens. No minimums.

Transparent usage-based pricing. Only pay for what you use. Pre-paid credits or monthly invoicing for teams.

Per-token billing

Simple rates per million input/output tokens. Different rates per model.

Free tier included

Every account gets free monthly credits to test all models before committing.

Real-time usage dashboard

Track spend per model, per key, per team member. Set hard limits and alerts.

Transparent rates for every model

Per-million-token pricing across specialized models and LLMs.

See full pricing

Start building

Open-source models, zero hassle.

Create an account, generate an API key, and ship. Free credits on signup.

Get API key View pricing

Open-source AI models, production-ready.

Purpose-built. State of the art. Hosted for you.

Language Detection

Lemmatization

Named Entity Recognition

Entity Linking

StellarOCR

Text Embeddings — Small

Text Embeddings — Large

Text Embeddings — Long Context

Image Embeddings

Reranking

The best open-weights LLMs, served on our GPUs

GPT-OSS 120B

Devstral 2

Qwen 3.5 397B A17B

Ship in under 5 minutes

Sign up

Create an API key

Call any model

Pay per token

Inference in Europe. Governed by Europe.

Pay per million tokens. No minimums.

Per-token billing

Free tier included

Real-time usage dashboard

Open-source models, zero hassle.

Open-source AI models,
production-ready.