Lyceum
European GPU cloud for serverless inference, training, and on-demand GPU clusters
Lyceum is a European GPU cloud for running AI workloads, built and hosted in the EU. It offers serverless inference with pay-per-token, OpenAI-compatible API access, and dedicated endpoints for reserved capacity.
For training and larger jobs it provides on-demand GPU VMs (1-8 GPUs, ready in seconds), large-scale clusters (8-8000 GPUs with InfiniBand), serverless Python and Docker execution, and Jupyter cloud notebooks.
Data stays in European data centres, so teams with EU data-residency or sovereignty needs can run inference and training without leaving EU jurisdiction. Lyceum is Berlin-based (Germany).
Pricing: Usage-based
What is Lyceum?
Lyceum is a GPU cloud platform for AI teams, built and hosted in Europe. It brings serverless inference, dedicated endpoints, on-demand GPU VMs, and large-scale training clusters into one platform, with GPUs running in European data centres for teams that need EU data residency.
What it offers
- Serverless inference: pay-per-token, OpenAI-compatible API access to open-source models including Llama 3.3 70B, gpt-oss 120B, Qwen3.5, Kimi K2.6, GLM-5.2, and DeepSeek V4 Pro. Point an OpenAI-style client at Lyceum and stream completions.
- Dedicated endpoints: reserved GPU capacity for production models when you need consistent throughput.
- On-demand GPU VMs: full root access, provisioned in seconds, with per-second billing and no minimum commitment. Hardware includes NVIDIA H100 (80GB) and B200 Blackwell GPUs.
- Large-scale clusters: 8 to 8,000 GPUs connected over InfiniBand for distributed training.
- Serverless execution and notebooks: run any Docker container or Python job on GPUs, or launch Jupyter notebooks, without managing infrastructure.
EU hosting and sovereignty
GPUs are hosted in European data centres with GDPR compliance and EU data residency, so inference and training stay under EU jurisdiction. This makes Lyceum a fit for teams with sovereignty, residency, or compliance requirements that rule out US-based providers.
Pricing
Usage-based with three models: serverless (pay-per-token, from around $0.13/1M input tokens on smaller models), on-demand VMs (per-second billing, from $3.29/hr for an H100 instance), and long-term contracts for reserved capacity. No minimum spend, and signing up needs no credit card. Prices as listed on lyceum.technology, June 2026, check their pricing page for current rates.
Who it is for
Teams that want European GPU compute without running their own hardware, from small AI startups shipping with serverless inference to infra teams orchestrating large training clusters. Lyceum is Berlin-based (Germany).
Lyceum Alternatives
Explore 76 products in the Inference APIs category. View all Lyceum alternatives.
vLLM
High-throughput LLM inference engine with PagedAttention for efficient GPU memory usage
Is your product missing?