Auditable by design
Every agent action is a typed, logged CloudEvent. Full execution traces for compliance, debugging, and replay — no black boxes.
Most AI projects fail in production — not because the models are wrong, but because the infrastructure around them isn’t built for enterprise requirements: auditability, governance, security, and integration with existing systems.
Alquimia is the AI execution layer that closes this gap. It wraps LLMs and autonomous agents in explicit contracts, deterministic execution paths, and enterprise-grade integrations — so your AI investments deliver measurable outcomes, not just demos.
Ships as two open-source components:
alquimia-core — Python SDK. The agent execution engine: evaluation strategies, memory management, tool integration (MCP, Llama Stack, A2A), and OCI-based agent registry.alquimia-runtime — FastAPI service. HTTP API wrapping the core SDK with async inference, SSE streaming, Redis-backed session state, Keycloak auth, Vault secrets, and a full registry management API.Auditable by design
Every agent action is a typed, logged CloudEvent. Full execution traces for compliance, debugging, and replay — no black boxes.
Enterprise security
Keycloak OIDC, JWT, or API token auth. HashiCorp Vault for secrets. OCI artifact distribution for agent configs. Zero plaintext credentials.
Multi-model, multi-tool
Connect OpenAI, Anthropic, Ollama, vLLM, or any OpenAI-compatible endpoint. Integrate MCP servers, Llama Stack, or other agents as tools.
Production-ready infrastructure
Kubernetes-native with Kustomize overlays. Kafka-backed event bus with HMAC-signed CloudEvents. OpenTelemetry observability. PostgreSQL audit logs. Redis session state.
Governance & control
Agent configurations are versioned, validated, and distributed as OCI artifacts. Hold/unhold agents without redeployment. Dry-run validation before publishing.
Human-in-the-loop
Require human approval before any tool execution. Configurable per tool: NONE, LAZY, or REQUIRED. Approval flows over the same SSE stream.
Multi-channel
Agents receive messages from WhatsApp, Slack, Email, and web — normalized through the same execution pipeline. One agent, many channels.
Memory & context
Short-term token windowing and long-term Chain-of-Density summarization. Session state persisted in Redis with configurable TTL. RAG via Qdrant.
| Repository | Description |
|---|---|
| Alquimia-ai/alquimia-core | Python SDK — agent execution engine, evaluation strategies, memory, tools, registry |
| Alquimia-ai/alquimia-runtime | FastAPI service — HTTP API, async inference, SSE streaming, Keycloak auth, Vault secrets |