Question 1

Is Interlocute a framework?

Accepted Answer

No. Interlocute is a runtime — a deployment substrate you call into, not a library you build inside of. You keep your application code, your language, and your architecture. A node is an addressable endpoint: your app sends requests to it, the runtime handles the infrastructure layer (recording, streaming, memory, metering, governance), and the response comes back. There is no agent loop to implement, no graph to define, no paradigm to adopt.

Question 2

Do I still write code?

Accepted Answer

Yes — your product logic, your data models, your UI, your application flows all live in your codebase as they always have. Interlocute replaces the infrastructure plumbing: you stop writing logging middleware, billing trackers, context management code, and streaming handlers. The things that are the same for every LLM app are done. The things that are specific to your product are yours.

Question 3

Can I start with one use case and grow?

Accepted Answer

Yes. Nodes are independent — you create one for the first use case that needs it, and it has no impact on the rest of your system. When a second use case needs the same substrate, you create another node. Each node has its own configuration, memory partition, API keys, and usage ledger. You scale out by adding nodes, not by re-architecting.

Question 4

Do I need to manage infrastructure?

Accepted Answer

No servers to run, no vector databases to provision, no message queues to manage, no scaling policies to write. Interlocute is fully managed. The computation, storage, and orchestration that sits behind a node is the platform's concern. You manage nodes through the dashboard or API; the rest is handled.

Question 5

How does Interlocute relate to Azure or cloud infrastructure?

Accepted Answer

Interlocute runs on cloud infrastructure and is designed to integrate cleanly with standard application architectures. If your app is already deployed on a cloud provider, Interlocute nodes are additional endpoints your application calls — the same way you call any external API. No cloud-specific SDK is required. Authentication uses standard bearer tokens.

Question 6

What if I already use LangChain, LlamaIndex, or a similar library?

Accepted Answer

Those frameworks handle orchestration logic at the application layer. Interlocute operates at the infrastructure layer — below your orchestration code. You can continue using an orchestration library if you need it; the LLM calls those libraries make can be routed through Interlocute nodes instead of directly to the model provider, giving you the full substrate (recording, metering, streaming) without changing your orchestration approach.

Question 7

How does cost work for a production application?

Accepted Answer

You pay a small platform premium on LLM tokens plus computation charges — there is no monthly platform fee, no per-seat pricing, and no minimum commitment. Every request is metered per call and attributed to the node, thread, and API key that produced it. For multi-tenant applications, issuing separate API keys per customer gives you a ready-made cost ledger for chargeback. When usage is zero, cost is zero.

Question 8

Is Interlocute production-ready today?

Accepted Answer

Interlocute is in v1 beta. The core runtime — chat, streaming, memory, observability, guardrails, and cost attribution — is available and being used in production. Some capabilities are marked coming soon. The API contract is stable for v1 features. Review the docs for the current feature status and any open limitations.

Same models.
Real workspace.

Every LLM app rebuilds
the same infrastructure.

The runtime, complete

Streaming SSE

Thread management

Long-term memory

RAG

Tool use

Scheduling

Not bolted on.
Built in.

Capability traces

Token-level accounting

Latency per call

Governance log

Know exactly what
your AI costs.

Built for production, not prototypes

SaaS builders

Platform and agency builders

Internal tooling teams

Agent builders

Integration developers

Teams evaluating in production

Frequently Asked Questions

Deploy your first node.

Same models.Real workspace.

Every LLM app rebuildsthe same infrastructure.