Intelligence (CIF)

Token tracking, budgets, and cost analytics

The Common Intelligence Framework (CIF) is the observability and cost management layer of the EVC Platform. It tracks every LLM call, enforces budgets, provides cost analytics, manages provider routing, and monitors provider health across the entire organization.

Token tracking and call logging

Every LLM call across the platform is logged with full token accounting. CIF captures input tokens, output tokens, model, provider, latency, and cost for every call regardless of whether it originates from a build, a Crucible session, or a Promptr Studio playground execution.

The call log provides a searchable, filterable table of every LLM interaction. Filter by project, user, provider, model, or date range. Each entry links back to the originating build or session for full traceability.

CIF call log table showing recent LLM calls with provider, model, tokens, and cost

Token counts are reported by the LLM provider and recorded without modification. The platform does not estimate tokens; it uses the provider-reported values from each response.

[Screenshot: Call detail view showing full token breakdown, latency, and linked build — pending capture]

Budget configuration and enforcement

Set spending limits at the organization, project, or user level. CIF enforces budgets in real time by checking available balance before every LLM call.

Budget levels

Organization budget — the total spend cap for the billing period. All activity across all projects counts against this limit.
Project budget — a per-project allocation within the organization budget. Prevents one project from consuming the entire allocation.
User budget — optional per-user caps for controlling individual usage.

Budget configuration page showing org, project, and user budget settings

Enforcement behavior

When a budget is exhausted, new LLM calls are rejected with a clear error message. In-flight builds that hit the budget limit fail gracefully with a budget-exceeded status. Notifications are sent when budgets reach 80% and 100% of the configured limit.

[Screenshot: Budget usage meter showing current spend against limit with warning threshold — pending capture]

Budget enforcement is real-time. The balance is checked before each LLM call, not at build submission. This prevents cost overruns from long-running builds.

Cost analytics and provider comparison

CIF provides dashboards for analyzing LLM spend across dimensions: time, project, user, provider, and model. Use these analytics to optimize cost-efficiency and identify spending trends.

The cost analytics dashboard shows total spend over time with breakdowns by provider, model, and project. Compare providers to see which offers the best cost-per-quality ratio for your workloads.

Cost analytics dashboard with spend-over-time chart and provider breakdown

The provider comparison view shows side-by-side metrics for each provider: average cost per call, average latency, token efficiency, and error rate. Use this data to inform provider routing decisions.

Provider comparison table showing cost, latency, and error rate per provider

Provider routing policies

Routing policies control which provider and model are used for different types of work. Configure policies at the organization or project level to balance cost, quality, and availability.

Policy types

Cost-optimized — routes to the lowest-cost provider that meets the minimum quality threshold.
Quality-optimized — routes to the highest-rated provider regardless of cost.
Latency-optimized — routes to the fastest-responding provider.
Manual — always uses the explicitly specified provider and model.

Routing policies apply to automatic provider selection. When a user explicitly selects a provider and model at build time, the manual selection takes precedence over the policy.

Provider health monitoring

CIF continuously monitors the health and availability of configured LLM providers. Health data feeds into routing decisions and surfaces in the organization dashboard.

The provider health dashboard shows real-time status for each configured provider: availability, current latency, error rate, and rate limit utilization. Historical data shows uptime trends and incident windows.

Provider health dashboard showing status indicators and latency graphs

When a provider experiences degraded performance or outage, the routing layer can automatically fail over to an alternative provider if a routing policy with fallback is configured.

[Screenshot: Provider health history showing uptime percentage and incident timeline — pending capture]

SDK integration

For teams that call LLM providers directly in their own applications, the CIF SDK provides a drop-in wrapper that routes calls through the platform for tracking, budget enforcement, and analytics.

The SDK wraps your existing provider SDK calls and adds transparent instrumentation. Token counts, costs, and latency are reported to CIF without changing your application logic. Budget enforcement applies the same rules as platform-native calls.

[Screenshot: SDK integration code example showing wrapper initialization and usage — pending capture]

The SDK supports Node.js and Python. Additional language support is planned. See the API Reference for REST-based integration without the SDK.