Blog — Model updates & releases

NEW · BUDGET v2.1.0 · Jun 24, 2026

NLC ShipLow 1.0 is live — six-lane budget fusion for everything-at-once

ShipLow is a budget multi-agent that doesn't compromise on capabilities. A fast router reads your request and dispatches it to the specialist executor that's strongest at that specific kind of work — six dedicated lanes — all under one nlc-shiplow brand id. $0.30/M input, $0.60/M output — a fraction of NLC PRO.

The six lanes

· General chat & reasoning — fast all-rounder lane with tool-use and image input.
· Code — single-file generation, refactors, and bug fixes at competition-grade quality.
· Agent — multi-file and repo-level engineering with patch-style edits.
· Math — careful step-by-step numeric reasoning and proofs.
· Vision — image attached → auto-routed here. Reads screenshots, diagrams, UI mockups.
· Multilingual — Thai / Indic / CJK / Arabic, natural and fluent.

Auto-routing — the router classifies in <1s and dispatches. You see one streamed answer.
128K context — fits a multi-file repo, a long doc, or an hours-long conversation.
MCP tools work — call your connectors from any lane.
$0.30 / $0.60 per 1M tokens — over 10× cheaper than NLC PRO and 15× cheaper than NLC ULTRA MAX. Drop back to those any time you want maximum quality.

Budget Multi-agent Vision MCP tools 128K context

NEW v2.0.0 · Jun 20, 2026

NLC ULTRA MAX 1.0 is live — multi-agent routing for full-stack work

ULTRA MAX is our premium tier: a fast router classifies your request as FRONTEND, BACKEND, or GENERAL, then dispatches it to the best executor model. Frontend tasks go to a vision-capable model; backend tasks go to a deep-reasoning model. You get one streamed response under the nlc-ultra brand id — you never see which sub-agent ran.

Multi-agent Full-stack 1M context

UPDATE v1.9.0 · Jun 20, 2026

Migrated to a new inference provider — faster, more reliable

We've switched our upstream inference to a new provider for better latency and reliability. All models (PRO, Vision, Fast, ULTRA MAX, Embed, Rerank) now run on the new infrastructure. No API changes — your existing keys and code work as-is.

Infrastructure Performance

FEATURE v1.8.0 · Jun 20, 2026

Anthropic Messages API now supported

You can now use the Anthropic Python/JS SDK with NLC AI. Point your client at https://oddsforge.org/inference and use your NLC API key. Works with Claude Code, Anthropic SDK, and any Anthropic-compatible client.

Anthropic Claude Code New protocol

FEATURE v1.7.0 · Jun 19, 2026

NLC Embed 1.0 + NLC Rerank 1.0 — RAG pipeline endpoints

Two new endpoints for building RAG pipelines: /v1/embeddings generates vector embeddings, and /v1/rerank reranks documents against a query. Both at $0.20/M input tokens.

Embeddings Rerank RAG

SECURITY v1.6.0 · Jun 19, 2026

MFA, API key scopes, audit logs, GDPR export

Major security update: TOTP two-factor authentication, per-API-key scopes (e.g. embeddings-only), admin + user audit logs, webhook delivery logs with retry, SSRF guards, GDPR data export and account deletion.

MFA Scopes GDPR Audit log

Model updates & releases

Meet the models

NLC ULTRA MAX 1.0

NLC Vision 1.0

NLC Fast 1.0

NLC PRO 1.0

NLC ShipLow 1.0

NLC Embed 1.0

NLC Rerank 1.0