Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.tensormesh.ai/llms.txt

Use this file to discover all available pages before exploring further.

May 27, 2026

New Serverless Models Ten models are now available for serverless inference:
ModelFamilyParametersContext
DeepSeek-V4-FlashDeepSeek-V4284B1M
gemma-4-31B-itGoogle Gemma 430.7B256K
GLM-5.1-NVFP4-MTPGLM-5.1433B128K
MiniMax-M2.5MiniMaxAI228B196K
Kimi-K2.6Kimi1T256K
gpt-oss-120bOpenAI gpt-oss116B131K
gpt-oss-20bOpenAI gpt-oss20B131K
Qwen3.5-397B-A17B-FP8Qwen3.5397B262K
Qwen3.6-27B-FP8Qwen3.627B262K
Qwen3-Coder-30B-A3B-InstructQwen3 Coder30.5B262K
See Serverless Inference for more details. External Storage External Storage gives your serverless models a persistent KV cache bucket — so context is remembered across requests and sessions. Subscribe to Bronze, Silver, or Gold plans from Operations → Storage to increase bucket capacity and drive a higher fraction of your tokens to $0. (External Storage) Reserved GPU Clusters On-demand GPU deployments have been removed from the platform. If you need dedicated GPU capacity for high-volume production workloads, contact us through Deploy → Reserved to request a reserved cluster. The Tensormesh team will work with you on a tailored capacity plan. (Reserved Deployments)
Overview Page — The Overview page has been updated with quick actions linking to Serverless, Demos, Claude Code, and Codex CLI. The stats shows Cache Hit Rate, Models Called, Monthly Spending, and Account Balance. A cached tokens chart visualizes your cache performance over time, and a serverless model catalog is shown at the bottom for quick access.Cache SavingsManagement → Cache Savings now shows estimated savings, a stacked area chart of input spend vs. cache savings over time, and a top-models-by-savings table.SDK & CLI — The SDK and CLI have been updated to focus on serverless inference.New Docs Pages — Dedicated documentation for External Storage, Serverless Usage, Cache Savings, and a Glossary of key terms.

April 29, 2026

Account Deletion — You can now delete your own account from Management → Account. Account deletion requires email confirmation and is permanent.Billing Transaction Details — Transaction details on the Billing page now show a deeper breakdown.

April 15, 2026

Serverless Inference Run models via API with no infrastructure to manage. Pay-per-token pricing with $0 for cached tokens. Track per-model token usage and costs under Operations → Serverless Usage. The API is OpenAI-compatible — point any existing SDK to https://serverless.tensormesh.ai. (Serverless Inference) Tensormesh Demos A new Demos section with interactive benchmarks. Navigate to Operations → Demos to run live inference demos and observe KV cache acceleration across TTFT, E2E latency, and inter-token latency. CLI Documentation A new CLI tab in the docs with guides and a full command reference for the tm CLI tool — covering installation, authentication, inference, model management, billing, and admin workflows. (CLI) Python SDK A new Python SDK tab with guides for the tensormesh package — covering sync and async clients, inference, control plane resources, and migration from OpenAI/Fireworks. (Python SDK) API & SDK Reference Documentation Full interactive API & SDK docs for all Tensormesh endpoints with an in-browser playground and code examples in cURL, Python, and JavaScript. (API & SDK Reference)
Email Notification Preferences — Toggle email notifications for deployment updates from Management → Account.Support Ticket Attachments — You can now attach files when creating support tickets.Quick Actions on Dashboard — Quick action cards on the Overview page for faster navigation to common operations.

March 17, 2026

MiniMax-M2.5 on Serverless MiniMaxAI/MiniMax-M2.5 is now available for serverless inference. Built on a 228B-parameter Mixture-of-Experts architecture with a 196K context window, it excels at advanced reasoning, coding, and building autonomous systems that combine tool orchestration with large-scale information processing.
Cost Saving Breakdown — Cache Savings page now displays a step-by-step savings breakdown with calculation formulas.Browser Notifications — Notification preferences can now be configured from Management → Account.