token-diet

Name: token-diet
Rating: 5 (3 reviews)
Author: VDADev2022

Author: VDADev2022

Production-ready token optimization reduces costs by 40-75% using retrieval pruning, smart caching, and model routing. It's ideal for optimizing API costs, latency, or managing long contexts, especially in RAG pipelines, high-volume systems, multi-turn conversations, or when context exceeds 2K tokens.

3stars

Updated 3 months ago

View on GitHub ↗License: MIT

How to add

/plugin marketplace add VDADev2022/token-diet

The exact command may vary by repository. Check the README on GitHub.

For the skill author

Drop this on your repo README

Shows your skill is listed on Skillteca, generates a backlink and trackable traffic.

[![Listada na Skillteca](https://www.skillteca.com.br/api/badge/token-diet/svg)](https://www.skillteca.com.br/skills/token-diet?utm_source=badge&utm_medium=readme&utm_campaign=badge)

#llm #api

Related skills

See all in Design e Frontend →

webapp-testing

153.1k

Toolkit for interacting with and testing local web applications using Playwright. Supports verifying frontend functionality, debugging UI behavior, capturing browser screenshots, and viewing browser logs.

Design e Frontend#testby anthropics

brand-guidelines

153.1k

Applies Anthropic's official brand colors and typography to any artifact that may benefit from its look-and-feel. Use it when brand colors, style guidelines, visual formatting, or company design standards apply.

Design e Frontendby anthropics

frontend-design

153.1k

Creates distinctive, production-grade frontend interfaces with high design quality, generating creative, polished code and UI design that avoids generic AI aesthetics. Use for building web components, pages, and applications, or for styling/beautifying web UIs.

Design e Frontend#css#aiby anthropics

mcp-builder

153.1k

Guide for creating high-quality MCP (Model Context Protocol) servers that enable LLMs to interact with external services through well-designed tools. Use when building MCP servers to integrate external APIs or services, whether in Python (FastMCP) or Node/TypeScript (MCP SDK).

Design e Frontend#llm#typescriptby anthropics

Category alert

Get new Design e Frontend skills every Monday

One short email with only the new Design e Frontend skills. 4 minutes of reading, no spam, unsubscribe with one click.

You confirm your email on the first send. No spam. Unsubscribe with one click.

Token Diet v3.0 (Production-Ready)

Deployment-grade token optimization with execution order, ROI metrics, guardrails, and measurable outcomes.

Execution Flow (Order Matters)

Q → Retrieve → Prune → Cache → Route → Build Prompt → Compress → Call LLM → Measure → Update State

Why this order: Early pruning eliminates waste before caching/routing decisions. Compression happens last (post safety checks). Measurement feeds back into next iteration.

token-diet

How to add

Drop this on your repo README

Related skills

webapp-testing

brand-guidelines

frontend-design

mcp-builder

Get new Design e Frontend skills every Monday

Token Diet v3.0 (Production-Ready)

Execution Flow (Order Matters)

1. Retrieval Pruning (Hig

Comments · No comments