Note for first-time users: If you haven’t deployed OpenClaw yet, start with Tencent Cloud OpenClaw first. You can launch your OpenClaw instance in seconds with one click, then come back to this guide to explore more advanced use cases.
As you use OpenClaw over time, it accumulates more and more context and memory. If left unchecked, the token cost for the same question can grow from a few thousand to tens or even hundreds of thousands. This article focuses on tactics that deliver immediate results and introduces several simple, practical ways to reduce cost.
For everyday conversations, the fastest way to see results is to make good use of a few built-in slash commands. They are all very easy to use: just send them directly to OpenClaw in the chat box with no extra prefix.
/compact: Compress the Current Session ContextPurpose:
Ask OpenClaw to summarize and compress the current conversation history, keeping the key information while dropping unnecessary detail, so that token usage is significantly reduced in later turns.
When to use it:
How to use it:
Send this directly in the current chat window:
/compact
OpenClaw will try to condense the previous conversation into a shorter “summary memory,” and future responses will prioritize that summary, reducing the amount of context sent in each request.

/reset: Keep Memory, Reset the Current TopicPurpose:
Reset the short-term context of the current conversation while preserving long-term memory and global configuration. In other words: “start this conversation over, but keep the important things you already remembered for me.”
When to use it:
How to use it:
Send this directly in the current chat window:
/reset
After it runs:
MEMORY or other important saved records, will remain.
/new: Start a Brand-New ConversationPurpose:
Create a truly fresh session from scratch, similar to opening a new chat tab.
When to use it:
How to use it:
Send this directly in the current chat window:
/new
After execution, OpenClaw will talk to you as a brand-new session. In practice, this is usually more token-efficient than continuing to pile new questions onto a long-running thread.

When you start using OpenClaw for multiple tasks at once—such as writing documentation, coding, operations, and team management—and pile all of that into a single Agent’s brain, two obvious problems appear:
A more sensible approach is to split Agents the way you would organize a real team:
The benefits include:
If you want to implement the “Lark group → independent Agent” model, refer to this detailed tutorial, which already includes command-line examples and binding strategies: 👉 Tutorial on Integrating Custom Large Models with OpenClaw (Clawdbot)
In real projects, the best approach is often a combination: first split work across multiple Agents to cleanly separate responsibilities at the architectural level, then use /compact and /reset inside each Agent to control the context length of individual conversations.
memory-search Instead of “Infinite Conversations”Besides directly controlling context itself, there is a smarter way to reduce token usage: do not stuff everything into the same conversation. Instead, let the Agent learn to look things up.
OpenClaw provides a memory-search capability that allows the Agent to actively retrieve past memory when needed, instead of forcing all conversation history into the context every time. A common pattern is:
memory-search.This gives you several advantages:
memory-search is enabled by default in OpenClaw. What you need to do is build the habit of asking OpenClaw to remember important information after a complete round of discussion or after it finishes a task. You can simply tell it directly in the chat box.
These three approaches can be combined:
/compact, /reset, and /new in day-to-day work to control the length of the current conversation.memory-search for precise retrieval instead of brute-force context stuffing.When you combine these methods well, you will quickly notice two changes: responses become more stable with fewer off-topic drifts, and your monthly token bill stops climbing month after month.
Unlock advanced tips on Discord
Click to join the community
Note: After joining, you can get the latest plugin templates and deployment playbooks
Join WhatsApp / WeCom for dedicated technical support
| Channel | Scan / Click to join |
|---|---|
| WhatsApp Channel |
|
| WeCom (Enterprise WeChat) |
|
Learn more on the official page: Tencent Cloud OpenClaw