A reference architecture from Tencent Cloud International — production-ready in under an hour.
The "where does my AI assistant live" question is back on the agenda in 2026.
In 2024–2025 the default answer was "just use the vendor's web app." But in Q2 2026, three forces are pushing development teams toward a chat-first path:
Taking these three signals seriously, we built and open-sourced a 2026 reference architecture for a self-hosted Telegram AI assistant. It runs on Tencent Cloud Lighthouse with the open-source OpenClaw framework — and the cloud choice itself matters far more than your average "small VPS reference" post would suggest. Three traits unique to Lighthouse let this architecture run end-to-end without console busywork: from a blank cloud account to a production-grade team copilot in under an hour.
Most "1vCPU cloud reference" articles treat the cloud as a fungible commodity. For a 2026 Q2 Telegram AI agent, it isn't.
1. APAC region density. Telegram users span the globe; a bot's tail latency is bounded by its slowest hop — Telegram edge → your cloud instance → model API. At this price tier, Lighthouse offers 12 APAC regions (Hong Kong, Tokyo, Seoul, Singapore, Bangkok, Jakarta, Mumbai, and more). If your audience is mostly in APAC, picking the right region cuts p99 latency materially compared to falling back to US-East.
2. Bandwidth that fits AI bot workloads. Telegram AI bot egress comes from two things: streaming LLM responses and voice/file uploads. The monthly 1.5TB included with the Lighthouse 2vCPU / 4GB tier is calibrated for exactly this profile — generous for a single user, and not easy to blow through with small-team chat.
3. A pre-built application image makes "5 minutes from boot to SSH" real. The "30 lines of Python plus docker-compose up" promise only works when the cloud already ships Docker, ssh-keygen, and sane firewall defaults. Lighthouse's pre-built OpenClaw application image delivers exactly that: pick image → click create → in 1–2 minutes the OpenClaw web dashboard is reachable on port :3210. No VPC wiring. No reverse proxy setup. No apt install round-trip. This is the difference between "bot running in 5 minutes" and "I'll get to it Saturday."
Telegram's platform-level rate limits (1 msg/s per chat, 30 msg/s globally) become a bottleneck long before the hardware does.
Two components:
@BotFather, free, 2 minutes.:3210, multi-model routing across cloud APIs and local Ollama, and Lighthouse SkillHub. Light personal use can drop to 2vCPU / 2GB; multi-Skill teams should go to 4vCPU / 8GB.The stack itself is light. Telegram's rate limits will throttle you long before the box does.
Forward a PR link to your Telegram bot; before your coffee is done, you have an AI review plus Approve / Refine / Reject inline buttons — and the diff never leaves to a third-party SaaS.
Three things specific to the DevOps scenario:
:3210.@-mentioned to fire (no noise); destructive actions (closing a PR, silencing alerts, calling external APIs) require a second tap on an inline button; every Skill invocation produces a structured audit log piped to a private channel watched by the SRE/ops lead.In the Lighthouse console, choose a region close to your team and downstream services. Use the 2 vCPU / 4 GB price-performance tier. For image type, pick Application Image → OpenClaw.
The instance enters Running in 1–2 minutes. No apt install, no hand-written docker-compose, no nginx config.

Open the model configuration page and paste an API key — OpenAI's sk-xxxx, Anthropic Claude, Hy3 Preview, DeepSeek, Gemini, or any compatible endpoint. You can also point OpenClaw at Tencent Cloud TokenHub (a unified multi-model gateway) to call all the above providers through a single key. Send a test message in the chat panel to confirm the round trip.

In the OpenClaw dashboard, go to App Channels → Telegram. In Telegram, search @BotFather (the official account with the blue verified checkmark) → Start → send /newbot → set a display name → set a globally unique username ending in bot → copy the HTTP API Token that BotFather returns.
⚠️ Minimum OpenClaw version: 2026.2.3. Upgrade first if your image is older.

Paste the HTTP API Token into the OpenClaw dashboard.

From Lighthouse SkillHub, install Web Search, File Reader, and Scheduler. Hold off on API Caller for now. For each Skill, paste the "never echo more than 20 lines of code" rule into the system prompt; configure the multi-model router so prompts containing PII, internal repo paths, or proprietary algorithms route to local Ollama instead of a cloud LLM.

Add the bot to your team's Telegram group chat. Enable @-mention-only triggering.

Wire destructive actions through inline-button confirmation. Pipe every Skill invocation's structured audit log to a private channel for the ops lead.

For data persistence, follow OpenClaw's recommended 3-2-1 backup: three copies of the data, two media types (local + cloud), one off-site copy.


The full cron, docker-compose.backup.yml, and COS upload script are in the Techpedia tutorial — copy-paste ready.
| Stage | Time | Key dependency |
|---|---|---|
| Step 1–3 — basic forward-and-reply working | ≤ 15 min | Follow Techpedia verbatim |
| Step 4 — DevOps Skills config | 10–20 min | Custom system prompts |
| Step 5 — team permissions + backup | 10–15 min | cron + COS setup |
| Full production deployment | ≤ 1 hour | — |
OpenClaw + Telegram bot's main load isn't local inference — it's ingress and orchestration: pulling PR diffs, reading logs, forwarding webhook/Telegram messages, dispatching Skills, and calling model APIs. Across a 30-minute mixed load covering PR review, log triage, and /test, bottlenecks consistently appear in external model APIs, Git remotes, and Telegram rate limits — not in the Lighthouse instance itself.
For solo developers and small teams, the 2vCPU / 4GB tier is enough for daily review-only, log triage, and test triggering. If you turn on local Ollama inference, run concurrent multi-repo reviews, or use a heavyweight code-execution sandbox, move those tasks to a separate higher-memory instance.
| Component | Cost |
|---|---|
| Lighthouse 2 vCPU / 4 GB / 1.5 TB bandwidth | $4 / month |
| Cloudflare Tunnel | $0 |
| Telegram Bot API | $0 |
| Item | Lighthouse | AWS Lightsail | Hetzner Cloud | DigitalOcean |
|---|---|---|---|---|
| 2vCPU / 4GB monthly | $4 | $24 | $4.6 | $24 |
| Monthly bandwidth | 1.5 TB | 4 TB | 20 TB | 4 TB |
| APAC regions | 12 | 7 | 1 | 3 |
Hetzner ships more bandwidth — well-suited for egress-heavy workloads in Europe. Lightsail and DigitalOcean both bundle larger bandwidth too. Lighthouse's edge here isn't being the cheapest on any single line — it's the combination: APAC region density + a ready-to-use OpenClaw image + bandwidth that fits AI bot workloads, all at the same price tier. For a 2026 Q2 Telegram AI agent, the combination is what matters.
Bot messages traverse Telegram's servers; cloud LLM API calls go to the model provider. Teams with strict data-residency requirements should run local Ollama on a higher-spec Lighthouse instance and route through the dispatcher.
We recommend:
A DevOps copilot accumulates real value over weeks — refined prompts, a RAG index of internal docs, an audit log of every action taken. Step 5's 3-2-1 backup is how you protect that value.
If your team is already running a chat-first AI workflow on Lighthouse + OpenClaw, we'd love to hear from you — we're collecting case studies for upcoming deep-dive content. Reach out through any of the channels below.
Join the Tencent Cloud International developer community to submit a case study, swap notes with other builders, and unlock more advanced playbooks.
Unlock advanced tips on Discord
Dedicated Support
| Channel | Scan / Click to join |
|---|---|
| WhatsApp Channel | ![]() |
| WeCom (Enterprise WeChat) | ![]() |
— Tencent Cloud International Developer Team