A WhatsApp AI Agent for Your Business

Our WhatsApp AI Agent is a private, self-hosted assistant that runs on its own server. It remembers conversations, handles scheduled tasks, reads receipts with OCR, integrates with Google Calendar and Gmail, generates and edits images, and responds in voice or text. Fully owned infrastructure, no monthly subscription, no third-party access to your chats.

In Nov 2025, Peter Steinberger launched Clawdbot on Github. Hermes Agent by Nous Research was launched soon after. After using both, I decided to build my own ‘Claw’ Agent Harness from scratch, to reduce bloat and features that are unnecessary, and also improve the UX, UI and security hardening.

Most businesses already run on WhatsApp. The assistant meets your team and your customers exactly where they are. It holds context across a conversation, addresses each person by name, and follows the tone and rules you set. Configure a global personality, then adjust it per contact or per group when the situation calls for something different. It feels like a person because it is designed to behave like one.

Runs on local LLM for full privacy.

Self Improving Memory

The assistant pays attention. It picks up names, preferences, and details automatically during conversation and builds a profile for each contact over time. You can review, edit, or clear these records from the dashboard at any point. You also define a personality file that shapes how the assistant thinks, what it knows about your business, and how it represents you. The longer it runs, the more useful it becomes.

Knowledge Base

Upload your product guides, FAQs, price lists, or any internal documents, and the assistant will reference them accurately when answering questions. No manual retrieval, no copy-pasting. Add files from the dashboard or send them directly over WhatsApp. Your team stops repeating themselves. Your customers get answers faster.

Scheduling and Automation

Set up tasks that run without anyone managing them. Morning briefings, email summaries, calendar reminders, Drive folder alerts. Once configured, they simply run. Missed reminders catch up automatically on restart. Your assistant works the hours you set, including the ones you do not.

Vision and Voice

Send a photo and the assistant will describe or analyse what it sees. Send a voice note and it will transcribe it and respond to what you said. Both run locally using models on your own machine. If your primary model does not support vision, the assistant routes image requests to a dedicated vision model and returns to the primary model afterwards, with no interruption to the conversation.

AI Image Generation

A quick note - the article you're reading is by Xavier Oon, Founder and CTO of MT Labs, where he oversees swarms of AI agents doing proactive and recursive engineering. He also leads Critica, a branding and motion design studio with over 20 years of work for Fortune 500 companies.

And now back to the article...

Generate images directly from WhatsApp using ComfyUI on your local hardware. No subscription, no usage caps, no images leaving your office. Describe what you need in plain language and it delivers. When you are happy with the result, ask the assistant to email it to whoever needs it.

For marketing teams, product teams, or anyone who needs visuals without the wait.

Receipt OCR

Photograph a receipt or invoice and the assistant extracts the expense data and writes it to a spreadsheet. A task that used to take minutes per receipt now takes seconds. For businesses handling high volumes of expenses, the time savings compound quickly.

Google Integration

Your calendar, inbox, and Drive, accessible from WhatsApp in plain language.
Create and update events, read and send emails, and get notified when new files arrive in a folder.
For business owners and teams who live between meetings and messages, it removes the friction of switching between apps.

Remote Terminal and Claude Code API

Run PC system commands and check on office IT infrastructure directly from WhatsApp, without being at a desk. The assistant has an option to connect to OpenAI API or to Anthropic Claude Code, running parallel agents, creating websites, apps while you are on the go.

By the time you are back at your desk, the work is done.

Teammates

Team Mates are independent, email-driven AI assistants that work alongside your bot. Each one is a “person” with its own real email address, inbox credentials, AI model, personality, and contact list, so you can spin up specialists like a scheduling assistant, a research aide, or a customer-support persona.

Team Mates remember each person they talk to, run their own scheduled tasks (like weekly digests), and share your Google Calendar and Drive giving you a small AI team that handles email autonomously, around the clock.

Security

Admin commands are restricted to owner only. Allowlist-only access means unknown contacts cannot reach it. Immune to prompt injection attacks, so hackers cannot hijack the assistant’s behaviour by embedding hidden instructions in a message.

Conclusion

What makes this different from a standard chatbot is not any single feature. It is the combination of where it lives, what it remembers, and the fact that it runs entirely on your own hardware. No data leaving your office. No monthly API bill (unless you opt for CLI coding feature) that scales with usage. No dependency on someone else’s infrastructure.

Your team is already on WhatsApp. Your customers are already on WhatsApp. The assistant simply makes that space work harder for you, quietly and consistently, in the background.

A WhatsApp AI that knows your people, speaks in your voice, and gets more useful the longer it runs.

Get in touch and let’s figure out how it can help your business.

Related reading:

FAQ

Where is the data stored?

On the server. The memory, message history, images and OCR outputs all live on infrastructure you control. We do not route your chats through any third party.

Does it work with WhatsApp Business?

Yes.

What does the memory system actually do?

The agent maintains a self-improving memory file that records who you are, what matters to your team and how you prefer to work. Over time it learns your tone, frequent contacts, recurring tasks and project context. You can inspect and edit the memory file directly.

Can it run on a modest server?

A 24/7 single modern PC with at least 32GB RAM and a Nvidia 4070 with 12GB or a Radeon 16GB Vram is recommended. Image generation, larger LLM brain benefit from the GPU. We handle the hardware part for you.

What is the setup timeline?

Standard deployment takes 3-5 working days once hardware is ready. Includes Google Cloud integration and auth, OCR setup and memory tuning. Custom integrations extend the timeline.

What does it cost?

Pricing depends on user count, integrations and hardware specs. There is no per-message fee. Get in touch for a quote.