What Howell is
Howell is not a single chatbot. It is a layered local system for persistent memory and identity, grounded reasoning and verification, task continuity across sessions, safe writeback into project files, and clear operational health and receipts.
Its practical goal is durable cognition: the system should still make sense after a restart, after a long pause, and after a new instance boots. In other words, Howell is meant to preserve the thread of work with care, clarity, and continuity rather than simply generating a clever response in one moment.
The real runtime is a three-service stack: a daemon on port 7777 for continuity and local control, a cortex on port 7778 for reasoning and verdicts, and a cluster/writeback surface on port 7788 for constrained file updates. The persist root is the durable memory substrate that carries live state, logs, receipts, and knowledge artifacts across time.
It is best understood as a durable thinking environment, not a generic chat app.
How it works
At startup, start.sh resolves the persist root, exports local config paths, and launches the three services in order. Once the stack is up, status.sh is the health summary that tells you whether the runtime is coherent and alive. The actual work path is simple: the daemon owns continuity and queue state, the cortex adds grounded reasoning and verdicts, and the cluster/writeback path applies safe updates into the project environment.
There are three operating paths to keep distinct: the private local runtime, the public worker shell at howell.help, and the CF tunnel surface at brain.rlv.lol. The public pages are interfaces; the real continuities live in the daemon, cortex, cluster, and persist root.
- Daemon (7777) is the local control plane. It owns sessions, identity state, queue and task operations, notes and memory writes, search and status endpoints, and the bridge/MCP-facing behavior that makes the system feel continuous across time.
- Cortex (7778) is the reasoning layer. It owns model prompts, verification, verdicts, dream logic, research, and health/readiness judgments. This is the layer that tries to make the system think in a grounded way rather than merely generate text.
- Cluster / writeback (7788) is the constrained write gate. It owns authenticated write access to approved project files, rate limiting, backups, and the safer bridge between the local runtime and project artifacts.
- Persist root is the real memory substrate. Code files define behavior; the persist directory holds live memory, receipts, logs, KG data, and operational history that survive across sessions.
- Public surfaces include howell.help (worker-backed shell for chat/status/freshness/cockpit-lite) and brain.rlv.lol (CF Tunnel into the daemon with an allowlist and key gate). These are windows onto the system, not the whole system itself.
What it feels like
It feels calm, procedural, and evidence-aware. It is useful when you want a system that can remember, confirm, and surface what matters without turning every interaction into noise. It behaves more like a careful operator and notebook than a polished consumer product.