Lobs AI | Standalone Agent Runtime

The Origin

One dev.
An entire team of agents.

What started as a structured task framework became a fully standalone agent runtime. Eight versions, each driven by a hard production ceiling.

Built by Rafe because it is genuinely useful and interesting to build. The goal was never just an AI assistant. It was a workforce: specialized agents that run autonomously, pick up tasks, and deliver results around the clock.

Eight versions. Each one forced by a hard ceiling hit in production. Every limitation became the next version's motivation.

Custom Task Framework, Jan 2026

Single agent inside OpenClaw — a powerful AI coding tool — with a custom task framework for managing work. OpenClaw was the launchpad: a capable, well-designed platform that made it possible to start building immediately. Gave structure to AI work beyond raw chat prompts.

Why it wasn't enough

One agent could not handle complex multi-step work. There was no parallelism. The ceiling appeared fast. As soon as more than one thing needed to happen at once, progress stalled.

Multi-Agent System, Early Feb 2026

Multiple specialized agents: Programmer, Writer, Researcher, Reviewer, Architect. Agent spawning and task routing made parallel work possible.

Why it broke down

There was no memory between sessions. Agents repeated the same mistakes. Every session started cold, and lessons disappeared as soon as the session ended.

Reflections & Learning, Mid Feb 2026

An agent reflection system captured completed work. Shared memory persisted lessons across sessions. Agents improved over time instead of starting over.

Why it wasn't enough

Still running as scripts and manual orchestration. No proper server or API. Couldn't scale coordination or run reliably in the background without babysitting.

Web Server Era, Late Feb 2026

lobs-server: FastAPI backend with REST API, task queue, worker management, health monitoring. Mission Control (SwiftUI macOS) and Lobs Mobile (iOS) joined the ecosystem.

Why it broke down

38K lines of Python. Task execution relied on linear if/else chains with no branching, rollback, or complex workflows. The codebase became genuinely hard to maintain.

DAG Workflow Engine, Late Feb 2026

Node-based DAG workflow system. State machines, conditional branching, rollback, event-driven triggers, cron scheduling. 20 workflow definitions shipped.

Why it needed replacing

The Python codebase was massive and fragile. Running a separate server process added architectural complexity — that was a consequence of the design, not a flaw in OpenClaw. The glue layer was heavier than the logic it connected.

OpenClaw Plugin, Mar 2026

Full rewrite as an OpenClaw TypeScript plugin. Ran inside OpenClaw — a great host — with no separate server process. Everything from v1–v5 preserved but dramatically cleaner. The plugin phase was the right consolidation step.

The breakthrough

The glue layer disappeared. The system became a single process. Circuit breaker, five-tier model chooser, and budget guard were first-class parts of the runtime. That rewrite replaced a much larger Python stack with a leaner TypeScript core.

38,000 lines Python → leaner TS core one process

Nexus Dashboard & Custom Domain, Mar 2026

Nexus: React + Vite web dashboard replacing the SwiftUI macOS app. Self-hosted at lobslab.com with Caddy auto-TLS + Cloudflare Tunnel. The system got a real home on the web.

The breakthrough

Cross-platform access from any device. Public site at lobslab.com, private Nexus dashboard over Tailscale. Workers autonomously fix UI bugs. The system builds its own interface.

Standalone Runtime (lobs-core), Mar 2026 [CURRENT]

Lobs graduates to its own standalone runtime. lobs-core now runs as its own Node.js process with a custom LLM execution loop, built-in HTTP server, Discord bot, memory supervision, and an integrated dashboard. The custom runtime exists because building agent systems is the career path I'm on — first-semester CSE MS at the University of Michigan, GSI for EECS 281 and 291, and building this to deeply understand every layer. OpenClaw was the launchpad that made it all possible.

The breakthrough

No external platform dependency remains in the runtime path. lobs-core calls provider APIs directly, manages its own workers, serves Nexus from its own HTTP server, runs lobs-memory under supervision, and starts from a native CLI. One lobs start command boots the system.

born in OpenClaw → fully custom runtime no host dependency

Live System

Live Pulse

Real telemetry from the production system. Updated daily. No synthetic data. This is real.

Fetching telemetry…

Sub-Projects

The Building Blocks

Interlocking projects that make up the Lobs AI ecosystem. Current owned-code footprint by cloc: lobs-core 106k, companion 20k, lobs-nexus 22k, agentic 9k, lobs-mobile 5k, jot 5k, lobs-memory 4k, lobs-vim 3k, lobs-sentinel 2k, lobs-voice 1k, website 5k.

lobs-core

The standalone agent runtime. TypeScript monorepo that runs as its own Node.js process. Orchestrator, agent runner, context engine, workflow system, HTTP server, Discord bot, voice pipeline, and live meeting transcription — all in one. Current owned source footprint: about 106k lines of code by cloc.

TypeScriptSQLiteNode.js106k LOCCurrent

Workflow Engine

DAG execution with state machines, conditional branching, rollback, event-driven triggers, and cron scheduling.

Model Chooser

Five-tier cost-aware model selection with per-agent fallback chains. The cheapest model that can handle the task wins.

Circuit Breaker

Tracks worker failure rates, quarantines bad actors, and prevents cascading failures.

Task Database

SQLite via drizzle-orm. Full task history, worker run records, workflow state, and research memos.

CLI

Native CLI

lobs init, lobs start, lobs status, and lobs logs manage the system without a host platform.

API

Direct Provider Calls

The runtime calls Anthropic, OpenRouter, LM Studio, and other OpenAI-compatible endpoints directly through its own runner.

MEM

Memory Supervision

lobs-memory runs under lobs-core supervision, so semantic search is part of the runtime instead of an external bolt-on.

Post-Success Validation

Artifact validation and selective review triggers catch phantom completions before bad output gets marked done.

Dynamic Tools

Agents can create, edit, and delete their own tools at runtime — shell scripts, TypeScript modules, or procedural steps that persist across sessions.

Self-Learning Loop

Corrections and preferences are captured in persistent memory automatically, so the same mistake never happens twice.

lobs-nexus

Nexus Dashboard

React + Vite web dashboard served directly by lobs-core. Real-time monitoring of agents, tasks, workflow runs, and system health. Includes agent chat, live meeting transcription with AI activity feed, and brain dump task capture. Current owned frontend footprint: about 22k lines of code by cloc.

ReactViteTailwind22k LOCCurrent

Agent Status Monitoring

View all six agent types in idle, running, or error state, with worker history and success rates.

Workflow Visualization

Active workflow steps, completed nodes, and pending branches are visible in the dashboard.

Polling-Based Updates

Periodic polling tracks task state changes, worker completions, and system alerts. WebSocket support is planned.

Task Inspector

Inspect notes, agent assignment, model tier, worker run log, output, and timing in one view.

Live Meeting Transcription

Record meetings from the browser with real-time transcription and an AI activity feed that extracts action items, decisions, flags, and open questions as the meeting happens.

lobs-vim

Neovim Plugin

AI coding agent for Neovim. Reasoning runs on lobs-core, tools execute locally in your editor. File editing, shell commands, search — all with an approval system for dangerous operations. About 500 lines across Lua and TypeScript.

LuaTypeScriptNeovim3k LOCCurrent

Local Tool Execution

The LLM reasons on the server; file edits, shell commands, and searches run locally in Neovim. No round-trip for tool execution.

Exec Approval System

Dangerous commands require explicit approval. Safe operations (reads, searches) run automatically. Configurable trust levels.

CTX

Project Context Injection

Automatically sends project context on new sessions so the agent understands the codebase from the first message.

WebSocket Streaming

Real-time streaming of agent responses and tool calls over WebSocket. Cookie-based auth ties into lobs-core's session system.

lobs-sentinel

Sentinel Agents

Persistent single-purpose AI agents for GitHub. Each sentinel runs in Docker, watches repos, and performs one job: reviewing PRs, triaging issues, or auto-labeling. Multi-provider LLM support (Anthropic + OpenAI). About 2k lines of TypeScript.

TypeScriptDockerGitHub API2k LOCCurrent

PR Reviewer

Reviews pull requests with full context — diffs, comments, commit history. Posts code review comments and approve/request-changes decisions.

Auto-Labeler

Classifies issues and PRs by content and applies labels automatically. Keeps repos organized without manual triage.

Issue Triage

Categorizes new issues, assigns priority, and asks clarifying questions. First-responder for incoming work.

Multi-Provider LLM

Supports Anthropic and OpenAI models. Per-deployment config lets you pick the right model for the task and budget.

lobs-voice

Voice Pipeline

Local STT + TTS sidecar services for Discord voice integration. Whisper.cpp handles speech-to-text, Chatterbox handles text-to-speech. The voice module in lobs-core bridges Discord voice channels to the agent pipeline. About 1k lines of Python and shell scripts.

PythonWhisper.cppChatterbox TTS1k LOCCurrent

STT

Speech-to-Text

Whisper.cpp with CoreML acceleration on Apple Silicon. Converts Discord voice input to text for agent processing.

TTS

Text-to-Speech

Chatterbox TTS server generates natural speech from agent responses. Streams audio back into Discord voice channels.

Voice Bridge

The voice module in lobs-core connects Discord voice channels to the STT → Claude → TTS pipeline. Fully local, no cloud speech APIs.

OpenAI Realtime

Alternative speech-to-speech bridge using OpenAI's Realtime API for lower-latency conversational voice interaction.

iOS

lobs-mobile

Lobs Mobile

iOS companion app for staying connected to your agents on the go. Check task status, receive completions, and interact with Lobs from your phone.

SwiftiOSPush notifications5k LOCCurrent

iOS Push Notifications

Get notified when blockers arise, urgent tasks complete, or the system needs attention without checking Discord.

Task Creation On The Go

Create tasks from anywhere. The orchestrator picks them up within ten seconds.

Task Queue View

Browse active, completed, and blocked tasks. Filter by agent type or model tier, and see what is running now.

Chat Interface

Message Lobs directly from the app. Same coordinator, same routing, just on mobile instead of Discord.

squad

Squad

Open-source multi-agent orchestration platform. Gateway-centric, connector-based, Docker-first. Built to replace brittle script-based orchestration with a proper abstraction layer for running coordinated multi-agent workflows in production.

TypeScriptDockerGatewayCurrent

Gateway Architecture

Central gateway handles agent registration, message routing, and session management. All agents communicate through the gateway — no direct peer connections.

Connector Model

Pluggable connectors for Discord, Slack, web, CLI. Add new platforms without touching core logic.

Docker-First

Each agent runs in its own container. Isolation, reproducibility, and easy deployment with docker-compose.

Workflow Orchestration

DAG-based workflow definition with state management, retries, and rollback support.

lobs-memory

Memory Search

Hybrid search server for agent memory. Combines BM25 keyword search with vector embeddings for semantic retrieval across all agent context, learnings, and project documentation. Runs on port 7420 under lobs-core supervision. About 3k lines of TypeScript and Python.

TypeScriptPythonEmbeddings3k LOCCurrent

Hybrid Search

BM25 keyword matching combined with dense vector search. Best of both worlds for finding relevant context across hundreds of memory files.

Embeddings

Local embedding generation for all indexed content. No external API calls needed for search — everything runs on the same machine.

Supervised Process

Runs under lobs-core process supervision with automatic restart on failure. Health checks ensure search is always available to agents.

Multi-Collection Index

Indexes workspace files, shared memory, daily logs, project docs, and ADRs as separate searchable collections.

lobs-shared-memory

Shared Memory

A Git-backed cross-project knowledge base. ADRs, research memos, runbooks, and agent instructions that persist across sessions and sync across the entire system. The supporting lobs-memory search service provides hybrid BM25 + vector search over the knowledge base.

GitMarkdownADRsRunbooks

ADR

Architecture Decision Records

Every major technical decision is recorded as an ADR: what was decided, why, and what alternatives were rejected. Agents read these before acting.

Git-Backed Sync

All memory files live in a git repo. Changes are committed and synced. Workers can read, agents can write, and everything is auditable.

Research Memos

Researcher agents write structured memos that persist across sessions. When a similar question returns, the answer is already there.

Agent Runbooks

Operational playbooks for each agent type explain edge cases, preferred tools, and failure modes. They keep evolving with the system.

lobslab-apps

LobsLab Apps

Web apps and mobile projects hosted on lobslab.com. Crapuler (UMich course watchlist), Ballz (physics toy), Stellar Siege (cosmic RTS), Grandmas Stories (family story recording), Over the Horizon (iOS AR). Each app runs in Docker with Caddy reverse proxy.

JavaScriptSwiftDockerCaddyCurrent

Crapuler

UMich course section watchlist. Monitors registration status and alerts when seats open. Backed by 50k lines of course catalog JSON data.

Ballz

Physics sandbox with 10k lines of vanilla JavaScript. Bouncing balls with collision, gravity, and interactive controls.

Stellar Siege

Cosmic real-time strategy game. Conquer star systems, build fleets, and manage resources across a procedurally generated galaxy.

Grandma's Stories

iOS app for recording and sharing family stories. Voice recording, tagging, and a beautiful timeline for replaying memories.

Over the Horizon

iOS AR app that overlays location names on the live camera feed. Point your camera at a landmark and see its name floating in 3D space above it.

Docker + Caddy Stack

Each app containerized independently. Caddy handles automatic TLS and subdomain routing. Add a new app with a Dockerfile and one compose entry.

Course

EECS 498-016

Applied Agentic Software Engineering — hands-on course building autonomous AI agents that do real work. Covers agent architectures, tool use, memory systems, multi-agent orchestration, and deployment. Uses OpenClaw as the runtime. Co-developed with Marcus Darden at the University of Michigan.

University of MichiganEECSOpenClaweecs498-aase.github.io ↗

Agent Architectures

Designing and implementing autonomous agents: reasoning loops, tool use, and goal-directed behavior.

Tool Use & Memory Systems

Building agents that use tools effectively and maintain persistent memory across sessions.

Multi-Agent Orchestration

Coordinating multiple agents to work in parallel on complex, multi-step tasks.

Deployment & Runtime

Taking agents from prototype to production using OpenClaw as the runtime platform.

agentic

Agentic Toolkit

Five composable TypeScript packages extracted from lobs-core: multi-provider LLM client, tool executor with safety validation, hybrid memory search, agent runner loop, and config management. About 9k lines, all battle-tested in production before becoming a library.

TypeScriptESM9k LOCOpen SourceCurrent

LLM

Multi-Provider Client

Provider-prefixed model strings route to Anthropic, OpenAI, OpenRouter, or LM Studio. Built-in retry, fallback chains, key rotation, and sticky session keys for prompt cache hits.

9 Built-in Tools

read, write, edit, exec, grep, glob, ls, code-search, find-files. Each has an Anthropic-compatible input schema. Pluggable registry — add tools by name without touching the runner.

Safety Validation

mtime-based staleness detection on edits, binary file rejection, blocked device paths, and quote normalization. Every check exists because something went wrong without it.

MEM

Hybrid Memory

SQLite + file-backed memory with vector and keyword search. LLM-powered fact extraction and session continuity across runs.

Runner Loop

Think/act/observe loop with hook system for observability, loop detection, and context compaction. Same pattern used in lobs-core and Companion.

companion

Lobs Companion

Local-first macOS AI collaborator built with Tauri + React + Node.js. Permanently remembers your projects, preferences, and past sessions through three memory layers. Edits files alongside you in real time. No server, no cloud, no latency. About 20k lines of TypeScript.

TauriReactTypeScriptmacOS20k LOCCurrent

MEM

Three Memory Layers

Session memory (active transcript), project memory (per-repo context from AGENTS.md and git state), and permanent memory (facts extracted at session end). Context accumulates over time without re-explaining.

Unix Socket Bridge

Tauri and the Node.js runtime talk over a Unix domain socket at ~/.lobs-companion/run/agent.sock via newline-delimited JSON-RPC 2.0. Clean bidirectional channel, no HTTP overhead.

Native macOS UI

Dark glass morphism design, SF Pro typography, SF Symbols, spring physics on panel transitions. Three panels: project context, conversation, and live tool output. Keyboard-first navigation.

Bidirectional Editing

Both I and the agent edit files on disk in real time. mtime staleness detection prevents silent conflicts when both sides touch the same file.

Permission Guards

Tool calls that write, delete, or exec shell commands surface a confirmation in the UI. Reads and searches run automatically. Configurable trust levels per tool category.

jot

Jot

Local AI note-taking CLI. Capture notes in plain English and they save to SQLite instantly. A background worker sends each note to a local model (LM Studio or Ollama) for async enrichment: tags, action items, linked notes, urgency flags. All offline, all private. About 5k lines of TypeScript.

TypeScriptSQLiteLM StudioOllama5k LOCCurrent

Instant Capture, Async Analysis

Notes land in SQLite immediately. A detached background worker handles enrichment — tags, action items, linked notes — without blocking capture. Zero AI latency on add.

Local Models Only

LM Studio or Ollama handle all AI analysis. No API keys, no per-query cost, no data leaving the machine. Local models are reliable for structured extraction tasks.

Three-Tier Daemon

Tier 1 (every 15 min): analyze notes, check todos. Tier 2 (7:30am daily): morning digest to Discord or terminal. Tier 3 (event-driven): overdue todos and urgent note flags.

Learning File

All prompts inject ~/.jot/user.md — a two-section file with a manual profile and an auto-maintained section the model appends to over time. Tagging accuracy improves as context accumulates.

Structured Search

Full-text search across raw note content and extracted fields. Summarize action items and tag frequency across the full database with a single command.

API

lobs-server, Legacy / Retired

lobs-server

The FastAPI backend that powered v4 and v5: REST API, task queue, worker management, and health monitoring. Replaced by lobs-core. 38K lines of Python that taught us everything.

PythonFastAPISQLiteRetired

API

REST API

Full task management API to create, read, update, and delete tasks and projects. It backed Mission Control and Lobs Mobile before the standalone era.

Python Orchestrator

Multi-agent spawning with aiohttp. It taught us the cost of race conditions, session leaks, and if/else workflow logic at scale.

Health Monitoring

Worker status tracking, failure rates, and uptime metrics. Those patterns directly influenced lobs-core's circuit breaker design.

Why It Was Retired

The Python stack grew large and hard to maintain. The glue was heavier than the logic it connected. lobs-core replaced it completely.

Changelog

Build Timeline

Eight versions. Three months. A system that actually runs in production, every day.

Jan 2026

v1, Custom Task Framework

Lobs comes online. A custom task framework gives structure to AI work beyond raw chat prompts. Single agent, single session, but a real foundation.

v1 · task framework

Early Feb

v2, Multi-Agent System

Multiple specialized agents: Programmer, Writer, Researcher, Reviewer, Architect. Agent spawning and task routing. Parallel work becomes possible for the first time.

v2 · multi-agent

Mid Feb

v3, Reflections & Learning

Workers reflect on completed tasks. Lessons get written to shared memory. The system starts getting smarter over time, not just faster, without manual updates.

v3 · learning

Late Feb

v4, Web Server Era

lobs-server FastAPI backend deployed. Mission Control (SwiftUI macOS) and Lobs Mobile (iOS) join the ecosystem. Real infrastructure: APIs, monitoring, and mobile access.

v4 · server + apps

Late Feb

v5, DAG Workflow Engine

Multi-step tasks now run as structured workflows with state, branching, and rollback. 20 workflow definitions covering code, research, review, reporting. Complex work becomes a pipeline, not a prompt.

v5 · DAG workflows

Mar 2026

v6, OpenClaw Plugin Rewrite (TypeScript)

The big one. Full rewrite as an OpenClaw TypeScript plugin — and a great move. Running inside OpenClaw as a first-class plugin eliminated the separate server entirely. Everything from v1-v5 stayed intact, but cleaner. The glue layer disappeared, and the system became dramatically smaller than the Python stack it replaced.

v6 · OpenClaw plugin

Mar 2026

v7, Nexus Dashboard & Custom Domain

React + Vite web dashboard replaces SwiftUI macOS app. Self-hosted at lobslab.com with Caddy auto-TLS and Cloudflare Tunnel. Private Nexus dashboard over Tailscale. Workers autonomously implement UI fixes.

v7 · Nexus dashboard

Mar 2026

v8, Standalone Runtime (lobs-core)

Lobs graduates to its own custom runtime. lobs-core runs as its own Node.js process with custom LLM execution, built-in HTTP server, Discord bot, integrated Nexus dashboard, and supervised memory server. Built from scratch to deeply understand every layer — from LLM calls to tool orchestration to memory management — and to be ready to teach it. One lobs start boots the whole system.

v8 · standalone runtime

Mar 13

Standalone Milestone, Self-Contained v2

The architecture shifted from plugin dependency to self-contained runtime. Direct provider calls, a native Discord service, browser-based web tools, and a slimmer orchestrator became the plan of record.

standalone milestone · ADR-010

Mid Mar

Standalone Milestone, Native CLI and Process Control

lobs init, lobs start, lobs stop, lobs status, and lobs logs gave the system its own operational surface. No host app needed to boot, inspect, or manage the runtime.

standalone milestone · native ops

Mid Mar

Standalone Milestone, Runtime Owns the Stack

lobs-core now starts its own HTTP server, serves Nexus directly, supervises lobs-memory, runs its own Discord bot, and validates worker output after completion. The runtime is fully custom — a deliberate choice to understand every layer, not a rejection of OpenClaw.

standalone milestone · fully custom

Late Mar

Voice Pipeline & Live Meeting Transcription

Lobs joins Discord voice channels. Local Whisper.cpp for STT, Chatterbox for TTS, with the voice module in lobs-core bridging it all to the agent pipeline. Live meeting transcription with speaker detection followed — real-time notes from any voice call.

voice · meetings · STT/TTS

Early Apr

Dynamic Tools, Self-Learning & Memory Refactor

Agents can now create their own tools at runtime — shell scripts, TypeScript, or procedural steps that persist across sessions. A self-learning loop captures corrections into permanent memory. The memory system got a major refactor: SessionWatcher removed, compaction-triggered reflection, and 75 pre-existing test failures fixed in one push.

dynamic tools · self-learning · you are here

Field Notes

Wins & Hard-Won Lessons

Real systems break in real ways. Every incident below happened in production. None were fun. All were instructive.

WM Wins & Milestones

Multi-Agent Parallelism

Multiple specialized agents work on different tasks at the same time. Programmer codes, Researcher investigates, Writer documents. The ceiling on what one person can accomplish expanded dramatically.

Agent Reflection & Learning

Workers reflect on completed tasks, capture lessons, and feed back into shared memory. The system gets meaningfully better week-over-week without manual updates. Mistakes stop repeating.

Workflow Engine as Control Plane

Multi-step tasks now run as structured DAG workflows with state, branching, and rollback, not one-shot prompts. That made work possible that previously required constant babysitting.

Model Tier System

Tasks automatically route to the cheapest model that can handle them. First week of operation: 60% of tasks handled by the free local model. Cost stays manageable at scale.

Standalone Runtime Win

Going standalone removed an entire class of problems: platform dependency, IPC overhead, session handshake failures, sync issues, and deployment complexity. One process. No glue.

Fully Custom Runtime

The current system owns its process model, CLI, HTTP server, Discord integration, provider calls, memory supervision, and worker lifecycle. What started inside OpenClaw is now a ground-up custom runtime — built to teach, built to learn, built to last.

LS Losses & War Stories

The Restart Loop Incident HIGH

Workers edited plugin source code, then called gateway restart. The restart spawned fresh workers. Who picked up active tasks. Who edited source and called restart. Infinite loop. The system was restarting itself every 30 seconds for 20 minutes before it was caught.

What we learned

Gateway restart is now hard-denied in all worker agent tool configs. Workers can read source but cannot modify it. Defense in depth over clever permission systems.

Heartbeat Spam MEDIUM

Background exec commands in workers triggered event-driven heartbeats on completion, flooding the main session with "Exec completed" noise. At peak, 40+ heartbeat messages arrived in 10 minutes. Actual notifications disappeared in the noise.

What we learned

Banned all background exec patterns (&, nohup, sleep &&) in worker configs. Direct, synchronous commands only. The AGENTS.md now has a whole section on this.

JSON Double-Conversion Bug MEDIUM

Swift's .convertFromSnakeCase was set on the decoder. Then manual CodingKeys were added for the same mapping. The decoder applied both. Fields silently dropped. No errors, just missing data. It took two sessions to diagnose.

What we learned

Never add CodingKeys for simple snake→camel conversions when .convertFromSnakeCase is active. Now documented as permanent institutional knowledge in TOOLS.md.

Session Validation Errors LOW

"No tool call found" errors came from workers that completed their task without making any tool calls. Session validation expected at least one tool call, so workers answering purely from context failed and got retried, wasting tokens.

What we learned

Stricter session validation distinguishing "no tools needed" from "something went wrong." Simple tasks that don't need tools are now explicitly tagged at creation time.

Reflection Spam LOW

Agent reflection was too eager. Workers reflected after every minor action: reading a file, running a test, making a small edit. Shared memory grew by hundreds of low-quality entries in a single day. Signal got buried in noise.

What we learned

Reflection now gates behind strict completion criteria and rate limits. Only significant task completions trigger reflection. Quality over volume, every time.

The open-source platform for multi-agentorchestration.

One dev.An entire team of agents.

Custom Task Framework, Jan 2026

Multi-Agent System, Early Feb 2026

Reflections & Learning, Mid Feb 2026

Web Server Era, Late Feb 2026

DAG Workflow Engine, Late Feb 2026

OpenClaw Plugin, Mar 2026

Nexus Dashboard & Custom Domain, Mar 2026

Standalone Runtime (lobs-core), Mar 2026 [CURRENT]

The Stats

Live Pulse

How It All Fits Together

Squad Under the Hood

Message arrives via a connector

Gateway routes to the right agent

Task primitives execute in parallel

LLM adapter routes to your provider

Plugins extend capability

Results route back through the gateway

The Building Blocks

lobs-core

Workflow Engine

Model Chooser

Circuit Breaker

Task Database

Native CLI

Direct Provider Calls

Memory Supervision

Post-Success Validation

Dynamic Tools

Self-Learning Loop

Nexus Dashboard

Agent Status Monitoring

Workflow Visualization

Polling-Based Updates

Task Inspector

Live Meeting Transcription

Neovim Plugin

Local Tool Execution

Exec Approval System

Project Context Injection

WebSocket Streaming

Sentinel Agents

PR Reviewer

Auto-Labeler

Issue Triage

Multi-Provider LLM

Voice Pipeline

Speech-to-Text

Text-to-Speech

Voice Bridge

OpenAI Realtime

Lobs Mobile

iOS Push Notifications

Task Creation On The Go

Task Queue View

Chat Interface

Squad

Gateway Architecture

Connector Model

Docker-First

Workflow Orchestration

Memory Search

Hybrid Search

Embeddings

Supervised Process

Multi-Collection Index

Shared Memory

Architecture Decision Records

Git-Backed Sync

Research Memos

Agent Runbooks

LobsLab Apps

Crapuler

Ballz

Stellar Siege

Grandma's Stories

Over the Horizon

Docker + Caddy Stack

The open-source platform
for multi-agent
orchestration.

One dev.
An entire team of agents.