Slots are lost
Dealers lose nearly a third of bookable slots to no-shows, mis-scheduling, and missed callbacks. Margin evaporates.
Work / Enterprise & Security / VSBS
No. 28 · In progress · Mobility · Research-gradeBooking your car for service is still a 1990s phone call. VSBS is the opposite. A LangGraph supervisor with a verifier on every tool call, a deterministic safety engine, a wellbeing score with ten peer-reviewed sub-scores, and a payment state machine that runs the same in sim and in production. Zero human operators in the loop. Currently in active development · CARLA-driven driving simulation work is blocked on a dedicated GPU host (10GB+ VRAM); local laptop hardware cannot carry the simulation load. If your organisation is willing to sponsor a GPU host for the simulation suite, or would prefer to engage me to integrate VSBS on your fleet platform, I would be glad to discuss the arrangement that suits you.
Act I · The Problem
Vehicle service is still synchronous, opaque, and built around the dealer's diary. Three problems compound, and customers absorb every one of them.
Dealers lose nearly a third of bookable slots to no-shows, mis-scheduling, and missed callbacks. Margin evaporates.
The owner repeats the symptom three times, on hold, on email, in person. The mechanic still gets the wrong context.
"Grinding when I brake" should never wait three days for an appointment. Today, it does. Nobody is checking.
Act II · The Promise
Type a symptom. The agent does the rest. Every step is auditable, deterministic where it must be, and shows its working.
The wizard captures intent in plain language. GET /v1/vin/:vin hits the real NHTSA vPIC database, free, no key.
A deterministic red-flag gate runs before the LLM. "Grinding when braking" returns severity HIGH and a "do not drive" message before any model speaks.
Each tool call gets verified by a cheap second model. Hallucinated tool calls fail the verifier and are dropped.
A composite score with ten peer-reviewed sub-scores ranks dispatch options by stress, trust, and transparency, not just distance.
If the vehicle is autonomous, a signed capability token authorises the journey. Post-quantum hybrid envelope: ML-KEM-768 + X25519. Time-bounded. Revocable.
order → intent → authorise → capture → settle. Identical wire flow in sim and in production. Idempotent. Webhook-driven.
Act III · The Composition
Any one of these is a research project. VSBS composes all six in a repo you can clone, build, and read end to end on a Sunday afternoon.
01 · Supervisor
Every tool call is checked by a cheap second model for grounding before the result is trusted. Hallucinated calls are dropped, not executed.
02 · Provider-agnostic
Run on Gemini, Vertex Claude, OpenAI, Anthropic direct, or a scripted sim driver. LLM_PROFILE=sim needs zero keys.
03 · Sensors
Deterministic simulator. Distinguishes sensor faults from actual events. Aligned with ISO 21448.
04 · Prognostics
Refuses autonomous operation when a tier-1 safety-critical sensor is dead. Fails closed, by design.
05 · Privacy
Append-only consent log with evidence hashes. 72-hour breach runbook checked into the repo. FRIA documented.
06 · Crypto
Cloud KMS ML-KEM-768 + X25519 for long-lived secrets. Today's threats and tomorrow's quantum, both covered.
Act IV · The Stack
Every line is in the open. pnpm install, pnpm -r typecheck, six packages clean, full repo build, agent loop running in under a minute. Driving-simulation suite (CARLA) is paused pending dedicated GPU infrastructure.
Act V · Proof
POST /v1/concierge/turn streams typed events: tool-call → verifier → tool-result → delta → final. Every arrow is real code you can read today.
Replay mode runs in CI without a CARLA install. Live mode drives a Tesla Model 3 simulation at 10 Hz: 8 cameras, 4D radar, LiDAR, thermal, IMU, V2X, 96-cell HV pack.
BEV view, perception event log, OOD score, UNECE R157 takeover ladder, MRM, capability budget, DMS gaze. Every channel a real production AV stack publishes off-vehicle.
DPIA, FRIA, gap-audit, defensive publication, simulation policy, safety notice. Every architectural decision traces to a paper or a published standard.
Honest about what it is: research-grade reference architecture. Not a certified safety-of-life system. The repo says so on line one.
Published as prior art so nobody can patent the loop and lock OEMs out. docs/defensive-publication.md.
I build agentic workflows that ground every tool call, fail closed on safety, cite their sources, and run on a slow phone. Every architectural decision has a paper or a standard behind it.