Ambient AI Agent

Hawk

An ambient AI agent that sees what you see, remembers your day, and acts — live across your glasses, your phone, and a powerful backend brain.

Hawk wears your world: an iOS frontend on iPhone and Ray-Ban Meta glasses is the eyes and ears, while a long-lived backend agent does the thinking and the acting. One system, always circling — and six concrete flows it already runs.

Get started

Download & try in 30 seconds

You are
Surface
Install via
Run this:

One system, two halves

Eyes on your world, a brain behind them

Everything Hawk does comes back to three things — what it wears, what it runs, and what it can already do.

Worn on your world

The Hawk iOS app turns your iPhone and Ray-Ban Meta glasses into a first-person camera and microphone. Live conversation, visual memory, a multi-panel carousel, and a lock-screen widget — the eyes and ears you carry into the room.

iOS frontend

A brain that never sleeps

The Hawk gateway is a long-lived process running a powerful coding-agent core. Agent loop, sessions, hybrid memory, heartbeat, cron, MCP tools, and sub-agents — always on, always watching, ready to act.

Backend agent

Six flows, one agent

Not a promise deck — real flows. From whispering names across a noisy room to shipping a feature you asked for out loud, Hawk runs six concrete demos today, with more circling in.

Multiple demo flows

Six flows, one agent

What Hawk actually does

Six flows from the same agent — three live, two in progress, one in preview. Five are real recordings; tap any clip to play. The last is wired and circling in.

Cocktail Meeting

Live

In a noisy party or meeting, Hawk listens, picks out who's talking, and quietly briefs you — names, context, and what to say next.

Safety

Live

Hawk watches your surroundings and warns you about hazards in real time, before they reach you.

Reminder

In progress

Hawk notices the commitments you make in conversation and surfaces the right reminder at the right moment.

Silent

In progress

A hands-free, no-audio mode. Hawk reads the scene and assists without speaking — a glance at your glasses display or lock screen is all it takes.

Coding

Live

Ask Hawk out loud to build a feature. The backend agent writes and ships the code while you keep moving.

Demo clip soon

Visual Memory

Preview

"What did I do yesterday?" "What did I just do?" Hawk answers from a searchable visual record of your day — down to "how many times did my dog appear in the last 10 minutes?"

Capabilities

Everything under one agent

Perception, memory, and action — the pieces that let Hawk see, remember, and strike.

Glass support

Ray-Ban Meta glasses become a live input to the agent — first-person camera and microphone, streaming what you see and hear. Hawk's vision is whatever yours is pointed at, hands free.

Visual Memory

Hawk keeps a searchable record of what you saw and did. Ask in plain language or query the video directly — "what did I just do," "where did I leave it," "how often did that appear" — and get an answer from your own day.

Memory distillation

The realtime conversation is dumped to a session log, then a single Haiku call consolidates that session log into a daily log. At the start of each realtime session, Hawk's prompt is rebuilt from soul, identity, global memory, and daily memory — so it always wakes up knowing who it is and what just happened.

Interactivity

Four live modes for whatever the moment needs — meeting and cocktail briefings, real-time safety, in-the-moment reminders, and a silent, no-audio mode. Hawk shifts how it helps without you breaking stride.

Agentic

Speak a request and the backend agent goes and does it — writing and shipping a real feature while you carry on. Hawk isn't a chat window; it's an agent with hands.

Multi-stream

Phone mic, glasses camera, and more — Hawk ingests multiple concurrent input streams at once and routes them to the agent. Many eyes, many ears, one decision loop.

Personality

A prompt-defined soul plus an identity give Hawk a consistent character across every session. It doesn't reset between conversations — it stays itself.

Architecture

Thin clients, one relentless brain

Your glasses and phone stream everything you see and hear to the Hawk gateway as multiple concurrent streams over a single WebSocket, tunneled through Tailscale. There the agent loop, memory, and background services do the thinking — pluggable across Hermes, Codex, and Claude Code.

Hawk system architecture Architecture diagram — Ray-Ban Meta glasses and the Hawk iPhone app stream multi-stream audio and video over WebSocket through Tailscale into the Hawk gateway, which runs the agent loop, hybrid memory, heartbeat, cron, MCP, sub-agents, and push, then drives pluggable runtimes (Hermes, Codex, Claude Code) and ultimately Claude and other LLMs; a web PWA and terminal UI connect to the same gateway. Capture Transport Gateway Runtimes Models iOS FRONTEND Ray-Ban Meta glasses camera · mics · speaker live POV capture iPhone — Hawk app multi-panel live view lock-screen widget multi-stream audio · video concurrent inputs audio/video WebSocket over Tailscale encrypted mesh tunnel WS stream BACKEND AGENT Hawk Gateway long-lived backend brain · agent loop · sessions · hybrid memory BM25 + vector · FTS5 · heartbeat · cron · MCP · sub-agents · push Thin clients web PWA terminal UI Background memory consol. heartbeat cron push notifications → iPhone dispatch PLUGGABLE RUNTIMES Hermes Codex Claude Code interchangeable adapters Claude / LLMs reasoning core Capture the wearer's world → stream over an encrypted WebSocket/Tailscale link → the gateway runs the agent loop against hybrid memory → a pluggable runtime drives the LLM, then acts and pushes back.

Install

Two tracks: the brain, and you

Stand up the backend in one line, then point the iOS app at it. URLs and names below are placeholders — keep them tidy and swap when you ship.

Agent · backend

Agent — backend (Mac / server)

The long-lived brain. Requires Bun ≥ 1.3.

# install
bun install -g hawk            # requires Bun >= 1.3
# or one-line:
curl -fsSL https://hawk.sh/install.sh | bash

# run
hawk gateway                   # start the long-lived backend
hawk                           # open the terminal client
Human · iOS frontend

Human — iOS frontend

The eyes and ears you carry.

  1. Install the Hawk iOS app on your iPhone.
  2. Point it at your gateway: ws://<your-tailscale-ip>:4242
  3. Pair your Ray-Ban Meta glasses in the Glasses tab.

Placeholder URLs and names (hawk.sh, ws://<your-tailscale-ip>:4242) — swap for your own when you deploy.

Roadmap

Always circling forward

Where Hawk's vision and reach widen next.

  • 01

    Benchmark + agent gym

    Realtime-agent benchmark plus an auto-testing gym with synthetic tests.

  • 02

    Recursive self-improvement

    Agent builds agent: prompt optimization, sharper tooling, live dev.

  • 03

    CarPlay

    Hawk in the car — briefing and acting from the dashboard.

  • 04

    Latency-adaptive sampling

    Sample perception at a rate tuned to live latency.

  • 05

    Device support

    Android, Apple Watch, and more glasses beyond Ray-Ban Meta.

  • 06

    Perception pipeline

    Face recognition and speaker diarization to map audio to who's talking.

  • 07

    Personality & memory

    Tunable soul/identity and shareable memory presets.

Cite / Source

Use it, cite it, read the source

Hawk is open at the link below. If it helps your work, cite it — the BibTeX is ready to copy.

github.com/hawk-agent/hawk
@software{hawk_ambient_agent,
  title  = {Hawk: An Ambient AI Agent},
  author = {The Hawk Team},
  year   = {2026},
  url    = {https://github.com/hawk-agent/hawk},
  note   = {Placeholder URL — see project page}
}