vemem — visual entity memory

Visual entity memory — remember faces, objects, and places across sessions with persistent identity. Use when the user asks who is in an image, when you need...

installs

stars

karma

SkillRank score ↗

8.2/ 10

evaluated by implexa, claude-haiku-4-5 · 2026-06-21

vemem provides persistent visual identity across sessions by embedding faces and objects into a local vector store, returning named entity references with attached facts for reasoning. bridges vision and text by turning raw images into queryable entities.

structure

9.0

trigger phrases

9.0

procedure

8.0

edge cases

8.0

documentation

8.0

strengths

view original SKILL.md from clawhubclick to expand

---
name: vemem
description: Visual entity memory — remember faces, objects, and places across sessions with persistent identity. Use when the user asks who is in an image, when you need to resolve an image to a specific known person/thing, when identifying someone from a photo, when labeling a new face for future recognition, or when maintaining knowledge (facts, events, relationships) keyed by appearance rather than by text. Bridges vision models and text LLMs by turning raw images into named entity references with attached context the text side can reason about.
license: MIT
compatibility: Requires Python 3.12+ and the vemem package (pip install vemem). InsightFace model weights (~200MB) download on first use.
metadata:
  homepage: https://github.com/linville-charlie/vemem
  version: "0.1"
---

# vemem — visual entity memory

## Before you install — what this skill touches on your system

Read this first. This skill handles biometric data, so transparency up front beats surprises later.

### What this skill is, exactly

**The skill itself is instruction-only.** It's the markdown you're reading — no scripts, no executables, no automatic installation of anything. Adding this skill to your ClawHub / Claude Code / Hermes / OpenClaw install does not by itself run code on your machine.

The skill *instructs the agent* to install and use the `vemem` Python package separately. That package is the component that actually reads images and writes to disk.

### If you install the vemem package, here is everything it does

**Local state it creates or reads:**
- `~/.vemem/` (override with `VEMEM_HOME`) — the LanceDB store holding face embeddings, entity bindings, facts, and the event log. This is where biometric vectors live.
- `~/.insightface/models/buffalo_l/` — InsightFace model weights (~200MB), downloaded from InsightFace's official distribution on the first face observation.
- **Images you explicitly pass** to `observe_image` / `identify_image` — either as base64 or as a filesystem path the library reads. **vemem does not scan your disk for images on its own.** It only sees bytes you hand it.

**Network activity vemem itself produces:**
- **First-use only**: downloads InsightFace `buffalo_l` weights. After that, zero network activity from the library.
- The MCP server (`vemem-mcp-server`) uses **stdio only** — no network ports opened.
- The optional OpenClaw sidecar (`vemem-openclaw-sidecar`) binds to **localhost only**.

**What vemem does NOT do on its own:**
- Does not call remote LLM APIs. Example recipes in `references/examples.md` show how to *compose* vemem with OpenAI / Anthropic APIs if you choose to. Those are your calls, with your API keys. If you stay local (Ollama etc.), nothing leaves your machine.
- Does not process images automatically. Every `observe_image` is an explicit invocation by the agent or you.
- Does not train on your data, phone home, or send embeddings anywhere.

### The OpenClaw automatic-processing concern is a separate opt-in

The ClawHub review correctly flagged that vemem has a first-party OpenClaw integration that can auto-process every image attachment. **That integration is a separate install** (`vemem-openclaw-sidecar` + registering a specific OpenClaw plugin) and is NOT enabled by adding this skill or by installing the base `vemem` package.

Enable it only if you understand you're granting an always-on face-recognition layer over every image your agent sees. Disable at any time by stopping the sidecar process.

### Verification & provenance

- Source: [github.com/linville-charlie/vemem](https://github.com/linville-charlie/vemem) · MIT license
- Release tags are signed commits on `main`; pin a version in production (e.g. `vemem==0.1.0`) rather than tracking `latest`
- `pip show -f vemem` lists every file the install adds to your environment
- To audit what the MCP server or sidecar actually touches at runtime:
  - Linux/macOS: `lsof -p <pid>` (open files + sockets) or `strace -e trace=file,network -p <pid>`
  - macOS Instruments File Activity trace for a GUI view
- The GDPR-style `forget()` is [test-verified](https://github.com/linville-charlie/vemem/blob/main/tests/storage/test_lancedb_specific.py) to physically remove embeddings from LanceDB version history. Reproduce locally before trusting for regulated data.

### Compliance context

vemem stores biometric identifiers. **If you deploy it to users other than yourself, YOU are the data controller under GDPR / BIPA / CCPA.** The library provides primitives (`forget` / `restrict` / `export`) but does not enforce consent capture — that's your app's responsibility. Full deployer checklist: [COMPLIANCE.md](https://github.com/linville-charlie/vemem/blob/main/COMPLIANCE.md).

### Recommended first-run posture

1. Install into a dedicated venv, not your system Python.
2. Use a test `VEMEM_HOME` path (e.g. `/tmp/vemem-test`) for your first session so you can inspect + delete the store wholesale.
3. Use a local VLM/LLM (Ollama) for the first integration test, not remote APIs, to confirm no images leave your machine.
4. Enable the OpenClaw sidecar integration only after you've seen vemem behave as a manually-invoked tool.

---

## What this skill does

vemem is the identity layer that sits between a vision model (face/object detector) and a text LLM. It turns "an image of a person" into a **named, stable entity ID** — same person across sessions, same face across angles, same object across lighting.

It keeps track of facts, events, and relationships **per entity** — like Mem0 or mem0-style stores, but keyed on visual identity rather than on a `user_id` you have to know in advance.

## When to activate

Activate this skill when the user:
- asks "who is this?" / "who is in this picture?" / "do you recognize them?"
- wants to remember someone or something for later ("that's Charlie, he runs marathons")
- tells you to correct an identity ("no, that's Dana not Charlie")
- wants to forget an entity for privacy ("remove all data about X")
- asks about entities they've previously introduced ("what do you know about Charlie?")
- wires you into a camera/photo pipeline that needs persistent visual identity

Do NOT activate for:
- general text memory (use the standard memory skill / mem0 / etc.)
- image generation or editing (vemem doesn't touch pixels)
- OCR, captioning, scene description (those are VLM jobs — vemem consumes their output)

## Setup

### Quick check — is vemem available?

Run `python -c "import vemem; print(vemem.__version__)"`. If that fails, install:

```bash
pip install vemem
# or with uv:
uv pip install vemem
```

First-time face encoding triggers a ~200MB InsightFace model download into `~/.insightface/`. Warn the user if their network is constrained.

### Run the MCP server (preferred for agents)

```bash
python -m vemem.mcp_server
```

This exposes 14 tools over stdio. Wire it into your host's MCP config. For Claude Desktop, the ready-to-paste config lives at `docs/examples/claude_desktop_config.json` in the vemem repo.

### Use directly (preferred for scripting in Python)

```python
from vemem import Vemem
vem = Vemem()  # LanceDB store at ~/.vemem, InsightFace encoder
```

Store path is overridable via `VEMEM_HOME` env var or `Vemem(home="/path/to/store")`.

## Core operations — the mental model

There are 13 operations. The ones you'll use most:

### Writing identity into the store

| Op | When |
|---|---|
| `observe(image_bytes)` | A new image came in. Detect faces/objects, embed, persist. Returns a list of Observations, each with a stable content-hash id. |
| `label(observation_ids, name)` | The user just told you who someone is. Creates the entity if new, binds those observations to it. This is the moment identity becomes permanent. |
| `remember(entity_id, fact)` | Attach a fact to a known entity — "Charlie runs marathons", "the red mug lives in the kitchen". |

### Reading identity out

| Op | When |
|---|---|
| `identify(image_bytes, k=5)` | Return candidate entities matching the image, ranked by similarity. Each candidate already includes attached facts — you don't need a separate recall call. |
| `recall(entity_id)` | All known facts, events, and relationships for an entity. Use when the user references someone by name. |

### Correcting mistakes

| Op | When |
|---|---|
| `relabel(observation_id, new_name)` | "That's not Charlie, that's Dana" — reassigns the observation and emits a negative binding so the clusterer won't re-attach it. |
| `merge(entity_ids)` | Two entities turn out to be the same person. Folds them together, preserving facts with provenance. |
| `split(entity_id, groups)` | One entity turns out to be two people. Separates them with cross-wise negative bindings. |
| `forget(entity_id)` | Privacy delete — hard-removes observations, embeddings, bindings, facts. Physically prunes from storage version history (GDPR Art. 17 compliant). **Not reversible.** |
| `undo(event_id=None)` | Undoes the most recent reversible op by you (within a 30-day window). Does not work on `forget`. |

## Common patterns

### Pattern A: camera frame comes in

```python
observations = vem.observe(image_bytes)
candidates = vem.identify(image_bytes, k=3, min_confidence=0.4)

if candidates:
    names = [f"{c.entity.name} (conf {c.confidence:.2f})" for c in candidates]
    print(f"Visible: {', '.join(names)}")
else:
    print(f"Unknown face(s) detected: {len(observations)}. Label with vem.label(obs_ids, name=...).")
```

### Pattern B: user says "that's Charlie"

```python
observations = vem.observe(image_bytes)
charlie = vem.label([o.id for o in observations], name="Charlie")
vem.remember(charlie.id, "we met at the coffee shop on 2026-04-17")
```

### Pattern C: agent needs context for a reply

```python
candidates = vem.identify(image_bytes, k=3)
context_parts = []
for c in candidates:
    fact_strs = "; ".join(f.content for f in c.facts)
    context_parts.append(f"{c.entity.name} (conf {c.confidence:.2f}): {fact_strs}")
context = "People visible: " + " | ".join(context_parts) if context_parts else "No known faces."

# Feed `context` into your LLM's system message or context block.
```

### Pattern D: correction

```python
# identify() said "Charlie" at 0.71 confidence, but user says it's Dana
candidates = vem.identify(image_bytes)
wrong_obs_id = candidates[0].matched_observation_ids[0]
vem.relabel(wrong_obs_id, "Dana")
# A negative binding against Charlie is now recorded — the clusterer won't re-assign.
```

### Pattern E: privacy request

```python
# User says "forget everything about Sarah"
sarah = vem.store.find_entity_by_name("Sarah")
if sarah is not None:
    counts = vem.forget(sarah.id)
    print(f"Deleted: {counts}")
    # Pruned from LanceDB version history — actually gone, GDPR-compliant.
    # This is NOT reversible. Warn the user before calling.
```

## Important constraints

- **Identity is the entity ID, not the name.** `label(..., name="Charlie")` re-uses an existing Charlie entity by name — but renaming an entity doesn't merge it with another same-named one. If the user has two "Charlie"s, use `merge()` explicitly.
- **`forget()` is irreversible.** Ask for confirmation before calling. 30-day `undo` does not cover it.
- **Encoder version is part of identity-of-evidence.** If you try `identify()` with a different encoder than the one used when building the gallery, you get an empty result — not a false match. This is by design.
- **Facts are free-form text.** vemem does not LLM-extract facts from conversations. That's the caller's job (or use Mem0 in parallel, keyed by `entity_id` as the `user_id`).
- **Composable with text memory systems.** The `entity_id` vemem returns is a perfect `user_id` for Mem0 / Letta / Supermemory. They own text conversational memory; vemem owns visual identity.

## Compliance note

vemem stores biometric identifiers. If the host app is deployed to users, the **deployer is the data controller** under GDPR / BIPA / CCPA. Key primitives:
- `forget(entity_id)` = Art. 17 erasure (with prune)
- `restrict(entity_id)` = Art. 18 restriction
- `export(entity_id)` = Art. 20 portability

Full checklist: `COMPLIANCE.md` in the vemem repo.

## Not this skill's job

- **No general chat memory** — use Mem0 / Letta / Supermemory in parallel for text conversational memory.
- **No image generation / editing** — this is a read-and-remember layer.
- **No autonomous clustering commits** in v0 — auto-suggestions exist but require explicit `label()` to commit. This keeps the hot path deterministic.

## Troubleshooting

| Symptom | Cause | Fix |
|---|---|---|
| `identify()` returns `[]` on a face you labeled earlier | Different encoder version, or the face isn't being detected | Check `encoder.id` hasn't changed; try `min_confidence=0.2` to see raw scores |
| `RuntimeError: image pipeline unavailable` | InsightFace weights not installed | First call downloads ~200MB from InsightFace to `~/.insightface/`; ensure network access on first use |
| `ModalityMismatchError` on merge | Trying to merge a face entity with an object entity | v0 keeps modalities separate; create an `instance_of` relationship instead |
| `OperationNotReversibleError` on undo | Past 30 days, or op was `forget` | Not fixable — `forget` is deliberately irreversible; window is configurable via `DEFAULT_UNDO_WINDOW` |

## Deeper references (bundled with this skill)

Loaded on demand when the agent needs the detail — keep them out of the
hot context path.

- [`references/mcp-tools.md`](references/mcp-tools.md) — every MCP tool's
  exact input/output shape. Read when deciding parameter names for a
  specific tool call.
- [`references/examples.md`](references/examples.md) — copy-paste code
  recipes for Ollama, OpenAI, and Claude; correction flows; privacy
  flows; composition with Mem0 / Letta.
- [`references/troubleshooting.md`](references/troubleshooting.md) —
  expanded error matrix with diagnostic commands. Read when a tool
  raises something unexpected.

## Upstream references (in the vemem repo)

- Full spec (load-bearing): `docs/spec/identity-semantics.md`
- Architecture: `docs/ARCHITECTURE.md`
- Real-world VLM+LLM recipes: `docs/examples/real_bridge.md`
- MCP tool reference: `docs/examples/mcp_usage.md`
- Compliance checklist: `COMPLIANCE.md`

Repo: https://github.com/linville-charlie/vemem

don't have the plugin yet? install it then click "run inline in claude" again.

restructured original into implexa's 6-part template, made decision logic explicit, documented env vars and external connections, added edge cases (encoder version mismatch, network constraints, compliance), preserved original author's intent and all 13 operations.

vemem , visual entity memory

Item: vemem — visual entity memory
Rating: 8.2
Author: Implexa

intent

vemem turns images into named, stable entity IDs so you can remember who/what you've seen before. Use it when the user asks "who is this?", wants to label someone for future recognition, needs to correct an identity, or asks about someone they've introduced before. it bridges vision models and text LLMs by converting raw image data into named entity references with attached facts that the text side can reason about. vemem does not scan your disk or call remote APIs on its own , every image observation is explicit.

inputs

Python environment:

Python 3.12+
vemem package (install via pip install vemem or uv pip install vemem)
InsightFace model weights (~200MB), auto-downloaded to ~/.insightface/models/buffalo_l/ on first face encoding

Configuration (all optional):

VEMEM_HOME environment variable to override default store location (default: ~/.vemem/). store contains face embeddings, entity bindings, facts, and event log in LanceDB format.
VEMEM_ENCODER_ID to pin encoder version (default: insightface/buffalo_l). changing this breaks similarity matching against existing gallery.

Image sources:

image bytes (base64 or filesystem path). vemem does not scan your disk , you pass bytes explicitly.
each image can contain 0 to N detectable faces or objects depending on content and detector quality.

External connections (optional, composable, not required):

OpenAI / Anthropic APIs if you want to compose vemem output with remote LLMs (your API keys, your calls, not vemem's responsibility).
Ollama or other local LLM/VLM for fully offline integration.
Mem0, Letta, or Supermemory as text memory layer (keyed by entity_id that vemem returns).

MCP server setup (for Claude / agents):

run python -m vemem.mcp_server to expose 14 tools over stdio.
wire into host MCP config (Claude Desktop config at docs/examples/claude_desktop_config.json in vemem repo).

procedure

1. check and install vemem

input: system Python environment
step: run python -c "import vemem; print(vemem.__version__)". if import fails, install: pip install vemem or uv pip install vemem.
output: vemem importable, version confirmed. on first face encoding call, InsightFace weights download (~200MB) to ~/.insightface/. warn user if network is constrained.

2. initialize the store

input: optional VEMEM_HOME path or env var
step: instantiate Vemem() object in Python or start MCP server with python -m vemem.mcp_server. store location defaults to ~/.vemem/, overridable via env var or constructor arg.
output: LanceDB store ready at specified path. empty on first run.

3. observe image and detect entities

input: image bytes (base64 string, filepath, or raw bytes)
step: call vem.observe(image_bytes). encoder detects faces/objects, computes embeddings, writes observations to store. returns list of Observation objects, each with stable content-hash id.
output: observations list. example: [Observation(id='obs_abc123', embedding=[...], modality='face', detected_bbox=(...)), ...]. if no faces detected, observations is empty list.

4. identify entities in image (match against gallery)

input: image bytes, optional k=5 (top-k results), optional min_confidence=0.4
step: call vem.identify(image_bytes, k=5, min_confidence=0.4). runs embedding similarity search, returns ranked candidates from known entities.
output: list of IdentifyResult tuples. example: [IdentifyResult(entity=Entity(id='ent_xyz', name='Charlie'), confidence=0.92, facts=[Fact(content='runs marathons'), ...], matched_observation_ids=[...]), ...]. includes attached facts per candidate. empty list if no match above threshold.

5. label observations as a named entity

input: observation_ids list (from step 3), name string
step: call vem.label(observation_ids, name="Charlie"). if entity named Charlie exists, binds observations to it. if not, creates new entity. this is the moment identity becomes permanent in the store.
output: Entity object with id, name, created_at timestamp. example: Entity(id='ent_xyz', name='Charlie', created_at=2026-04-17T..., ...).

6. attach facts to entity

input: entity_id, fact text string
step: call vem.remember(entity_id, "runs marathons") or vem.remember(entity_id, "lives in Brooklyn"). attaches free-form text fact to entity.
output: Fact object with id, content, timestamp, provenance. example: Fact(id='fact_123', content='runs marathons', created_at=2026-04-17T...).

7. recall all facts for named entity (when user references by name)

input: entity_id or entity name
step: call vem.recall(entity_id) or vem.store.find_entity_by_name("Charlie") then pass id to recall. returns all facts, events, relationships attached to that entity.
output: Entity object with full facts list. example: Entity(id='ent_xyz', name='Charlie', facts=[Fact(...), Fact(...), ...], relationships=[...]).

8. correct identity (relabel single observation)

input: observation_id (from identify result), new_name string
step: call vem.relabel(observation_id, "Dana"). reassigns observation to Dana entity (create if new). emits negative binding so clusterer won't re-attach to Charlie.
output: Entity object (Dana). example: Entity(id='ent_new', name='Dana', ...). a negative binding is now recorded in the event log.

9. merge entities (two people turn out to be same)

input: entity_ids list (at least 2)
step: call vem.merge([ent_id_1, ent_id_2]). folds entities together, preserving facts with provenance tracking. resulting entity keeps the first id.
output: merged Entity object. example: Entity(id='ent_xyz', name='Charlie', facts=[Fact(provenance='merged from ent_abc'), ...]).

10. split entity (one entity turns out to be two people)

input: entity_id, groups list (each group is observation_id list)
step: call vem.split(entity_id, [[obs_1, obs_2], [obs_3, obs_4]]). separates observations into N new entities. cross-wise negative bindings prevent re-merging.
output: list of new Entity objects. example: [Entity(id='ent_new1', name=None, ...), Entity(id='ent_new2', name=None, ...)]. user should relabel each new entity.

11. forget entity (privacy delete, irreversible)

input: entity_id
step: call vem.forget(entity_id). hard-removes all observations, embeddings, bindings, facts for that entity. physically prunes from LanceDB version history (GDPR Art. 17 compliant). NOT reversible.
output: dict with counts. example: {'observations_deleted': 5, 'facts_deleted': 3, 'embeddings_pruned': 5}. warn user this is irreversible before calling.

12. undo recent operation (reversible ops only)

input: optional event_id (defaults to most recent reversible op)
step: call vem.undo() or vem.undo(event_id='evt_123'). rolls back last reversible op (label, remember, relabel, merge, split). does NOT work on forget. 30-day window (configurable).
output: success message. example: UndoResult(event_id='evt_123', reverted_op='label', status='success').

13. export entity data (for portability / audit)

input: entity_id
step: call vem.export(entity_id). returns json-serializable dict with entity metadata, facts, observation ids, and embedding vectors.
output: dict. example: {'entity': {'id': '...', 'name': '...'}, 'facts': [...], 'observations': [...], 'embeddings': [...]}.

decision points

if identify() returns empty list:

encoder version has changed since gallery was built (embeddings no longer comparable). solution: rebuild gallery or downgrade encoder.
face confidence below min_confidence threshold. solution: lower threshold (e.g. min_confidence=0.2) to see raw scores.
face not detected at all (detector quality or lighting issue). solution: inspect raw detector output or check image quality.

if user says "that's not Charlie, that's Dana":

if Charlie and Dana are already separate entities: call vem.relabel(observation_id, "Dana") to reassign and record negative binding.
if Dana doesn't exist yet: relabel creates Dana entity automatically.
if it's the same person with two names: user should confirm merge intent before calling vem.merge([charlie_id, dana_id]).

if identify() returns multiple candidates with similar confidence:

return top-k candidates and let user pick ("is this Charlie or Dana?"). do not auto-commit without user confirmation.
if user confirms a non-top candidate: call vem.relabel() to correct, emitting negative binding against the wrong candidate.

if user asks to forget an entity:

always ask for explicit confirmation before calling vem.forget(). emphasize irreversibility.
check compliance rules if deploying to regulated users (GDPR, BIPA, CCPA). deployer is data controller.

if InsightFace encoder version changes (library update):

new observations use new encoder. old gallery observations won't match.
solution: pin encoder version in VEMEM_ENCODER_ID or rebuild gallery.
do not mix encoder versions in same store.

if user has OpenClaw sidecar enabled:

vemem-openclaw-sidecar (separate optional install) auto-processes every image attachment. this is opt-in, not default.
disable by stopping sidecar process or removing OpenClaw plugin registration.
base vemem package does NOT enable this.

if network access is constrained:

first-run InsightFace download (~200MB) requires internet. warn user.
after first run, vemem uses zero network (local LanceDB store only, no phone-home).
if deploying offline, pre-download InsightFace weights or use different encoder.

if user wants to compose vemem with a text LLM:

if using remote API (OpenAI, Anthropic): you control that API key. vemem doesn't call it.
if using local LLM (Ollama): no network activity at all.
pass context = identify(image).facts as part of LLM system message or context block. example in references/examples.md.

output contract

successful observe():
json array of Observation objects:

{
  "observations": [
    {
      "id": "obs_abc123",
      "embedding": [0.123, 0.456, ...],
      "modality": "face",
      "detected_bbox": [x1, y1, x2, y2],
      "confidence": 0.95
    }
  ]
}

successful identify():
json array of IdentifyResult objects:

{
  "candidates": [
    {
      "entity": {
        "id": "ent_xyz",
        "name": "Charlie",
        "created_at": "2026-04-17T14:30:00Z"
      },
      "confidence": 0.92,
      "matched_observation_ids": ["obs_abc123"],
      "facts": [
        {
          "id": "fact_123",
          "content": "runs marathons",
          "created_at": "2026-04-15T10:00:00Z"
        }
      ]
    }
  ]
}

successful label():
Entity object:

{
  "entity": {
    "id": "ent_xyz",
    "name": "Charlie",
    "created_at": "2026-04-17T14:30:00Z",
    "num_observations": 3,
    "num_facts": 2
  }
}

successful recall():
Entity object with all facts:

{
  "entity": {
    "id": "ent_xyz",
    "name": "Charlie",
    "created_at": "2026-04-17T14:30:00Z",
    "facts": [
      {
        "id": "fact_123",
        "content": "runs marathons",
        "created_at": "2026-04-15T10:00:00Z"
      },
      {
        "id": "fact_124",
        "content": "met at coffee shop on 2026-04-17",
        "created_at": "2026-04-17T14:30:00Z"
      }
    ],
    "relationships": []
  }
}

successful forget():
counts object:

{
  "counts": {
    "observations_deleted": 5,
    "facts_deleted": 3,
    "embeddings_pruned": 5
  }
}

store location:
~/.vemem/ (default) or path specified in VEMEM_HOME env var. contains LanceDB tables: entities, observations, facts, embeddings, bindings, events.

outcome signal

the user knows vemem worked when:

first identify after label: calling identify(same_image) returns the labeled entity at top-k. example: "Charlie (confidence 0.92)".
facts appear in identify results: after remember(ent_id, "runs marathons"), calling identify() on Charlie's photo includes that fact in the candidate result.
correction sticks: after relabel(wrong_obs_id, "Dana"), re-running identify() on that same image returns Dana, not Charlie. Charlie's confidence drops because negative binding is recorded.
recall returns all facts: calling recall("Charlie") returns a list of all facts you've attached to Charlie across sessions. example: "runs marathons", "met at coffee shop", "has a red jacket".
forget is irreversible: after forget(charlie_id), identify() on Charlie's photo returns empty (no match). calling recall("Charlie") returns not found.
merge consolidates facts: after merge([ent1_id, ent2_id]), one entity has facts from both. example: "Charlie" now includes facts you attached to "Charles" separately.
LanceDB store exists and grows: ls ~/.vemem/ shows populated .db files. ls ~/.insightface/models/buffalo_l/ shows model weights downloaded.
MCP tools respond: if running MCP server, agent can invoke 14 tools and get json results over stdio (no timeout, no network errors).

vemem — visual entity memory

related skills

vemem , visual entity memory

intent

inputs

procedure

1. check and install vemem

2. initialize the store

3. observe image and detect entities

4. identify entities in image (match against gallery)

5. label observations as a named entity

6. attach facts to entity

7. recall all facts for named entity (when user references by name)

8. correct identity (relabel single observation)

9. merge entities (two people turn out to be same)

10. split entity (one entity turns out to be two people)

11. forget entity (privacy delete, irreversible)

12. undo recent operation (reversible ops only)

13. export entity data (for portability / audit)

decision points

output contract

outcome signal