Item: ln-522-manual-tester
Rating: 5.2
Author: Implexa
Performs manual testing of Story AC via executable bash scripts in tests/manual/. Use when Story implementation needs hands-on AC verification.
SKILL.md

Paths: File paths (references/, ../ln-*) are relative to this skill directory.

MANDATORY READ: Load references/ci_tool_detection.md — compact output flags, pipefail, and failure-artifact policy for bash/curl/Puppeteer scripts.

Inputs

Input
Required
Source
Description

storyId
Yes
args, git branch, kanban, user
Story to process

Resolution: Story Resolution Chain.
Status filter: To Review

Manual Tester

Type: L3 Worker

Manually verifies Story AC on running code and reports structured results for the quality gate.
Purpose & Scope

Create executable test scripts in tests/manual/ folder of target project.

Run AC-driven checks via bash/curl (API) or puppeteer (UI).

Save scripts permanently for regression testing (not temp files).

Document results via the configured tracker provider (addComment) with pass/fail per AC and script path.

No status changes or task creation.

When to Use

Use when a Story needs hands-on acceptance-criteria verification before automated planning

Research comment "## Test Research:" exists on Story (from ln-521)

All implementation tasks in Story status = Done

Test Design Principles

1. Fail-Fast - No Silent Failures

CRITICAL: Tests MUST return 1 (fail) immediately when any criterion is not met.

Never use: print_status "WARN" + return 0 for validation failures, graceful degradation without explicit flags, silent fallbacks that hide errors.

Exceptions (WARN is OK): Informational warnings that don't affect correctness, optional features (with clear justification in comments), infrastructure issues (e.g., missing Nginx in dev environment).

2. Expected-Based Testing - The Golden Standard

CRITICAL: Tests MUST compare actual results against expected reference files, not apply heuristics or algorithmic checks.

Directory structure:

tests/manual/NN-feature/
├── samples/               # Input files
├── expected/              # Expected output files (REQUIRED!)
│   └── {base_name}_{source_lang}-{target_lang}.{ext}
└── test-*.sh

Heuristics acceptable ONLY for: dynamic/non-deterministic data (timestamps, UUIDs, tokens - normalize before comparison; JSON with unordered keys - use jq --sort-keys).

3. Results Storage

Test results saved to tests/manual/results/ (persistent, in .gitignore). Named: result_{ac_name}.{ext} or response_{ac_name}.json. Inspectable after test completion for debugging.

4. Expected File Generation

To create expected files:

Run test with current implementation

Review output in results/ folder

If correct: copy to expected/ folder with proper naming

If incorrect: fix implementation first, then copy

IMPORTANT: Never blindly copy results to expected. Always validate correctness first.

Workflow

Phase 0: Resolve Inputs

MANDATORY READ: Load references/input_resolution_pattern.md

Resolve storyId: Run Story Resolution Chain per guide (status filter: [To Review]).

Phase 1: Setup tests/manual structure

Read docs/project/infrastructure.md — get port allocation, service endpoints, base URLs. Read docs/project/runbook.md — get Docker commands, test prerequisites, environment setup

Check if tests/manual/ folder exists in project root

If missing, create structure:

tests/manual/config.sh — shared configuration (BASE_URL, helpers, colors)

tests/manual/README.md — folder documentation (see README.md template below)

tests/manual/test-all.sh — master script to run all test suites (see test-all.sh template below)

tests/manual/results/ — folder for test outputs (add to .gitignore)

Add tests/manual/results/ to project .gitignore if not present

If exists, read existing config.sh to reuse settings (BASE_URL, tokens)

Phase 2: Create Story test script

Fetch Story, parse AC into Given/When/Then list (3-5 expected)

Check for research comment (from ln-521-test-researcher) — incorporate findings into test cases

Detect API vs UI (API → curl, UI → puppeteer). IF UI: MANDATORY READ: Load references/puppeteer_patterns.md

Create test folder structure:

tests/manual/{NN}-{story-slug}/samples/ — input files (if needed)

tests/manual/{NN}-{story-slug}/expected/ — expected output files (REQUIRED for deterministic tests)

Generate test script: tests/manual/{NN}-{story-slug}/test-{story-slug}.sh

Use appropriate template: TEMPLATE-api-endpoint.sh (direct calls) or TEMPLATE-document-format.sh (async jobs)

Header: Story ID, AC list, prerequisites

Test function per AC + edge/error cases

diff-based validation against expected files (PRIMARY)

Results saved to tests/manual/results/

Summary table with timing

Make script executable (chmod +x)

Phase 3: Update Documentation

Update tests/manual/README.md:

Add new test to "Available Test Suites" table

Include Story ID, AC covered, run command

Update tests/manual/test-all.sh:

Add call to new script in SUITES array

Maintain execution order (00-setup first, then numbered suites)

Phase 4: Execute and report

MANDATORY READ: Load references/test_result_format_v1.md

Rebuild Docker containers (no cache), ensure healthy

Run generated script, capture output

Parse results (pass/fail counts)

Post tracker comment (addComment, per test_result_format_v1.md) with:

AC matrix (pass/fail per AC)

Script path: tests/manual/{NN}-{story-slug}/test-{story-slug}.sh

Rerun command: cd tests/manual && ./{NN}-{story-slug}/test-{story-slug}.sh

Critical Rules

Scripts saved to project tests/manual/, NOT temp files.

Rebuild Docker before testing; fail if rebuild/run unhealthy.

Keep language of Story (EN/RU) in script comments and tracker comment.

No fixes or status changes; only evidence and verdict.

Script must be idempotent (can rerun anytime).

Runtime Summary Artifact

MANDATORY READ: Load references/test_planning_summary_contract.md, references/test_planning_worker_runtime_contract.md

Runtime profile:

family: test-planning-worker

worker: ln-522

summary kind: test-planning-worker

payload fields used by coordinators: worker, status, warnings, manual_result_path

Invocation rules:

standalone: omit runId and summaryArtifactPath

managed: pass both runId and exact summaryArtifactPath

always write the validated summary before terminal outcome

Test scripts always go to tests/manual/, never to the project root.

Monitor Integration (Claude Code 2.1.98+)

MANDATORY READ: Load references/monitor_integration_pattern.md

When running test scripts expected to take >30 seconds:
Monitor(command="bash tests/manual/{suite}/test-{slug}.sh 2>&1", timeout_ms=300000, description="manual test: {slug}")

Fallback: if Monitor is unavailable (Bedrock/Vertex), use Bash(run_in_background=true).

Definition of Done

 tests/manual/ structure exists (config.sh, README.md, test-all.sh, results/ created if missing).

 tests/manual/results/ added to project .gitignore.

 Test script created at tests/manual/{NN}-{story-slug}/test-{story-slug}.sh.

 expected/ folder created with at least 1 expected file per deterministic AC.

 Script uses diff-based validation against expected files (not heuristics).

 Script saves results to tests/manual/results/ for debugging.

 Script is executable and idempotent.

 README.md updated with new test suite in "Available Test Suites" table.

 test-all.sh updated with call to new script in SUITES array.

 App rebuilt and running; tests executed.

 Verdict and tracker comment posted with script path and rerun command.

Script Templates

README.md (created once per project)

# Manual Testing Scripts

> **SCOPE:** Bash scripts for manual API testing. Complements automated tests with CLI-based workflows.

## Quick Start

```bash
cd tests/manual
./00-setup/create-account.sh  # (if auth required)
./test-all.sh                 # Run ALL test suites

Prerequisites

Docker containers running (docker compose ps)

jq installed (apt-get install jq or brew install jq)

Folder Structure

tests/manual/
├── config.sh          # Shared configuration (BASE_URL, helpers, colors)
├── README.md          # This file
├── test-all.sh        # Run all test suites
├── 00-setup/          # Account & token setup (if auth required)
│   ├── create-account.sh
│   └── get-token.sh
└── {NN}-{topic}/      # Test suites by Story
    └── test-{slug}.sh

Available Test Suites

Suite
Story
AC Covered
Run Command

—
—
—
—

Adding New Tests

Create script in {NN}-{topic}/test-{slug}.sh

Update this README (Available Test Suites table)

Update test-all.sh (add to SUITES array)

### test-all.sh (created once per project)

```bash
#!/bin/bash
# =============================================================================
# Run all manual test suites
# =============================================================================
set -e
SCRIPT_DIR="$(cd "$(dirname "${BASH_SOURCE[0]}")" && pwd)"
source "$SCRIPT_DIR/config.sh"

echo "=========================================="
echo "Running ALL Manual Test Suites"
echo "=========================================="

check_jq
check_api

# Setup (if exists)
[ -f "$SCRIPT_DIR/00-setup/create-account.sh" ] && "$SCRIPT_DIR/00-setup/create-account.sh"
[ -f "$SCRIPT_DIR/00-setup/get-token.sh" ] && "$SCRIPT_DIR/00-setup/get-token.sh"

# Test suites (add new suites here)
SUITES=(
    # "01-auth/test-auth-flow.sh"
    # "02-translation/test-translation.sh"
)

PASSED=0; FAILED=0
for suite in "${SUITES[@]}"; do
    echo ""
    echo "=========================================="
    echo "Running: $suite"
    echo "=========================================="
    if "$SCRIPT_DIR/$suite"; then
        ((++PASSED))
        print_status "PASS" "$suite"
    else
        ((++FAILED))
        print_status "FAIL" "$suite"
    fi
done

echo ""
echo "=========================================="
echo "TOTAL: $PASSED suites passed, $FAILED failed"
echo "=========================================="
[ $FAILED -eq 0 ] && exit 0 || exit 1

config.sh (created once per project)

#!/bin/bash
# Shared configuration for manual testing scripts
export BASE_URL="${BASE_URL:-http://localhost:8080}"
export RED='\033[0;31m'
export GREEN='\033[0;32m'
export YELLOW='\033[1;33m'
export NC='\033[0m'

print_status() {
    local status=$1; local message=$2
    case $status in
        "PASS") echo -e "${GREEN}[PASS]${NC} $message" ;;
        "FAIL") echo -e "${RED}[FAIL]${NC} $message" ;;
        "WARN") echo -e "${YELLOW}[WARN]${NC} $message" ;;
        "INFO") echo -e "[INFO] $message" ;;
    esac
}

check_jq() {
    command -v jq &> /dev/null || { echo "Error: jq required"; exit 1; }
}

check_api() {
    local response=$(curl -s -o /dev/null -w "%{http_code}" "$BASE_URL/health" 2>/dev/null)
    if [ "$response" != "200" ]; then
        echo "Error: API not reachable at $BASE_URL"
        exit 1
    fi
    print_status "INFO" "API reachable at $BASE_URL"
}

SCRIPT_DIR="$(cd "$(dirname "${BASH_SOURCE[0]}")" && pwd)"
export SCRIPT_DIR

Test Script Templates

See: references/templates/

Template
Use Case
Location

template-api-endpoint.sh
API endpoint tests (NO async jobs)
template-api-endpoint.sh

template-document-format.sh
Document/file processing (WITH async jobs)
template-document-format.sh

Quick start:

cp references/templates/template-api-endpoint.sh {NN}-feature/test-{feature}.sh      # Endpoint tests
cp references/templates/template-document-format.sh {NN}-feature/test-{format}.sh    # Document tests

Reference Files

Script format reference: prompsit-api tests/manual/ (production example)

AC format: references/templates/test_task_template.md (or local docs/templates/ in target project)

Risk-based context: references/risk_based_testing_guide.md

Research findings: ln-521-test-researcher creates "## Test Research" comment on Story

Puppeteer patterns: references/puppeteer_patterns.md

Test result format: references/test_result_format_v1.md

Version: 1.0.0
Last Updated: 2026-01-15
ln-522-manual-tester

SKILL.md

related skills