Skip to content

Releases: gabelul/pixelslop

pixelslop: v0.3.5

16 Jun 10:41
36ec962

Choose a tag to compare

0.3.5 (2026-06-16)

Features

  • add a doctor self-check so stale/broken installs self-diagnose (92e20fb)
  • add a doctor self-check so stale/broken installs self-diagnose (6c88fb6)

pixelslop: v0.3.4

10 Jun 13:56
eb86371

Choose a tag to compare

0.3.4 (2026-06-10)

Bug Fixes

  • rewrite skill paths on install so /pixelslop works in any project (59394da)
  • rewrite skill paths on install so /pixelslop works in any project (a09e43a)

pixelslop: v0.3.3

10 Jun 10:03
852a481

Choose a tag to compare

0.3.3 (2026-06-10)

Features

  • add the design-director pass for subjective findings (2fb915e)
  • exhaustive-by-default scans with a design-judgment layer (7c8d0b3)
  • generate and evaluate project-specific personas (6dcd540)
  • generate native Codex TOML agents at install time (92cfb7e)
  • make every option discoverable from the skill, guarded against drift (5c01132)
  • make scans exhaustive by default (26468b9)
  • make the skill advise the user, not just run (011391b)
  • split report findings into measured and judgment layers (aeaaf6f)

Bug Fixes

  • make agent spawning work under Codex (inline fallback + native TOML) (5f9f850)
  • make the user-asking protocol work under Codex, not just Claude Code (3bc3920)
  • run agents inline when the harness can't spawn them (Codex) (c7dd90e)

pixelslop: v0.3.2

09 Jun 14:26
77b8a8b

Choose a tag to compare

0.3.2 (2026-06-09)

Features

  • add broken-image, oversized-h1, edge-text, and clipped-content detectors (1d09945)
  • add humanName to persona schema for narrative reports (5271af0)
  • add scan save-results command for deterministic data handoff (4c24136)
  • classify content pages and harden loopback resolution (34afc4e)
  • extended detectors and per-target score trends (2247768)
  • fix toward the project's design tokens (f66114e)
  • generate HTML report with plan data after fix loop (ac260a3)
  • measure typography readability and score it (e0a581d)
  • multi-section HTML report with CSS-only tab navigation (3c19d29)
  • narrative persona report format with human names (ce3be25)
  • page-type heuristic for smart persona selection (f803be2)
  • persist per-issue details and show them in the HTML report (15ca62c)
  • report generate command in pixelslop-tools (6b5982e)
  • run-time config step with --quick bypass and precedence spec (7468029)
  • self-contained HTML report template (b4be84c)
  • show pillar evidence and slop patterns in HTML report (c20110b)
  • support checkpoint operations without git for non-repo projects (2bf093d)
  • track per-target score trends across runs (4ec641d)
  • typography readability metrics and fix-toward-tokens (93dadcb)

Bug Fixes

  • force light mode on HTML report, drop broken dark mode (d4c92c0)
  • harden checkpoints against path traversal, fix scan-results resolution (adb4239)
  • harden HTML report against XSS, path traversal, and unsafe URLs (ba548de)
  • pin playwright-core to an exact version for deterministic installs (915903b)
  • template resolution for installed layout, symlink refresh, test coverage (ee9a228)
  • update installer resource count for report-template.html (3357b3c)

Refactoring

  • redesign HTML report with Command Folio design system (61cb281)

pixelslop: v0.3.1

30 Mar 13:44
5a52e51

Choose a tag to compare

0.3.1 (2026-03-30)

Features

  • add interactive settings flow to /pixelslop skill (db13482)
  • add settings commands to pixelslop-tools (d70a8d7)
  • interactive project settings for /pixelslop (9015c5e)

Bug Fixes

  • address Codex review — 5 settings bugs + test gaps (369b7c9)
  • correct action-semantic-pull-request commit hash pin (60b2ce6)
  • fence-aware section parser for settings read/write (9827117)

pixelslop: v0.3.0

30 Mar 11:52

Choose a tag to compare

Pixelslop 0.3.0 is a ground-up rework of how design quality gets measured. The scanner that used to be one monolithic agent is now a collector feeding 6 specialist evaluators. The browser runtime that depended on Playwright MCP is now direct Playwright execution. And the collector doesn't just look at static screenshots anymore — it scrolls, hovers, tabs through elements, and clicks interactive widgets to verify they actually work.

The /20 scoring model hasn't changed. The 5 pillars are the same. But the evidence behind each score is substantially deeper, and the evaluators are sharper about what they penalize.

Scanner Architecture

The old scanner was a single agent that captured screenshots, extracted styles, and scored everything in one pass. That's gone.

  • Evidence collector — captures screenshots, computed styles, contrast ratios, typography, spacing, a11y snapshots, and persona checks across 3 viewports (1440px, 768px, 375px). Outputs a structured evidence bundle.
  • 6 specialist evaluators — hierarchy, typography, color, responsiveness, accessibility, and slop. Each reads the evidence bundle, applies its rubric, and returns a scored JSON finding. They run in parallel and can't see each other's work.
  • Evidence schema — formal contract between collector and evaluators. Defines every field, its type, which evaluator owns it, and confidence flags.

Direct Browser Runtime

Replaced Playwright MCP tool declarations with direct Playwright execution via pixelslop-tools browser * commands. No MCP middleware, no tool-call overhead.

Interaction Evidence Layer

The collector now runs 4 interaction passes after static evidence capture:

  • Scroll pass — fold-by-fold screenshots, sticky element tracking, lazy image detection, below-fold typography sampling
  • Hover pass — before/after computed style diffs on up to 15 interactive elements
  • Focus pass — keyboard Tab-through with focus indicator detection and non-semantic clickable identification
  • Promise verification — click→verify for mobile menus, anchor links, tabs/accordions. Binary pass/fail.
  • --deep flag — doubles budgets and raises caps for complex pages

Each pass has its own time budget with graceful bailout. Skipped probes (ambiguous triggers) classified as unverifiable, not broken.

Evaluator Wiring

  • Accessibility — focus indicator gaps (>30% missing = cap at 2), non-semantic clickables (>3 = cap at 2), broken widget ARIA
  • Responsiveness — broken mobile menu (cap at 2), anchor failures scoped to mobile/no-sticky-nav
  • Hierarchy — scroll data for content priority signals

Interactive Installer

npx pixelslop install is an interactive wizard. Detects runtimes, picks scope, rewrites paths, configures MCP. npx pixelslop@latest update upgrades with backup + diff.

Release Infrastructure

  • PR title linting via amannn/action-semantic-pull-request
  • Draft release PRs for changelog review
  • Changelog section grouping (features/fixes/refactoring visible, noise hidden)

Tests

781 tests (was 470 at 0.2.0). 7 interaction fixtures, prompt contract tests, evidence schema validation, browser runtime integration tests.

Breaking Changes

None. /20 scoring model, report format, plan format, CLI interface all unchanged.

pixelslop: v0.2.7

30 Mar 08:45
58f092f

Choose a tag to compare

Release infrastructure: PR title linting, changelog section grouping, draft release PRs for review before publish. No functional changes.

pixelslop: v0.2.6

30 Mar 08:32
b9861fe

Choose a tag to compare

Interaction evidence inside the existing evaluators. The browser collector now scrolls, hovers, tabs, and clicks interactive elements — then feeds what it finds into the accessibility, responsiveness, and hierarchy evaluators. No new pillar, no scoring model change. The existing /20 score gets sharper because evaluators can see things they couldn't before: missing focus indicators, divs masquerading as buttons, broken hamburger menus.

Features

  • Scroll pass — fold-by-fold page analysis with screenshots, sticky element tracking, lazy image detection, and below-fold typography sampling. Pages that scroll for 8+ viewport heights get flagged for content priority issues.
  • Hover pass — before/after computed style diffs on up to 15 interactive elements at desktop. Detects elements with no hover feedback.
  • Focus pass — keyboard Tab-through that tests up to 30 elements for visible focus indicators. Identifies non-semantic clickables (divs/spans with cursor:pointer or onclick that should be buttons).
  • Promise verification — click→verify loop for mobile menus, anchor links, and tabs/accordions. Binary pass/fail outcomes — if the nav doesn't open or the anchor doesn't scroll, that's a measurable failure.
  • --deep flag — doubles all time budgets and raises element caps for extended collection on complex pages.
  • Evaluator wiring — accessibility evaluator now caps score at 2 when >30% of focused elements lack visible indicators, or when >3 non-semantic clickables are found. Responsiveness evaluator caps at 2 for broken mobile menus. Hierarchy evaluator uses scroll data for content priority.
  • Skipped probe handling — ambiguous or unclickable triggers classified as "skipped" (unverifiable), not "failed" (broken). Evaluators only penalize real click-action failures.

Architecture

  • Element ref system assigns stable selectors to interactive elements with semantic classification.
  • Probe isolation: state reset between every interaction and every pass. A noisy probe never contaminates subsequent collection.
  • Per-pass time budgets (scroll 8s, hover 5s, focus 3s, promises 12s) with graceful bailout.
  • Evidence schema updated with formal field specifications and evaluator routing rules.

Tests

781 tests (was 616). 7 new test fixtures, 3 new test files, prompt contract tests for accessibility and responsiveness evaluators.

pixelslop: v0.2.5

26 Mar 22:47
5bcac70

Choose a tag to compare

0.2.5 (2026-03-26)

Features

  • replace Playwright MCP with direct browser runtime (0a9d42e)

Bug Fixes

  • CI installs playwright-core and Chromium before running tests (7a7461c)

pixelslop: v0.2.4

26 Mar 12:03
0813874

Choose a tag to compare

0.2.4 (2026-03-26)

Features

  • evidence schema, 6 specialist evaluator agents, evaluator tests (35c6e93)
  • scanner decomposition into evidence collector + specialist fan-out (4e89083)
  • specialist agent architecture — scanner decomposition (67faf02)

Bug Fixes

  • add pixelslop-code-scanner to installer AGENT_FILES (ecd2116)
  • address Codex review — paths, schema fields, slop coverage, responsiveness evidence (91c7b49)
  • Codex round 2 — schema/snippet alignment, test lockdown, evaluator field refs (19fdb7d)
  • remaining Codex review items — schema drift, title, uninstall safety (2034284)