Research / Design Tooling

Open-Source Design Tooling:
Gaps & Ideas

A survey of where the design tooling ecosystem stands in mid-2026 — and five concrete tools nobody has built well yet.

Method Web research + ecosystem analysis
Model Claude Sonnet 4.6 via OpenClaw
Ideas 5 concrete proposals

Where things stand

Design Tokens — W3C spec stable since Oct 2025. Style Dictionary + Tokens Studio dominate. Integration is still bespoke glue code per team.
Figma MCP — Official server launched late 2024. Exposes design context and variable defs to AI agents. No drift detection on top of it.
Accessibility — Pa11y, axe-core, IBM Equal Access handle DOM-level auditing. Stark covers Figma (paid). Nothing at the token layer.
Visual Regression — Chromatic (cloud/paid), BackstopJS (config-heavy), Playwright (dev-focused). Designer-friendly options: zero.
Design System Diff — Existing tools compare frameworks (React vs. Angular), not Figma file versions. No structured changelog tooling.
Color / Typography — Open Color, Leonardo, Coolors cover generation. Auditing a full token set for consistency: nothing.
The recurring pattern: tooling exists at the extremes — inside Figma (plugins, mostly paid) or at the DOM/code layer (axe, Chromatic). Almost nothing lives in the middle — the handoff layer, the token governance layer, or the design-drift detection layer.

Five ideas worth building

01 / 05

token-a11y

Design Token Accessibility Linter

What it does

A CLI that reads any W3C-compliant .tokens.json file (or Tokens Studio export) and cross-checks all color tokens for WCAG AA/AAA compliance across every realistic foreground × background combination. Outputs a structured report: which token pairs fail, which modes (light/dark/high-contrast) break, which components are likely affected. No browser. No DOM. Pure token-level audit.

Why it doesn't exist

  • Stark checks accessibility inside Figma but is paid and not scriptable
  • axe-core and Pa11y operate on a rendered DOM — you can't run them in CI without a browser and a full build
  • No tool consumes the W3C token format (just stabilized Oct 2025) and emits a11y reports in a pipeline-safe way

Build scope

  • Parse W3C token JSON or Style Dictionary output
  • Build foreground/background permutations from semantic token relationships
  • Apply WCAG 2.1 / APCA contrast algorithms
  • Output SARIF or JSON for CI integration

~3–4 weeks · Node.js or Rust · zero heavy deps

02 / 05

figma-drift

Design-to-Code Drift Detector

What it does

A CLI + optional MCP server that compares a Figma component's declared styles (via Figma REST API or the existing MCP get_design_context) to what's actually in your codebase — CSS variables, Tailwind classes, component props. Outputs a semantic diff: "Button background in Figma is --color-brand-500, but your code uses #2563EB hardcoded" or "Figma says border-radius is 8px; code says 6px."

Why it doesn't exist

  • Figma MCP exposes design context but no tool systematically runs it against the codebase
  • Visual regression tools catch screenshot differences, not semantic token drift
  • The gap between "designer updates Figma" and "developer updates code" is invisible until someone notices visually
  • Building this per team is reported as a significant engineering effort — everyone's doing it in isolation

Build scope

  • Figma API / MCP wrapper to extract token refs from a node
  • CSS / Tailwind / CSS-in-JS parser for computed values per component
  • Diff engine with configurable tolerance (pixel vs. semantic token name)
  • GitHub Action wrapper for CI

~4–6 weeks · can reuse Figma MCP SDK as foundation

03 / 05

ds-changelog

Design System Changelog Generator

What it does

A CLI that polls Figma's version history API and generates a structured, human-readable CHANGELOG for your design system. Detects: token renames, token value changes, component deletion, variant additions/removals, library style changes. Outputs Markdown changelogs, JSON diffs, or posts to Slack/GitHub Discussions. Essentially git log for your Figma file.

Why it doesn't exist

  • Figma has version history in the UI but no structured diff API between versions
  • Designers have no equivalent of git log or CHANGELOG.md — changes are communicated manually via Slack or Notion pages
  • Design system consumers (engineers) often find out about breaking changes by noticing visual bugs in production
  • No open-source project traverses Figma's node tree across versions to detect semantic changes

Build scope

  • Figma REST API version history traversal
  • Node-tree diffing (components, styles, variables)
  • Change categorization: breaking vs. additive vs. cosmetic
  • Output adapters: Markdown, JSON, Slack webhook

~3–5 weeks · complexity is efficient large-file traversal

04 / 05

token-validate

Multi-Brand Token Inheritance Validator

What it does

A CLI for design systems that support multiple brands sharing a base token set. Validates the inheritance tree: catches circular references, ensures every brand has all required overrides, detects tokens that fall back to base values unintentionally, and flags orphaned tokens. Think eslint for your token architecture.

Why it doesn't exist

  • Style Dictionary can transform tokens but doesn't validate inheritance correctness
  • The W3C spec defines the format but not validation rules for multi-brand systems
  • Teams building white-label or multi-brand design systems hit these issues at scale with no tooling to guard against them
  • Authoring vs. consuming token gaps are a known open problem in the W3C community group

Build scope

  • Token graph builder from W3C format files
  • Cycle detection and required-override schema definitions
  • Reporter with rule-level configuration (like ESLint rules)
  • GitHub Action or pre-commit hook integration

~2–3 weeks · highly composable with token-a11y

05 / 05

type-audit

Typography System Auditor

What it does

A CLI that extracts all font sizes, weights, and line heights from a Figma file or a CSS codebase, then compares them against the declared type token scale — flagging rogue values that don't fit the system, violations of vertical rhythm consistency, and WCAG readability standards. Answers the question: "Does the code actually match the type scale we defined?"

Why it doesn't exist

  • Typography auditing lives inside Figma plugins (paid) or browser DevTools (manual)
  • No CLI tool extracts type values from both Figma and code and cross-references them
  • Large codebases accumulate rogue font sizes over time with no automated enforcement beyond a linter rule that covers only new code
  • Vertical rhythm and modular scale validation aren't covered by any existing open-source tool

Build scope

  • Figma API extractor for text style nodes
  • CSS / Tailwind parser for font-size, font-weight, line-height usage
  • Scale validator: detect values off the declared type ramp
  • WCAG minimum text size and contrast check

~3–4 weeks · natural companion to figma-drift

Summary

Tool Fills the gap between Scope
token-a11y Token files and WCAG auditing 3–4 weeks
figma-drift Figma design and live codebase 4–6 weeks
ds-changelog Figma versions and developer awareness 3–5 weeks
token-validate Token specs and multi-brand correctness 2–3 weeks
type-audit Declared type scale and what's in the code 3–4 weeks