Skip to content

feat(fallbacks): add census-driven fallback rows#63

Merged
caio-pizzol merged 1 commit into
mainfrom
caio-pizzol/census-driven-fallbacks
Jun 10, 2026
Merged

feat(fallbacks): add census-driven fallback rows#63
caio-pizzol merged 1 commit into
mainfrom
caio-pizzol/census-driven-fallbacks

Conversation

@caio-pizzol

Copy link
Copy Markdown
Contributor

Adds census-driven fallback evidence for document fonts surfaced by corpus analysis: Times, Courier, Century Gothic, Segoe UI, Japanese Mincho/Gothic rows, and Arial aliases.

Why:

  • The initial CJK sample used only full-width characters, so unrelated Japanese designs could score 0.00%.
  • This enriches cjk-jp with half-width katakana, Latin, digits, punctuation, and Han.
  • Exact CJK width matches are capped at cell_width_only, so width evidence does not pretend to prove visual shape similarity.

Impact:

  • Liberation-backed aliases can activate for consumers that already ship those families.
  • SuperDoc still needs assets for URW Gothic, Selawik, BIZ UDMincho, and BIZ UDGothic after a DocFonts release.
  • Arial Unicode MS stays customer_supplied because no broad Unicode substitute was confirmed.

Verified:

  • bun run format
  • bun run check
  • git diff --check
  • private-string and em dash scan

@caio-pizzol caio-pizzol marked this pull request as ready for review June 10, 2026 19:03
@caio-pizzol caio-pizzol merged commit 164d7bd into main Jun 10, 2026
1 check passed
@caio-pizzol caio-pizzol deleted the caio-pizzol/census-driven-fallbacks branch June 10, 2026 19:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant