feat: vision-worthy detection and token-savings estimate by sid732 · Pull Request #6 · sid732/LocalContextRouter

sid732 · 2026-06-19T17:04:15Z

Completes the routing brain. Even pages with a clean text layer can lose their meaning when flattened — tables, charts, diagrams — so those go to a vision model. Every page now carries a token estimate, making the cost avoided explicit.

What

Vision-worthy detection (detect.py): is_vision_worthy(features) routes a page to vision when raster images cover >= 40% of it, vector paths cover >= 30%, or there are >= 25 vector paths (ruled tables, charts, diagrams).
Layout features (Pdf.page_features): counts raster image and vector path objects and their coverage via pypdfium2 — no rendering, no ML, and no AGPL PyMuPDF dependency.
Token estimator (tokens.py): claude_image_tokens (28px patches, 1568/4784 caps), openai_image_tokens (tile counting), estimate_text_tokens.
Router: adds the Source.VISION branch and attaches a TokenEstimate to each page; RouteResult.tokens_saved totals the tokens avoided versus sending every page as an image.

Why pypdfium2 over PyMuPDF

PyMuPDF (get_drawings/find_tables) is AGPL, which would be imposed on this MIT package. pypdfium2 page objects give the same image/vector signals under a permissive license.

Tests

Detector: synthetic features for each rule, plus a real table PDF (>= 25 paths) versus prose.
Tokens: formulas asserted against documented provider examples (1296, 1521, 765, 1105, 3888 tokens).
Router: a table page routes to vision; a text page reports positive savings.

On a mixed 3-page document (prose / table / scan) the router saves ~3077 tokens versus sending every page as an image.

Verified locally: ruff, ruff format, mypy (strict), pytest (37) all pass.

Add PageFeatures (image/path counts and coverage), TokenEstimate with a saved property, the Source.VISION case, the tokens field on PageRoute, and RouteResult.tokens_saved.

Add Pdf.page_features, which counts raster image and vector path objects and their page coverage via pypdfium2 — the signals that flag charts, tables, and diagrams without rendering.

Add is_vision_worthy: route a page to a vision model when images cover much of it, vectors cover a large area, or many vector paths suggest a table or chart.

Add token estimators following each provider's documented tokenization: Claude 28px patches with resolution caps, OpenAI tile counting, and a text estimate.

route_pdf now sends visually-dominant pages to vision and attaches a token estimate to every page, so RouteResult.tokens_saved shows the cost avoided.

Test the detector on synthetic and real page features, the token formulas against documented provider examples, and routing of a table page to vision.

sid732 added 6 commits June 19, 2026 13:03

feat(core): add layout features and token types

42f105f

Add PageFeatures (image/path counts and coverage), TokenEstimate with a saved property, the Source.VISION case, the tokens field on PageRoute, and RouteResult.tokens_saved.

feat(core): extract page layout features

a4191e6

Add Pdf.page_features, which counts raster image and vector path objects and their page coverage via pypdfium2 — the signals that flag charts, tables, and diagrams without rendering.

feat(core): detect vision-worthy pages

17cbc28

Add is_vision_worthy: route a page to a vision model when images cover much of it, vectors cover a large area, or many vector paths suggest a table or chart.

feat(core): estimate image vs text token cost

106356d

Add token estimators following each provider's documented tokenization: Claude 28px patches with resolution caps, OpenAI tile counting, and a text estimate.

feat(core): route vision-worthy pages and report savings

925d1cb

route_pdf now sends visually-dominant pages to vision and attaches a token estimate to every page, so RouteResult.tokens_saved shows the cost avoided.

test(core): cover detection, tokens, and vision routing

a915c63

Test the detector on synthetic and real page features, the token formulas against documented provider examples, and routing of a table page to vision.

sid732 merged commit a8e4b6e into main Jun 19, 2026
6 checks passed

sid732 deleted the feat/vision-routing branch June 23, 2026 03:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: vision-worthy detection and token-savings estimate#6

feat: vision-worthy detection and token-savings estimate#6
sid732 merged 6 commits into
mainfrom
feat/vision-routing

sid732 commented Jun 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

sid732 commented Jun 19, 2026

What

Why pypdfium2 over PyMuPDF

Tests

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant