Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Binary file added PBIF Policy Library.docx
Binary file not shown.
112 changes: 112 additions & 0 deletions PBIF Policy Library.txt

Large diffs are not rendered by default.

14 changes: 7 additions & 7 deletions src/content/pbif/01-executive-summary.md
Original file line number Diff line number Diff line change
Expand Up @@ -11,17 +11,17 @@ PolicyEngine Policy Library
## Executive Summary (limit 250 words)
*"In a concise summary, describe the core problem your project addresses, the proposed technical solution, the target beneficiaries, and the anticipated impact."*

**Word Count: 212/250**
**Word Count: 239/250**

The Policy Library creates the missing infrastructure layer that transforms how America's safety net operates. Currently, 18% of benefit policy URLs from 2019 are dead, forcing every organization to independently maintain fragile document collections. When tools access our comprehensive, permanently archived source documents, we enable transformative capabilities: AI assistants accurately determining multi-program eligibility without hallucinations, caseworkers confidently navigating complex rules with authoritative sources, researchers tracking policy evolution across jurisdictions, and innovations we haven't imagined.

We combine AI-powered monitoring (Claude/GPT-5) with PolicyEngine's open-source rules engine creating intelligent infrastructure. Our system understands document relationships, surfaces non-obvious connections (TANF-SNAP categorical eligibility), providing authoritative ground truth for accurate benefits determination.
We'll combine AI-powered monitoring (Claude/GPT-5) with PolicyEngine's open-source rules engine to create intelligent infrastructure. Our system will understand document relationships and surface non-obvious connections like TANF-SNAP categorical eligibility, providing authoritative ground truth for accurate benefits determination. We'll use agentic AI with comprehensive metadata rather than fine-tuning, enabling flexible and accurate document retrieval at scale.

**PBIF Priority Impact:** Income verification via state-specific disregards; reduced SNAP errors through current criteria; confident beneficiary communication with source documents; backlog reduction saving staff time.
**PBIF Priority Impact:** We'll enable income verification via state-specific disregards, reduce SNAP errors through current criteria, support confident beneficiary communication with source documents, and reduce backlogs to save staff time.

**Not starting from scratch:** Collaboration with Federal Reserve Bank of Atlanta, Georgia Center for Opportunity, NBER, Prenatal-to-3, Better Government Lab, USC, MyFriendBen, and Benefit Navigator continues—we seed the library with documents covering nationwide scope. NBER and Prenatal-to-3 use PolicyEngine for tax credit modeling; Mirza and Impactica use our API. We'll add document display to API requests, integrate caseworker training. Colorado users and Riverside caseworkers see sources with calculations.
**Not starting from scratch:** Our collaboration with Federal Reserve Bank of Atlanta, Georgia Center for Opportunity, NBER, Prenatal-to-3, Better Government Lab, USC, MyFriendBen, and Benefit Navigator continues. We'll seed the library with documents covering nationwide scope. NBER and Prenatal-to-3 already use PolicyEngine for tax credit modeling, while Mirza and Impactica use our API. We'll add document display to API requests and integrate caseworker training. Colorado users and Riverside caseworkers will see sources alongside calculations.

**12-month timeline:** Months 1-3: Launch 5,000+ documents, 10 states; Months 4-6: API v1 with partners; Months 7-9: 30 states; Months 10-12: Full production covering 50+ jurisdictions.
**12-month production timeline:** Months 1-3: Launch 5,000+ documents, 10 states; Months 4-6: API v1 with partners; Months 7-9: 30 states; Months 10-12: Full production covering 50+ jurisdictions.

## Stage of Development
**Status:** Pilot ready / Active pilot
Expand All @@ -30,6 +30,6 @@ Our collaboration with Atlanta Fed and Georgia Center for Opportunity archives f

## Project Timeline & Funding
**Start Date:** November 15, 2025
**End Date:** November 14, 2027 (24 months)
**End Date:** November 14, 2026 (12 months)
**Total Grant Request:** $675,059
**Other Funding:** [To be determined]
**Other Funding:** PolicyEngine operational support, partner in-kind contributions
12 changes: 6 additions & 6 deletions src/content/pbif/02-value-proposition.md
Original file line number Diff line number Diff line change
Expand Up @@ -9,19 +9,19 @@ The benefits ecosystem lacks infrastructure for policy documentation, forcing or

Partners validated this need. MyFriendBen and Benefit Navigator waste resources maintaining documents. Georgetown and Michigan researchers lack document foundations. Atlanta Fed's collaboration shows even sophisticated institutions need shared infrastructure. Families face inconsistent information without comprehensive documentation.

Archive.org can't solve this: captures pages indiscriminately without understanding policy documents, no API for "Colorado SNAP rules" queries, can't identify document relationships, no metadata or semantic search. Benefits platforms need purpose-built infrastructurestructured data, reliable APIs, intelligent understanding. AI uniquely enables: (1) Intelligent crawling understands which documents matter, (2) LLMs identify changes requiring preservation, (3) AI surfaces non-obvious connections like TANF-SNAP categorical eligibility. Claude and GPT-5 excel at document extraction. This infrastructure amplifies human expertise, enabling impossible innovations.
Archive.org can't solve this problem because it captures pages indiscriminately without understanding policy documents, provides no API for queries like "Colorado SNAP rules," can't identify document relationships, and lacks metadata or semantic search capabilities. Benefits platforms need purpose-built infrastructure with structured data, reliable APIs, and intelligent understanding. AI uniquely enables this solution: (1) Intelligent crawling understands which documents matter, (2) LLMs identify changes requiring preservation, (3) AI surfaces non-obvious connections like TANF-SNAP categorical eligibility. We'll use agentic AI approaches with robust metadata rather than fine-tuning, as Claude and GPT-5 excel at document extraction and understanding when given proper context. This infrastructure amplifies human expertise, enabling innovations that weren't previously possible.

## Solution & Target Beneficiaries (250 words)

**Word Count: 244/250**

The Policy Library solves document disappearance through four integrated components: (1) Bulk ingestion where partners contribute thousands of documents—AI extracts metadata while humans verify; (2) AI-powered crawlers using Claude/GPT-5 monitor government websites weekly, understanding context and relationships; (3) Human review via GitHub pull requests ensures accuracy; (4) Stable API serves documents with permanent source IDs. We'll launch with 5,000+ documents bulk-uploaded from participating organizations—PolicyEngine, Atlanta Fed, GCO, NBER (TAXSIM MOU), Prenatal-to-3 at Vanderbilt, Better Government Lab, USC, MyFriendBen, Benefit Navigator—ensuring comprehensive coverage from day one.

Vulnerable families navigating benefits currently lose access when documents disappear—they are our primary beneficiaries. We involve them through partnerships with direct service organizations that serve these populations daily. MyFriendBen and Benefit Navigator staff provide continuous feedback on document needs and usability, ensuring we capture what families actually need.
Vulnerable families navigating benefits currently lose access when documents disappear—they are our primary beneficiaries. We'll delegate outreach and trust-building to our direct service partners who already have established relationships with these communities. MyFriendBen serves 3,500+ Colorado families monthly and will integrate document displays directly into their existing screeners. Benefit Navigator's caseworkers in LA and Riverside Counties will provide continuous feedback on document needs while serving their clients. These partners will handle beneficiary outreach through their existing channels, ensuring authentic engagement rather than top-down communication.

Organizations serving these families also benefit significantly. Direct service providers save hours they currently waste maintaining broken links. Our system proactively monitors all document URLs and sends immediate alerts when links break, allowing partners to update references before users encounter errors. Benefits navigators access reliable documentation instantly. University researchers gain the ability to conduct longitudinal policy analysis. Government agencies benefit from permanent archives of their own historical documents.

We involve beneficiaries throughout the project via: Monthly feedback sessions with partner organizations, open GitHub discussions for document requests, public dashboards showing coverage gaps, and direct integration with tools families already use. This participatory approach ensures we're building infrastructure that serves real needs, not theoretical ones.
We'll involve beneficiaries throughout the project via multiple channels. Our partners will conduct monthly feedback sessions with their users and relay insights to us. We'll maintain open GitHub discussions for document requests, publish public dashboards showing coverage gaps, and ensure direct integration with tools families already use. MyFriendBen and Benefit Navigator will serve as our primary conduits for beneficiary feedback, leveraging their existing trust relationships to gather authentic input. This delegated, participatory approach ensures we're building infrastructure that serves real needs identified by those closest to the beneficiaries, not theoretical ones.

## Proposed Benefit and Impact Evaluation (250 words)

Expand All @@ -35,7 +35,7 @@ Specific measurable metrics include:

**Reliability metrics:** 99.9% API uptime, under 100ms retrieval speed, 99.5% accuracy via human verification.

**Impact metrics:** MyFriendBen's 3,500 monthly Colorado users see primary sources; Riverside County's 500+ caseworkers access real-time verification. Rules engine integration ensures ALL relevant documents including non-obvious connections (TANF-SNAP eligibility). Track: document retrievals per partner, broken link reduction, time to resolve eligibility questions. LLM accuracy improvement of 24pp through test cases, including rules-as-code generation experiments (Beeck Center approach) comparing AI performance with/without primary sources—expecting significant improvement generating accurate PolicyEngine parameter files when LLMs reference actual statutes.
**Impact metrics:** MyFriendBen's 3,500 monthly Colorado users will see primary sources; Riverside County's 500+ caseworkers will access real-time verification. Our rules engine integration will ensure ALL relevant documents are found, including non-obvious connections like TANF-SNAP eligibility. We'll track document retrievals per partner, broken link reduction, and time to resolve eligibility questions. We'll measure API response times, document accuracy rates through human verification, and partner satisfaction scores through monthly surveys.

We track progress through automated dashboards, monthly partner surveys, and API analytics. We publish quarterly reports sharing findings publicly. Success means families never hear "we can't find that document" when applying for benefits.

Expand Down Expand Up @@ -65,7 +65,7 @@ Community organization adoption follows a tiered approach. Tier 1: Direct integr

Sustainability comes through diversified support. Enterprise API subscriptions from large platforms generate recurring revenue. Government contracts for official preservation services provide stable funding. Foundation support maintains free access for nonprofits. Open-source model enables community contributions reducing costs.

Scalability is built into architecture. Cloud infrastructure handles growth automatically. Crawler architecture is jurisdiction-agnostic, adding new states requires configuration not code. Community contributors can add coverage through pull requests. By Month 24, we'll cover all 50 states plus federal programs, becoming essential infrastructure for America's safety net.
Scalability is built into our architecture. Cloud infrastructure will handle growth automatically. Our crawler architecture is jurisdiction-agnosticadding new states requires only configuration changes, not new code. Community contributors can add coverage through pull requests. Within 12 months, we'll achieve full production with 50+ jurisdictions covered. Post-grant, we'll continue expanding internationally, becoming essential infrastructure for America's safety net and beyond.

## Dissemination & Learning (250 words)

Expand All @@ -77,7 +77,7 @@ Knowledge sharing maximizes impact across the benefits ecosystem.

**Public data access:** Document corpus via API with free tier. Bulk exports for researchers. Public dashboards showing coverage. Weekly Internet Archive dumps for preservation.

**Learning dissemination:** Quarterly reports on policy patterns, preservation challenges, adoption metrics. LLM benchmark results showing accuracy improvements (baseline, with documents, with tools, full stack), including rules-as-code generation experiments demonstrating how primary sources enable accurate PolicyEngine parameter generation—extending Beeck Center's work. Academic papers on AI-powered benefits navigation. Conference presentations at Code for America Summit, Benefits Data Trust convening. Webinars for navigators and developers.
**Learning dissemination:** Quarterly reports on policy patterns, preservation challenges, and adoption metrics. Academic papers on AI-powered benefits navigation and document preservation methodologies. Conference presentations at Code for America Summit and Benefits Data Trust convening. Webinars for navigators and developers on using the Policy Library API and integrating document access into their tools.

**Community engagement:** Monthly calls for feedback. GitHub discussions for requests. Newsletter with updates. Documentation wiki.

Expand Down
Loading