How We Score
The Republic Score uses an AI-powered, time-adjusted system. We evaluate "are they doing what's expected at THIS point in time?" — not "have they delivered everything?"
Key Concept
A government that's only 9 days old shouldn't get a D for not having delivered everything. The time-adjusted system understands each commitment's complexity and timeline — quick wins are expected in the first week, but structural reforms take years.
1. Every Commitment Has Its Own Timeline
AI classifies each of the 109 commitments into one of 4 complexity tiers:
Executive orders, directives, appointments — things a PM can do immediately.
e.g.: Form cabinet with 18 ministries, issue anti-corruption directive
Policy reform, program launches, budget allocations — requires coordination across departments.
e.g.: Launch digital land registry, reform procurement rules
Legislation, infrastructure, institutional reform — requires sustained effort over months.
e.g.: Build new highways, pass constitutional amendments, education reform
Fundamental institutional change — deep reform that takes years to implement.
e.g.: Federal restructuring, tax system overhaul, judicial reform
2. We Measure 3 Types of Effort
Every signal (news article, speech, budget document) is AI-classified into an effort tier:
Speeches, announcements, plans, committee formations — talk, not yet action.
Budget allocations, policy changes, bills tabled, contracts signed — concrete steps.
Infrastructure completed, services launched, laws enacted — tangible results citizens can see.
3. Weights Shift Over Time
In the early days, intent signals (speeches, plans) are valuable. As time passes, only delivery matters.
| Phase | Intent | Action | Delivery |
|---|---|---|---|
| First 2 weeks | 60% | 35% | 5% |
| Month 1-2 | 30% | 55% | 15% |
| Month 3-6 | 10% | 50% | 40% |
| Month 6-12 | 5% | 25% | 70% |
| Past deadline | 2% | 13% | 85% |
4. Every Commitment Gets a Trajectory
5. Overall Republic Score
The overall score is a weighted average across complexity tiers. The weights shift based on how far into the term:
6. Our Data Sources
AI automatically collects and classifies signals from 80+ sources every 12 hours. No human intervention needed.
Data Confidence
We don't show scores without sufficient verified data. "Too early" commitments score 50 (neutral) and are excluded from grading.
Transparency
- • All scoring code is open-source — anyone can inspect the logic
- • AI models: Qwen 3.6+ (free classification), GPT-4.1-mini (deep analysis)
- • Score auto-recomputes every sweep (2x daily)
- • Admins can override AI timelines — overrides are flagged
- • Total AI cost under $5/month (free models used first)
Current Limitations
- • AI occasionally misclassifies signals — admin review corrects this
- • Budget execution data is still limited
- • Citizen voting just launched — sample size is growing
- • Some commitments have "insufficient" data confidence — more signals needed