Data product brief for education researchers — the largest entity-attributed email security dataset covering K-12 districts and higher education institutions across all 50 states.
| Total domains DNS-profiled | 406,680 |
| Domains alive (resolving) | 391,984 (96.4%) |
| Public sector entities | 69,892 |
| For-profit businesses (SAM.gov) | 273,713 |
| Nonprofits (IRS BMF) | 84,272 |
| States covered | 50 + DC |
| DNS record types per domain | 7 |
| Entity schema fields | 40 |
| K-12 districts + charter LEAs | 22,741 |
| Higher education institutions | 3,246 |
| Total education entities | 25,987 |
| K-12 with website / domain | 18,274 |
| K-12 with phone + address | 19,281 (100%) |
| K-12 with NCES LEAID (join key) | 19,281 |
| K-12 data freshness | 2024-2025 |
The U.S. Census Bureau's Census of Governments counts entities but provides no internet domains or email provider data. NCES CCD/IPEDS cover enrollment and finance but no technology infrastructure. This is the only dataset linking every U.S. K-12 district and accredited institution to its full DNS profile, email provider, security gateway status, and authentication posture.
Unlike threat intelligence feeds that start from domain lists, this dataset starts from authoritative government rosters and works forward to DNS. Every record is attributed to a named entity with type, subtype, state, county, and source provenance. Researchers can join to NCES, Census, or SAIPE datasets by district name, state, and county.
Each domain is profiled across the complete email authentication stack: SPF record presence and qualifier • DKIM public key across common selectors • DMARC policy level • MX provider classification • Gateway proxy detection • Underlying provider inference via SPF analysis
| Segment | Google Workspace | Microsoft 365 | Ratio |
|---|---|---|---|
| K-12 school districts | 8,611 | 2,159 | 4.0 : 1 |
| Higher education | 579 | 1,680 | 1 : 2.9 |
| Protocol | K-12 | Higher Ed | National Avg |
|---|---|---|---|
| MX records | 85.7% | 93.3% | 81.0% |
| SPF | 79.1% | 90.5% | 75.9% |
| DMARC (any) | 51.7% | 86.1% | 47.2% |
| DMARC reject | ~25% | ~55% | ~22.6% |
| Entity Type | Domains with MX | Using Gateway | Rate |
|---|---|---|---|
| Higher education | 2,767 | 564 | 20.4% |
| K-12 | 12,607 | 1,033 | 8.2% |
Districts with gateways show +22.7pp DMARC uplift and +8.1pp SPF uplift over non-proxied domains.
The dataset covers all 50 states + DC with per-state breakdowns. States with centralized IT governance show higher adoption; states with fragmented district structures show lower adoption. Rural vs urban district size correlates with security posture.
| Provider Category | SPF | DMARC | Gap |
|---|---|---|---|
| Email Security Proxy | 97.3% | 75.7% | 21.6pp |
| Enterprise Cloud (Google/M365) | 93.2% | 60.4% | 32.8pp |
| Budget Hosting (GoDaddy, IONOS) | 51.3% | 23.8% | 27.5pp |
Districts using GoDaddy email: 7.3% SPF — effectively unprotected. The vendor ecosystem determines the security floor.
| Field | Type | Description |
|---|---|---|
entity_name | string | Official district or institution name |
entity_type | enum | k12 or higher_ed |
entity_subtype | string | school_district, charter_org, community_college, university, etc. |
state | string | Two-letter USPS code |
county | string | County name |
primary_domain | string | Apex domain (e.g., springisd.org) |
mx_provider | string | Classified email provider |
has_spf / has_dkim / has_dmarc | boolean | Protocol presence flags |
dmarc_policy | string | none, quarantine, reject |
email_proxy | string | Security gateway service (Barracuda, Proofpoint, etc.) |
underlying_provider | string | Real mailbox platform behind gateway |
dns_score | integer | 0–100 composite security score |
grade | string | A / B / C / D / F |
source_name / source_url | string | Authoritative collection source with URL |
| NEW — CCD 2024-2025 Fields | ||
nces_leaid | string | NCES LEA ID — canonical join key to NCES finance, enrollment, demographics |
phone | string | District main phone number |
physical_address | string | Physical street address |
mailing_address | string | Mailing address |
grade_low / grade_high | string | Grade span (e.g., PK to 12) |
operational_schools | integer | Number of operational schools in district |
lea_type | string | LEA type (regular, charter agency, regional, specialized, etc.) |
charter_flag | string | Charter status (NOTCHR, CHRTIDEAESEA, etc.) |
| Configuration | Points |
|---|---|
-all (hard fail) | 30 |
~all (soft fail) | 15 |
?all or +all | 5 |
| No SPF record | 0 |
| Configuration | Points |
|---|---|
| DKIM public key published | 30 |
| No DKIM key found | 0 |
Selectors checked: google, mail, selector1, selector2, s1, s2, k1
| Configuration | Points |
|---|---|
p=reject (full enforcement) | 40 |
p=quarantine | 20 |
p=none (monitoring only) | 10 |
| No DMARC record | 0 |
DMARC receives 40% weight because it is the only protocol that prevents spoofing in the From: header — the field end users actually see.
superintendent@district.org and it lands in inboxes.The ESI scoring rubric is applied uniformly across our full entity registry. Researchers who need broader context can access scored datasets for:
How does K-12 email security compare to the nonprofit sector? Do SAM.gov-registered contractors outperform the school districts they serve? Education-only purchasers can add sector packs later without re-licensing.
| CSV / Parquet | Flat files, one row per entity |
| SQLite database | Pre-indexed by state, type, provider, grade |
| API access | RESTful query with filtering & aggregation |
| Interactive dashboard | Web-based explorer with drill-down |
All deliveries include methodology docs, source provenance, data dictionary, and reproducibility scripts.
| Tier | Scope |
|---|---|
| State Pack | Single state, all education entities |
| K-12 National | All 22,741 K-12 districts + charters, 50 states |
| Higher Ed National | All 3,246 institutions, 50 states |
| Full Education | K-12 + Higher Ed combined |
| Full Public Sector | All 69,892 entities (all types) |
| API + Updates | Quarterly re-scan, API access |
| Capability | This Dataset | NCES CCD/IPEDS | Threat Intel | Censys/Shodan |
|---|---|---|---|---|
| Entity-attributed (named districts) | Yes | Yes | No | No |
| All 50 states + DC | Yes | Yes | Partial | — |
| Email provider classification | Yes | No | No | Partial |
| DMARC/SPF/DKIM posture | Yes | No | Yes | No |
| Email gateway detection | Yes | No | Partial | No |
| Underlying provider inference | Yes | No | No | No |
| K-12 + Higher Ed combined | Yes | Separate | No | No |
| Education-sector benchmarks | Yes | No | No | No |
Try a free email security scorecard for any education domain
Get Your Free Scorecard →Contact: research@monitorworkspace.com