We present the Lokentra U.S. Email Security Index (ESI), the most comprehensive entity-attributed DNS email security dataset for the United States. The registry covers 801,359 organizations spanning nonprofits, for-profit businesses, government entities, and educational institutions across all 50 states and six territories, with 577,882 domains fully DNS-profiled across seven record types including SPF, DKIM, DMARC, CAA, and MTA-STS. We find that only 38.8% of U.S. organizations with a web presence have any DMARC record, and only 5.6% deploy DMARC at enforcement level (p=reject) — the only control that actively prevents domain spoofing. DKIM adoption stands at 36.9%, exposing the majority of U.S. organizations to email impersonation. Grade distribution is heavily skewed toward poor posture: 28.6% of scanned domains receive a Grade F (minimal or no authentication), compared to only 0.4% achieving Grade A (full enforcement). Google Workspace and Microsoft 365 are nearly equally deployed at the national level, together accounting for 51.4% of all classified email infrastructure. Sector analysis reveals dramatic variation: higher education achieves 71.4% A/B grades while IRS-registered nonprofits reach only 10.3%, a 61.1 percentage-point gap. The dataset links 434,659 entities to geocoordinates, enabling geographic analysis of security posture at the state, county, and municipal level. To our knowledge, this is the largest and most diverse entity-attributed email security dataset publicly described.
Email remains the primary vector for phishing, business email compromise (BEC), and ransomware deployment. The Federal Bureau of Investigation's Internet Crime Complaint Center (IC3) reported $2.9 billion in BEC losses in 2023, with spoofed email domains the primary mechanism. Despite the availability of free, standards-based defenses — SPF, DKIM, and DMARC — adoption across U.S. organizations outside the federal government remains inconsistently measured and broadly insufficient.
Binding Operational Directive (BOD) 18-01 mandated DMARC enforcement for federal .gov agencies in 2017. No equivalent mandate exists for the 800,000+ state, local, nonprofit, and private sector organizations that constitute the majority of U.S. infrastructure. This study attempts to measure the resulting security gap at scale.
This paper makes four contributions:
| Metric | Value | Notes |
|---|---|---|
| Total entities registered | 801,359 | Organizations with verified government-source records |
| Entities with internet domain | 619,376 (77.3%) | Website URL resolved to apex domain |
| Domains DNS-profiled | 577,882 | Full 7-check scan (MX, SPF, DKIM, DMARC, CAA, MTA-STS, TLS-RPT) |
| Domains alive / active | 459,018 (79.4%) | MX record resolves; domain is email-capable |
| Domains dormant (no MX) | 72,650 (12.6%) | Domain resolves but no email infrastructure |
| Domains dead (NXDOMAIN) | 44,766 (7.7%) | Domain no longer exists in DNS |
| Domains timeout / error | 1,448 (0.3%) | Resolution failed; domain state unknown |
| Entities geocoded | 434,659 (54.2%) | Latitude/longitude from ZIP or county centroid |
| States and territories covered | 59 | 50 states + DC + territories |
| DNS record types checked per domain | 7 | MX, SPF, DKIM, DMARC, CAA, MTA-STS, TLS-RPT |
| Sector | Primary Sources | Entities | Share |
|---|---|---|---|
| Nonprofits (IRS BMF) | IRS Business Master File | 206,161 | 25.7% |
| Businesses — large/mid (SAM.gov) | SAM.gov federal registration | 144,623 | 18.0% |
| Special districts | State registries, Census | 138,165 | 17.2% |
| Small businesses (10KSB) | Goldman Sachs 10,000 Small Businesses directory | 95,260 | 11.9% |
| Nonprofits (SAM.gov registered) | SAM.gov — nonprofit business type | 86,596 | 10.8% |
| Government — municipal | Census TIGER, state registries | 40,856 | 5.1% |
| Government — state agencies | State government portals | 28,935 | 3.6% |
| Municipalities | Census Gazetteer | 20,735 | 2.6% |
| K-12 education | NCES CCD 2024-2025 | 20,021 | 2.5% |
| Townships | Census Gazetteer | 12,105 | 1.5% |
| Higher education | NCES IPEDS, state boards | 3,171 | 0.4% |
| Counties | Census Gazetteer | 3,146 | 0.4% |
| Other public sector | State registries, EPA SDWIS | 1,585 | 0.2% |
| Total | 801,359 | 100% |
| Source | Segments Covered | Method | Records |
|---|---|---|---|
| IRS Business Master File (BMF) | All 501(c) nonprofits | Annual federal data release | 206,161 |
| SAM.gov Public Extract (V2) | Businesses, SAM nonprofits | Monthly pipe-delimited flat file, 142 fields | 231,219 |
| Goldman Sachs 10KSB Directory | Small businesses | Institutional program participant list | 95,260 |
| NCES CCD 2024-2025 | K-12 school districts | Federal flat file (CSV) | 20,021 |
| NCES IPEDS | Higher education institutions | Federal API | 3,171 |
| U.S. Census Bureau (TIGER/Gazetteer) | Government, municipalities, townships, counties | Federal datasets, state registry aggregation | 245,527 |
All entity records are ingested into a unified PostgreSQL schema with normalized fields for entity name, type, subtype, state, county, primary domain, and geocoordinates. Entities are deduplicated via SHA-1 hash of normalized name, entity type, and state. Cross-source matching is performed for entities appearing in multiple registries (e.g., a nonprofit registered in both IRS BMF and SAM.gov).
Each entity's website URL is resolved to its apex domain, then profiled concurrently across seven DNS record types using a Python-based pipeline with 50 parallel worker threads. The pipeline classifies email providers from MX records using a provider detection library covering 12 classified providers (Google Workspace, Microsoft 365, Proofpoint, Mimecast, GoDaddy, Zoho Mail, Yahoo Mail, Amazon SES, Mailgun, SendGrid, iCloud Mail, and Fastmail). Email security gateways are detected by comparing MX host names against gateway fingerprint patterns. All check results are stored with timestamps in a versioned dns_checks table, enabling longitudinal tracking.
| Configuration | Points |
|---|---|
SPF record with -all (hard fail) | 30 |
SPF record with ~all (soft fail) | 15 |
SPF record with ?all or +all | 5 |
| No SPF record | 0 |
| Configuration | Points |
|---|---|
| DKIM public key published (v=DKIM1) | 30 |
| No DKIM key found | 0 |
| Configuration | Points |
|---|---|
p=reject (full enforcement) | 40 |
p=quarantine | 20 |
p=none (monitoring only) | 10 |
| No DMARC record | 0 |
| Grade | Score | Interpretation |
|---|---|---|
| A | 90–100 | Full enforcement |
| B | 70–89 | Strong posture |
| C | 50–69 | Partial protection |
| D | 30–49 | Weak configuration |
| F | 0–29 | Minimal/no authentication |
| Protocol | Domains with record | Adoption rate | Significance |
|---|---|---|---|
| MX records (any) | 459,018 | 79.4% | Email infrastructure present |
| SPF (any) | 399,074 | 69.0% | Sender authorization configured |
| DKIM (any) | 213,292 | 36.9% | Message signing configured |
| DMARC (any policy) | 224,230 | 38.8% | Domain spoofing policy present |
— DMARC p=none | 141,041 | 24.4% | Monitoring only; no enforcement |
— DMARC p=quarantine | 50,276 | 8.7% | Partial enforcement |
— DMARC p=reject | 32,361 | 5.6% | Full enforcement — spoofing blocked |
| CAA records | 8,089 | 1.4% | Certificate issuance restrictions |
| MTA-STS | 2,889 | 0.5% | Transport layer encryption policy |
| TLS-RPT | 3,467 | 0.6% | Transport security reporting |
The headline finding is stark: only 5.6% of U.S. organizations have DMARC configured at enforcement level (p=reject). An additional 8.7% use p=quarantine, leaving 85.7% of all scanned domains without active spoofing prevention. Of the 38.8% with any DMARC record, 62.9% are monitoring-only (p=none), providing visibility without protection.
DKIM adoption at 36.9% is notably lower than SPF (69.0%). This gap is consistent with the relative complexity of DKIM key management versus SPF's single-record configuration. CAA and MTA-STS remain rare, below 2% nationally.
| Grade | Domains | Share | Interpretation |
|---|---|---|---|
| A (90–100) | 2,584 | 0.4% | SPF -all + DKIM + DMARC reject |
| B (70–89) | 138,271 | 23.9% | Strong posture, not full enforcement |
| C (50–69) | 129,881 | 22.5% | SPF + DKIM but weak/no DMARC |
| D (30–49) | 141,824 | 24.5% | SPF only or partial DMARC |
| F (0–29) | 165,322 | 28.6% | Minimal or no authentication |
More than a quarter of all U.S. organization domains (28.6%) receive a Grade F — meaning they have either no SPF, no DKIM, and no DMARC, or only the weakest possible configurations. Combined D and F domains represent 53.1% of the scanned universe. Only 24.3% of domains achieve a B or better.
| Provider | Domains | Share of classified |
|---|---|---|
| Google Workspace | 150,197 | 32.7% |
| Microsoft 365 | 146,881 | 32.0% |
| Other / self-hosted | 114,719 | 25.0% |
| Proofpoint (gateway) | 24,858 | 5.4% |
| GoDaddy | 9,239 | 2.0% |
| Mimecast (gateway) | 5,178 | 1.1% |
| Zoho Mail | 3,482 | 0.8% |
| Yahoo Mail | 1,871 | 0.4% |
| Mailgun | 1,785 | 0.4% |
| Amazon SES | 485 | 0.1% |
| iCloud Mail | 312 | 0.1% |
Denominator: 459,018 domains with classified MX provider. Excludes 118,864 domains where MX lookup returned an error or provider was not classified.
Google Workspace and Microsoft 365 are nearly equally deployed at national scale, with a combined 64.7% share of classified email infrastructure. This near-parity contrasts sharply with sector-level findings: K-12 education is dominated by Google (4:1 over Microsoft), while higher education and government invert this pattern toward Microsoft. The 5.4% of domains using Proofpoint as a gateway correlates strongly with higher security posture, as gateway-proxied domains exhibit substantially better DMARC adoption (see cross-sector analysis).
| Status | Domains | Share | Definition |
|---|---|---|---|
| Fresh / Active | 459,018 | 79.4% | MX resolves; scanned within 30 days |
| Dormant (no MX) | 72,650 | 12.6% | Domain resolves but has no email infrastructure |
| Dead (NXDOMAIN) | 44,766 | 7.7% | Domain no longer exists in DNS |
| Unknown (timeout/error) | 1,448 | 0.3% | DNS resolution failed |
The 7.7% NXDOMAIN rate reflects domain expiration or deliberate abandonment, concentrated in older IRS-registered nonprofits and legacy SAM.gov registrations. The 12.6% dormant rate represents organizations with a web presence but no active email infrastructure — typically small nonprofits or businesses using hosted email outside their primary domain. Combined, 20.3% of registered domains are email-inactive, meaning phishing exposure analysis should focus on the 79.4% that are email-capable.
| Sector | Scanned | MX | SPF | DMARC | reject | A/B | F |
|---|---|---|---|---|---|---|---|
| Higher education | 700 | — | 91.9% | 89.0% | 23.0% | 71.4% | 7.4% |
| K-12 education | 15,568 | 86.4% | 79.7% | 55.5% | 12.2% | 47.1% | 19.2% |
| Government — municipal | 38,754 | 96.5% | 90.6% | 59.7% | 12.6% | 39.1% | 13.2% |
| Businesses (SAM.gov) | 133,809 | 95.8% | 87.9% | 53.5% | 11.7% | 33.8% | 18.3% |
| Nonprofits (SAM.gov) | 83,448 | 90.3% | 79.5% | 55.7% | 6.9% | 35.2% | 20.6% |
| Government — state | 27,814 | 86.2% | 80.7% | 51.1% | 10.6% | 37.6% | 22.3% |
| Small businesses (10KSB) | 93,214 | 90.9% | 81.9% | 43.5% | 6.3% | 27.3% | 17.8% |
| Nonprofits (IRS BMF) | 201,492 | 64.3% | 49.8% | 22.4% | 2.0% | 10.3% | 49.6% |
The sector gap is dramatic. Higher education leads with 71.4% of scanned domains achieving Grade A or B, while IRS-registered nonprofits achieve only 10.3% — a 61.1 percentage-point gap. The nonprofit disparity is largely explained by composition: the IRS BMF includes 200,000+ small civic organizations, religious groups, and community associations with minimal technical capacity. SAM.gov-registered nonprofits, which are federally active organizations with operational sophistication, achieve 35.2% A/B, three times better.
Government municipal entities outperform government state entities on F-rate (13.2% vs 21.4%), likely reflecting the broad state-level category including small administrative offices and legacy agencies alongside well-resourced departments. Businesses with federal contracts (SAM.gov) score significantly better than small businesses (10KSB) on A/B rate (33.8% vs 27.3%), consistent with federal procurement cybersecurity requirements creating upward pressure on contractor posture.
| Provider | Domains | SPF | DMARC | DKIM |
|---|---|---|---|---|
| Email Security Gateway (Proofpoint, Mimecast) | 30,036 | 97.3% | 75.7% | 76.2% |
| Enterprise Cloud (Google Workspace, M365) | 297,078 | 93.2% | 60.4% | 52.1% |
| GoDaddy / budget hosting | 9,239 | 51.3% | 23.8% | 18.2% |
| Yahoo Mail / iCloud / personal | 2,183 | 38.4% | 14.1% | 9.7% |
| Other / self-hosted | 114,719 | 84.0% | 25.9% | 21.3% |
Email security gateway users achieve 97.3% SPF and 75.7% DMARC adoption — the strongest posture of any provider category. Domains using budget hosting providers (GoDaddy, shared hosting) exhibit dramatically weaker authentication, with SPF at 51.3% and DMARC at 23.8%. Organizations using personal email providers (Yahoo, iCloud) as their domain email provider reach only 38.4% SPF adoption, representing the most exposed cohort after domains with no MX infrastructure at all.
| Field | Type | Description |
|---|---|---|
entity_id | integer | Unique entity identifier |
entity_name | string | Official organization name |
entity_type | enum | nonprofit, business, business_smb, k12, higher_ed, govt_municipal, govt_state, county, special_district |
entity_subtype | string | Detailed classification within type |
state | string | Two-letter USPS state code |
county | string | County name (where available) |
primary_domain | string | Apex internet domain |
latitude / longitude | float | Geocoordinates (ZIP-derived or county centroid) |
source_name | string | Authoritative data source (IRS, SAM.gov, NCES, etc.) |
DNS Check Fields (from dns_checks table) | ||
grade | string | A / B / C / D / F composite score |
mx_provider | string | Classified email provider |
mx_gateway_vendor | string | Security gateway if detected |
spf_status / spf_qualifier | string | pass/warn/fail; -all/~all/?all/+all |
spf_lookup_count | integer | DNS lookup depth (RFC 7208 10-limit) |
dkim_status / dkim_selector | string | pass/fail; matching selector |
dmarc_status / dmarc_policy | string | pass/warn/fail; none/quarantine/reject |
dmarc_rua / dmarc_ruf | string | Aggregate and forensic report URIs |
caa_status | string | pass/fail — certificate authority authorization |
mta_sts_status | string | pass/fail — transport encryption policy |
tls_rpt_status | string | pass/fail — transport security reporting |
checked_at | timestamp | UTC timestamp of DNS scan |
dns_checks schema enables time-series analysis of adoption trends| Tier | Scope | Suggested Use |
|---|---|---|
| State Pack | All entities in one state, all sectors | State-level risk assessment, policy analysis |
| Sector Pack | All entities in one sector, national | Market intelligence, sector research |
| Government | All municipal + state entities (69,791) | SLTT cybersecurity research |
| Nonprofit | All nonprofits — IRS + SAM (292,757) | Nonprofit sector risk analysis |
| Business | All SAM + SMB businesses (239,883) | Commercial threat intelligence, sales targeting |
| Full Registry | All 801,359 entities, all sectors | Comprehensive research, national policy |
| API + Quarterly Updates | Full registry, re-scanned quarterly | Longitudinal studies, dashboards, monitoring |
Delivery formats: CSV/Parquet, SQLite (pre-indexed), REST API, or interactive dashboard. All deliveries include full methodology documentation, source provenance, data dictionary, and reproducibility scripts (Python). Interactive exploration available at the MonitorWorkspace Email Scorecard.
Contact: research@monitorworkspace.com
Citation: Lokentra Research Team (2026). U.S. Entity Email Security Intelligence Dataset: A Multi-Sector DNS Authentication Registry Covering 801,359 Organizations. Lokentra U.S. Email Security Index (ESI). https://lokentra.com/research/overall-dataset-paper.html
Data sources: IRS Business Master File; SAM.gov Public Extract V2 (March 2026); Goldman Sachs 10,000 Small Businesses program directory; NCES Common Core of Data 2024-2025; NCES IPEDS; U.S. Census Bureau TIGER/Gazetteer. All DNS data derived from publicly accessible DNS records.
Competing interests: Lokentra develops MonitorWorkspace, a Google Workspace administration platform. The ESI dataset is produced by the Lokentra Research Division independently of the product team.