From raw DVLA data to actionable vehicle intelligence — here's exactly what happens when you search a registration.
| Source | Official DVLA MOT data — the same data that powers the government's MOT check service |
|---|---|
| Volume | 50M+ individual MOT test records covering millions of UK vehicles |
| Freshness | Updated daily via an automated pipeline. New MOT results are typically available within 24 hours |
| Storage | Columnar Parquet format with DuckDB query engine — designed for fast analytical queries over large datasets |
| Derived tables | Pre-aggregated lookup tables for vehicle groups, defect patterns, and comparison statistics. This is why results are instant rather than taking minutes |
| DuckDB + Parquet | In-process analytical database reading columnar files. No network latency, no database server — queries run directly against the data files at analytical speed. |
|---|---|
| MiniLM embeddings | Sentence-transformer model that converts defect text into semantic vectors. Enables similarity matching that keyword search can't achieve — "tyre tread depth insufficient" and "tyre worn beyond legal limit" are understood as the same issue. |
| Redis caching | Results are cached so repeat lookups are instant. Comparable vehicle data, defect patterns, and full analyses are stored for fast retrieval. |
| Async worker pipeline | AI analysis runs in a dedicated background worker process. You see progress updates in real time as each stage completes — typically 2–4 seconds total. |
| Daily delta pipeline | Every morning, new DVLA data is downloaded, validated, converted, and merged into our tables automatically. No manual intervention, no stale data. |
| Pre-computed aggregations | Group statistics, defect patterns, and comparison baselines are pre-built into derived tables. When you search, we look up results — we don't recalculate from scratch. |
Our data comes from official DVLA sources, not scraped from other websites.
No reviews from owners, no forum posts, no subjective ratings. Everything is derived from structured test data.
Your searches are not shared with dealers, insurers, or third parties.
MOT history and vehicle details are free. AI analysis and comparisons are included.
Our predictions are statistical likelihoods, not guarantees. We always use comparative language ("higher risk than average") rather than absolute statements ("this car will fail").
< 1 second (pre-built derived tables)
< 1 second (pre-aggregated group statistics)
2–4 seconds (parallel pipeline with real-time progress)
Updated daily (automated delta pipeline)
Explore our Reviews, Comparisons, MOT History, and AI Hunches pages to see each feature in detail.