Public chain data should be public
Querying Ethereum’s history shouldn’t require a corporate budget. The data is already on-chain — yet getting it clean, decoded, and usable still means archive nodes, tracing pipelines, or per-megabyte export bills from analytics platforms. So we did the heavy lifting and gave it away. 10+ years of Ethereum, indexed, decoded, and lineage-tagged — free to download, no credit card, no sales call, no API quota.Get the datasets on Hugging Face
BlockDB Full EVM Research Datasets — 10+ years. Browse, preview, and download every published set.
_tracing_id back to on-chain evidence. You don’t have to trust us — you can verify it through the Lineage API.
Files are partitioned Parquet, so you can pull a single token, month, or exchange instead of the full archive.
Load it
Licensing
- Free to use for research, backtesting, model training, and commercial products built on top of the data.
- Keep the
_tracing_idcolumn if you redistribute — it preserves provenance back to on-chain evidence.
Need more than this snapshot?
These are historical snapshots. When you need to go live:- Filters, other tokens, or recent history — BlockDB Historic REST API.
- Real-time streams — BlockDB WebSocket feed.
- Custom extracts (anything we can derive from on-chain data) — support@blockdb.io.