← Rabbit Holes

Bitcoin Blockchain Rabbit Hole

Everyone’s heard the word “blockchain.” Almost nobody knows what it actually is. This is the thread that fixes that.

15 questions · ~24 min

1.How does the Bitcoin blockchain actually work — what is it, really?

The Bitcoin blockchain is a chronological record of every Bitcoin transaction ever made, stored simultaneously on thousands of computers worldwide, where each new page of transactions is permanently sealed to the previous one in a chain that makes history practically impossible to alter.

Strip away the buzzword and it’s a ledger. A long, public, shared record of who sent what to whom, going all the way back to January 3, 2009 — the day Satoshi mined the first block. Every transaction ever made on Bitcoin is in there. Every one.

What makes it unusual isn’t the ledger — banks have ledgers too. What makes it unusual is that nobody owns it. No company runs it. No server hosts the “official” copy. Thousands of computers around the world each hold a complete copy, and they all agree on the same history through a set of rules enforced by the software they’re running.

The “chain” part is the security mechanism. Each new batch of transactions — a block — is cryptographically sealed to the previous block. To change anything in the past, you’d have to redo not just that block but every block that came after it, faster than the entire rest of the network is building forward. In practice, this becomes impossible within hours of a transaction being confirmed.

Understanding that requires going one level deeper — into what a block actually contains.


2.What’s actually inside a Bitcoin block?

A Bitcoin block contains a header (with technical metadata including the previous block’s hash, a timestamp, and the winning nonce) and a body (the list of transactions confirmed in that block, starting with the coinbase transaction that pays the miner’s reward).

The header is the block’s identity card. It contains: the hash of the previous block (the cryptographic link that forms the chain), a timestamp, the difficulty target the miner had to beat, the nonce (the number miners iterated through billions of times to win), and the Merkle root (a compact mathematical summary of every transaction in the block).

The body is the list of transactions. The first transaction in every block is always the coinbase transaction — a special transaction with no sender, which creates new Bitcoin from nothing and pays it to the winning miner. This is the only mechanism by which new Bitcoin enters existence. After the coinbase, the remaining transactions are the regular user transactions the miner selected from the mempool, ordered by feerate.

A typical block contains anywhere from a few hundred to a few thousand transactions, depending on their sizes. The block size limit — currently around 4 million weight units after SegWit — caps how much can fit. This cap is what creates the fee market: when demand for block space exceeds supply, fees rise.

The header’s most important element — the one that creates the “chain” — is the previous block’s hash.


3.Why are Bitcoin blocks “chained” — what does that actually mean?

Bitcoin blocks are chained because each block’s header contains the cryptographic hash of the previous block — meaning any change to a past block produces a completely different hash, which breaks every block that came after it and makes history tampering immediately detectable.

A hash is the output of a one-way mathematical function — feed it any input and it produces a fixed-length fingerprint. Change one character of the input and the fingerprint looks completely different. There’s no pattern, no gradual change — it’s a completely new value.

Each block’s header includes the hash of the block before it. So block 800,001 contains the hash of block 800,000. Block 800,002 contains the hash of 800,001. And so on, all the way back to block 0 — the genesis block Satoshi mined in 2009.

This creates a dependency chain. If you wanted to alter a transaction in block 800,000 — say, pretend you never sent Bitcoin to someone — the change would produce a different hash for that block. But block 800,001 contains the old hash of 800,000. Now 800,001 is broken too. To fix it, you’d have to recalculate 800,001, which breaks 800,002, and so on. You’d have to redo the proof-of-work for every single block from your change all the way to the present — faster than the entire honest network is building forward. It becomes computationally impossible within hours.

The chain of hashes is what turns a ledger into an immutable history. But within each block, there’s another layer of mathematical protection worth understanding.


4.What is a Bitcoin Merkle tree and why do we need it?

A Bitcoin Merkle tree is a mathematical structure that summarizes all transactions in a block into a single hash called the Merkle root — allowing anyone to verify a specific transaction was included in a block without downloading every transaction in it.

Here’s how it works. Take all the transactions in a block. Hash each one. Then pair them up and hash each pair. Then pair those hashes and hash again. Keep halving until you have one final hash — the Merkle root — that sits in the block header.

The result is a compact proof. If you want to verify that transaction X was in block 800,000, you don’t need to download all 3,000 transactions in that block. You just need the small set of hashes along the branch from your transaction to the root — called a Merkle proof. A few kilobytes of data instead of megabytes.

This is what makes lightweight Bitcoin wallets possible. Your phone doesn’t download the full blockchain — it downloads block headers and uses Merkle proofs to verify its own transactions are confirmed. The security comes from the mathematical guarantee: if the Merkle root in the header is valid, and your transaction’s Merkle proof checks out, the transaction is genuinely in that block.

Merkle trees are also tamper detection. Change any single transaction in the block and the Merkle root changes — which changes the block header hash — which breaks every block that follows. The tree and the chain work together as one protection system.

Which brings the real question into focus: what does “immutable” actually mean in practice?


5.What does “Bitcoin’s ledger is immutable” actually mean — can records really never change?

Bitcoin’s immutability means that altering any confirmed transaction becomes exponentially more expensive with every new block added on top — it’s not technically impossible, but within hours it requires more computational power than exists on earth, making it practically permanent.

Immutable doesn’t mean physically carved in stone. It means economically and computationally prohibitive to reverse.

A transaction confirmed one block deep is theoretically reversible — an attacker with enormous hash rate could redo that one block. This is why exchanges wait for multiple confirmations before crediting deposits. A transaction six blocks deep requires redoing six blocks of proof-of-work simultaneously while the honest network keeps adding more on top. The computational cost grows with every new block.

After a few days, a transaction is buried under hundreds of blocks. To reverse it would require an impossible amount of energy — more than the entire Bitcoin network has produced since the transaction was confirmed. At this depth, “immutable” is functionally accurate.

This is also why the commonly cited “51% attack” is more nuanced than it sounds. Even with 51% of hash rate, an attacker can only reverse recent transactions — and each block they need to redo costs real electricity. For transactions more than a few hours old, the cost of reversal exceeds any plausible benefit. The economic impracticality IS the immutability.

The blockchain records transactions. But transactions aren’t “balances” — and this is where most people’s mental model breaks down completely.


6.What is a Bitcoin UTXO — and why doesn’t Bitcoin track balances like a bank?

A Bitcoin UTXO (Unspent Transaction Output) is a discrete chunk of Bitcoin that has been received but not yet spent — Bitcoin doesn’t track account balances at all, it tracks individual chunks of value, and your “balance” is just the sum of all UTXOs assigned to your addresses.

Banks track balances. Your account has a number, and transactions change that number up or down. Bitcoin works completely differently.

Every Bitcoin transaction consumes inputs (previous UTXOs) and creates outputs (new UTXOs). When someone sends you 0.5 BTC, that creates a UTXO — a discrete parcel of 0.5 BTC assigned to your address. It sits there, unspent, as a complete unit. When you spend it, that UTXO is consumed entirely and new UTXOs are created: one for the recipient and usually one back to yourself as change.

There’s no “account” in Bitcoin. The blockchain just contains a list of all UTXOs that haven’t been spent yet — called the UTXO set. Your wallet scans the blockchain for any UTXOs assigned to addresses it controls and adds them up. That sum is your “balance.” But the balance isn’t stored anywhere — it’s calculated on the fly from the UTXO set.

This has real implications. Each UTXO is like a separate coin or bill. You can have one UTXO worth 1 BTC or ten UTXOs each worth 0.1 BTC — same total, very different structure. Transaction fees depend partly on how many UTXOs you’re spending, not just the amount. Sending Bitcoin that’s spread across many small UTXOs costs more in fees than sending the same amount from one large UTXO.

Which immediately leads to the obvious question about what your wallet is actually showing you.


7.So when my Bitcoin wallet shows a balance, where does that number actually come from?

Your Bitcoin wallet balance is calculated by scanning the blockchain for all unspent transaction outputs (UTXOs) assigned to addresses your wallet controls and adding them up — the number isn’t stored anywhere, it’s derived fresh from the blockchain every time.

Your wallet knows which addresses it controls — because it generated them from your private key. It queries the blockchain (either directly if you run a full node, or through a trusted server if you use a light wallet) asking: are there any unspent outputs assigned to these addresses?

The blockchain returns the list of UTXOs. The wallet adds them up. That sum appears as your balance. If you receive new Bitcoin, a new UTXO appears on the blockchain assigned to one of your addresses. Your wallet recalculates. The balance updates.

This is why Bitcoin wallets can be fully restored from a seed phrase. The wallet doesn’t store your balance — it derives it. Give the wallet your seed phrase, it regenerates all your private keys and addresses, scans the blockchain for UTXOs assigned to those addresses, and your balance reappears. Nothing was “stored” in the old wallet that needs to be recovered. The blockchain held the truth the whole time.

It also explains why old, unused wallets can still hold Bitcoin decades later. The UTXOs assigned to those addresses sit on the blockchain unchanged. As long as someone holds the private key, the Bitcoin is accessible — even if the original device is long gone.

That blockchain — the one being scanned — doesn’t live in one place.


8.How many people have a copy of the Bitcoin blockchain? Where does it live?

The Bitcoin blockchain lives on approximately 15,000–20,000 publicly reachable full nodes worldwide, plus many more private nodes — no single server, no central host, every full node holds the complete identical history independently.

There is no Bitcoin headquarters where “the blockchain” is kept. Every full node operator — anyone running Bitcoin Core or compatible software with a full copy of the chain — is hosting it. They downloaded every block from the genesis block forward, verified every transaction against Bitcoin’s rules independently, and now hold the complete history on their own hardware.

These nodes are in people’s homes, in data centers, on cloud servers, on Raspberry Pis. The geographic distribution spans dozens of countries. The United States, Germany, France, and the Netherlands host significant clusters, but nodes exist across every major region.

When a new block is found, it propagates to every node within seconds through the gossip protocol. Every node independently verifies it and adds it to their copy. All of them agree on the same chain — not because they trust each other, but because they all apply the same rules and valid blocks pass the same tests everywhere.

The redundancy is total. To destroy the Bitcoin blockchain you’d have to simultaneously destroy every copy on every node in every country at the same moment. Even then, archival copies exist on drives, tapes, and satellites. The history is essentially indestructible.

All those copies of the same data raise a practical question.


9.How big is the Bitcoin blockchain — and does it keep growing forever?

The Bitcoin blockchain is approximately 650GB as of 2025 and grows by roughly 50–80GB per year — it does grow indefinitely, but pruned nodes can discard old block data and keep only the current UTXO set, reducing storage requirements to a few gigabytes.

Every ten minutes a new block — roughly 1–2MB — is added permanently. That’s about 52,000 blocks per year, accumulating steadily since 2009. The full chain is currently around 650GB and growing.

For most users this doesn’t matter — light wallets don’t download the full chain at all. For full node operators, the growth is manageable on modern hardware. A 2TB drive holds many years of blockchain growth and costs very little.

Pruning is the practical solution for node operators who want to verify without storing everything. A pruned node downloads and verifies each block fully — checking every transaction, every signature, every rule — then discards the old block data after verification. It keeps only the UTXO set (the current state of all unspent coins) and the block headers. The UTXO set runs to a few gigabytes. The node still fully validates; it just doesn’t archive history it no longer needs.

Full archival nodes — the ones keeping everything — are maintained by researchers, exchanges, blockchain explorers, and enthusiasts who want the complete record. The network benefits from their existence, but most nodes don’t need to be archival to do their security job.

Those archival nodes are what power the tools that let anyone look at Bitcoin’s history directly.


10.What can I actually see on a Bitcoin blockchain explorer?

A Bitcoin blockchain explorer lets you look up any transaction, address, or block in Bitcoin’s entire history — seeing amounts sent, addresses involved, fees paid, confirmation status, and the complete chain of where any Bitcoin came from and went to.

Blockchain explorers — mempool.space, blockchair.com, blockchain.com — are essentially search interfaces over the full Bitcoin blockchain. Enter a transaction ID, an address, or a block number and you see everything Bitcoin recorded about it.

For a transaction: the sending addresses, receiving addresses, amount, fee paid, size in bytes, which block it was confirmed in, how many confirmations it now has, and the exact timestamp. If it’s still unconfirmed, you can see its feerate and roughly where it sits in the mempool queue.

For an address: every transaction that ever sent to or received from it, the current UTXO balance, total received and sent over its lifetime. The full financial history of that address, from the first transaction to the last, is publicly visible to anyone.

For a block: which miner found it (usually identifiable by a tag in the coinbase transaction), the exact timestamp, the difficulty it was mined at, every transaction it contains, and the total fees collected.

All of this is public. No account required. No permission needed. The transparency is absolute — which leads to something most new Bitcoin holders find surprising.


11.Can I trace where any Bitcoin has been — all the way back to when it was first mined?

Yes — every Bitcoin can theoretically be traced back through the blockchain to the coinbase transaction that created it, because every transaction in Bitcoin’s history references the outputs it’s spending, creating an unbroken chain of custody from creation to present.

Every UTXO was created by a transaction. That transaction consumed previous UTXOs. Those came from earlier transactions. Follow the chain backwards and you eventually reach a coinbase transaction — the special transaction in a mined block that created the Bitcoin from nothing. The whole history is there, permanently, in the blockchain.

Chain analysis firms — Chainalysis, Elliptic, CipherTrace — do exactly this professionally. They trace Bitcoin flows, cluster addresses believed to belong to the same entity, and map fund movements across the network. Exchanges use their reports for compliance. Law enforcement uses them for investigations.

The practical implication: Bitcoin received from a regulated exchange (which collected your identity) is linked to you. Bitcoin that passed through a known darknet market or sanctions-linked address carries that history with it. Some exchanges apply “taint analysis” and flag or reject Bitcoin with suspicious histories, even if the current holder was an innocent downstream recipient.

This traceability is Bitcoin’s transparency working as designed. But it’s also why serious privacy advocates go further than just pseudonymous addresses.

All of this — blocks, chains, UTXOs, history — describes one version of the blockchain. But it’s not guaranteed there will only ever be one.


12.What’s the difference between a Bitcoin soft fork and a hard fork?

A Bitcoin soft fork is a backward-compatible rule change where old nodes still accept new blocks as valid, while a hard fork is an incompatible rule change where old and new nodes reject each other’s blocks — permanently splitting the chain if both sides have significant support.

Soft forks tighten or add rules without breaking backward compatibility. Nodes running old software see new blocks as valid — they just don’t understand the new features. Nodes running new software enforce the new rules more strictly. The network stays together because everyone agrees on validity, even if not everyone understands the new features. SegWit was a soft fork. Taproot was a soft fork.

Hard forks change rules in ways old nodes reject. An old node sees a new-rule block as invalid and ignores it. A new node sees old-rule blocks as invalid and ignores them. If both camps have enough miners and nodes to keep their version alive, two separate chains emerge — each with its own transaction history from the fork point forward, each with its own coin, permanently diverged.

Bitcoin Cash was a hard fork in August 2017 — a faction of the community insisted on larger blocks, changed the rules, and created a new chain that old Bitcoin nodes rejected. Both chains have continued independently ever since.

The Bitcoin community learned from these events to strongly prefer soft forks — changes that can be adopted incrementally without forcing everyone to upgrade simultaneously or risk a split.

When a fork does happen, what physically occurs at the blockchain level is worth understanding clearly.


13.When Bitcoin forks, what actually happens to the blockchain — does it split?

When a Bitcoin hard fork happens, the blockchain literally splits at a specific block — both chains share identical history up to that block, then diverge permanently, with each chain adding its own blocks forward under its own rules.

Up to the fork block, both chains are identical. Every transaction, every block, every balance — the same history on both. From the fork block forward, they’re different ledgers maintained by different communities under different rules.

Everyone who held Bitcoin at the moment of the fork automatically holds coins on both chains — because their addresses and UTXOs exist in the shared history. This is why Bitcoin Cash holders were given BCH equal to their BTC holdings at the fork date. The coins existed on both chains simultaneously until each chain’s subsequent blocks diverged.

Technically: nodes running old software keep building on the old chain. Nodes running new software build on the new chain. Miners choose which chain to mine. The chain with more cumulative proof-of-work is considered “main” by each camp’s own standards — but there’s no universal arbiter. The market decides which coin has value.

Most forks fade quickly — the minority chain loses miner support, blocks become infrequent, and the coin loses value. Occasionally, like Bitcoin Cash, the minority chain persists long-term with its own community and ecosystem.

All of this — fork or no fork — happens on a completely transparent public ledger. Which raises a question most new users find unsettling at first.


14.If the Bitcoin blockchain is completely public, does that mean it has no privacy at all?

Bitcoin is pseudonymous rather than anonymous — addresses aren’t tied to real identities by default, but all transaction amounts and flows are permanently public, and once an address is linked to a real identity, the entire transaction history of that address becomes visible.

The blockchain records addresses, not names. “1A1zP1eP5QGefi2DMPTfTL5SLmv7Divf” sent 0.5 BTC to “3J98t1WpEZ73CNmQviecrnyiWrnqRhWNLy” — that’s what the ledger shows. No names, no emails, no locations.

The pseudonymity breaks down at connection points. Buy Bitcoin on a regulated exchange that collected your passport — your identity is now linked to your withdrawal address. Receive Bitcoin at an address you posted publicly. Pay for something with Bitcoin and give a shipping address. Each connection point links an address to a real identity, and once linked, every transaction to and from that address is visible forever.

Privacy-conscious users employ several techniques: using a new address for every transaction (wallets do this automatically), using CoinJoin to mix transactions with other users, routing through the Lightning Network (which doesn’t record individual payments on-chain), or using dedicated privacy tools.

The privacy situation is comparable to cash that leaves permanent receipts. The receipt shows the amount and the account numbers but not the name — unless you’ve connected your name to an account somewhere.

Bitcoin’s transparency, which creates the privacy challenge, is also the reason the system works without a trusted third party.


15.So all my Bitcoin transactions are public forever — should I be worried?

Yes, all Bitcoin transactions are permanently public — but your real identity isn’t attached by default, and the same transparency that makes your transactions visible is exactly what makes the money supply trustworthy and the system work without a bank.

Banks operate on private ledgers. You trust that your bank’s internal records accurately reflect your balance. You trust that the bank isn’t creating money it shouldn’t. You trust that the numbers you see in your account app match what the bank has actually recorded. This trust is enforced by regulation and audits — external checks on systems you cannot inspect yourself.

Bitcoin’s ledger is public precisely to eliminate that trust requirement. Anyone can download the full blockchain and verify independently: every transaction ever made, every coin ever created, the exact current supply (you can count every existing UTXO), and whether any rule has ever been broken. No trust required. No auditor needed. The math is the audit.

This is also why Bitcoin’s 21 million supply cap is credible in a way no central bank’s promises can be. You don’t have to believe anyone — you can verify it. Count the UTXOs. Count the coinbase rewards. Check the halving schedule. The supply is mathematically verifiable by anyone with an internet connection and a computer.

The transparency that makes privacy complicated is the same transparency that makes the money honest. That tradeoff is not accidental. It’s the design.


Related Deep Dive Threads

Still curious? The blockchain records history, but understanding who controls Bitcoin’s rules — and how changes actually get made — is its own rabbit hole. That thread is here.

One of 9 Bitcoin rabbit holes — pick a topic and fall in.