Merkle tree

Short description: Type of data structure

In cryptography and computer science, a hash tree or Merkle tree is a tree in which every "leaf" node is labelled with the cryptographic hash of a data block, and every node that is not a leaf (called a branch, inner node, or inode) is labelled with the cryptographic hash of the labels of its child nodes. A hash tree allows efficient and secure verification of the contents of a large data structure. A hash tree is a generalization of a hash list and a hash chain.

Demonstrating that a leaf node is a part of a given binary hash tree requires computing a number of hashes proportional to the logarithm of the number of leaf nodes in the tree.^[1] Conversely, in a hash list, the number is proportional to the number of leaf nodes itself. A Merkle tree is therefore an efficient example of a cryptographic commitment scheme, in which the root of the tree is seen as a commitment and leaf nodes may be revealed and proven to be part of the original commitment.^[2]

The concept of a hash tree is named after Ralph Merkle, who patented it in 1979.^[3]^[4]

Uses

Hash trees can be used to verify any kind of data stored, handled and transferred in and between computers. They can help ensure that data blocks received from other peers in a peer-to-peer network are received undamaged and unaltered, and even to check that the other peers do not lie and send fake blocks.

Hash trees are used in:

hash-based cryptography.
InterPlanetary File System (IPFS),
BitTorrent
hashtree^[5]
ZFS file system^[6] (to counter data degradation^[7]);
Dat protocol;
Apache Wave protocol;^[8]
Git and Mercurial distributed revision control systems (although, strictly speaking, they use directed acyclic graphs, not trees);
the Tahoe-LAFS backup system;
Zeronet;
OpenZFS^[9]
the Bitcoin and Ethereum peer-to-peer networks;^[10]
the Certificate Transparency framework;
the Nix package manager and descendants like GNU Guix;^[11]
a number of NoSQL systems such as Apache Cassandra, Riak, and Dynamo.^[12]

Suggestions have been made to use hash trees in trusted computing systems.^[13]

Overview

A hash tree is a tree of hashes in which the leaves (i.e., leaf nodes, sometimes also called "leafs") are hashes of data blocks in, for instance, a file or set of files. Nodes farther up in the tree are the hashes of their respective children. For example, in the above picture hash 0 is the result of hashing the concatenation of hash 0-0 and hash 0-1. That is, hash 0 = hash( hash 0-0 + hash 0-1 ) where "+" denotes concatenation.

Most hash tree implementations are binary (two child nodes under each node) but they can just as well use many more child nodes under each node.

Usually, a cryptographic hash function such as SHA-2 is used for the hashing. If the hash tree only needs to protect against unintentional damage, non-cryptographic checksums such as CRCs can be used.

In the top of a hash tree there is a top hash (or root hash or master hash). Before downloading a file on a P2P network, in most cases the top hash is acquired from a trusted source, for instance a friend or a web site that is known to have good recommendations of files to download. When the top hash is available, the hash tree can be received from any non-trusted source, like any peer in the P2P network. Then, the received hash tree is checked against the trusted top hash, and if the hash tree is damaged or fake, another hash tree from another source will be tried until the program finds one that matches the top hash.^[14]

Second preimage attack

The Merkle hash root does not indicate the tree depth, enabling a second-preimage attack in which an attacker creates a document other than the original that has the same Merkle hash root. For the example above, an attacker can create a new document containing two data blocks, where the first is hash 0-0 + hash 0-1, and the second is hash 1-0 + hash 1-1.^[15]^[16]

One simple fix is defined in Certificate Transparency: when computing leaf node hashes, a 0x00 byte is prepended to the hash data, while 0x01 is prepended when computing internal node hashes.^[14] Limiting the hash tree size is a prerequisite of some formal security proofs, and helps in making some proofs tighter. Some implementations limit the tree depth using hash tree depth prefixes before hashes, so any extracted hash chain is defined to be valid only if the prefix decreases at each step and is still positive when the leaf is reached.

Tiger tree hash

The Tiger tree hash is a widely used form of hash tree. It uses a binary hash tree (two child nodes under each node), usually has a data block size of 1024 bytes and uses the Tiger hash.^[17]

Tiger tree hashes are used in Gnutella,^[18] Gnutella2, and Direct Connect P2P file sharing protocols^[19] and in file sharing applications such as Phex,^[20] BearShare, LimeWire, Shareaza, DC++^[21] and gtk-gnutella.^[22]

References

↑ Becker, Georg (2008-07-18). "Merkle Signature Schemes, Merkle Trees and Their Cryptanalysis". Ruhr-Universität Bochum. p. 16. http://www.emsec.rub.de/media/crypto/attachments/files/2011/04/becker_1.pdf.
↑ "Handbook of Applied Cryptography". Section 13.4.1. https://cacr.uwaterloo.ca/hac/.
↑ Merkle, R. C. (1988). "A Digital Signature Based on a Conventional Encryption Function". Advances in Cryptology – CRYPTO '87. Lecture Notes in Computer Science. 293. pp. 369–378. doi:10.1007/3-540-48184-2_32. ISBN 978-3-540-18796-7.
↑ Ralph Merkle, "Method of providing digital signatures", US patent 4309569, published Jan 5, 1982, assigned to The Board of Trustees of the Leland Stanford Junior University
↑ "hashtree developer page". https://hashtree.cc/#/dev.
↑ Bonwick, Jeff (2005-12-08). "ZFS End-to-End Data Integrity". https://blogs.oracle.com/bonwick/entry/zfs_end_to_end_data.
↑ Likai Liu. "Bitrot Resistance on a Single Drive". http://lifecs.likai.org/2014/06/bitrot-resistance-on-single-drive.html.
↑ "General Verifiable Federation". http://www.waveprotocol.org/whitepapers/wave-protocol-verification.
↑ "Introduction to ZFS — openzfs latest documentation". https://openzfs.readthedocs.io/en/latest/introduction.html.
↑ Koblitz, Neal; Menezes, Alfred J. (January 2016). "Cryptocash, cryptocurrencies, and cryptocontracts". Designs, Codes and Cryptography 78 (1): 87–102. doi:10.1007/s10623-015-0148-5.
↑ Dolstra, E. The Purely Functional Software Deployment Model. PhD thesis, Faculty of Science, Utrecht, The Netherlands. January 2006. p.21 ISBN 90-393-4130-3.
↑ Adam Marcus. "The NoSQL Ecosystem". http://www.aosabook.org/en/nosql.html. "When a replica is down for an extended period of time, or the machine storing hinted handoffs for an unavailable replica goes down as well, replicas must synchronize from one another. In this case, Cassandra and Riak implement a Dynamo-inspired process called anti-entropy. In anti-entropy, replicas exchange Merkle trees to identify parts of their replicated key ranges which are out of sync. A Merkle tree is a hierarchical hash verification: if the hash over the entire keyspace is not the same between two replicas, they will exchange hashes of smaller and smaller portions of the replicated keyspace until the out-of-sync keys are identified. This approach reduces unnecessary data transfer between replicas which contain mostly similar data."
↑ Kilian, J. (1995). "Improved Efficient Arguments". Advances in Cryptology — CRYPT0' 95. Lecture Notes in Computer Science. 963. pp. 311–324. doi:10.1007/3-540-44750-4_25. ISBN 978-3-540-60221-7. https://link.springer.com/content/pdf/10.1007/3-540-44750-4_25.pdf.
↑ ^14.0 ^14.1 Laurie, B.; Langley, A.; Kasper, E. (June 2013). "Certificate Transparency" (in en). IETF. doi:10.17487/rfc6962. https://www.rfc-editor.org/info/rfc6962.
↑ Elena Andreeva; Charles Bouillaguet; Orr Dunkelman; John Kelsey (January 2009). "Herding, Second Preimage and Trojan Message Attacks beyond Merkle-Damgård". Selected Areas in Cryptography. Lecture Notes in Computer Science. 5867. SAC. pp. 393–414. doi:10.1007/978-3-642-05445-7_25. ISBN 978-3-642-05443-3.
↑ Elena Andreeva; Charles Bouillaguet; Pierre-Alain Fouque; Jonathan J. Hoch; John Kelsey; Adi Shamir; Sebastien Zimmer (2008). "Second Preimage Attacks on Dithered Hash Functions". in Smart, Nigel. Advances in Cryptology – EUROCRYPT 2008. Lecture Notes in Computer Science. 4965. Istanbul, Turkey. pp. 270–288. doi:10.1007/978-3-540-78967-3_16. ISBN 978-3-540-78966-6.
↑ Chapweske, J.; Mohr, G. (March 4, 2003). "Tree Hash EXchange format (THEX)". http://open-content.net/specs/draft-jchapweske-thex-02.html.
↑ "tigertree.c File Reference". https://gtk-gnutella.sourceforge.net/doxygen/tigertree_8c.html.
↑ "Audit: P2P DirectConnect Application". https://www.symantec.com/security_response/attacksignatures/detail.jsp?asid=21587.
↑ "Phex 3.0.0 released". 7 Jan 2007. http://www.phex.org/mambo/content/view/80/2/.
↑ "DC++'s feature list". http://dcplusplus.sourceforge.net/features.html.
↑ "Development". http://gtk-gnutella.sourceforge.net/en/?page=devel.

External links

A C implementation of a dynamically re-sizeable binary SHA-256 hash tree (Merkle tree)
Merkle tree implementation in Java
Tiger Tree Hash (TTH) source code in C#, by Gil Schmidt
Tiger Tree Hash (TTH) implementations in C and Java
RHash, an open source command-line tool, which can calculate TTH and magnet links with TTH

0.00

(0 votes)

Original source: https://en.wikipedia.org/wiki/Merkle tree. Read more

[1] Becker, Georg (2008-07-18). "Merkle Signature Schemes, Merkle Trees and Their Cryptanalysis". Ruhr-Universität Bochum. p. 16. http://www.emsec.rub.de/media/crypto/attachments/files/2011/04/becker_1.pdf.

[2] "Handbook of Applied Cryptography". Section 13.4.1. https://cacr.uwaterloo.ca/hac/.

[3] Merkle, R. C. (1988). "A Digital Signature Based on a Conventional Encryption Function". Advances in Cryptology – CRYPTO '87. Lecture Notes in Computer Science. 293. pp. 369–378. doi:10.1007/3-540-48184-2_32. ISBN 978-3-540-18796-7.

[4] Ralph Merkle, "Method of providing digital signatures", US patent 4309569, published Jan 5, 1982, assigned to The Board of Trustees of the Leland Stanford Junior University

[5] "hashtree developer page". https://hashtree.cc/#/dev.

[endtoend-6] Bonwick, Jeff (2005-12-08). "ZFS End-to-End Data Integrity". https://blogs.oracle.com/bonwick/entry/zfs_end_to_end_data.

[7] Likai Liu. "Bitrot Resistance on a Single Drive". http://lifecs.likai.org/2014/06/bitrot-resistance-on-single-drive.html.

[8] "General Verifiable Federation". http://www.waveprotocol.org/whitepapers/wave-protocol-verification.

[9] "Introduction to ZFS — openzfs latest documentation". https://openzfs.readthedocs.io/en/latest/introduction.html.

[10] Koblitz, Neal; Menezes, Alfred J. (January 2016). "Cryptocash, cryptocurrencies, and cryptocontracts". Designs, Codes and Cryptography 78 (1): 87–102. doi:10.1007/s10623-015-0148-5.

[11] Dolstra, E. The Purely Functional Software Deployment Model. PhD thesis, Faculty of Science, Utrecht, The Netherlands. January 2006. p.21 ISBN 90-393-4130-3.

[12] Adam Marcus. "The NoSQL Ecosystem". http://www.aosabook.org/en/nosql.html. "When a replica is down for an extended period of time, or the machine storing hinted handoffs for an unavailable replica goes down as well, replicas must synchronize from one another. In this case, Cassandra and Riak implement a Dynamo-inspired process called anti-entropy. In anti-entropy, replicas exchange Merkle trees to identify parts of their replicated key ranges which are out of sync. A Merkle tree is a hierarchical hash verification: if the hash over the entire keyspace is not the same between two replicas, they will exchange hashes of smaller and smaller portions of the replicated keyspace until the out-of-sync keys are identified. This approach reduces unnecessary data transfer between replicas which contain mostly similar data."

[13] Kilian, J. (1995). "Improved Efficient Arguments". Advances in Cryptology — CRYPT0' 95. Lecture Notes in Computer Science. 963. pp. 311–324. doi:10.1007/3-540-44750-4_25. ISBN 978-3-540-60221-7. https://link.springer.com/content/pdf/10.1007/3-540-44750-4_25.pdf.

[:0-14] 14.0 ^14.1 Laurie, B.; Langley, A.; Kasper, E. (June 2013). "Certificate Transparency" (in en). IETF. doi:10.17487/rfc6962. https://www.rfc-editor.org/info/rfc6962.

[15] Elena Andreeva; Charles Bouillaguet; Orr Dunkelman; John Kelsey (January 2009). "Herding, Second Preimage and Trojan Message Attacks beyond Merkle-Damgård". Selected Areas in Cryptography. Lecture Notes in Computer Science. 5867. SAC. pp. 393–414. doi:10.1007/978-3-642-05445-7_25. ISBN 978-3-642-05443-3.

[16] Elena Andreeva; Charles Bouillaguet; Pierre-Alain Fouque; Jonathan J. Hoch; John Kelsey; Adi Shamir; Sebastien Zimmer (2008). "Second Preimage Attacks on Dithered Hash Functions". in Smart, Nigel. Advances in Cryptology – EUROCRYPT 2008. Lecture Notes in Computer Science. 4965. Istanbul, Turkey. pp. 270–288. doi:10.1007/978-3-540-78967-3_16. ISBN 978-3-540-78966-6.

[17] Chapweske, J.; Mohr, G. (March 4, 2003). "Tree Hash EXchange format (THEX)". http://open-content.net/specs/draft-jchapweske-thex-02.html.

[18] "tigertree.c File Reference". https://gtk-gnutella.sourceforge.net/doxygen/tigertree_8c.html.

[19] "Audit: P2P DirectConnect Application". https://www.symantec.com/security_response/attacksignatures/detail.jsp?asid=21587.

[20] "Phex 3.0.0 released". 7 Jan 2007. http://www.phex.org/mambo/content/view/80/2/.

[21] "DC++'s feature list". http://dcplusplus.sourceforge.net/features.html.

[22] "Development". http://gtk-gnutella.sourceforge.net/en/?page=devel.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

v t e Tree data structures
Search trees (dynamic sets/associative arrays)	2–3 2–3–4 AA (a,b) AVL B B+ B* B^x (Optimal) Binary search Dancing HTree Interval Order statistic (Left-leaning) Red-black Scapegoat Splay T Treap UB Weight-balanced
Heaps	Binary Binomial Brodal Fibonacci Leftist Pairing Skew van Emde Boas Weak
Tries	Ctrie C-trie (compressed ADT) Hash Radix Suffix Ternary search X-fast Y-fast
Spatial data partitioning trees	Ball BK BSP Cartesian Hilbert R k-d (implicit k-d) M Metric MVP Octree Priority R Quad R R+ R* Segment VP X
Other trees	Cover Exponential Fenwick Finger Fractal tree index Fusion Hash calendar iDistance K-ary Left-child right-sibling Link/cut Log-structured merge Merkle PQ Range SPQR Top

Anonymous

Search

Merkle tree

Namespaces

More

Page actions

Contents

Uses

Overview

Second preimage attack

Tiger tree hash

See also

References

Further reading

External links

Navigation

Navigation

Resources

Help

googletranslator

Navigation

Wiki tools

Wiki tools

Anonymous

Search

Merkle tree

Uses

Overview

Second preimage attack

Tiger tree hash

See also

References

Further reading

External links

Navigation

Wiki tools

Page tools

Other projects

Categories