Abstract
Blockchain is believed to be able to build trust among multiple parties and improve the operational efficiency of economy and society, which is attributed to its decentralization property. However, the endless growth of Blockchain data keeps challenging the storage capacity of Blockchain nodes, compromising decentralization, and revealing the issue that the state-of-the-art storage systems of Blockchain are not scalable. From the perspective of improving the scalability of Blockchain storage systems, this paper introduces the logical and physical data structure used by Blockchain storage systems, surveys the current schemes of Blockchain storage systems in terms of the approaches to reducing data redundancy, the corresponding influence on its degree of decentralization and data reliability, and conducts quantitative analysis on data redundancy for schemes of Blockchain storage systems. The study finds that the key to realizing scalable Blockchain storage systems is to deal with the contradiction between data redundancy and its decentralization characteristic. Based on the findings, the node-based scalable model for Blockchain storage systems (SMBSS) is proposed, experimental analysis on the prototypes of the SMBSS is carried out to verify its validity, and future directions for scalable Blockchain storage systems are concluded.
Similar content being viewed by others
References
Yuan Y, Wang FY (2016) Blockchain: the state of the art and future trends. Acta Autom Sin 42(4):481–494
Nakamoto S (2009) Bitcoin: a peer-to-peer electronic cash system. Preprint at https://bitcoin.org/bitcoin.pdf
Buterin V (2014) Ethereum white paper: a next generation smart contract & decentralized application platform. Preprint at https://ethereum.org/en/whitepaper/
Androulaki E, Manevich Y, Muralidharan S, Murthy C, Laventman G (2018) Hyperledger fabric: a distributed operating system for permissioned blockchains. In: Paper presented at the 13th European conference on computer systems, 23–26 April 2018
Hayes A (2020) Investopedia. figshare https://www.investopedia.com/terms/s/scalability.asp
BtcBitaps. figshare https://btc.bitaps.com (2021)
Bitnodes. figshare https://bitnodes.io/dashboard/?days=730 (2021)
EthBitaps. figshare https://eth.bitaps.com/ (2021)
BitcoinJ. figshare https://bitnodes.io/dashboard/?days=730
Mckinney J, Light client protocol. figshare https://github.com/ethereum/wiki/wiki/Light-client-protocol
Zhang Z, Wang G, Xu J, Du X (2020) Survey on data management in blockchain systems. J Softw 31(9):2903–2925
Yuan Y, Wang FY (2020) Editable blockchain: models, techniques and methods. Acta Autom Sin 46(5):831–846
Merkle RC (1980) Protocols for public key cryptosystems. In: Paper presented at 1980 IEEE symposium on security and privacy, 14–16 April 1980
Buterin V, Merkling in ethereum. figshare https://blog.ethereum.org/2015/11/15/merkling-in-ethereum
Xu J, Wei L, Zhang Y, Wang A, Zhou F, Gao CZ (2018) Dynamic fully homomorphic encryption-based Merkle tree for lightweight streaming authenticated data structures. J Netw Comput Appl 107:113–124
Shao QF, Jin CQ, Zhang Z, Qian WN, Zhou AY (2018) Blockchain: architecture and research progress. J Softw 41(5):969–988
Buterin V, Thoughts on UTXOs. figshare https://medium.com/@ConsenSys/thoughts-on-utxo-by-vitalik-buterin-2bb782c67e53
Bitcoin core 0.11. figshare https://en.bitcoin.it/wiki/Bitcoin_Core_0.11_(ch_2):_Data_Storage
Xie J, Yu FR, Huang T, Xie R, Liu J, Liu Y (2019) A survey on the scalability of blockchain systems. IEEE Net 33(5):166–173
Shuai Z, Yong Y, Xiao-Chun N, Fei-Yue W (2019) Scaling blockchain towards bitcoin: key technologies, constraints and related issues. Acta Autom Sin 45(6):1015–1030
Politou E, Casino F, Alepis E, Patsakis C (2019) Blockchain mutability: challenges and proposed solutions. IEEE Trans Emerg Top Comput, arXiv:1907.07099
Zhou Q, Huang H, Zheng Z (2020) Solutions to scalability of blockchain: a survey. IEEE Access 8:16440–16455. https://doi.org/10.1109/ACCESS.2020.2967218
Sun Z, Zhang X, Xiang F, Chen L (2021) Survey of storage scalability on blockchain. J Softw 32(1):1–20
Zhao YL, Niu BN, Li P, Fan X (2019) A novel enhanced lightweight node for blockchain. In: Paper presented at the 1st blockchain and trustworthy systems, 7–8 December 2019
Shard wiki. figshare https://en.wikipedia.org/wiki/Shard_(database_architecture
Yu B, Li X, Zhao H (2020) Virtual block group: a scalable blockchain model with partial node storage and distributed hash table. Comput J 63(10):1524–1536
Zhang XH, Niu BN, Gong T (2021) Account-based blockchain scalable storage model. Preprint at https://kns.cnki.nct/kcms/detail/11.2625.V.20210316.1345.002.html
Wang YW (2020) A blockchain system with lightweight full node based on dew computing. Internet of Things 10(3):100184
Wang Y (2016) Definition and categorization of dew computing. Open J Cloud Comput 3(1):1–7
Wang YW (2015) Cloud-dew architecture. Int J Cloud Comput 4:199–210
Jia DY, Xin JC, Wang ZQ, Wei GG, Wang GR (2018) ElasticChain: support very large blockchain by reducing data redundancy. In: Paper presented at the 2018 Asia-Pacific Web (APWeb) and web-age information management (WAIM) joint international conference on web and big data, 23–25 July 2018
Jia DY, Xin JC, Wang ZQ, Wei GG, Wang GR (2018) Mitigating bitcoin node storage size by DHT. In: Paper presented at the 2018 Asian internet engineering conference, 12–14 November 2018
Kaneko Y, Asaka T (2018) DHT clustering for load balancing considering blockchain data size. In: Paper presented at the 6th international symposium on computing and networking workshops (CANDARW), 27–30 November 2018
Perard D, Lacan J, Bachy Y, Detchart J (2018) Erasure code-based low storage blockchain node. In: Paper presented at the 2018 Asian internet engineering conference, 30 July–3 August 2018
Li J, Li B (2013) Erasure coding for cloud storage systems: a survey. Tsinghua Sci Technol 3:259–272
Li J, Li B (2018) A low storage room requirement framework for distributed ledger in blockchain. IEEE Access 2018:22970–22975
Palm E, Schelen O, Bodin U (2018) Selective blockchain transaction pruning and state derivability. In: Paper presented at the 2018 crypto valley conference on blockchain technology (CVCBT), 20–22 June 2018
Florian M, Henningsen S, Beaucamp S, Scheuermann B (2019) Erasing data from blockchain nodes. In: Paper presented at the 2019 IEEE European symposium on security and privacy workshops (EuroS&PW), 17–19 June 2019
Bruce JD, The mini-blockchain scheme. figshare http://cryptonite.info/files/mbc-scheme-rev3.pdf
Chepurnoy A, Larangeira M, Ojiganov A (2019) Rollerchain, a blockchain with safely pruneable full blocks. Preprint at http://arxiv.org/abs/1603.07926
Feng X, Ma J, Miao Y, Meng Q, Liu X, Jiang Q, Li H (2019) Pruneable sharding-based blockchain protocol. Peer-to-Peer Netw Appl 12:934–950
Buterin V, State Tree Pruning. figshare https://blog.ethereum.org/2015/06/26/state-tree-pruning/
Ehrenhofer J, Monero adds blockchain pruning and improves transaction efficiency. figshare https://web.getmonero.org/zh-cn/2019/02/01/pruning.html
Dai X, Xiao J, Yang W, Wang C, Jin H (2019) Jidar: a jigsaw-like data reduction approach without trust assumptions for bitcoin system. In: Paper presented at the 2019 IEEE 39th international conference on distributed computing systems (ICDCS) , 7–10 July 2019
Block file pruning. figshare https://github.com/bitcoin/bitcoin/blob/v0.11.0/doc/release-notes.md#block-file-pruning
Luu L, Narayanan V, Zheng C, Baweja K, Saxena P (2016) A secure sharding protocol for open blockchains. In: Paper presented at the 2016 ACM SIGSAC conference, 24–28 October 2016
Kokoris-kogias E, Jovanovic P, Gasser L, Gailly N, Syta E, Ford B (2016) OmniLedger: a secure, scale-out, decentralized ledger via sharding. In: Paper presented at the 2018 IEEE symposium on security and privacy (SP), 20–24 May 2018
Xu Z, Han S, Lei C (2018) CUB, a consensus unit-based storage scheme for blockchain system. In: Paper presented at the 2018 IEEE 34th international conference on data engineering (ICDE), 16–19 April 2018
Kokoris-kogias E, Jovanovic P, Gasser L, Gailly N, Syta E, Ford B (2016) RapidChain: scaling blockchain via full sharding. In: Paper presented at the 2018 ACM SIGSAC conference on computer and communications security, 15–19 October 2018
Pamies-Juarez L, Sanchez-Artigas M, Garcia-Lopez P, Mondejar R, Chaabouni R (2014) On the interplay between data redundancy and retrieval times in p2p storage systems. Comput Netw 59(11):1–16
Segura SD, Perez-Sola C, Navarro-Arribas G, Herrera-Joancomarti J (2016) Analysis of the Bitcoin UTXO set. In: Paper presented at the 22nd international conference on financial cryptography and data security, 26 Feb 2018
Xu J (2011) (ed.): Research on data reliability in Peer-to-peer network storage system. Harbin: Harbin Engineering University
Borel E (1962) (ed.): Probabilities and life. Maurice Baudin Dover Publications, Inc., New York
Bitcoin-ETL developer. figshare https://github.com/blockchain-etl/bitcoin-etl
Ethereum Network Status. figshare https://ethstats.net
PeerSim. figshare http://peersim.sourceforge.net/
Acknowledgements
This research is supported by National Natural Science Foundation of China (62072326) and the International Cooperation Project of the Major Research, Development Program of Shanxi (201903D421007) and Hubei Key Laboratory of Optical Information and Pattern Recognition, Wuhan Institute of Technology (201903).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Fan, X., Niu, B. & Liu, Z. Scalable blockchain storage systems: research progress and models. Computing 104, 1497–1524 (2022). https://doi.org/10.1007/s00607-022-01063-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00607-022-01063-8
Keywords
- Blockchain
- Schemes of Blockchain storage systems
- Data redundancy
- Decentralization
- Node-based scalable model for Blockchain storage systems