Skip to main content
Log in

Scalable blockchain storage systems: research progress and models

  • Regular Paper
  • Published:
Computing Aims and scope Submit manuscript

Abstract

Blockchain is believed to be able to build trust among multiple parties and improve the operational efficiency of economy and society, which is attributed to its decentralization property. However, the endless growth of Blockchain data keeps challenging the storage capacity of Blockchain nodes, compromising decentralization, and revealing the issue that the state-of-the-art storage systems of Blockchain are not scalable. From the perspective of improving the scalability of Blockchain storage systems, this paper introduces the logical and physical data structure used by Blockchain storage systems, surveys the current schemes of Blockchain storage systems in terms of the approaches to reducing data redundancy, the corresponding influence on its degree of decentralization and data reliability, and conducts quantitative analysis on data redundancy for schemes of Blockchain storage systems. The study finds that the key to realizing scalable Blockchain storage systems is to deal with the contradiction between data redundancy and its decentralization characteristic. Based on the findings, the node-based scalable model for Blockchain storage systems (SMBSS) is proposed, experimental analysis on the prototypes of the SMBSS is carried out to verify its validity, and future directions for scalable Blockchain storage systems are concluded.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

References

  1. Yuan Y, Wang FY (2016) Blockchain: the state of the art and future trends. Acta Autom Sin 42(4):481–494

    MathSciNet  Google Scholar 

  2. Nakamoto S (2009) Bitcoin: a peer-to-peer electronic cash system. Preprint at https://bitcoin.org/bitcoin.pdf

  3. Buterin V (2014) Ethereum white paper: a next generation smart contract & decentralized application platform. Preprint at https://ethereum.org/en/whitepaper/

  4. Androulaki E, Manevich Y, Muralidharan S, Murthy C, Laventman G (2018) Hyperledger fabric: a distributed operating system for permissioned blockchains. In: Paper presented at the 13th European conference on computer systems, 23–26 April 2018

  5. Hayes A (2020) Investopedia. figshare https://www.investopedia.com/terms/s/scalability.asp

  6. BtcBitaps. figshare https://btc.bitaps.com (2021)

  7. Bitnodes. figshare https://bitnodes.io/dashboard/?days=730 (2021)

  8. EthBitaps. figshare https://eth.bitaps.com/ (2021)

  9. BitcoinJ. figshare https://bitnodes.io/dashboard/?days=730

  10. Mckinney J, Light client protocol. figshare https://github.com/ethereum/wiki/wiki/Light-client-protocol

  11. Zhang Z, Wang G, Xu J, Du X (2020) Survey on data management in blockchain systems. J Softw 31(9):2903–2925

    Google Scholar 

  12. Yuan Y, Wang FY (2020) Editable blockchain: models, techniques and methods. Acta Autom Sin 46(5):831–846

    Google Scholar 

  13. Merkle RC (1980) Protocols for public key cryptosystems. In: Paper presented at 1980 IEEE symposium on security and privacy, 14–16 April 1980

  14. Buterin V, Merkling in ethereum. figshare https://blog.ethereum.org/2015/11/15/merkling-in-ethereum

  15. Xu J, Wei L, Zhang Y, Wang A, Zhou F, Gao CZ (2018) Dynamic fully homomorphic encryption-based Merkle tree for lightweight streaming authenticated data structures. J Netw Comput Appl 107:113–124

    Article  Google Scholar 

  16. Shao QF, Jin CQ, Zhang Z, Qian WN, Zhou AY (2018) Blockchain: architecture and research progress. J Softw 41(5):969–988

    Google Scholar 

  17. Buterin V, Thoughts on UTXOs. figshare https://medium.com/@ConsenSys/thoughts-on-utxo-by-vitalik-buterin-2bb782c67e53

  18. Bitcoin core 0.11. figshare https://en.bitcoin.it/wiki/Bitcoin_Core_0.11_(ch_2):_Data_Storage

  19. Xie J, Yu FR, Huang T, Xie R, Liu J, Liu Y (2019) A survey on the scalability of blockchain systems. IEEE Net 33(5):166–173

    Article  Google Scholar 

  20. Shuai Z, Yong Y, Xiao-Chun N, Fei-Yue W (2019) Scaling blockchain towards bitcoin: key technologies, constraints and related issues. Acta Autom Sin 45(6):1015–1030

    Google Scholar 

  21. Politou E, Casino F, Alepis E, Patsakis C (2019) Blockchain mutability: challenges and proposed solutions. IEEE Trans Emerg Top Comput, arXiv:1907.07099

  22. Zhou Q, Huang H, Zheng Z (2020) Solutions to scalability of blockchain: a survey. IEEE Access 8:16440–16455. https://doi.org/10.1109/ACCESS.2020.2967218

    Article  Google Scholar 

  23. Sun Z, Zhang X, Xiang F, Chen L (2021) Survey of storage scalability on blockchain. J Softw 32(1):1–20

    Google Scholar 

  24. Zhao YL, Niu BN, Li P, Fan X (2019) A novel enhanced lightweight node for blockchain. In: Paper presented at the 1st blockchain and trustworthy systems, 7–8 December 2019

  25. Shard wiki. figshare https://en.wikipedia.org/wiki/Shard_(database_architecture

  26. Yu B, Li X, Zhao H (2020) Virtual block group: a scalable blockchain model with partial node storage and distributed hash table. Comput J 63(10):1524–1536

    Article  MathSciNet  Google Scholar 

  27. Zhang XH, Niu BN, Gong T (2021) Account-based blockchain scalable storage model. Preprint at https://kns.cnki.nct/kcms/detail/11.2625.V.20210316.1345.002.html

  28. Wang YW (2020) A blockchain system with lightweight full node based on dew computing. Internet of Things 10(3):100184

    Article  Google Scholar 

  29. Wang Y (2016) Definition and categorization of dew computing. Open J Cloud Comput 3(1):1–7

    Article  Google Scholar 

  30. Wang YW (2015) Cloud-dew architecture. Int J Cloud Comput 4:199–210

    Article  Google Scholar 

  31. Jia DY, Xin JC, Wang ZQ, Wei GG, Wang GR (2018) ElasticChain: support very large blockchain by reducing data redundancy. In: Paper presented at the 2018 Asia-Pacific Web (APWeb) and web-age information management (WAIM) joint international conference on web and big data, 23–25 July 2018

  32. Jia DY, Xin JC, Wang ZQ, Wei GG, Wang GR (2018) Mitigating bitcoin node storage size by DHT. In: Paper presented at the 2018 Asian internet engineering conference, 12–14 November 2018

  33. Kaneko Y, Asaka T (2018) DHT clustering for load balancing considering blockchain data size. In: Paper presented at the 6th international symposium on computing and networking workshops (CANDARW), 27–30 November 2018

  34. Perard D, Lacan J, Bachy Y, Detchart J (2018) Erasure code-based low storage blockchain node. In: Paper presented at the 2018 Asian internet engineering conference, 30 July–3 August 2018

  35. Li J, Li B (2013) Erasure coding for cloud storage systems: a survey. Tsinghua Sci Technol 3:259–272

    Article  Google Scholar 

  36. Li J, Li B (2018) A low storage room requirement framework for distributed ledger in blockchain. IEEE Access 2018:22970–22975

    Google Scholar 

  37. Palm E, Schelen O, Bodin U (2018) Selective blockchain transaction pruning and state derivability. In: Paper presented at the 2018 crypto valley conference on blockchain technology (CVCBT), 20–22 June 2018

  38. Florian M, Henningsen S, Beaucamp S, Scheuermann B (2019) Erasing data from blockchain nodes. In: Paper presented at the 2019 IEEE European symposium on security and privacy workshops (EuroS&PW), 17–19 June 2019

  39. Bruce JD, The mini-blockchain scheme. figshare http://cryptonite.info/files/mbc-scheme-rev3.pdf

  40. Chepurnoy A, Larangeira M, Ojiganov A (2019) Rollerchain, a blockchain with safely pruneable full blocks. Preprint at http://arxiv.org/abs/1603.07926

  41. Feng X, Ma J, Miao Y, Meng Q, Liu X, Jiang Q, Li H (2019) Pruneable sharding-based blockchain protocol. Peer-to-Peer Netw Appl 12:934–950

    Article  Google Scholar 

  42. Buterin V, State Tree Pruning. figshare https://blog.ethereum.org/2015/06/26/state-tree-pruning/

  43. Ehrenhofer J, Monero adds blockchain pruning and improves transaction efficiency. figshare https://web.getmonero.org/zh-cn/2019/02/01/pruning.html

  44. Dai X, Xiao J, Yang W, Wang C, Jin H (2019) Jidar: a jigsaw-like data reduction approach without trust assumptions for bitcoin system. In: Paper presented at the 2019 IEEE 39th international conference on distributed computing systems (ICDCS) , 7–10 July 2019

  45. Block file pruning. figshare https://github.com/bitcoin/bitcoin/blob/v0.11.0/doc/release-notes.md#block-file-pruning

  46. Luu L, Narayanan V, Zheng C, Baweja K, Saxena P (2016) A secure sharding protocol for open blockchains. In: Paper presented at the 2016 ACM SIGSAC conference, 24–28 October 2016

  47. Kokoris-kogias E, Jovanovic P, Gasser L, Gailly N, Syta E, Ford B (2016) OmniLedger: a secure, scale-out, decentralized ledger via sharding. In: Paper presented at the 2018 IEEE symposium on security and privacy (SP), 20–24 May 2018

  48. Xu Z, Han S, Lei C (2018) CUB, a consensus unit-based storage scheme for blockchain system. In: Paper presented at the 2018 IEEE 34th international conference on data engineering (ICDE), 16–19 April 2018

  49. Kokoris-kogias E, Jovanovic P, Gasser L, Gailly N, Syta E, Ford B (2016) RapidChain: scaling blockchain via full sharding. In: Paper presented at the 2018 ACM SIGSAC conference on computer and communications security, 15–19 October 2018

  50. Pamies-Juarez L, Sanchez-Artigas M, Garcia-Lopez P, Mondejar R, Chaabouni R (2014) On the interplay between data redundancy and retrieval times in p2p storage systems. Comput Netw 59(11):1–16

    Article  Google Scholar 

  51. Segura SD, Perez-Sola C, Navarro-Arribas G, Herrera-Joancomarti J (2016) Analysis of the Bitcoin UTXO set. In: Paper presented at the 22nd international conference on financial cryptography and data security, 26 Feb 2018

  52. Xu J (2011) (ed.): Research on data reliability in Peer-to-peer network storage system. Harbin: Harbin Engineering University

  53. Borel E (1962) (ed.): Probabilities and life. Maurice Baudin Dover Publications, Inc., New York

  54. Bitcoin-ETL developer. figshare https://github.com/blockchain-etl/bitcoin-etl

  55. Ethereum Network Status. figshare https://ethstats.net

  56. PeerSim. figshare http://peersim.sourceforge.net/

Download references

Acknowledgements

This research is supported by National Natural Science Foundation of China (62072326) and the International Cooperation Project of the Major Research, Development Program of Shanxi (201903D421007) and Hubei Key Laboratory of Optical Information and Pattern Recognition, Wuhan Institute of Technology (201903).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Baoning Niu.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Fan, X., Niu, B. & Liu, Z. Scalable blockchain storage systems: research progress and models. Computing 104, 1497–1524 (2022). https://doi.org/10.1007/s00607-022-01063-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00607-022-01063-8

Keywords

Mathematics Subject Classification

Navigation