You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 4 Next »


Modularization of Hyperledger Besu - can we make Besu more flexible by factoring it into decoupled components which can be exchanged for alternate implementations?

Goal of this document: 

  • Starting a conversation about modularizing Besu.
  • Keeping track of the discussions.

General context


We are getting various signals that the future of blockchain technologies is all about modularity.  If L2 chains on top of L1 chains are the future, how can we make an L1 client that can be composed from various implementations of sub-components?  We also see evidence of this elsewhere - The Merge separated consensus from execution. MEV actors like Flashbots separate proposing a block from building it. Even within clients, we see teams like Erigon re-writing their client in different languages, and combining the best performing subcomponents regardless of language.

Apart from the general direction of blockchain, software has been trending away from monolithic implementations, in order to maximize developer efficiency, and reduce change fatigue. Smaller components can reach stability more easily than large monoliths can. 

Potential Benefits

Releases - finer grained components could have a finer grained release process, speeding up the release cycle.

Reduces cognitive complexity - better defined scope for contributors to target a specific part of the codebase.  New developers can focus more narrowly, and get up to speed faster with fewer distractions.

Increases pace of innovation - experiments and prototypes become much easier, faster, and lower risk to pursue.

End User Control - software modularity should lend itself easily to greater customizability for the end user. Whether this is exposed to them has yet to be discussed.

General Concerns and Challenges, Possible Mitigations

  1. Engineering effort around Besu
    1. Large engineering effort - we will need to always prefer incremental delivery over greenfield or big-bang approaches.
    2. Series of workshops to define the work
  2. Technical project organization
    1. Communication planning
      1. Internal - how do we make sure all Besu contributors can keep their finger on the pulse of this initiative.
      2. External - do we need to convey this to external users or interested parties. If so, how?
    2. multi stakeholders discussion, federating people around modular besu. Examples of stakeholders this would benefit:
      1. MEV searchers
      2. rollup implementers
      3. infrastructure providers like Infura or Alchemy
      4. developers

Besu Minimum Useful Components

Hypothetical situations that would benefit from component composition:

  • EVM and state are needed and not the Consensus. Ex: Rollups, Hedera Hashgraph, EVM testing tools
  • Transaction pool and block gossip needed. Mev searchers. 
    • Possibly EVM needed to for gas use analysis.
  • All-in-one mainnet client that provides ethereum proof-of-stake as its only consensus mechanism.
  • State Synch Testbed, rapid prototyping for data stores which can be populated with state changes from a moving chain.

Potential First Steps

  • Catalog all components 
  • Test approach on one or more situation listed above. 
  • Extrapolate out rough timeline on MVP scope and modules timing vs the catalog.
  • Scope MVP (minimum viable platform)



Debrief of meeting with Erigon

Meeting #1 - 9/14/21

Participants: Alexey, Madeline, Sajida

  • Sentry component
  • C++ and rust implementation are being done
  • Each reimplem takes less time than the precedent
  • Contrary to popular belief, it’s not hard to rewrite things from scratch. Might even be easier.
  • Alexey wants to start a Java reimplementation, and they don’t have anyone to do it in java
  • Besu in ⅔ years - he sees a dead end for the monolith model like besu, nethermind, openE
  • Geth snapshotter; Geth realised that traversing the tree
  • Collaboration would be:
    • Join their family of product
    • Reimplement core product like evm
    • Make them compatible with their others components
    • That will be a 4th compatible implement to their portfolio



  • Erigon is funded by EF, gnosis and small amount from various org 
  • They are hiring for the go implementation, they have 2 active dev, they might bring couple other, it is a small team
    • Cpp team : ⅘ ppl
    • Rust team: 2,5 ppl , some of them are not employed but just contributing part time


  • Cycles of modularization
    • 1st rewrite: 2017 - 4 years or 3,5 years
    • 2st rewrite may 2020 - c++ w/ couple ppl , now they are almost finish the core component (1 year and half) might get the core component roughly finished end of 2021
    • 3rd rewrite jan 2021 - rust, could get to the same level as the other by the end of 2021, so 1 year; Rust will be ahead of the C++ implementation
    • He predicts that with Besu in 6 months because we already have a codebase, we don’t start from scratch.
  • Should we join the effort ? should we invest in Erigon?




Meeting #2 - 10/6/21

Participants: Artem +1, Gary, Sajida

  • Starting from scratch is easier than refactoring existing code into Erigon architecture.
  • Artem used to work on OE and is now working on Acula (rust) mainly alone for 4 months and it’s already passing consensus.
  • Modularization
    • Breaking the monolith - reusable parts: tx pool, consensus engine, sync module
    • Sync module is interesting alone to process by block or by stage
    • might require a change of database, stage sync require MVCC database  (LMDB, Badger LSMbased, B+2
    • it might be possible to start module by module. 
    • Data model could be a good start (might reduce space consumption). 
    • We already have a pluggable storage engine that we could Interface of the pluggable storage resembles MDB/LMDB/DBX Peer 2 peer part (sentry) of Geth was re-used by Erigon but the plumbing is totally different
    • Erigon is heavily optimized toward sequential writes. Random reads / Sequential write - very fast for MDBX.
  • EVM bug leveraging a hole in the memory as triggered by a tx, that was broadcasted everywhere and affected all clients (even on Binance smart chain) - spreads like wildfire.
  • If they have a clique ethereum, fork the module, modify it and connect to JRPC and connect the rest of Erigon. You just had to invest time in creating a module and you get the rest of the client for free.
  • Erigon can be run as a Kubernetes cluster.
  • Transaction pool should get EVM inside and be able to be part of the consensus. It is a security parameter. If we have a DOS attack, the tx pool should guard the blockchain from an attack. Having multiple tx pools that could coexist: one for MEV, on maybe getting DOS in this scenario and one running smoothly. And then you can pick the one that can do the work. Any tx pool could go down while the node is still up. Node is behind the “forest” of other P2P nodes. Ex: Besu sentry (x10 instances), all sentries go down but the core that runs the database/blockchain and stores the chain stays up.
  • The idea of modularity; you make the core, the spec, and the rest is up to you.
  • Andrew: maintainer of yellow paper, has an enum that maps to yellow paper parts. He runs silkworm - very good resource to start the work. Should be interesting to Justin.




  • Very fruitful to invest R&D in this because lots of work has been done so the cycle of reimplementation are getting smaller
  • Refactor: use case -> modularity for l2 , rollups, pluggable, MEV
  • Argument:
    • database - we (besu) have a trie in a trie MPT (access complexity is multiplied). so just switching to another data model would increase our performance. 
    • Erigon threw out the MPT (merkle patricia trie) completely and computes state root post execution and other than that we have a flat state. Plain state table: value = account, key = account address. We are almost there with bonsai on the flat storage but we should work on simplifying
    • using JRPC sure adds communication overhead but it brings so much value in other places that they (erigon) can live with it - JRPC could be replaced of course by something else, like jar(?)



  • No labels