feat: replace soft tx cache with revalidation-based validation cache by mpskowron · Pull Request #744 · midnightntwrk/midnight-node

mpskowron · 2026-02-22T17:54:08Z

Overview

Remove the soft transaction cache and introduce tx revalidation: cached VerifiedTransaction entries are reused when state changes by revalidating against a RevalidationReference instead of re-running full ZK proof verification. Adds cache metrics (miss, strict hit, revalidation hit) and tests covering the full validation lifecycle. Metrics are not backwards compatible - require grafana dashboards update.

Revalidation skips most costly part of transaction verification which is independent of ledger state - ZK proofs. This should be a significant performance improvement. We also rerun apply on every mempool revalidation now - this is slower than the previous solution (cache it once and never rerun in mempool), but from the other hand it lets us invalidate transaction from the mempool earlier.

🗹 TODO before merging

Benchmark performance
Ready

📌 Submission Checklist

Changes are backward-compatible (or flagged if breaking): grafana metrics need updating
Pull request description explains why the change is needed
Self-reviewed the diff
I have included a change file, or skipped for this reason:
If the changes introduce a new feature, I have bumped the node minor version
Update documentation (if relevant)
Updated AGENTS.md if build commands, architecture, or workflows changed
No new todos introduced

🧪 Testing Evidence

Please describe any additional testing aside from CI:

TBD

Additional tests are provided (if possible)

🔱 Fork Strategy

Node Runtime Update
Node Client Update
Other:
N/A

Links

Jira1: https://shielded.atlassian.net/browse/PM-21736
Jira2: https://shielded.atlassian.net/browse/PM-18691

github-actions · 2026-02-22T17:55:22Z

KICS version: v2.1.16

	Category	Results
	CRITICAL	0
	HIGH	0
	MEDIUM	96
	LOW	12
	INFO	83
	TRACE	0
	TOTAL	191

Metric	Values
Files scanned	31
Files parsed	31
Files failed to scan	0
Total executed queries	73
Queries failed to execute	0
Execution time	9

mpskowron · 2026-02-23T11:39:31Z

ledger/src/versions/common/mod.rs


-#[derive(PartialEq, Eq, Hash)]
-pub struct StrictTxValidationKey {
-	state_hash: Hash,


Twox128 has no cryptographic collision resistance, so using it as a cache key seems risky. Was there a specific reason it was chosen over tx.hash(), or is replacing it with the transaction's own hash straightforwardly safe here?

mpskowron · 2026-02-23T11:42:11Z

primitives/ledger/src/lib.rs

Changes here require the update of grafana dashboards, but I think they will give us much better insight on the performance impact of this change. Where can I update them?

mpskowron · 2026-02-23T11:45:42Z

ledger/src/versions/common/mod.rs

+			tx_validation_key,
+			Arc::new(TxValidationValue {
+				verified_tx: verified_tx.clone(),
+				state: ledger.state.clone(),


ledger state is a bag of pointers, so caching it directly should be fine. However, I'm wondering - does keeping multiple clones of ledger states have an influence on ledger performance?

mpskowron · 2026-02-23T11:48:28Z

ledger/src/versions/common/mod.rs


 		// Dry-run apply to validate guaranteed execution against current state
 		let ctx = ledger.get_transaction_context(block_context.clone())?;
 		let (_next_state, result) = ledger.state.apply(&verified_tx, &ctx);


Previously, in case of a cache hit (even for old state) we didn't rerun apply. This might impact performance. Should we revert to previous behavior or always run it? (We can also just rerun it only on strict cache hit, but strict hits should almost never happen here)

mpskowron · 2026-02-23T11:51:02Z

pallets/midnight/src/tests.rs


+static NEXT_SPEC_VERSION: AtomicU32 = AtomicU32::new(1_000_000);
+
+fn unique_spec_version() -> u32 {


the cache is lazy static, thus shared between tests. In order for the test to have distinct cache entries we force different runtime version per test

mpskowron · 2026-02-23T11:52:19Z

pallets/midnight/src/tests.rs

I couldn't figure out a better way to test the cache than reading prometheus metrics. I'm open to other ideas

Remove the soft transaction cache and introduce tx revalidation: cached VerifiedTransaction entries are reused when state changes by revalidating against a RevalidationReference instead of re-running full ZK proof verification. Adds cache metrics (miss, strict hit, revalidation hit) and tests covering the full validation lifecycle.

mpskowron · 2026-02-23T12:14:03Z

ledger/src/versions/common/mod.rs

-	///
-	/// Uses `tx_hash` only for quick revalidation of transactions already in the pool.
-	/// The soft cache prevents redundant ZK proof verification for mempool housekeeping.
+	fn revalidate_transaction(


@tkerber could you please double check if we're introducing revalidation correctly in this PR?:

On cache miss: run full well_formed(), cache the VerifiedTransaction + current LedgerState keyed by (tx_hash, runtime_version)

On cache hit with a different state hash: run revalidate_transaction against (cached_state, new_state) instead of full verification — this covers all three entry points: mempool (validate_transaction), block proposal (those are all possible places, aren't they?)

On success: update the cache entry with the new VerifiedTransaction + new LedgerState, so subsequent state changes can revalidate incrementally again

mpskowron force-pushed the skowron/tx-revalidation branch from 747cee2 to a55f163 Compare February 23, 2026 11:32

mpskowron commented Feb 23, 2026

View reviewed changes

mpskowron force-pushed the skowron/tx-revalidation branch from a55f163 to 0009021 Compare February 23, 2026 11:56

mpskowron marked this pull request as ready for review February 23, 2026 11:57

mpskowron requested a review from a team as a code owner February 23, 2026 11:57

mpskowron commented Feb 23, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: replace soft tx cache with revalidation-based validation cache#744

feat: replace soft tx cache with revalidation-based validation cache#744
mpskowron wants to merge 1 commit intomainfrom
skowron/tx-revalidation

mpskowron commented Feb 22, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Feb 22, 2026

Uh oh!

mpskowron Feb 23, 2026

Uh oh!

mpskowron Feb 23, 2026 •

edited

Loading

Uh oh!

mpskowron Feb 23, 2026

Uh oh!

mpskowron Feb 23, 2026

Uh oh!

mpskowron Feb 23, 2026

Uh oh!

mpskowron Feb 23, 2026 •

edited

Loading

Uh oh!

mpskowron Feb 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant


		static NEXT_SPEC_VERSION: AtomicU32 = AtomicU32::new(1_000_000);

		fn unique_spec_version() -> u32 {

Conversation

mpskowron commented Feb 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

🗹 TODO before merging

📌 Submission Checklist

🧪 Testing Evidence

🔱 Fork Strategy

Links

Uh oh!

github-actions bot commented Feb 22, 2026

Uh oh!

mpskowron Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

mpskowron Feb 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mpskowron Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

mpskowron Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

mpskowron Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

mpskowron Feb 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mpskowron Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

mpskowron commented Feb 22, 2026 •

edited

Loading

mpskowron Feb 23, 2026 •

edited

Loading

mpskowron Feb 23, 2026 •

edited

Loading