Sharing Model Updates with the Model Owner and Verifying Authenticity of Training
Objective: To securely share the final, aggregated model updates with the model owner while verifying that training was conducted on the correct dataset. Using Merkle Trees with Hash Logging enables data integrity verification by ensuring that training accessed only the committed dataset.
Process Overview
Merkle Tree Construction and Commitment: Each node constructs a Merkle Tree for its dataset and commits to the Merkle root as a cryptographic fingerprint of the data.
Hash Logging During Training: Nodes log the hashes of dataset chunks accessed during training. These logs ensure that the model trained on the correct dataset.
Verification of Hash Logs and Merkle Proofs: The model owner verifies the Merkle root, Merkle proofs, and hash logs to confirm that training was conducted on the committed dataset.
Last updated