gguf_hash.py: Add sha256 #8470

mofosyne · 2024-07-13T14:25:42Z

Found out that python hashlib that is already included in this repo has sha256 in addition to the sha1 we are already using for sha1 and uuid hash.

We have already implemented this in the C version of llama-gguf-hash so might as well include it in the python version.

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

Cross Checking `gguf_hash.py` with the C implementation `llama-gguf-hash`

$ ~/llama.cpp/gguf-py/scripts/gguf_hash.py Maykeye_Tinyllama-5M-v0.2-F16.gguf --no-layer
sha1      a9de42f2bbeee1eba49bc39b25cf69ff7a0937f6  Maykeye_Tinyllama-5M-v0.2-F16.gguf
sha256    8b3e00226cc2a55398b1ffbda7af8464040f9cd7b22ccbef8ba60b227924a2b1  Maykeye_Tinyllama-5M-v0.2-F16.gguf
uuid      86b1ebff-d754-50fc-9245-d23fe329817c  Maykeye_Tinyllama-5M-v0.2-F16.gguf

$ ~/llama.cpp/llama-gguf-hash --all --no-layer --uuid Maykeye_Tinyllama-5M-v0.2-F16.gguf
xxh64     cbd383cfd4c897e6  Maykeye_Tinyllama-5M-v0.2-F16.gguf
sha1      a9de42f2bbeee1eba49bc39b25cf69ff7a0937f6  Maykeye_Tinyllama-5M-v0.2-F16.gguf
sha256    8b3e00226cc2a55398b1ffbda7af8464040f9cd7b22ccbef8ba60b227924a2b1  Maykeye_Tinyllama-5M-v0.2-F16.gguf
uuid      86b1ebff-d754-50fc-9245-d23fe329817c  Maykeye_Tinyllama-5M-v0.2-F16.gguf

(side thought: Would it be worth having general.sha256 to have self validating gguf files... sounds interesting... but likely only makes sense in the context of cryptographic signing of tensor data... which will require more thought on standardizing the authenticity and verification of tensor weights as not being tampered with... and llama.cpp is a bit too new to do something like that in my opinion)

gguf-py/scripts/gguf_hash.py

Co-authored-by: compilade <[email protected]>

* gguf_hash.py: Add sha256 * gguf_hash.py: rename string UUIDv5 --> uuid * Apply suggestions from code review Co-authored-by: compilade <[email protected]> --------- Co-authored-by: compilade <[email protected]>

github-actions bot added the python python script changes label Jul 13, 2024

gguf_hash.py: Add sha256

77b0ca4

mofosyne force-pushed the gguf-hash-py-add-sha256 branch from 7265315 to 77b0ca4 Compare July 13, 2024 14:30

gguf_hash.py: rename string UUIDv5 --> uuid

f2d6eb0

mofosyne added the Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix label Jul 13, 2024

compilade approved these changes Jul 13, 2024

View reviewed changes

gguf-py/scripts/gguf_hash.py Outdated Show resolved Hide resolved

gguf-py/scripts/gguf_hash.py Show resolved Hide resolved

Apply suggestions from code review

6cd47fd

Co-authored-by: compilade <[email protected]>

mofosyne added the merge ready indicates that this may be ready to merge soon and is just holding out in case of objections label Jul 14, 2024

mofosyne merged commit e236528 into ggml-org:master Jul 14, 2024
8 checks passed

mofosyne deleted the gguf-hash-py-add-sha256 branch July 14, 2024 06:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

gguf_hash.py: Add sha256 #8470

gguf_hash.py: Add sha256 #8470

Uh oh!

mofosyne commented Jul 13, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gguf_hash.py: Add sha256 #8470

gguf_hash.py: Add sha256 #8470

Uh oh!

Conversation

mofosyne commented Jul 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Cross Checking gguf_hash.py with the C implementation llama-gguf-hash

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mofosyne commented Jul 13, 2024 •

edited

Loading

Cross Checking `gguf_hash.py` with the C implementation `llama-gguf-hash`