[P] Versioning code & large models together in GitHub

semicausal@alien.top · 10 months ago

[P] Versioning code & large models together in GitHub

semicausal@alien.top · 10 months ago

Good questions:

- DVC: no new commands to learn (we extend Git) and you don’t need S3.

- Git LFS: we inject useful views into your large files inside GitHub itself (in commits and PR’s) unlike Git LFS (e.g. check this model diff: https://youtu.be/lAyymscJUvI?t=87), we scale to much larger sizes (100 terabytes), and we deduplicate better (Git LFS considers a 1 line change to a large CSV file a new entire file, our technique captures the differences)

[P] Versioning code &amp; large models together in GitHub

[P] Versioning code &amp; large models together in GitHub

[P] Versioning code & large models together in GitHub

[P] Versioning code & large models together in GitHub