1
0
mirror of https://github.com/gilbertchen/duplicacy synced 2025-12-11 05:43:23 +00:00
2016-02-22 21:39:30 -05:00
2016-02-22 21:39:30 -05:00

Duplicacy: A new generation cloud backup tool based on Lock-Free Deduplication

Duplicacy works with major cloud storage providers and at the same time offers all essential features of a modern backup tool:

  • Incremental backup: only back up what has been changed
  • Full snapshot : even if each backup is incremental, it must appear to be a full snapshot independent of others
  • Deduplication: identical files must be stored as one copy (file-level deduplication), and identical parts from different files must be stored as one copy (block-level deduplication)
  • Encryption: encrypt not only file contents but also file paths, sizes, times, etc.
  • Deletion: every backup can be deleted independently without affecting others
  • Concurrency: multiple clients can back up to the same storage at the same time

The key idea behind Duplicacy is a technique called Lock-Free Deduplication. There are three elements of lock-free deduplication:

  • Use variable-size chunking algorithm to split files into chunks
  • Store each chunk in the storage using a file name derived from its hash, and rely on the file system API to manage chunks without using a centralized indexing database
  • A two-step fossil collection algorithm to remove chunks that become unreferenced after a backup is deleted

Getting Started

Description
No description provided
Readme 32 MiB
Languages
Go 99.3%
Shell 0.7%