Updated Erasure Coding for warm storage (markdown)

Chris Lu 2019-06-11 00:43:36 -07:00
parent 713221908f
commit f2c84f15aa

@ -15,7 +15,7 @@ However, data can become warm or cold after a period of time. They are accessed
## Architecture
SeaweedFS implemented 10.4 Reed-Soloman Erasure Coding (EC). The large volumes are split into chunks of 1GB, and every 10 data chunks are also encoded into 4 parity chunks. So a 30 GB data volume will be encoded into 14 EC shards, each shard is of size 3 GB and has 3 EC blocks.
Since the data is split into 1GB chunks, usually one small file is contained in shard, or possibly two shards in edge cases. So most reads still only cost O(1) disk read.
Since the data is split into 1GB chunks, usually one small file is contained in one shard, or possibly two shards in edge cases. So most reads still only cost O(1) disk read.
For smaller volumes less than 10GB, and for edge cases, the volume is split into smaller 1MB chunks.