Updated Remote Storage Architecture (markdown)

Chris Lu 2021-08-10 01:19:24 -07:00
parent ff6e2961c3
commit fd3129e75b

@ -67,6 +67,7 @@ The asynchronous write back will not slow down any local operations.
* Machine learning training jobs need to repeatedly visit a large set of files. Increase training speed and reduce API cost and network cost.
* Saving data files. With cloud capacity and storage tiering, saving data files there may be a good idea. The cache can save the programming effort.
* Run Spark/Flink jobs on mounted folders for faster computation.
* Multiple access methods, HDFS/HTTP/S3/WebDav/Mount, to access remote storage. No need to use one specific way to access remote storage.
* If you plan to move off cloud, you can start with SeaweedFS Remote Storage Cache. When you are happy with it, just stop the write back process (and cancel the monthly payment to the cloud vendor!).