From fd3129e75b492d521179591bbbf3a9446c6f7a9d Mon Sep 17 00:00:00 2001 From: Chris Lu Date: Tue, 10 Aug 2021 01:19:24 -0700 Subject: [PATCH] Updated Remote Storage Architecture (markdown) --- Remote-Storage-Architecture.md | 1 + 1 file changed, 1 insertion(+) diff --git a/Remote-Storage-Architecture.md b/Remote-Storage-Architecture.md index 45a1c23..abddf85 100644 --- a/Remote-Storage-Architecture.md +++ b/Remote-Storage-Architecture.md @@ -67,6 +67,7 @@ The asynchronous write back will not slow down any local operations. * Machine learning training jobs need to repeatedly visit a large set of files. Increase training speed and reduce API cost and network cost. * Saving data files. With cloud capacity and storage tiering, saving data files there may be a good idea. The cache can save the programming effort. +* Run Spark/Flink jobs on mounted folders for faster computation. * Multiple access methods, HDFS/HTTP/S3/WebDav/Mount, to access remote storage. No need to use one specific way to access remote storage. * If you plan to move off cloud, you can start with SeaweedFS Remote Storage Cache. When you are happy with it, just stop the write back process (and cancel the monthly payment to the cloud vendor!).