mirror of
https://github.com/seaweedfs/seaweedfs.git
synced 2024-01-19 02:48:24 +00:00
Created Cloud Cache Quick Setup (markdown)
parent
3654d84de6
commit
99f2775b9b
167
Cloud-Cache-Quick-Setup.md
Normal file
167
Cloud-Cache-Quick-Setup.md
Normal file
|
@ -0,0 +1,167 @@
|
|||
To users not familiar with SeaweedFS, there seems many things to learn.
|
||||
But for SeaweedFS Cloud Cache, the setup is easy.
|
||||
|
||||
# Setup a simple SeaweedFS cluster
|
||||
|
||||
To act as a cache, the high availability requirement is not that hard. You can start with a simple SeaweedFS cluster.
|
||||
|
||||
Since very likely you may want to use S3, the following will have S3 setup.
|
||||
|
||||
Just run this to have a SeaweedFS cluster:
|
||||
```
|
||||
$ weed server -s3
|
||||
```
|
||||
|
||||
## Setup S3 credentials
|
||||
Start a `weed shell`
|
||||
```
|
||||
$ weed shell
|
||||
master: localhost:9333 filer: localhost:8888
|
||||
> s3.configure -h
|
||||
Usage of s3.configure:
|
||||
-access_key string
|
||||
specify the access key
|
||||
-actions string
|
||||
comma separated actions names: Read,Write,List,Tagging,Admin
|
||||
-apply
|
||||
update and apply s3 configuration
|
||||
-buckets string
|
||||
bucket name
|
||||
-delete
|
||||
delete users, actions or access keys
|
||||
-secret_key string
|
||||
specify the secret key
|
||||
-user string
|
||||
user name
|
||||
> s3.configure -user me -access_key=any -secret_key=any -buckets=bucket1 -actions=Read,Write,List,Tagging,Admin
|
||||
{
|
||||
"identities": [
|
||||
{
|
||||
"name": "me",
|
||||
"credentials": [
|
||||
{
|
||||
"accessKey": "any",
|
||||
"secretKey": "any"
|
||||
}
|
||||
],
|
||||
"actions": [
|
||||
"Read:bucket1",
|
||||
"Write:bucket1",
|
||||
"List:bucket1",
|
||||
"Tagging:bucket1",
|
||||
"Admin:bucket1"
|
||||
]
|
||||
}
|
||||
]
|
||||
}
|
||||
```
|
||||
|
||||
# Configure Remote Storage
|
||||
|
||||
This step will configure a remote storage and how to access it.
|
||||
|
||||
For this particular demo, the following command created a remote storage named "s5", which actually uses the credential we just created locally. So this remote storage is actually just a loop back to another local s3 account.
|
||||
|
||||
In `weed shell`:
|
||||
```
|
||||
> remote.configure -h
|
||||
Usage of remote.configure:
|
||||
-delete
|
||||
delete one remote storage by its name
|
||||
-name string
|
||||
a short name to identify the remote storage
|
||||
-s3.access_key string
|
||||
s3 access key
|
||||
-s3.endpoint string
|
||||
endpoint for s3-compatible local object store
|
||||
-s3.region string
|
||||
s3 region (default "us-east-2")
|
||||
-s3.secret_key string
|
||||
s3 secret key
|
||||
-type string
|
||||
storage type, currently only support s3 (default "s3")
|
||||
|
||||
> remote.configure -name=s5 -type=s3 -s3.access_key=any -s3.secret_key=any -s3.endpoint=http://localhost:8333
|
||||
|
||||
> remote.configure
|
||||
{
|
||||
"type": "s3",
|
||||
"name": "s5",
|
||||
"s3AccessKey": "any",
|
||||
"s3Region": "us-east-2",
|
||||
"s3Endpoint": "http://localhost:8333"
|
||||
}
|
||||
|
||||
```
|
||||
|
||||
# Mount Remote Storage
|
||||
|
||||
The remote storage can be mounted to any directory. Here is an example:
|
||||
```
|
||||
> remote.mount -dir=/buckets/b2 -remote=s5/bucket1 -nonempty
|
||||
|
||||
```
|
||||
|
||||
# Test the setup
|
||||
|
||||
In the example, the remote source folders are empty.
|
||||
In reality, your remote folder should have some files already.
|
||||
|
||||
Right now you can already try to read or write to folder `/buckets/b2`.
|
||||
|
||||
# Setup write back
|
||||
|
||||
This step is only needed if you want local changes go back to the remote storage.
|
||||
|
||||
For this example, just start one process as this:
|
||||
```
|
||||
$ weed filer.remote.sync -dir=/buckets/b2
|
||||
```
|
||||
|
||||
This command will continuously write back changes of this mounted directory to the cloud storage.
|
||||
|
||||
This command is designed to run as a background process. It can be paused by `ctl+c`. It can also try to re-connect to filer if disconnected.
|
||||
|
||||
# Setup cache and uncache process
|
||||
|
||||
Since only metadata are pulled and there are no file content cache, reading remote files are somewhat slow.
|
||||
|
||||
You may want to cache a group of files, to make sure the first read is always fast.
|
||||
|
||||
You may want to uncache a group of files, to save some local storage.
|
||||
|
||||
These cache or uncache jobs can vary wildly. Here are some examples:
|
||||
|
||||
```
|
||||
# cache a whole folder
|
||||
> remote.cache -dir=/buckets/b2/a/b/c
|
||||
# cache all parquet files
|
||||
> remote.cache -dir=/buckets/b2 -include=*.parquet
|
||||
# cache file size between 1024 and 10240 bytes inclusively
|
||||
> remote.cache -dir=/buckets/b2 -minSize=1024 -maxSize=10240
|
||||
|
||||
# uncache file size older than 3600 seconds
|
||||
> remote.uncache -dir=/buckets/b2 -maxAge=3600
|
||||
# uncache file size more than 10240 bytes
|
||||
> remote.cache -dir=/buckets/b2 -minSize=10240
|
||||
|
||||
```
|
||||
|
||||
These jobs can be setup as scheduled cron jobs also.
|
||||
|
||||
# Detect Cloud Data Updates
|
||||
|
||||
If the cloud storage has other processes writing to it, the mounted folder needs to know the new files.
|
||||
|
||||
Very likely you want to setup cron jobs to run `remote.meta.sync` regularly.
|
||||
|
||||
```
|
||||
> remote.meta.sync -h
|
||||
Usage of remote.meta.sync:
|
||||
-dir string
|
||||
a directory in filer
|
||||
> remote.meta.sync -dir=/buckets/b2
|
||||
|
||||
```
|
||||
|
||||
|
Loading…
Reference in a new issue