Installation

The Cube Builder depends essentially on:

Python Client Library for STAC (stac.py)
Flask
Celery
rasterio
NumPy
scikit-image
RabbitMQ
marshmallow-SQLAlchemy
Brazil Data Cube Catalog Module

Development Installation

Clone the software repository:

$ git clone https://github.com/brazil-data-cube/cube-builder.git

Go to the source code folder:

$ cd cube-builder

Install in development mode:

$ pip3 install -e .[all]

Note

If you have problems with the librabbitmq installation, please, see [1].

Running in Development Mode

Launch the RabbitMQ Container

You will need an instance of RabbitMQ up and running in order to launch the cube-builder celery workers.

We have prepared in the Cube Builder repository a configuration for RabbitMQ container with docker-compose. Please, follow the steps below:

docker-compose up -d mq

After that command, check which port was binded from the host to the container:

$ docker container ls

CONTAINER ID   IMAGE                  COMMAND                  CREATED         STATUS         PORTS                    NAMES
a3bb86d2df56   rabbitmq:3-management  "docker-entrypoint.s…"   3 minutes ago   Up 3 minutes   4369/tcp, 5671/tcp, 0.0.0.0:5672->5672/tcp, 15671/tcp, 25672/tcp, 0.0.0.0:15672->15672/tcp   cube-builder-rabbitmq

Note

In the above output the RabbitMQ service is attached to the ports 5672 for socket client and 15672 for the RabbitMQ User Interface. You can check http://127.0.0.1:15672. The default credentials are guest and guest for user and password respectively.

Prepare the Database System

The Cube Builder uses BDC-DB as database definition to store data cube metadata.

Note

If you already have a database instance with the Brazil Data Cube data model, you can skip this section.

In order to prepare a Brazil Data Cube database model, you must clone the BDC-DB and run the migrations:

git clone https://github.com/brazil-data-cube/bdc-db.git /tmp/bdc-db
(
    cd /tmp/bdc-db
    SQLALCHEMY_DATABASE_URI="postgresql://postgres:password@host:port/bdc" \
    bdc-db db create-db
    SQLALCHEMY_DATABASE_URI="postgresql://postgres:password@host:port/bdc" \
    bdc-db db upgrade
)

After that, you can initialize Cube Builder migrations with the following commands:

SQLALCHEMY_DATABASE_URI="postgresql://postgres:password@host:port/bdc" \
cube-builder db create-db # Create database and schema

SQLALCHEMY_DATABASE_URI="postgresql://postgres:password@host:port/bdc" \
cube-builder db upgrade # Up migrations

# Load default functions for cube-builder
SQLALCHEMY_DATABASE_URI="postgresql://postgres:password@host:port/bdc" \
cube-builder load-data

Launch the `Cube Builder` service

In the source code folder, enter the following command:

$ FLASK_ENV="development" \
  DATA_DIR="/data" \
  SQLALCHEMY_DATABASE_URI="postgresql://postgres:password@host:port/bdc" \
  cube-builder run

You may need to replace the definition of some environment variables:

FLASK_ENV="development": used to tell Flask to run in Debug mode.
DATA_DIR="/data": set path to store data cubes
SQLALCHEMY_DATABASE_URI="postgresql://postgres:password@host:port/bdc": set the database URI connection for PostgreSQL.

The above command should output some messages in the console as showed below:

* Environment: development
* Debug mode: on
* Running on http://127.0.0.1:5000/ (Press CTRL+C to quit)
* Restarting with stat
* Debugger is active!
* Debugger PIN: 319-592-254

Launch the `Cube Builder` worker

Enter the following command to start Cube Builder worker:

DATA_DIR="/data" \
SQLALCHEMY_DATABASE_URI="postgresql://postgres:password@host:port/bdc" \
cube-builder worker -l INFO --concurrency 8

You may need to replace the definition of some parameters:

-l INFO: defines the Logging level. You may choose between DEBUG, INFO, WARNING, ERROR, CRITICAL, or FATAL.

--concurrency 8: defines the number of concurrent processes to generate of data cube. The default is the number of CPUs available on your system.

Note

The command line cube-builder worker is an auxiliary tool that wraps celery command line using cube_builder as context. In this way, all celery worker parameters are currently supported. See more in Celery Workers Guide.

Warning

The Cube Builder can use a lot of memory for each concurrent process, since it opens multiple images in memory. You can limit the concurrent processes in order to prevent it.

Using the Cube Builder

Please, refer to the document USING.rst for more information on how to use the Cube Builder.

Footnotes

[1]

During librabbitmq installation, if you have a build message such as the one showed below:

...
Running setup.py install for SQLAlchemy-Utils ... done
Running setup.py install for bdc-db ... done
Running setup.py install for librabbitmq ... error
ERROR: Command errored out with exit status 1:
 command: /home/gribeiro/Devel/github/brazil-data-cube/bdc-collection-builder/venv/bin/python3.7 -u -c 'import sys, setuptools, tokenize; sys.argv[0] = '"'"'/tmp/pip-install-1i7mp5js/librabbitmq/setup.py'"'"'; __file__='"'"'/tmp/pip-install-1i7mp5js/librabbitmq/setup.py'"'"';f=getattr(tokenize, '"'"'open'"'"', open)(__file__);code=f.read().replace('"'"'\r\n'"'"', '"'"'\n'"'"');f.close();exec(compile(code, __file__, '"'"'exec'"'"'))' install --record /tmp/pip-record-m9lm5kjn/install-record.txt --single-version-externally-managed --compile --install-headers /home/gribeiro/Devel/github/brazil-data-cube/bdc-collection-builder/venv/include/site/python3.7/librabbitmq
     cwd: /tmp/pip-install-1i7mp5js/librabbitmq/
Complete output (107 lines):
/tmp/pip-install-1i7mp5js/librabbitmq/setup.py:167: DeprecationWarning: 'U' mode is deprecated
  long_description = open(os.path.join(BASE_PATH, 'README.rst'), 'U').read()
running build
- pull submodule rabbitmq-c...
Cloning into 'rabbitmq-c'...
Note: checking out 'caad0ef1533783729c7644a226c989c79b4c497b'.

You are in 'detached HEAD' state. You can look around, make experimental
changes and commit them, and you can discard any commits you make in this
state without impacting any branches by performing another checkout.

If you want to create a new branch to retain commits you create, you may
do so (now or later) by using -b with the checkout command again. Example:

  git checkout -b <new-branch-name>

- autoreconf
sh: 1: autoreconf: not found
- configure rabbitmq-c...
/bin/sh: 0: Can't open configure

You will need to install autoconf:

$ sudo apt install autoconf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

INSTALL.rst

INSTALL.rst

Installation

Development Installation

Running in Development Mode

Launch the RabbitMQ Container

Prepare the Database System

Launch the `Cube Builder` service

Launch the `Cube Builder` worker

Using the Cube Builder

Files

INSTALL.rst

Latest commit

History

INSTALL.rst

File metadata and controls

Installation

Development Installation

Running in Development Mode

Launch the RabbitMQ Container

Prepare the Database System

Launch the Cube Builder service

Launch the Cube Builder worker

Using the Cube Builder

Launch the `Cube Builder` service

Launch the `Cube Builder` worker