This repository serves as the canonical environment for demonstrating integration of the PASS Ember application with its dependant services. This repository provides two things:
- Docker images that are the basis for the production deployment of PASS, pushed to the
pass
organization in Docker Hub - Provides a
docker-compose
orchestration that configures and launches PASS for developers
These instructions are for starting PASS with docker-compose
. If you have Docker already installed and want to start up the demo ASAP, jump to starting Docker.
- Create a "hosts" entry (
lmhosts
for windows,/etc/hosts
for *nix) that aliases the hostnamepass
to your loopback address (127.0.0.1
) or to your docker-machine address (e.g.192.168.99.100
) - A working Docker installation: Docker for Mac, Docker for Windows, Docker Linux, or Docker Machine
- Checkout (i.e. clone) this repository:
git clone https://github.com/OA-PASS/pass-demo-docker
cd
intopass-demo-docker
Docker Machine users should remember to set the appropriate environment variables in order to select an active machine (e.g.
eval $(docker-machine env default)
), and insure the selected machine is running (e.g.docker-machine ls
,docker-machine start default
)
Configuring the Docker images allows you to:
- change default ports used by the Docker containers
- change the default GitHub repository urls and branches used to build the code included in the images
To configure the Docker images, open up the .env
file and make any necessary changes. A brief description of the variables are below:
- PY_GGI_PORT: an empty HTTP
POST
to this port triggers the deposit of a dummy submission package to an FTP server - FTP_SUBMISSION_DEBUG_PORT: Used for Java debugging
- FTP_SERVER_PORT: The FTP command port, typically port 21
- The FTP submission code base will be downloaded and built from
SUBMISSION_GIT_REPO
, using the branch or tag defined inSUBMISSION_GIT_BRANCH
- EMBER_PORT: the Ember HTTP application is served on this port
- The Ember code base will be downloaded and built from
EMBER_GIT_REPO
, using the branch or tag defined inEMBER_GIT_BRANCH
- PASS_FEDORA_PATH: the URL path that the Fedora REST API is served from (e.g.
fcrepo/rest
) - Pick one of:
- PASS_FEDORA_HOST: the base HTTP url of Fedora (e.g.
http://localhost:8080
), and mutually exclusive withPASS_FEDORA_PORT
.- If
PASS_FEDORA_HOST
is present,PASS_FEDORA_HOST
andPASS_FEDORA_PATH
are combined to form the publicly accessible Fedora REST API URL. - This allows for the Fedora repository to be running on a host distinct from the Ember application
- If
- PASS_FEDORA_PORT: the port Fedora is publicly accessible from (N.B. this is the variable used by this demo)
- If
PASS_FEDORA_PORT
is present, the publicly accessible Fedora REST API URL is formed by the host present in the URL bar of the browser (i.e. the URL used to access the Ember application), the port fromPASS_FEDORA_PORT
, and the path fromPASS_FEDORA_PATH
. - This allows for the Fedora repository to be accessible when a host name for Fedora is not known a priori (e.g. when deploying to a cloud service that dynamically supplies IPs or hostnames)
- A limitation of this approach is that Fedora must be publicly accessible from the same host as the Ember application
- If
- PASS_FEDORA_HOST: the base HTTP url of Fedora (e.g.
- FCREPO_PORT: the port Fedora runs on (may differ from the port in
PASS_FEDORA_PORT
orPASS_FEDORA_HOST
when the infrastructure is behind a proxy) - FCREPO_JMS_PORT: used by Fedora JMS messaging. This is only necessary when using the internal activeMQ broker
- FCREPO_STOMP_PORT: used by Fedora JMS messaging. This is only necessary when using the internal activeMQ broker
- FCREPO_LOG_LEVEL: sets the log level of the Fedora repository
- FCREPO_TOMCAT_REQUEST_DUMPER_ENABLED: if set to
true
, instructs Tomcat to dump the headers for each request/response - FCREPO_TOMCAT_AUTH_LOGGING_ENABLED: if set to
true
, instructs Tomcat to log additional information regarding authentication and authorization - FCREPO_JMS_CONFIGURATION: Defines the JMS broker configuration for Fedora. A value of
classpath:/pass-jms-external.xml
specifies an external activemq.
- DSPACE_HOST: the host name DSpace will use when generating HTTP responses, defaults to
localhost
(docker-machine
users must set this to the IP address of their docker-machine) - DSPACE_PORT: the port that DSpace and its applications are exposed on, defaults to port
8181
- POSTGRES_DB_PORT: the port that the Postgres database is exposed on, defaults to
6543
- ES_PORT: the port that Elasticsearch is exposed on, defaults to
9200
See pass-indexer for more info.
- PI_FEDORA_USER: user to do basic auth with when retrieving Fedora resources
- PI_FEDORA_PASS: password for basic auth when retrieving Fedora resources
- PI_FEDORA_INTERNAL_BASE: Internal URI for Fedora. Used to test if it is up.
- PI_ES_BASE: URL to base of Elasticsearch. Used to test if it is up.
- PI_ES_INDEX: URL to Elasticsearch index where Fedora resources will be indexed.
- PI_ES_CONFIG: URL to Elasticsearch index where Fedora resources will be indexed.
- PI_FEDORA_JMS_BROKER: location of Fedora JMS broker.
- PI_FEDORA_JMS_QUEUE: name of Fedora JMS queue
- PI_TYPE_PREFIX: prefix of Fedora resource type which indicates resource should be indexed
- PI_LOG_LEVEL: log level of pass-indexer
- PI_MAX_ATTEMPTS: set the max attempts to contact Elasticsearch and Fedora (default: 20)
ACTIVEMQ_BROKER_URI
URI for clients to connect to the broker, e.g.failover:tcp://activemq:61616
ACTIVEMQ_JMS_PORT
Openwire wire protocol (ActiveMQ native) port e.g.61616
ACTIVEMQ_STOMP_PORT
= STOMP wire protocol (text-based) port e.g.61613
ACTIVEMQ_WEBCONSOLE_PORT
= HTTP port for the ActiveMQ web console (admin/admin) e.g.8161
Building the Docker Images (optional)
If the images deployed to Docker Hub are up-to-date, then you do not need to build the images.
- Check out this repository
cd
intopass-demo-docker
- Peek at
.env
and change any values you wish; environment variables are documented above - Run
docker-compose build
- Lots of things should fly across, including the Maven build of the FTP submission code
- If the build executes quickly, and you see references to cached image layers, then that means that Docker believes the image is already up-to-date (i.e. nothing has changed in the
Dockerfile
for the image).
Invoke
docker-compose build --no-cache
to insure that content included in the images byADD
,COPY
or GitHub source checkouts is up-to-date
- Run
docker-compose up
If you built the images (or if you already have the images locally from a previous build), services should begin to start right away. If you did not build the images, Docker will first pull the images from Docker Hub.
After starting the demo with the defaults, the following services should work.
- Ember application: https://localhost. See Shibboleth users below for login options
- Internal FTP server:
localhost:21
, username:nihmsftpuser
password:nihmsftppass
- HTTP POST submission trigger:
localhost:8081
- Fedora:
http://localhost:8080/fcrepo/rest
- Same Fedora instance behind a Shibboleth SP:
https://localhost/fcrepo/rest
- DSpace repository, exposed at port
8181
:http://localhost:8181/xmlui
- Login with username
[email protected]
, passwordfoobar
- Not behind the Shibboleth SP
- Login with username
- DSpace SWORD v2 endpoint:
http://localhost:8181/swordv2/servicedocument
- Protected by HTTP basic auth
- Authenticate with username
[email protected]
, passwordfoobar
- Not behind the Shibboleth SP
- Postgres database, exposed at port
6543
- DSpace database name
dspace
, usernamedspace
, no password
- DSpace database name
(N.B.
docker-machine
users will need to substitute the IP address of their Docker machine in place oflocalhost
)
There are four users that can log in via Shibboleth. These can be used to log in to Ember. Each uses the password moo
.
staff1
A staff member who does have grants and therefore does have a User resource already, and is the PI on grants.staff2
A staff member who does not have grants. Because PASS policy is to only allow faculty in, this user will be denied access to the user service, or the repository.faculty1
A faculty member who does have lots of grants, and therefore does have a User resource already, and is a PI on grants.faculty2
A faculty member who does not have grants. Because PASS policy is to give faculty submitter privileges, the first time this person logs to shibboleth and hits the user service, a new User resource is created, but the user should not be allowed to associate grants with his or her submissions.nih-user
Person with NIH grantsed-user
Person with DOE gtantsusaid-user
Person with USAID grantsincomplete-nih-user
Person with incomplete submissions harvested from NIHMS
- Type
CTRL-C
- Optionally, run
docker-compose down
TODO
To trigger a submission without using the PASS Ember UI:
- Start the demo
docker-compose up
- Send an empty HTTP
POST
request to the submission container, by default port8081
or whateverPY_CGI_PORT
is defined as- for
docker-machine
users, this will be an IP address like192.168.99.100
- for
Docker for *
users, this will belocalhost
- e.g.
curl -X POST localhost:8081
- e.g.
curl -X POST 192.168.99.100:8081
- for
You should see some logs flow across your Docker terminal window, and an indication of a successful FTP upload.
Submissions may be FTPed to one of two locations:
- The internal FTP server configured by Docker
- The NIH test FTP server
By default, the internal FTP server is used. To use the NIH test FTP server, export an environment variable named FTP_CONFIGURATION_KEY
with a value of nih
, then (re)start the demo application using docker-compose
. To explicitly configure the internal FTP server, export FTP_CONFIGURATION_KEY
with a value of local
.
To push the images to Docker Hub, you must have a Docker Hub account, and have write permissions to the PASS repository.
Images are identified by: <repository prefix>/<image name>:<image tag>
, where <repository prefix>
is the string pass
: e.g. pass/ember-fcrepo:0.0.1-demo
Image tags are similar to tags in a version control system: arbitrary strings that resolve a stable set of content. Tags in Docker can be overwritten, just like tags in a VCS.
Care should be taken when pushing images: overwriting a tag is something that should be done with caution
Tags are per-image, not per-repository. That means each image can have its own set of tags (i.e. version semantics). For example, the fcrepo image may have tags that correspond to the version of Fedora used by the image:
pass/fcrepo:4.7.1
pass/fcrepo:4.7.2
pass/fcrepo:5.0.0
The Ember application may have its own versioning scheme based on feature set:
pass/ember-fcrepo:0.0.1-demo
pass/ember-fcrepo:0.0.2-demo
pass/ember-fcrepo:1.0.0
It is imperative to know what tags (i.e. versions) need to be updated, and what the new tags are
N.B. it may be completely valid for the tags to remain unchanged, and simply have the images in the Docker Hub be overwritten. This may be the case especially when updating a development image with the latest code
-
Determine the tag for each image to be pushed
- Review and update
docker-compose.yml
, if necessary, to use the proper tag(s) - Again, it may be the case that tags remain unchanged, in which case the semantics of a push operation are: Overwrite existing images in Docker Hub
- Review and update
-
Build the images
-
$
docker images ls | grep 'pass/'
should return a list of images, including the image names and tags chosen in the previous step
-
-
(Optional) Start Docker and test the newly-built images
-
Push each image individually, specified as
pass/<image name>:<image tag>
-
$
docker push pass/ember-fcrepo:0.0.1-demo
-
$
docker push pass/nihms-submission:1.0.0-SNAPSHOT-demo
-
$ # push more images ...
-
-
If
docker-compose.yml
was updated in step (1), be sure to commit those changes to Git, and push.-
N.B. do not push local changes to
.env
-
-
(Optional) Use
git tag
in order to provide traceability for the deployed images-
N.B. don't forget to push the tag
-