1 About
2 Installation
3 Examples
3.1 List of Examples
3.2 Serve data from a minimal Python program
3.3 Serve data from an enhanced Python program
3.4 Serve data from a non-HAPI web service
3.5 Serve data stored in a single file
3.6 Serve data read by Autoplot
4 Usage
5 Server Configuration
5.1 conf/config.json
5.2 Apache
5.3 Nginx
6 Metadata
6.1 server
6.1.1 id and prefix
6.1.2 contact
6.1.3 landingFile and landingPath
6.1.4 catalog-update
6.2 catalog
6.2.1 catalog_inline
6.2.2 catalog_file
6.2.3 catalog_url
6.2.4 catalog_command
7 Development
8 Contact
The intended user for this server-side software is a data provider that wants to serve data through a HAPI API. To be able to serve data from a HAPI API from their server using this software, the data provider only needs
- HAPI metadata, in one of a variety of forms, for a collection of datasets and
- a command-line program that returns at least a headerless HAPI CSV stream for all parameters in the dataset over the full time range of available data. Optionally, the command line program can take inputs of a start and stop time, a list of one or more parameters to output, and a HAPI stream format.
This software handles
- HAPI metadata validation,
- request validation and error responses,
- logging and alerts,
- time and parameter sub-setting (as needed), and
- generation of HAPI JSON or HAPI binary (as needed).
Other features:
- Backpressure is handled by pausing command-line data generator
stdout
. The server requires only ~100 MB of memory. - Unit tests -- tests on the command line program and HAPI URLs can be specified in a configuration file. By default, the server will not start if any test fails.
- Coupling to the the
server-ui
client.
%A list of catalogs that are served using this software is given at http://hapi-server.org/servers.
Requires NodeJS v16+.
git clone https://github.com/hapi-server/server-nodejs.git && cd server-nodejs
npm install; npm update;
node server.js
This server is tested on Linux and OS-X.
The following examples are included in the metadata directory. The examples can be run using
./hapi-server -f metadata/FILENAME.json
where FILENAME.json
is one of the file names listed below (e.g., Example0.json
).
- Example0.json - A Python program dumps an entire dataset in the headerless HAPI CSV format; the server handles time and parameter subsetting and creation of HAPI Binary and JSON.
- Example1.json - Same as Example0 except the Python program handles time subsetting.
- Example2.json - Same as Example1, except the Python program handles time and parameter subsetting and creation of HAPI CSV and Binary.
- Example3.json - Same as Example2 except for HAPI info metadata for each dataset is stored in an external file.
- Example4.json - Same as Example3 except for HAPI info metadata for each dataset is generated by a command-line command.
- Example5.json - Same as Example4 except catalog metadata is stored in an external file.
- Example6.json - Same as Example5 except catalog metadata is generated by a command-line command.
- Example7.json - Pass-through example; Same as Example2 except that data is generated by remote request and catalog metadata is returned from a URL.
- Example8.json - A dataset in headerless HAPI CSV format is stored in a single file; the server handles parameter and time subsetting and creation of HAPI JSON and Binary.
- Example9.json - A URL returns a dataset in headerless HAPI CSV format; the server handles parameter and time subsetting and creation of HAPI JSON and Binary.
- Example10.json - Same as Example9 except catalog is returned by a command.
- AutoplotExample1.json - A dataset is stored in multiple files and AutoplotDataServer is used to subset in time.
- AutoplotExample2.json - A dataset is stored in a CDF file, and AutoplotDataserver is used to generate HAPI CSV.
- TestData2.0.json - A test dataset used to test HAPI clients.
- TestData2.1.json - A test dataset used to test HAPI clients.
- SSCWeb.json - Data from a non-HAPI web service is made available from a HAPI server.
- QinDenton.json - Data in a single ASCII file is converted to headerless HAPI CSV by a Python program. See section 2.5.
In this example, we assume that the command line program that returns a dataset has the minimal capabilities required - when executed, it generates a headerless HAPI CSV file with all parameters in the dataset over the entire time range of available data. The server handles time and parameter subsetting and the generation of HAPI Binary and JSON.
The Python script Example.py returns HAPI-formatted CSV data (with no header) with two parameters. Only a configuration file, Example0.json, is needed to serve this data. The configuration file has information that is used to call the command line program, and it also has HAPI metadata that describes the output of Example.py. Details about the configuration file format are described in the Metadata section.
The Python calling syntax of Example.py is
python Example.py
To run this example locally after installation, execute
./hapi-server --file metadata/Example0.json
and then open http://localhost:8999/Example0/hapi
. You should see the same landing page as that at http://hapi-server.org/servers-dev/Example0/hapi. Note that the --open
command-line switch can be used to automatically open the landing page, e.g.,
./hapi-server --file metadata/Example0.json --open
The Python script Example.py can subset parameters and time and provide binary output. To force the server to use these capabilities, we need to modify the server configuration metadata in Example1.json. The changes are replacing
"command": "python bin/Example.py"
with
"command": "python bin/Example.py --params ${parameters} --start ${start} --stop ${stop} --fmt ${format}"
and adding
"formats": ["csv", "binary"]
The modified file is Example2.json. To run this example locally after installation, execute
./hapi-server --file metadata/Example2.json
and then open http://localhost:8999/Example2/hapi
. The command-line program now produces binary output and performs parameter subsetting as needed and the response time for data should decrease.
The server responses will be identical to those in the previous example. You should see the same landing page at http://hapi-server.org/servers-dev/Example2/hapi.
A non-HAPI server can be quickly made HAPI compliant using this server as a pass-through. Data from SSCWeb, which is available from a REST API, has been made available through a HAPI API at http://hapi-server.org/servers/SSCWeb/hapi. The configuration file is SSCWeb.json and the command line program is SSCWeb.js. Note that the metadata file SSCWeb.json was created using code in metadata/SSCWeb.
To run this example locally after installation, execute
./hapi-server --file metadata/SSCWeb.json --open
You should see the same landing page at http://hapi-server.org/servers/SSCWeb/hapi.
The Qin-Denton dataset contains multiple parameters stored in a single large file.
The command-line program that produces HAPI CSV from this file is QinDenton.py and the metadata is in QinDenton.json.
To run this example, use
./hapi-server --file metadata/QinDenton.json
Nearly data file that can be read by Autoplot can be served using this server.
Serving data requires at most two steps:
- Generating an Autoplot URI for each parameter, and (in some cases)
- Writing (by hand) metadata for each parameter.
Example 1
The first example serves data stored in a single CDF file. The configuration file is AutoplotExample1.json.
In this example, step 2. above (writing metadata by hand) is not required because the data file has metadata in a format that Autoplot can translate to HAPI metadata.
To run this example locally, execute
./hapi-server --file metadata/AutoplotExample1.json
Example 2
The second example serves data stored in multiple ASCII files. The configuration file is AutoplotExample2.json.
To run this example locally, execute
./hapi-server --file metadata/AutoplotExample2.json
List command-line options:
./hapi-server -h
Usage: node server.js [options]
Options:
--help Show help [boolean]
--file, -f Catalog configuration file or file pattern
[string] [default: /metadata/TestData2.0.json"]
--port, -p Server port [number] [default: 8999]
--conf, -c Server configuration file
[string] [default: "conf/server.json"]
--ignore, -i Start server even if metadata validation errors [boolean]
--skipchecks, -s Skip startup metadata validation and command line tests
[boolean] [default: false]
--logdir, -l Log directory [string] [default: "log"]
--open, -o Open web page on start [boolean] [default: false]
--test, -t Run URL tests and exit [boolean] [default: false]
--verify Run verification tests on command line and exit
[boolean] [default: false]
--loglevel info or debug [string] [default: "info"]
--debug set loglevel to 'debug' [boolean] [default: false]
--server-ui-include Also include these servers in server-ui server drop-down.
Use multiple times for more than one list.
[string] [default: null]
--proxy-whitelist Allow proxying of servers in this file (so one can use
server=http://... in address bar of server-ui).
[string] [default: ""]
--verifier Verifier server URL on landing page (__VERIFIER__ in html
is replaced with this value)
[string] [default: "http://hapi-server.org/verify/"]
--plotserver Plot server URL on landing page (__PLOTSERVER__ in html
is replaced with this value)
[string] [default: "http://hapi-server.org/plot/"]
--hapiserverpath Absolute path to use for $HAPISERVERPATH in server
metadata files [string] [default: "."]
--nodejs Location of NodeJS binary to use for $NODEJSEXE in server
metadata files (if needed for command line calls).
[string] [default: "node"]
--python Location of Python binary to use for $PYTHONEXE in server
metadata files (if needed for command line calls).
[string]
--https Start https server [boolean]
--cert https certificate file path [string]
--key https key file path [string]
Basic usage:
./hapi-server --file metadata/TestData.json
Starts HAPI server at http://localhost:8999/TestData/hapi
and serves datasets specified in the catalog ./metadata/TestData.json.
Multiple catalogs can be served by providing multiple catalog files on the command line:
./hapi-server --file CATALOG1.json --file CATALOG2.json
For example
./hapi-server --file metadata/TestData2.0.json --file metadata/Example1.json
will serve the two datasets at
http://localhost:8999/TestData2.0/hapi
http://localhost:8999/Example1/hapi
And the page at http://localhost:8999/
will point to these two URLs.
The variables $HAPISERVERPATH
, $PYTHONEXE
, and $NODEJSEXE
can be used in commands, files, and URLs in the server metadata (which are files passed using the command-line --file
switch). See the examples for use cases.
These variables can be passed as command-line arguments or set in the default configuration file conf/server.json
; an alternative location can be set using a command-line argument, e.g.,
./hapiserver -c /tmp/config.json
Variables set on the command line take precedence over those set in conf/server.json
.
$HAPISERVERPATH
$HAPISERVERPATH
can be used in metadata files to reference the absolute path of a directory. For example,
{
"server": {...},
"data": {...},
"catalog_file": "$HAPISERVERPATH/mymetadata/Data.json"
}
By default, $HAPISERVERPATH
is the installation directory (the directory containing the shell launch script hapi-server
). To use a custom path, set hapiserverpath
on the command line or in conf/server.json
.
All relative paths in commands in metadata files are relative to the directory where hapi-server
was executed.
$PYTHONEXE
This is the command used to start Python. By default, it is python
. If python
is not in the path, this can be set using a relative or absolute path. Python is used by several of the demonstration catalogs.
This variable can also be set using the --python
command line switch, which has the highest precedence.
Example:
data: {
"command": "$PYTHONEXE $HAPISERVERPATH/mybin/Data.py"
}
$NODEJSEXE
This variable can also be set using the --nodejs
command line switch, which has the highest precedence.
This is the command used to call NodeJS. By default, it is the command used to start the server. The start-up script looks for a NodeJS executable in $HAPISERVERPATH/bin
and then tries node
and then nodejs
.
To expose a URL through Apache, (1) enable mod_proxy
and mod_proxy_http
and (2) add the following in a <VirtualHost>
node in a Apache Virtual Hosts file
<VirtualHost *:80>
ProxyPass /TestData2.0 http://localhost:8999/TestData2.0 retry=1
ProxyPassReverse /TestData2.0 http://localhost:8999/TestData2.0
</VirtualHost>
If serving multiple catalogs (--file
used more than once on the command line), use
<VirtualHost *:80>
ProxyPass /servers http://localhost:8999/servers retry=1
ProxyPassReverse /servers http://localhost:8999/servers
</VirtualHost>
For Nginx, add the following to nginx.conf
location /TestData2.0 { proxy_pass http://localhost:8999/TestData2.0;}
If serving multiple catalogs, use
location /servers {proxy_pass http://localhost:8999/servers;}
The metadata required for this server is similar to a HAPI server's /catalog
and /info
response.
The server requires that the /catalog
response is combined with the /info
response for all datasets in the catalog in a single JSON catalog configuration file. Additional information about how to generate data must also be included in this JSON file.
The top-level structure of the configuration file is
{
"server": // See section 6.1
{
"id": "",
"prefix": "",
"HAPI": "",
"contact": "",
"landingFile": "",
"landingPath": "",
"verify": "",
"catalog-update": null
},
"catalog": <object>,
or
"catalog_inline": <object>,
or
"catalog_file": <string>,
or
"catalog_url": <string>,
or
"catalog_command": <string>,
"data":
{
"command": "Command line template", // See metadata/Example{1-7}.json
or // See metadata/Example8.json
"file": "HAPI CSV file"
"fileformat": "one of 'csv', 'binary', 'json'"
or // See metadata/Example9.json
"url": "URL that returns HAPI data" // and
"urlformat": "one of 'csv', 'binary', 'json'"
// __DATACONTACT__ in landingFile is replaced with this value.",
"contact": "Email address if error in command line program."
// Number of milliseconds to wait for a response from command line program
// before sending 1501. See bin/Example.py for example of sending a custom
// error message.
"timeout": 59000,
"testcommands": [
{
"command": <string>,
"Nlines": <integer>,
"Nbytes": <integer>,
"Ncommas": <integer>
},
...
],
"testurls": [
{
"url": <string>,
"Nlines": <integer>,
"Nbytes": <integer>,
"Ncommas": <integer>
},
...
]
}
}
A variety of examples are listed in the Examples section.
The string command
in the data node is a command that produces a headerless HAPI data response that has optional placeholders. See metadata/Example{1-7}.json for examples of usage that includes calling bin/Example.py. For example, in metadata/Example4.json, command
is
python ./bin/Example.py --dataset ${id} --params ${parameters} --start ${start} --stop ${stop} --fmt ${format}"
The server node has the form
"server": {
// Default is file name without extension.
"id": "",
// Default is id.
"prefix": "",
// Required. __SERVERCONTACT__ in landingFile is replaced with this value.
"contact": "",
// HAPI API and schema (e.g., 2.1, 3.0) to use for checking metadata.
// __VERSION__ in landingFile is replaced with this value.
"HAPI": "",
"landingFile": "", // e.g., index.htm file
"landingPath": "", // relative paths in index.htm will resolve to this path
// Only verify DATASET_NAME when --verify is passed on command line
"verify": "DATASET_NAME",
// How often in seconds to re-read metadata in the catalog node
"catalog-update": null
}
The id
is by default the name of the server configuration file, e.g.,
./hapi-server --file metadata/TestData2.0.json
then id=TestData2.0
and prefix=TestData2.0
.
By default, this catalog would be served from
http://localhost:8999/TestData2.0/hapi
TestData2.0
in the URL can be changed to TestData2.0.1
by using prefix=TestData2.0.1
.
It should be, at minimum, the email address of a system administrator.
landingFile
is the file to serve in response to requests for
http://localhost:8999/TestData2.0/hapi
By default, the landing page served is the stand-alone file single.htm from the HAPI server UI codebase. The double underscore variables in this file are replaced using the information in the metadata file (e.g., __CONTACT__
is replaced with the server.contact
value. A different landing page can be served by setting the landingFile
configuration variable, e.g. "landingFile": "$HAPISERVERPATH/public/index.htm"
, where $HAPISERVERPATH
is described in Server Configuration.
If landingFile
has local CSS and JS dependencies, set landingPath
to be the local directory of the referenced files. Several possible settings are
"landingFile": "$HAPISERVERPATH/index.htm",
// $HAPISERVERPATH will be replaced with the location of hapi-server binary
"landingPath": "/var/www/public/" // Location of CSS and JS files
// If index.htm has <script src="index.js">, index.js should be in /var/www/public/
To serve a directory listing, use, e.g.,
"landingFile": "",
"landingPath": "/var/www/public/"
// Server will look for index.htm and index.html in /var/www/public/. If not
// found, directory listing of /var/www/public/ will be served.
This is an integer number of seconds corresponding to how often the catalog
node should be updated. Use this if the catalog
node is not static.
Instead of a catalog
node, nodes of catalog_inline
, catalog_file
, catalog_url
, or catalog_command
can be used.
The catalog array should have the same format as a HAPI /catalog
response (each object in the array has an id
property and optional title
property) with the addition of an info
property that is the HAPI response for that id
, e.g., /info?id=dataset1
.
"catalog": // or "catalog_inline"
[
{
"id": "dataset1",
"title": "a dataset",
"info": {
"startDate": "2000-01-01Z",
"stopDate": "2000-01-02Z",
"parameters": [...]
}
},
{
"id": "dataset2",
"title": "another dataset",
"info": {
"startDate": "2000-01-01Z",
"stopDate": "2000-01-02Z",
"parameters": [...]
}
}
]
Examples of this type of catalog include
Instead of an info
node, nodes of info_inline
, info_file
, info_url
, or info_command
can be used. (info
is treated the same as info_inline
).
"catalog_inline":
[
{
"id": "dataset1",
"title": "a dataset",
"info_file": "relativepath/to/${id}/info_file.json"
},
{
"id": "dataset2",
"title": "another dataset",
"info_file": "/absolutepath/to/${id}/info_file.json"
},
{
"id": "dataset3",
"title": "another dataset",
"info_url": "http://server/hapi/info?dataset=${id}"
},
{
"id": "dataset4",
"title": "another dataset",
"info_command": "bin/program --id ${id}"
}
]
Relative paths in info_file
and info_command
are relative to the location of the directory of server.js
. See Example3.json.
The file should contain an array with content of the form shown above in catalog_inline
.
See also Example3.json.
The URL should return a HAPI catalog response. The content for the info
node is populated automatically. See Example7.json
stdout
should contain an array with content of the form shown above in catalog_inline
. See Example10.json.
Install nodejs (tested with v16+) using either the standard installer or NVM.
# Install Node Version Manager
curl -o- https://raw.githubusercontent.com/nvm-sh/nvm/v0.35.3/install.sh | bash
# Open a new shell (see displayed instructions from above command)
# Install and use node.js version 16
nvm install 16
# Clone the server repository
git clone https://github.com/hapi-server/server-nodejs
# Install dependencies
cd server-nodejs; npm install
# Start server
node server.js
# Run tests; Python 2.7+ required for certain tests.
npm test
Please submit questions, bug reports, and feature requests to the issue tracker.