Skip to content

Latest commit

 

History

History
143 lines (103 loc) · 6.71 KB

index.md

File metadata and controls

143 lines (103 loc) · 6.71 KB
title slug license
Gravitino overview
/
Copyright 2023 Datastrato Pvt Ltd. This software is licensed under the Apache License version 2.

What's Gravitino?

Gravitino is a high-performance, geo-distributed, and federated metadata lake. It manages the metadata directly in different sources, types, and regions. It also provides users with unified metadata access for data and AI assets.

Learn more

Downloading

You can get Gravitino from the GitHub release page, or you can build Gravitino from source code. See How to build Gravitino.

Gravitino runs on both Linux and macOS platforms, and it requires the installation of Java 8, Java 11, or Java 17. Gravitino trino-connector runs with Trino, and requires Java 17. This should include JVMs on x86_64 and ARM64. It's easy to run locally on one machine, all you need is to have java installed on your system PATH, or the JAVA_HOME environment variable pointing to a Java installation.

See How to install Gravitino to learn how to install the Gravitino server.

Gravitino provides Docker images on Docker Hub. Pull the image and run it. For details of the Gravitino Docker image, see Docker image details.

Gravitino also provides a playground to experience the whole Gravitino system with other components. See the Gravitino playground repository and How to use the playground.

Getting started

To get started with Gravitino, see Getting started for the details.

How to use Gravitino

Gravitino provides two SDKs to manage metadata from different catalogs in a unified way: the REST API and the Java SDK. You can use either to manage metadata. See

Also, you can find the complete REST API definition in Gravitino Open API, and the Java SDK definition in Gravitino Javadoc.

Gravitino provides a web UI to manage the metadata. Visit the web UI in the browser via http://<ip-address>:8090. See Gravitino web UI for details.

Gravitino currently supports the following catalogs:

Relational catalogs:

Fileset catalogs:

Gravitino also provides an Iceberg REST catalog service for the Iceberg table format. See the Iceberg REST catalog service for details.

Gravitino playground

To experience Gravitino with other components easily, Gravitino provides a playground to run. It integrates Apache Hadoop, Apache Hive, Trino, MySQL, PostgreSQL, and Gravitino together as a complete environment. To experience all the features, see Getting started and How to use the Gravitino playground.

Where to go from here

Catalogs

Gravitino supports different catalogs to manage the metadata in different sources. Please see:

  • Iceberg catalog: a complete guide to using Gravitino to manage Apache Iceberg data.
  • Iceberg REST catalog service: a complete guide to using Gravitino as an Apache Iceberg REST catalog service.
  • Hive catalog: a complete guide to using Gravitino to manage Apache Hive data.
  • MySQL catalog: a complete guide to using Gravitino to manage MySQL data.
  • PostgreSQL catalog: a complete guide to using Gravitino to manage PostgreSQL data.
  • Hadoop catalog: a complete guide to using Gravitino to manage fileset using Hadoop Compatible File System (HCFS).

Trino connector

Gravitino provides a Trino connector to manage Trino metadata in a unified way. To use the Trino connector, see:

Server administration

Gravitino provides several ways to configure and manage the Gravitino server. See:

  • Security: provides security configurations for Gravitino, including HTTPS and OAuth2 configurations.
  • Gravitino metrics: provides metrics configurations and detailed a metrics list of the Gravitino server.

Programming guides

  • Gravitino Open API: provides the complete Open API definition of Gravitino.
  • Gravitino Javadoc: provides the Javadoc for the Gravitino API.

Development guides