-
Notifications
You must be signed in to change notification settings - Fork 2.9k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
feat(ingest): elasticsearch - add Elasticsearch Source (#3893)
Co-authored-by: Shirshanka Das <[email protected]>
- Loading branch information
1 parent
2daa06a
commit a44b48a
Showing
6 changed files
with
2,845 additions
and
0 deletions.
There are no files selected for viewing
11 changes: 11 additions & 0 deletions
11
metadata-ingestion/examples/recipes/elasticsearch_to_datahub.yml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,11 @@ | ||
source: | ||
type: "elasticsearch" | ||
config: | ||
host: 'localhost:9200' | ||
username: "" | ||
password: "" | ||
|
||
sink: | ||
type: "datahub-rest" | ||
config: | ||
server: "http://localhost:8080" |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,62 @@ | ||
# Elastic Search | ||
|
||
For context on getting started with ingestion, check out our [metadata ingestion guide](../README.md). | ||
|
||
## Setup | ||
|
||
To install this plugin, run `pip install 'acryl-datahub[elasticsearch]'`. | ||
|
||
## Capabilities | ||
|
||
This plugin extracts the following: | ||
|
||
- Metadata for indexes | ||
- Column types associated with each index field | ||
|
||
## Quickstart recipe | ||
|
||
Check out the following recipe to get started with ingestion! See [below](#config-details) for full configuration options. | ||
|
||
For general pointers on writing and running a recipe, see our [main recipe guide](../README.md#recipes). | ||
|
||
```yml | ||
source: | ||
type: "elasticsearch" | ||
config: | ||
# Coordinates | ||
host: 'localhost:9200' | ||
# Credentials | ||
username: "" | ||
password: "" | ||
# Options | ||
env = "prod" | ||
index_pattern: | ||
allow: [".*some_index_name_pattern*"] | ||
deny: [".*skip_index_name_pattern*"] | ||
|
||
sink: | ||
# sink configs | ||
``` | ||
|
||
## Config details | ||
|
||
Note that a `.` is used to denote nested fields in the YAML recipe. | ||
|
||
|
||
| Field | Required | Default | Description | | ||
| --------------------------- | -------- | ---------------- |---------------------------------------------------------------| | ||
| `host` | | "localhost:9092" | The elastic search host URI. | | ||
| `username` | | "" | The username credential. | | ||
| `password` | | "" | The password credential. | | ||
| `env` | | `"PROD"` | Environment to use in namespace when constructing URNs. | | ||
| `index_pattern.allow` | | | List of regex patterns for indexes to include in ingestion. | | ||
| `index_pattern.deny` | | | List of regex patterns for indexes to exclude from ingestion. | | ||
| `index_pattern.ignoreCase` | | `True` | Whether regex matching should ignore case or not | | ||
|
||
## Compatibility | ||
|
||
Coming soon! | ||
|
||
## Questions | ||
|
||
If you've got any questions on configuring this source, feel free to ping us on [our Slack](https://slack.datahubproject.io/)! |
Oops, something went wrong.