Skip to content

Commit

Permalink
README
Browse files Browse the repository at this point in the history
  • Loading branch information
ebrehault committed May 7, 2024
1 parent 863ddae commit a392ba2
Show file tree
Hide file tree
Showing 4 changed files with 29 additions and 1 deletion.
3 changes: 3 additions & 0 deletions extractor/CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
# 1.0.0 (2024-05-07)

- Initial release
21 changes: 21 additions & 0 deletions extractor/README.md
Original file line number Diff line number Diff line change
@@ -1 +1,22 @@
# Web page extractor

Allows to extract the HTML content of a web page.

## Objective

When pushing regular links to your Nuclia Knowledge Box, the content of the page is extracted by the Nuclia processing, but your web pages may not be accessible from the internet. This extractor allows you to extract the content of your web pages locally and so the Nuclia Sync Agent can push the corresponding content to your Nuclia Knowledge Box.

It must be used in conjunction with the [Nuclia Sync agent](https://github.com/nuclia/sync-agent).
It must be deployed on the same machine as the Nuclia Sync agent, and it runs on port 8091.

## Installation

```bash
npm install @nuclia/extractor
```

## Usage

```bash
npm start
```
2 changes: 1 addition & 1 deletion package.json
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
{
"name": "nuclia-sync-agent-app",
"version": "1.4.2",
"version": "1.5.0",
"description": "This is a Nuclia Sync Agent App",
"main": "build/index.js",
"scripts": {
Expand Down
4 changes: 4 additions & 0 deletions server/CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,3 +1,7 @@
# 1.5.0 (2024-05-07)

- Supports local extraction of web pages

# 1.4.2 (2024-04-30)

- When loading folders, expose `displayPath` when needed and possible, according the connector types.
Expand Down

0 comments on commit a392ba2

Please sign in to comment.