NLP Pipeline Library

⚠️ NOT a working implementation: just an interview task

This library provides a flexible and configurable pipeline architecture for NLP, allowing users to chain together modules for tasks such as text cleaning, entity extraction, sentiment analysis, and text generation.

Project Highlights

Why This Approach?

Modular design:
- Each NLP task is encapsulated in its own module, inheriting from a base interface (NLPModule).
- This promotes reusability, scalability, and clean separation of concerns.
Validation:
- Modules are validated for type compatibility between stages.
- Parameters are validated using pydantic, providing detailed error messages for misconfigurations.
Extensibility:
- Adding new modules requires minimal effort — just implement the NLPModule interface and register the new module into the module registry.

Setup Instructions

Prerequisites

Python 3.8 or higher

Installing Locally

Clone the repository:

git clone https://github.com/lpezzolla/py-nlp-pipelines.git
cd py-nlp-pipelines

(Optional) Create and activate a virtual environment:
```
python -m venv .venv
source .venv/bin/activate
```
Install dependencies:
```
pip install -r requirements.txt
```

Usage

You can just use the run_pipeline.py in the root directory to test the functionality, I included 4 different yaml files covering the most relevant cases.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
src		src
README.md		README.md
invalid_pipeline_module.yaml		invalid_pipeline_module.yaml
invalid_pipeline_ordering.yaml		invalid_pipeline_ordering.yaml
invalid_pipeline_params.yaml		invalid_pipeline_params.yaml
requirements.txt		requirements.txt
run_pipeline.py		run_pipeline.py
valid_pipeline.yaml		valid_pipeline.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLP Pipeline Library

Project Highlights

Why This Approach?

Setup Instructions

Prerequisites

Installing Locally

Usage

About

Releases

Packages

Languages

lpezzolla/py-nlp-pipelines

Folders and files

Latest commit

History

Repository files navigation

NLP Pipeline Library

Project Highlights

Why This Approach?

Setup Instructions

Prerequisites

Installing Locally

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages