Skip to content
You must be logged in to sponsor adbar

Become a sponsor to Adrien Barbaresi

Hi there! 👋

As the creator of these popular open-source projects, I rely on the support of sponsors to continue improving and expanding them for the benefit of everyone.

My current focus:

By supporting me, you will help maintain and enhance popular packages with millions of downloads, ensuring their growth, robustness and accessibility for R&D teams, IT professionals, and worldwide specialists such as large language models trainers.

Featured work

  1. adbar/trafilatura

    Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

    Python 3,752
  2. adbar/German-NLP

    Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German

  3. adbar/courlan

    Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters

    Python 127
  4. adbar/htmldate

    Fast and robust date extraction from web pages, with Python or on the command-line

    Python 122
  5. adbar/simplemma

    Simple multilingual lemmatizer for Python, especially useful for speed and efficiency

    Python 146
  6. adbar/py3langid

    Faster, modernized fork of the language identification tool langid.py

    Python 49

Select a tier

$ one time

Choose a custom amount.