Skip to content

Pinned Loading

  1. crawlers crawlers Public

    Norconex Crawlers (or spiders) are flexible web and filesystem crawlers for collecting, parsing, and manipulating data from the web or filesystem to various data repositories such as search engines.

    Java 183 67

  2. collector-filesystem collector-filesystem Public

    Norconex Filesystem Collector is a flexible crawler for collecting, parsing, and manipulating data ranging from local hard drives to network locations into various data repositories such as search …

    Java 22 13

  3. importer importer Public

    Norconex Importer is a Java library and command-line application meant to "parse" and "extract" content out of a file as plain text, whatever its format (HTML, PDF, Word, etc). In addition, it allo…

    Java 33 23

Repositories

Showing 10 of 22 repositories
  • crawlers Public

    Norconex Crawlers (or spiders) are flexible web and filesystem crawlers for collecting, parsing, and manipulating data from the web or filesystem to various data repositories such as search engines.

    Norconex/crawlers’s past year of commit activity
    Java 183 Apache-2.0 67 27 3 Updated Nov 19, 2024
  • commons-lang Public

    Generic library shared between several projects.

    Norconex/commons-lang’s past year of commit activity
    Java 12 Apache-2.0 7 0 6 Updated Oct 19, 2024
  • collector-core Public

    Collector-related code shared between different collector implementations

    Norconex/collector-core’s past year of commit activity
    Java 7 Apache-2.0 15 6 2 Updated Oct 15, 2024
  • importer Public

    Norconex Importer is a Java library and command-line application meant to "parse" and "extract" content out of a file as plain text, whatever its format (HTML, PDF, Word, etc). In addition, it allows you to perform any manipulation on the extracted text before using it in your own service or application.

    Norconex/importer’s past year of commit activity
    Java 33 Apache-2.0 23 14 1 Updated Oct 15, 2024
  • commons-maven-parent Public

    Maven parent POM for many Norconex Maven projects.

    Norconex/commons-maven-parent’s past year of commit activity
    CSS 0 Apache-2.0 2 0 1 Updated Oct 14, 2024
  • collector-filesystem Public

    Norconex Filesystem Collector is a flexible crawler for collecting, parsing, and manipulating data ranging from local hard drives to network locations into various data repositories such as search engines.

    Norconex/collector-filesystem’s past year of commit activity
    Java 22 13 9 1 Updated Sep 25, 2024
  • committer-solr Public

    Solr implementation of Norconex Committer. Should also work with any Solr-based products, such as LucidWorks.

    Norconex/committer-solr’s past year of commit activity
    Java 3 Apache-2.0 5 9 2 Updated Mar 12, 2024
  • committer-sql Public

    Implementation of Norconex Committer for SQL (JDBC) databases.

    Norconex/committer-sql’s past year of commit activity
    Java 1 Apache-2.0 6 4 1 Updated Jul 7, 2023
  • committer-core Public

    Norconex Committer is a java library and command line application used to route content to local or remote target repositories, such as a search engine index.

    Norconex/committer-core’s past year of commit activity
    Java 4 Apache-2.0 10 5 0 Updated Feb 8, 2023
  • committer-neo4j Public

    Implementation of Norconex Committer for Neo4j.

    Norconex/committer-neo4j’s past year of commit activity
    Java 2 Apache-2.0 1 2 0 Updated Jan 4, 2022

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…