Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
-
Updated
Nov 3, 2024 - Python
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
📜 Extract meaningful content from the chaos of a web page
Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.
A parser library for Go
Parser Building Toolkit for JavaScript
Library to parse and work with the C++ AST
Microsoft.Recognizers.Text provides recognition and resolution of numbers, units, date/time, etc. in multiple languages (ZH, EN, FR, ES, PT, DE, IT, TR, HI, NL. Partial support for JA, KO, AR, SV). Packages available at: https://www.nuget.org/profiles/Recognizers.Text, https://www.npmjs.com/~recognizers.text
Industrial-strength monadic parser combinator library
Portable Executable parsing library (from PE-bear)
📖🔬☕ BioJava is an open-source project dedicated to providing a Java library for processing biological data.
BNF wrangling and railroad diagrams
A parser combinator library for Zig
Dynamic parser combinators in Dart.
A sane rich text parsing and styling library.
竜 TatSu generates Python parsers from grammars in a variation of EBNF
Ksoup is a lightweight Kotlin Multiplatform library for parsing HTML, extracting HTML tags, attributes, and text, and encoding and decoding HTML entities.
🔪 Strictly RFC 3986 compliant URI parsing and handling library written in C89; moved from SourceForge to GitHub
A library to parse C/C++ source as AST
Add a description, image, and links to the parser-library topic page so that developers can more easily learn about it.
To associate your repository with the parser-library topic, visit your repo's landing page and select "manage topics."