GitHub - databyjp/distyll: A personal knowledge base that I can dump information to and help me learn

Installation

Install ffmpeg (brew install ffmpeg on macOS) Install the package with pip install distyll-info

distyll.text.from_pdf(pdf_url) -> pdf_text
distyll.text.from_arxiv_paper(arxiv_url) -> {"title": title, "url": arxiv_url, "text": pdf_text}
distyll.transcripts.from_youtube(youtube_url) -> {"title": title, "date": date, "yt_url": youtube_url, "uploader": uploader, "channel": channel, "transcripts": List[transcript]}
distyll.transcripts.from_local_video(video_url) -> List[transcript]

Please see the docstrings for more information.

OpenAI: Audio -> text functionalities make use of OpenAI's Whisper. You will need an API key to use this functionality.

Sorry! I'm working on making this more streamlined and better. For the old version, please see the distyll_old branch.

Name		Name	Last commit message	Last commit date
Latest commit History 88 Commits
.idea		.idea
src/distyll		src/distyll
tests		tests
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
demo.ipynb		demo.ipynb
docker-compose.yml		docker-compose.yml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml