VidTranslate

AI-Powered Desktop App for Text Extraction and Translation

A desktop application for macOS and Windows that utilizes local AI models like whisper.cpp for speech-to-text extraction and llama.cpp for text translation. The app can process both audio and video files, converting speech to text and translating the extracted text into multiple languages without requiring cloud services.

Key Features

Audio/Video to Text: Extracts text from audio and video files using whisper.cpp, a local implementation of OpenAI's Whisper for speech-to-text.
Text Translation: Translates the extracted text into various languages using llama.cpp and a local translation model.
Completely Offline: All processing is done locally, so there's no need for an internet connection.
Cross-Platform: Runs on both macOS and Windows.

Whisper Models for stt

https://ggml.ggerganov.com/ggml-model-whisper-large-q5_0.bin
https://ggml.ggerganov.com/ggml-model-whisper-medium-q5_0.bin

Llama models for translating

https://huggingface.co/notjjustnumbers/madlad400-3b-mt-Q4_K_M-GGUF/resolve/main/madlad400-3b-mt-q4_k_m.gguf?download=true

TODO

License

This project is licensed under the GNU General Public License v3.0. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 76 Commits
.vscode		.vscode
docs		docs
llama-t5		llama-t5
public		public
src-tauri		src-tauri
src		src
.eslintrc.cjs		.eslintrc.cjs
.gitignore		.gitignore
.prettierrc.json		.prettierrc.json
LICENSE		LICENSE
README.md		README.md
index.html		index.html
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VidTranslate

AI-Powered Desktop App for Text Extraction and Translation

Key Features

Whisper Models for stt

Llama models for translating

TODO

License

About

Releases

Packages

Languages

License

bruceunx/vidtranslate

Folders and files

Latest commit

History

Repository files navigation

VidTranslate

AI-Powered Desktop App for Text Extraction and Translation

Key Features

Whisper Models for stt

Llama models for translating

TODO

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages