v1.0.4

ggerganov released this 17 Dec 18:34

· 1484 commits to master since this release

What's Changed

Core `ggml` / `whisper`

Make ggml compatible with c99 9955fa4 | 0f11759
Fix UB causing asserts in Debug when reading the model vocabulary 124c718
Minor improvements in the Greedy decoding strategy 6a7c825
Add Windows build without OpenBLAS by @ggerganov in #282
Add whisper_tokenize() - basic text tokenization bf69b66
Language auto-detect option by @ggerganov in #286
Add AVX,AVX2 support for ggml_vec_scale_f32 by @katsu560 in #285
Implement extra cases for ggml_compute_forward_dup_f16() a7047b2
Added Roadmap and updated F.A.Q. discussion #126

C-style API

Add whisper_tokenize()
Add whisper_lang_max_id()
Add whisper_lang_str()
Add whisper_lang_auto_detect()
Add whisper_token_lang()

Examples

Improve prompting in "talk" example a613f16
Add "sliding window" mode to "stream" example b0f8013
Add Android sample by @Digipom in #277
Guided mode for the "command" example by @ggerganov in #271
Example "main" supports --prompt option b8065d9
Example "main" supports --print-progress option 32fbc8c
Example "main" supports --lang auto option fba10a4

New Contributors

@Digipom made their first contribution in #277

Full Changelog: 1.0.3...1.0.4

Highlights

Sample Android application example/whisper.android

General-purpose, short voice command detection on Raspberry Pi 4 using example/command:

command-guided-0.mp4

Contributors

ggerganov, Digipom, and katsu560

Assets 6