Skip to content

v1.0.4

Compare
Choose a tag to compare
@ggerganov ggerganov released this 17 Dec 18:34
· 1484 commits to master since this release
1d716d6

What's Changed

Core ggml / whisper

  • Make ggml compatible with c99 9955fa4 | 0f11759
  • Fix UB causing asserts in Debug when reading the model vocabulary 124c718
  • Minor improvements in the Greedy decoding strategy 6a7c825
  • Add Windows build without OpenBLAS by @ggerganov in #282
  • Add whisper_tokenize() - basic text tokenization bf69b66
  • Language auto-detect option by @ggerganov in #286
  • Add AVX,AVX2 support for ggml_vec_scale_f32 by @katsu560 in #285
  • Implement extra cases for ggml_compute_forward_dup_f16() a7047b2
  • Added Roadmap and updated F.A.Q. discussion #126

C-style API

  • Add whisper_tokenize()
  • Add whisper_lang_max_id()
  • Add whisper_lang_str()
  • Add whisper_lang_auto_detect()
  • Add whisper_token_lang()

Examples

  • Improve prompting in "talk" example a613f16
  • Add "sliding window" mode to "stream" example b0f8013
  • Add Android sample by @Digipom in #277
  • Guided mode for the "command" example by @ggerganov in #271
  • Example "main" supports --prompt option b8065d9
  • Example "main" supports --print-progress option 32fbc8c
  • Example "main" supports --lang auto option fba10a4

New Contributors

Full Changelog: 1.0.3...1.0.4

Highlights

image image

  • General-purpose, short voice command detection on Raspberry Pi 4 using example/command:

    command-guided-0.mp4