v1.5.5
Overview
Many small incremental updates + Token level timestamps with DTW by @denersc in #1485
Feedback is welcome!
Full Changelog: v1.5.4...v1.5.5
What's Changed
- server : fix server temperature + add temperature_inc by @ggerganov in #1729
- main : add cli option to disable system prints by @ggerganov in #1740
- server: add request path by @eschmidbauer in #1741
- Optional Piper TTS support for talk-llama example. by @RhinoDevel in #1749
- fix/1748 by @nank1ro in #1750
- Don't compute timestamps when not printing them. by @ghindle in #1755
- Add more parameters to server api by @ghindle in #1754
- Add SetInitialPrompt method to go bindings by @blib in #1753
- ggml : fix 32-bit ARM compat for IQ2_XS by @ggerganov in #1758
- refactor: get all scripts to be POSIX Compliant by @sonphantrung in #1725
- whisper : load the model into multiple buffers of max size 1GB by @ggerganov in #1763
- rebase against your -np changes (thx) and add better python file to be used on the command line or as library by @contractorwolf in #1744
- examples/talk-llama: Add optional commandline parameter to set the bot name. by @RhinoDevel in #1764
- server : fix building and simplify lib deps on Windows by @przemoc in #1772
- talk-llama: optional wake-up command and audio confirmation by @Rakksor in #1765
- examples/server: implement "verbose_json" format with token details by @rmmh in #1781
- whisper.android: Return output from benchmarks by @luciferous in #1785
- libwhisper.so should be position independent by @trixirt in #1792
- Docs: try to make model options / model install methods clearer by @mrienstra in #1806
- common : fix input buffer check by @ggerganov in #1812
- Update Makefile by @jwijffels in #1813
- Add fields to
verbose_json
response and show examples on the home page by @JacobLinCool in #1802 - common: fix wav buffer detection by @JacobLinCool in #1819
- Add macOS deployment target option to Makefile by @didzis in #1839
- Expose CUDA device setting in public API by @didzis in #1840
- whisper.android: How to build with CLBlast by @luciferous in #1809
- server: Allow CORS request with authorization headers by @valenting in #1850
- Embed Metal library source into compiled binary by @didzis in #1842
- added audio_ctx argument to main and server examples by @dscripka in #1857
- whisper : fix external encoder by @ggerganov in #1860
- swift : package no longer use ggml dependency by @ggerganov in #1861
- fix openvino setup docs by @jumpers775 in #1874
- clean up common code in examples by @felrock in #1871
- main : check if input files exist before proceeding by @Theldus in #1872
- Linking issue fix via Makefile when CUBLAS enabled in the WSL #1876 by @lbluep in #1878
- main : fix file existence check in main.cpp by @Theldus in #1889
- openvino : fix convert-whisper-to-openvino.py for v2023.0.0 (#1870) by @st-gr in #1890
- ggml : 32-bit arm compat by @ggerganov in #1891
- Add SYCL logic in whisper by @abhilash1910 in #1863
- talk and talk-llama: Pass text_to_speak as a file by @tamo in #1865
- Stream.wasm: Fix invalid memory access when no segments are returned by @Andrews54757 in #1902
- Update README to Recommend MacOS Sonoma for Core ML to avoid hallucination by @gavin1818 in #1917
- Add library versioning by @kenneth-ge in #1352
- Fix SF(segment fault) issue in Android JNI by @zhouwg in #1929
- Fix typo in source file whisper.cpp by @zhouwg in #1925
- bench:fix typo by @zhouwg in #1933
- Auto lowercase language parameter by @F1L1Pv2 in #1928
- ggml : try fix 32-bit arm compat by @ggerganov in #1938
- whisper : make beam candidate sort more stable by @josharian in #1943
- bindings/go : add linker flags to make metal work by @josharian in #1944
- whisper : improve beam search candidate diversity by @josharian in #1947
- whisper : document whisper_batch.n_seq_id by @josharian in #1942
- Rename --audio-context to --audio-ctx, as per help text by @joliss in #1953
- [DRAFT] Token level timestamps with DTW (#375) by @denersc in #1485
- Fedora dependencies needed (SDL2) by @Man2Dev in #1970
- libcuda.so.1 in PATH in Docker Container by @tiagofassoni in #1966
- ruby : fix build by @ggerganov in #1980
- Improve support for distil-large-v3 by @sanchit-gandhi in #1982
- whisper : improve handling of prompts by @ggerganov in #1981
- sync : ggml by @ggerganov in #2001
- Implemented command-style grammar in the main example. by @ulatekh in #1998
- Use pkg-config for OpenBLAS by @przemoc in #1778
- ci : add building in MSYS2 environments (Windows) by @przemoc in #1994
- Support CUDA versions < 11.1 by @primenko-v in #2020
- Create solution folders in the CMake build by @ulatekh in #2004
- Allow a regular expression to describe tokens to suppress by @ulatekh in #1997
- "main" example now allows a response-file as the sole parameter by @ulatekh in #2019
- Support for CPU BLAS build via Intel MKL by @slashlib in #2024
- Set stdin to binary mode on Windows. Fixes #2023 by @rotemdan in #2025
- Fix file-handle leak in read_wav() by @ulatekh in #2026
- Fix DTW memory access by @bradmurray-dt in #2012
- whisper: update grammar-parser.cpp by @eltociear in #2058
- fix missing reference to "model" variable in actual shell command run in whisper.nvim by @sixcircuit in #2049
- build : detect AVX512 in Makefile, add AVX512 option in CMake by @didzis in #2043
- feature/no timestamps node by @pprobst in #2048
- Update embedded Metal library generation process to include dependency by @didzis in #2045
- server.cpp: add dtw by @eschmidbauer in #2044
New Contributors
- @eschmidbauer made their first contribution in #1741
- @RhinoDevel made their first contribution in #1749
- @nank1ro made their first contribution in #1750
- @ghindle made their first contribution in #1755
- @blib made their first contribution in #1753
- @sonphantrung made their first contribution in #1725
- @contractorwolf made their first contribution in #1744
- @Rakksor made their first contribution in #1765
- @rmmh made their first contribution in #1781
- @luciferous made their first contribution in #1785
- @trixirt made their first contribution in #1792
- @mrienstra made their first contribution in #1806
- @JacobLinCool made their first contribution in #1802
- @valenting made their first contribution in #1850
- @dscripka made their first contribution in #1857
- @jumpers775 made their first contribution in #1874
- @Theldus made their first contribution in #1872
- @lbluep made their first contribution in #1878
- @st-gr made their first contribution in #1890
- @abhilash1910 made their first contribution in #1863
- @Andrews54757 made their first contribution in #1902
- @gavin1818 made their first contribution in #1917
- @kenneth-ge made their first contribution in #1352
- @zhouwg made their first contribution in #1929
- @F1L1Pv2 made their first contribution in #1928
- @josharian made their first contribution in #1943
- @joliss made their first contribution in #1953
- @Man2Dev made their first contribution in #1970
- @tiagofassoni made their first contribution in #1966
- @sanchit-gandhi made their first contribution in #1982
- @ulatekh made their first contribution in #1998
- @primenko-v made their first contribution in #2020
- @slashlib made their first contribution in #2024
- @rotemdan made their first contribution in #2025
- @bradmurray-dt made their first contribution in #2012
- @sixcircuit made their first contribution in #2049
- @pprobst made their first contribution in #2048
Full Changelog: v1.5.4...v1.5.5