* Download CoreML models on MacOS if possible
When using CoreML with Whispercpp, it needs additional encoder models so attempt to download the appropriate model and
symlink it to the location where whisper expects it to be
* Try fix windows build
* Remove erroneous entry in models registry
* Enable some CT2 backends on Linux
* Might as well enable OpenBLAS for whisper too
* Format cmake files
* Update README to add info about OpenBLAS now being a required dependency
* Fix OpenBLAS dependencies and linking for Ubuntu package
Simplify the translation options in the transcription-filter-properties.cpp file by adding a new option "translate_only_full_sentences". This option will be visible only when the "translate_enabled" flag is true and the "is_advanced" flag is set.
Remove unnecessary code in model-infos.cpp
Remove the code that logs a warning message when the "sha256" field is missing or invalid in the model JSON file. This code is no longer needed as it does not affect the functionality of the program.
Comment out download_json_from_github in model-infos.cpp
Comment out the call to the "download_json_from_github" function in the load_models_info() function in model-infos.cpp. This function is currently not working as intended and needs further investigation.
* refactor: Handle file exceptions when writing raw sentence and translations
This commit modifies the code in transcription-filter-callbacks.cpp to handle file exceptions when writing raw sentence and translations to files. It adds exception handling using try-catch blocks to ensure that file operations are properly handled. This change improves the robustness of the code and prevents crashes or unexpected behavior when file operations fail.
* refactor: Update models_info function to use cached models information
The models_info function in model-downloader.cpp has been updated to use a cached version of the models information. This improves performance by avoiding unnecessary file reads and JSON parsing. The function now returns a const reference to the cached models_info map. This change ensures that the models_info function is more efficient and reduces the overhead of loading the models information.
Refactor the code in model-downloader.cpp to use the updated models_info function and remove the unnecessary file read and JSON parsing code.
Closes#123
* refactor: Simplify file handling in transcription-filter-callbacks.cpp
* refactor: Add script to query Hugging Face models and update models_directory.json
This commit adds two new scripts, hugging_face_model_query.py and hugging_face_model_query_all.py, to query Hugging Face models and update the models_directory.json file. The hugging_face_model_query.py script fetches model information from the Hugging Face API and adds new models to the models_directory.json file. The hugging_face_model_query_all.py script fetches a list of models matching a specific search criteria and adds the matching models to the models_directory.json file. These scripts will help keep the models_directory.json file up to date with the latest models available on Hugging Face.
Refactor the file handling in transcription-filter-callbacks.cpp
This commit simplifies the file handling in the transcription-filter-callbacks.cpp file. The changes aim to improve the readability and maintainability of the code by reducing complexity and removing unnecessary code.
Update the models_info function to use cached models information
This commit updates the models_info function to use cached models information instead of fetching it every time the function is called. This change improves the performance of the function by reducing the number of API calls and improves the overall efficiency of the code.
Handle file exceptions when writing raw sentence and translations
This commit adds exception handling code to handle file exceptions when writing raw sentence and translations. The changes ensure that any file-related exceptions are caught and properly handled, preventing the program from crashing or producing incorrect results.
Simplify the Onnxruntime installation in FetchOnnxruntime.cmake
This commit simplifies the Onnxruntime installation process in the FetchOnnxruntime.cmake file. The changes aim to make the installation steps more concise and easier to understand, improving the overall maintainability of the code.
Update the version to 0.3.6 and adjust the website URL
This commit updates the version of the software to 0.3.6 and adjusts the website URL accordingly. The changes ensure that the software is properly versioned and the website URL is up to date.
* refactor: Add ExtraInfo struct to ModelInfo and update models_info function
* refactor: Update model names in models_directory.json and fix URL in transcription-filter.h
* Add accessor for VAD window size in samples
* Feed buffered audio data to VAD in proper window sizes
* Wake whisper thread whenever audio is received
* Update silero VAD to v5
* Only reset VAD state between chunks of activity
* Add Silero VAD model and integrate it into the transcription filter
* Fix Silero VAD model path and enable n_threads
* Update translation strings for multiple locales
* Update Onnxruntime library linking and fix compiler warning
* Fix variable naming and type casting in Silero VAD implementation
* Update Silero VAD model path and enable n_threads