obs-localvocal

mirror of https://github.com/royshil/obs-localvocal.git synced 2026-01-10 04:48:02 -05:00

Author	SHA1	Message	Date
Roy Shilkrot	71f2feafae	refactor: Update FindOpenSSL.cmake to use latest OpenSSL version - Update the URL and URL_HASH in FindOpenSSL.cmake to use the latest stable version of OpenSSL (3.3.1) - This ensures that the project is using the most up-to-date and secure version of OpenSSL for macOS	2024-07-17 08:12:02 -04:00
Roy Shilkrot	f2ef058dd2	refactor: Update whisper-processing.cpp to add cleared_last_sub flag	2024-07-16 06:12:54 -04:00
Roy Shilkrot	f30503e3fa	refactor: Add cleared_last_sub flag to whisper-processing.cpp - Add a new boolean flag, cleared_last_sub, to the whisper-processing.cpp file - Use the flag to determine if the current subtitle should be cleared based on the minimum subtitle duration - If the duration exceeds the threshold, clear the current subtitle and call the callback with an empty string - Update the last_sub_render_time when clearing the subtitle	2024-07-15 18:46:43 -04:00
Roy Shilkrot	92eea00b33	refactor: Separate cloning and bootstrap steps in FindOpenSSL.cmake - Separate the cloning of vcpkg and running the bootstrap script into individual steps in FindOpenSSL.cmake - Set the VCPKG_ROOT environment variable and add the vcpkg executable to the PATH - Install necessary packages after cloning vcpkg	2024-07-11 12:55:13 -04:00
Roy Shilkrot	c0366e96be	refactor: Add vcpkg installation step to build-project.yaml - Add a new step to install vcpkg by cloning the repository and running the bootstrap script - Set the VCPKG_ROOT environment variable to the vcpkg directory in the workspace - Add the vcpkg executable to the PATH	2024-07-11 12:21:43 -04:00
Roy Shilkrot	af3256ac0a	refactor: Update vcpkg.json, .gitignore, and vcpkg-configuration.json - Add vcpkg.json file with "openssl" dependency - Include vcpkg.json and vcpkg-configuration.json in .gitignore - Create vcpkg-configuration.json with default and additional registries - Update vcpkg-configuration.json with Amazon IVS settings - Add Amazon IVS settings variables to transcription-filter-data.h - Add timed-metadata-utils.h and timed-metadata-utils.cpp for timed metadata handling - Add FindOpenSSL.cmake for finding and linking OpenSSL libraries - Update CMakeLists.txt to include FindOpenSSL.cmake and timed-metadata-utils.cpp - Update CMakePresets.json to include CMAKE_TOOLCHAIN_FILE for VCPKG - Update transcription-filter-callbacks.cpp to send timed metadata to server	2024-07-10 21:48:03 -04:00
Roy Shilkrot	9ecd759968	refactor: Update whispercpp dependency to version 0.0.3 (#103 )	2024-05-30 23:00:04 -04:00
Roy Shilkrot	5227a437b6	VAD based segmentation (#97 ) * refactor: Add whisper_buffer to transcription_filter_data struct * refactor: Add sentence_psum_accept_thresh to transcription_filter_data struct * refactor: Update buffer size and overlap size in whisper-processing.cpp * refactor: Update buffer size and overlap size in whisper-processing.cpp * refactor: Add audio-file-utils.cpp for audio file handling * refactor: Update buffer size and overlap size in whisper-processing.cpp * refactor: Add external model option to translation settings * refactor: Add support for input tokenization style in translation settings * refactor: Update buffer size and overlap size in whisper-processing.cpp	2024-05-16 15:07:00 -04:00
Roy Shilkrot	31c41a9574	Offline transcription accuracy tests (#96 ) * Update translation-utils.h, transcription-filter.h, whisper-model-utils.h, model-find-utils.h, and model-downloader.h * Update create_context function to include ct2ModelFolder parameter * fix: add fix_utf8 flag to transcription_filter_data struct * Update create_context function to include ct2ModelFolder parameter * Update read_text_from_file function to include join_sentences parameter * fix: Update VadIterator::reset_states to include reset_hc parameter * Update create_context function to include whisper_sampling_method parameter * Update tests README with additional configuration options * feat: Add function to find file in folder by regex expression * refactor: Improve text conditioning logic in transcription-filter.cpp * refactor: Improve text conditioning logic in transcription-filter.cpp * chore: Update ctranslate2 dependency to version 1.2.0 * refactor: Improve text conditioning logic in transcription-filter.cpp * chore: Update cmake BuildCTranslate2.cmake to disable -Wno-comma warning * refactor: Update translation context in whisper-processing.cpp and translation-utils.cpp	2024-05-10 17:37:09 -04:00
Roy Shilkrot	493ecad254	Update CTranslate2 and cpu_features dependencies (#94 ) * Update CTranslate2 and cpu_features dependencies * Update CTranslate2 and cpu_features dependencies * Update dependencies and fix special tokens handling * Add BUILD_BYPRODUCTS to CMake build command * Update version to 0.2.5 in buildspec.json	2024-04-30 09:48:23 -04:00
Roy Shilkrot	ab1b74a35c	Overlap analysis (#92 ) * Update buffer size and overlap size in whisper-processing.h and default buffer size in msec in transcription-filter.cpp * Update buffer size and overlap size in whisper-processing.h and default buffer size in msec in transcription-filter.cpp * Update suppress_sentences in en-US.ini and transcription-filter-data.h * Update suppress_sentences and fix whitespace in transcription-filter-data.h, whisper-processing.h, transcription-utils.cpp, and transcription-filter.h * Update whisper-processing.cpp and whisper-utils.cpp files * Update findStartOfOverlap function signature to use int instead of size_t * Update Whispercpp_Build_GIT_TAG to use commit 7395c70a748753e3800b63e3422a2b558a097c80 in BuildWhispercpp.cmake * Update buffer size and overlap size in whisper-processing.h and default buffer size in msec in transcription-filter.cpp * Update unused parameter in transcription-filter-properties function * Update log level and add suppress_sentences feature in transcription-filter.cpp and whisper-processing.cpp * Add translation output feature in en-US.ini and transcription-filter-data.h * Add DTW token timestamps and buffered output feature * trigger rebuild * Refactor remove_leading_trailing_nonalpha function to improve readability and performance * Refactor is_lead_byte and is_trail_byte macros for improved readability and maintainability * Refactor is_lead_byte and is_trail_byte macros for improved readability and maintainability * trigger build	2024-04-25 17:14:13 -04:00
Roy Shilkrot	65da380f9f	Bump whisper, clblast, add buffered output (#90 ) * Bump whisper, clblast, add buffered output * Update CPU_OR_CUDA environment variable error messages * Update Cublas validation in Package-Windows.ps1 and initialize function in captions-thread.h * Update Cublas validation and fix typo in Package-Windows.ps1 * Update default whisper model path to Whisper Tiny English (74Mb) * Update translation strings for multiple locales	2024-04-18 10:28:32 -04:00
Kaito Udagawa	e5a10f48cc	Fix add_custom_command to accept the argument with paren (#88 ) * Update FetchOnnxruntime.cmake * Update FetchOnnxruntime.cmake	2024-04-15 21:38:46 -04:00
Kaito Udagawa	f4307168de	Update build scripts according to the latest obs-plugintemplate (#87 ) * Update build-project.yaml * Update action.yaml * Update helpers_common.cmake * Update compilerconfig.cmake * Update .clang-format * Fix * Fix * Update build-project.yaml * Update check-format.yaml * Update push.yaml * Update build-project.yaml	2024-04-15 08:19:40 -04:00
Roy Shilkrot	f79571f316	Add Silero VAD (#85 ) * Add Silero VAD model and integrate it into the transcription filter * Fix Silero VAD model path and enable n_threads * Update translation strings for multiple locales * Update Onnxruntime library linking and fix compiler warning * Fix variable naming and type casting in Silero VAD implementation * Update Silero VAD model path and enable n_threads	2024-04-13 22:39:28 -04:00
Roy Shilkrot	3afe7670fe	Readme update	2024-04-01 22:24:32 -04:00
Roy Shilkrot	4638ce80fe	Remove Cublas input from build script (#80 ) * Remove Cublas input from build script * Remove CUDA Toolkit installation and curl submodule	2024-04-01 21:59:37 -04:00
Roy Shilkrot	a569da2ed3	Built-in Translation (#79 ) * Add translation feature and dependencies * Add model-infos.cpp and translate_add_context to en-US.ini * Fix formatting and whitespace issues * Update build plugin and version, fix translation and whisper-utils * Fix compiler warning and simplify code in transcription-filter.cpp * Update CMakePresets.json and buildspec.json * Fix Clang compiler warnings * Enable QT in CMakePresets.json * Fix compiler warnings and create missing config folder * Fix formatting of is_lead_byte and is_trail_byte macros	2024-04-01 14:37:31 -04:00
Roy Shilkrot	0c7d7234af	Update CUDA support and model versions (#78 )	2024-03-24 21:23:06 -04:00
Roy Shilkrot	17ffcfc2c1	Enable MacOS ARM64 and Windows CUDA builds (#76 ) * Enable CoreML and allow fallback to CPU on MacOS ARM64 * Disable CoreML support on MacOS ARM64 * Fix build configuration for MacOS * Update macOS build configuration based on MACOS_ARCH environment variable * Update BuildWhispercpp.cmake to disable FMA instructions on non-Apple platforms * Add cuBLAS support to build and package actions * Update Cublas versions in Windows build and packaging scripts * Update CUDA_TOOLKIT_ROOT_DIR environment variable * Add sub-packages and non-cuda-sub-packages options to CUDA toolkit setup * Update CUDA sub-packages in build-project.yaml * Add "visual_studio_integration" to sub-packages in CUDA build workflow * Fix typo in build-project.yaml * Fix typo in CUDA build method * Update sub-packages in CUDA toolkit installation * Remove unnecessary CUDA sub-packages and method	2024-03-22 13:33:07 -04:00
Roy Shilkrot	7a1a6f8d69	Bump libcurl	2024-03-18 00:06:16 -04:00
Roy Shilkrot	8fe7da6d42	Fix Max Channels, Update macOS brew command and fix compiler warnings (#75 )	2024-03-17 13:16:01 -04:00
Roy Shilkrot	4c15b9514c	Update Whispercpp_Build_GIT_TAG in BuildWhispercpp.cmake (#72 )	2024-03-08 10:25:03 -05:00
Roy Shilkrot	d8f64971c2	Update version and Whispercpp build tag (#65 )	2024-01-25 11:59:05 -05:00
Roy Shilkrot	b45b235ad6	Bump whisper.cpp. Simple settings mode (#60 ) * bump whispercpp, simple settings mode * lint	2023-12-21 11:08:36 -05:00
Roy Shilkrot	8c02e0c3fc	Fix CUDA build, shuffle whisper files around (#58 ) * fix CUDA build, shuffle whisper files around * lint	2023-11-20 09:18:06 -05:00
Roy Shilkrot	677c08c672	roll back to faster whispercpp ver	2023-11-15 22:17:38 -05:00
Roy Shilkrot	1d80602bbe	Bump whispercpp, fix mac build (#56 )	2023-11-15 18:49:25 -05:00
Roy Shilkrot	ec65ffbbf7	cmake-format	2023-11-13 16:40:09 -05:00
Roy Shilkrot	dcfaddeedb	add fpic to plugin-support on linux	2023-11-13 16:33:57 -05:00
Roy Shilkrot	ea34206400	bump whisper	2023-11-04 22:30:23 -04:00
Roy Shilkrot	9e8f2064a8	fetch libcurl instead of build	2023-10-17 14:17:00 -04:00
Roy Shilkrot	9299e7592e	srt saving	2023-10-07 13:46:58 -04:00
Roy Shilkrot	15b6e1ed00	Merge pull request #28 from obs-ai/roy.fix_linux_crash	2023-09-28 02:04:23 -04:00
Roy Shilkrot	a2003d59db	lint	2023-09-28 01:46:21 -04:00
Roy Shilkrot	280bd420fc	remove blas	2023-09-28 01:45:03 -04:00
Roy Shilkrot	3fa1f11418	fix osx build	2023-09-28 00:09:56 -04:00
Roy Shilkrot	6a47a90b7f	disable AVX on linux	2023-09-26 15:11:00 -04:00
Roy Shilkrot	6721b66ade	lint	2023-09-25 23:58:10 -04:00
Roy Shilkrot	01412a1094	add openblas	2023-09-25 23:55:32 -04:00
Roy Shilkrot	7805574c7d	solve backslarh win path problem	2023-09-20 16:18:39 -04:00
Roy Shilkrot	a025799cd6	install CUDA dlls	2023-09-20 15:50:43 -04:00
Roy Shilkrot	745937af4d	openblas sha	2023-09-16 18:40:16 +03:00
Roy Shilkrot	27730fa407	add caption to stream option	2023-09-13 20:36:21 -04:00
Roy Shilkrot	06675eb0bb	lint	2023-09-12 01:03:32 -04:00
Roy Shilkrot	b92bf4d36c	add cuda build options	2023-09-12 01:03:05 -04:00
Roy Shilkrot	b60451f9e3	lean libcurl build	2023-09-11 23:21:48 -04:00
Roy Shilkrot	a0713f4bce	fix osx linux	2023-09-11 01:14:44 -04:00
Roy Shilkrot	adb2ede905	add openblas	2023-09-11 00:48:49 -04:00
Roy Shilkrot	d794906bdc	whisper universal	2023-08-13 23:53:25 +03:00

1 2

55 Commits