Commit Graph

  • 615de0d2d9 add WhisperLive to community integration (#647) makaveli 2024-01-18 17:24:14 +05:30
  • 44f7e58947 Update whisper-standalone-win description in README.md (#508) Purfview 2023-12-14 12:03:46 +00:00
  • ebcfd6b964 Fix broken prompt_reset_on_temperature (#604) Purfview 2023-12-13 12:14:39 +00:00
  • 19329a3611 Word timing tweaks (#616) trungkienbkhn 2023-12-13 18:38:44 +07:00
  • 65094b779e Update info on cuBLAS and cuDNN libs in README.md (#513) Purfview 2023-11-27 11:12:47 +00:00
  • 9641d5f56a Force read-mode in av.open (#566) Clayton Yochum 2023-11-27 02:43:35 -07:00
  • e1a218fab1 Bump version to 0.10.0 v0.10.0 Dang Chuan Nguyen 2023-11-24 23:19:47 +01:00
  • 3084409633 Add V3 Support (#578) Oscaarjs 2023-11-24 23:16:12 +01:00
  • 5a0541ea7d Bump version to 0.9.0 v0.9.0 Guillaume Klein 2023-09-18 16:21:37 +02:00
  • e94711bb5c Add property WhisperModel.supported_languages (#476) Guillaume Klein 2023-09-14 17:42:02 +02:00
  • 0048844f54 Expose function available_models (#475) Guillaume Klein 2023-09-14 17:17:01 +02:00
  • a49097e655 Add some missing typing annotations in transcribe.py Guillaume Klein 2023-09-12 15:45:54 +02:00
  • 81086f6d33 Always run the encoder at the beginning of the loop (#468) Guillaume Klein 2023-09-12 14:44:37 +02:00
  • f697945691 Update tokenizers requirement to include version 0.14 (#469) Guillaume Klein 2023-09-12 14:44:22 +02:00
  • 727ab81f31 Improve error message for invalid task and language parameters (#466) Guillaume Klein 2023-09-12 10:02:23 +02:00
  • 0285d46f6f Add more details about the requirements in the README (#463) Guillaume Klein 2023-09-08 14:35:17 +02:00
  • ad388cd394 Bump version to 0.8.0 v0.8.0 Guillaume Klein 2023-09-04 11:56:48 +02:00
  • 4a41746e55 Log a warning when the model is English-only but the language is set to something else (#454) Guillaume Klein 2023-09-04 11:55:40 +02:00
  • 1e6eb967c9 Add "large" alias for "large-v2" model (#453) Guillaume Klein 2023-09-04 11:54:42 +02:00
  • f0ff12965a Expose generation parameter no_repeat_ngram_size (#449) Guillaume Klein 2023-09-01 17:31:30 +02:00
  • 5871858a5f Force the garbage collector to run after decoding the audio with PyAV (#448) Guillaume Klein 2023-09-01 15:25:13 +02:00
  • e87fbf8a49 Added audio duration after VAD to TranscriptionInfo object (#445) MinorJinx 2023-08-31 10:19:48 -05:00
  • 7b271da035 docs: add wscribe to community integrations (#427) Hrishikesh Barman 2023-08-17 12:20:24 +05:30
  • 1562b02345 added repetition_penalty to TranscriptionOptions (#403) Aisu Wata 2023-08-06 05:08:24 -03:00
  • 1ce16652ee Adds DEBUG log message for prompt_reset_on_temperature (#399) Purfview 2023-08-04 08:06:17 +01:00
  • 857be6f621 Rename clear_previous_text_on_temperature argument (#398) Purfview 2023-08-03 17:44:37 +01:00
  • 1a1eb1a027 Add clear_previous_text_on_temperature parameter (#397) KH 2023-08-03 22:40:58 +09:00
  • 5c17de1771 Bump version to 0.7.1 v0.7.1 Guillaume Klein 2023-07-24 11:10:12 +02:00
  • 0f55c436fe Invalidate the cached encoder output when no_speech threshold is met (#376) Guillaume Klein 2023-07-24 10:57:15 +02:00
  • e786e26f75 Return result with best log prob when all temperature fallbacks failed (#356) KH 2023-07-20 23:13:11 +09:00
  • 687db319e0 Remove duplicate code (#359) KH 2023-07-18 23:03:01 +09:00
  • 171d90dd1f Bump version to 0.7.0 v0.7.0 Guillaume Klein 2023-07-18 15:23:47 +02:00
  • 0e051a5b77 Prepend prefix tokens with the initial timestamp token (#358) Guillaume Klein 2023-07-18 15:22:39 +02:00
  • 2a37390fed Minor reformatting in code snippet Guillaume Klein 2023-07-18 15:08:53 +02:00
  • 3b4a6aa1c2 Improve timestamp heuristics (#336) Hoon 2023-07-05 22:16:53 +09:00
  • c7cb2aa8d4 Add support for using whisper models from Huggingface by specifying the model id. (#334) zh-plus 2023-07-03 23:40:10 +08:00
  • c0d93d0829 Avoid computing higher temperatures on no_speech segments (#225) Guillaume Klein 2023-07-03 10:20:36 +02:00
  • 19c294f978 Squash long words at window and sentence boundaries (#226) Guillaume Klein 2023-07-03 10:20:20 +02:00
  • fee52c9229 Allow users to input an Iterable of token ids into initial_prompt (#306) FlippFuzz 2023-06-21 20:46:20 +08:00
  • efc4f61d85 Do not specify the vocabulary file extension in the download pattern (#311) Guillaume Klein 2023-06-20 10:53:11 +02:00
  • ad58ba26ab Fix typo (#304) kh 2023-06-16 14:37:45 +09:00
  • 20d4e9418b Add Open-Lyrics as a community project. (#291) zh-plus 2023-06-10 14:22:29 +08:00
  • d4222da952 Update README with community repo using FW (#284) Antonio Zarauz Moreno 2023-06-07 11:30:53 +02:00
  • 1bb7e33b93 Reformat code snippet in README Guillaume Klein 2023-05-24 18:22:44 +02:00
  • 2a00621564 Bump version to 0.6.0 v0.6.0 Guillaume Klein 2023-05-24 16:15:01 +02:00
  • a150adcc19 Enable onnxruntime dependency for Python 3.11 (#260) Guillaume Klein 2023-05-24 16:07:54 +02:00
  • ae1e6d9883 Remove reference to the VAD function from the README Guillaume Klein 2023-05-24 15:56:03 +02:00
  • cf7c021573 Export __version__ at the module level (#258) Guillaume Klein 2023-05-24 15:50:37 +02:00
  • 4db549b800 Make get_speech_timestamps backward compatible with the previous usage (#259) Guillaume Klein 2023-05-24 15:49:36 +02:00
  • c99feb22dc Include requirements files in sdist (#240) Guillaume Klein 2023-05-24 12:55:15 +02:00
  • 723cb97483 Fix occasional IndexError on empty segments (#227) Guillaume Klein 2023-05-24 12:55:04 +02:00
  • 6a2da9a95c Also catch client-side network exceptions when synchronizing models (#228) Guillaume Klein 2023-05-11 15:07:15 +02:00
  • 6a1d331d66 Add CONTRIBUTING.md (#229) Guillaume Klein 2023-05-11 15:06:46 +02:00
  • 2d7c984bfc Reformat function download_model for clarity Guillaume Klein 2023-05-11 14:47:22 +02:00
  • 8e5c747ab5 Reformat list of community integrations Guillaume Klein 2023-05-11 12:15:41 +02:00
  • 32b962bed8 Adds: whisper-standalone-win (#216) Purfview 2023-05-09 19:20:41 +01:00
  • 53d247b0bb retry model download locally if huggingface throws an http error. (#215) David Axelrod 2023-05-09 11:20:22 -04:00
  • 91f948b0d6 transcribe: return all language probabilities if requested (#210) Ozan Caglayan 2023-05-09 13:53:47 +01:00
  • 5d8f3e2d90 Implement VadOptions (#198) FlippFuzz 2023-05-09 18:47:02 +08:00
  • d889345e07 added whisper-diarize (#193) Mahmoud Ashraf 2023-04-28 10:56:13 +02:00
  • 5d203d2757 Update Github link to community project (#187) Jordi Mas 2023-04-27 14:53:28 +02:00
  • a3dcb90081 Bump version to 0.5.1 v0.5.1 Guillaume Klein 2023-04-26 17:38:16 +02:00
  • 89a4c7f1f0 Update docstring to clarify download_root and output_dir Guillaume Klein 2023-04-26 17:37:51 +02:00
  • 6f9d68dd6b Fix typing of local_files_only Guillaume Klein 2023-04-26 17:36:24 +02:00
  • 68df3214ba Use cache_dir instead of local_dir (#182) Jordi Mas 2023-04-26 16:35:18 +02:00
  • 67cce3f552 Bump version to 0.5.0 v0.5.0 Guillaume Klein 2023-04-25 17:00:41 +02:00
  • 8340e04dc6 Assign words to the speech chunk with the greatest coverage (#180) Guillaume Klein 2023-04-25 15:54:31 +02:00
  • 8cf5d5a4b3 Increase the default value of speech_pad_ms to 400 ms (#179) Guillaume Klein 2023-04-25 15:54:22 +02:00
  • 32dc625f11 Update README.md Guillaume Klein 2023-04-25 15:47:38 +02:00
  • e06511f96b Rename AudioInfo to TranscriptionInfo (#174) Guillaume Klein 2023-04-24 16:29:17 +02:00
  • 338a725ff8 fix where the tokens are reset (#175) Anthony 2023-04-24 16:28:47 +02:00
  • f893113759 Align segment structure with openai/whisper (#154) Amar Sood 2023-04-24 09:04:42 -04:00
  • 2b51a97e61 Add transcription_options to AudioInfo (#170) FlippFuzz 2023-04-24 21:02:19 +08:00
  • 358d373691 Allow specifying local_files_only to prevent checking the Internet everytime (#166) Jordi Mas 2023-04-20 14:26:06 +02:00
  • 3adcc12d0f Clarify that the returned segments value is a generator (#144) Guillaume Klein 2023-04-13 09:50:53 +02:00
  • 2b53dee6b6 Expose download location in WhisperModel constructor (#126) Ewald Enzinger 2023-04-08 10:02:36 +02:00
  • 2f6790a6f5 Fit words timestamps in VAD speech chunks fit-words-timestamps Guillaume Klein 2023-04-07 10:51:53 +02:00
  • 06d24056e9 Configure ignore for more files. (#122) Bekir Bakar 2023-04-06 20:13:09 +03:00
  • e9a082dcf2 Keep segment timestamps aligned with words timestamps after VAD (#119) Guillaume Klein 2023-04-06 11:54:40 +02:00
  • 051b3350e5 Add some info and debug logs (#113) Guillaume Klein 2023-04-05 16:57:59 +02:00
  • 746f2698db Bump version to 0.4.1 v0.4.1 Guillaume Klein 2023-04-04 12:16:23 +02:00
  • a5d03e55fa Prevent out of range error in method split_tokens_on_unicode (#111) Guillaume Klein 2023-04-04 10:51:14 +02:00
  • 9fa1989073 Revert "Prevent out of range error in method split_tokens_on_unicode" Guillaume Klein 2023-04-04 10:25:41 +02:00
  • 36160c1e7e Prevent out of range error in method split_tokens_on_unicode Guillaume Klein 2023-04-04 10:17:56 +02:00
  • 2f266eb844 Fix VAD index error when a predicted timestamps is too large (#107) Guillaume Klein 2023-04-03 19:34:54 +02:00
  • 8c36ac1be8 Bump version to 0.4.0 v0.4.0 Guillaume Klein 2023-04-03 17:24:49 +02:00
  • 19698c95f8 Support VAD filter (#95) Guillaume Klein 2023-04-03 17:22:48 +02:00
  • b4c1c57781 Added retrieval mechanism (avg_log_prob/no_speech_prob) (#103) palladium123 2023-04-03 22:56:35 +08:00
  • f20bb258de Support separating the left and right audio channels (#97) Guillaume Klein 2023-04-03 11:22:43 +02:00
  • 1a968a4323 Pass prefix only to the first window Guillaume Klein 2023-04-01 09:26:42 +02:00
  • def70d8496 Update headings in the Usage section Guillaume Klein 2023-03-31 18:54:55 +02:00
  • 7301df7f8b Update README.md (#101) mayeaux 2023-03-31 17:06:44 +02:00
  • d03383f902 Simplify reuse of the encoder output Guillaume Klein 2023-03-30 15:58:27 +02:00
  • 39fddba886 Suppress some special tokens when the default set is not used Guillaume Klein 2023-03-30 12:42:29 +02:00
  • eda840f8ff Always disable the progress bar specific to snapshot_download Guillaume Klein 2023-03-29 12:11:24 +02:00
  • 0224400584 Add large-v1 model Guillaume Klein 2023-03-28 14:36:10 +02:00
  • 8246479fda Ignore the invalid audio frames (#82) Guillaume Klein 2023-03-27 10:19:22 +02:00
  • e2705d11c9 Raise an explicit error message if the model size is invalid Guillaume Klein 2023-03-26 16:29:11 +02:00
  • f8d2fb169f Fix variable name reference (#77) Jordi Mas 2023-03-25 10:00:59 +01:00
  • a10732c74a Only download the required model files Guillaume Klein 2023-03-24 17:59:11 +01:00