Purfview
2eeafe05de
Update Silero-VAD weights to v6.2 ( #1390 )
...
* Update Silero-VAD weights to v6.2
Overall slight quality improvement (no metrics update);
Higher stability on OOD / rare / strange / unique data;
Significant quality improvements on various known edge cases:
Unusual voices
Child voices
Cartoon voices
Muted voices
Muted speech
Lower quality phone calls
https://github.com/snakers4/silero-vad/releases/tag/v6.2
* Changes: tiny -> base in test_monotonic_timestamps()
2025-11-19 17:14:42 +03:00
Mahmoud Ashraf
dea24cbcc6
Upgrade to Silero-VAD V6 ( #1373 )
...
Co-authored-by: sssshhhhhh 193317444+sssshhhhhh@users.noreply.github.com
2025-10-14 15:29:56 +03:00
Mahmoud Ashraf
2dbca5e559
Use Silero VAD in Batched Mode ( #936 )
...
Replace Pyannote VAD with Silero to reduce code duplication and requirements
2024-10-24 12:05:25 +03:00
Jilt Sebastian
eb8390233c
New PR for Faster Whisper: Batching Support, Speed Boosts, and Quality Enhancements ( #856 )
...
Batching Support, Speed Boosts, and Quality Enhancements
---------
Co-authored-by: Hargun Mujral <83234565+hargunmujral@users.noreply.github.com >
Co-authored-by: MahmoudAshraf97 <hassouna97.ma@gmail.com >
2024-07-18 16:48:52 +07:00
Ki Hoon Kim
8d400e9870
Upgrade to Silero-Vad V5 ( #884 )
...
* Fix window_size_samples to 512
* Update SileroVADModel
* Replace ONNX file with V5 version
2024-07-01 15:40:37 +07:00
otakutyrant
91c8307aa6
make faster_whisper.assets as a valid python package to distribute ( #772 ) ( #774 )
2024-04-02 18:22:22 +02:00
Guillaume Klein
19698c95f8
Support VAD filter ( #95 )
...
* Support VAD filter
* Generalize function collect_samples
* Define AudioSegment class
* Only pass prompt and prefix to the first chunk
* Add dict argument vad_parameters
* Fix isort format
* Rename method
* Update README
* Add shortcut when the chunk offset is 0
* Reword readme
* Fix end property
* Concatenate the speech chunks
* Cleanup diff
* Increase default speech pad
* Update README
* Increase default speech pad
2023-04-03 17:22:48 +02:00