tfhe-rs

mirror of https://github.com/zama-ai/tfhe-rs.git synced 2026-01-07 22:04:10 -05:00

Author	SHA1	Message	Date
Nicolas Sarlin	fc78450245	chore: test vector generation tool for core algorithms	2025-11-25 13:45:49 +01:00
Nicolas Sarlin	580e545b14	chore(wasm): install wasm bindgen before using wasm pack	2025-11-24 15:58:01 +01:00
David Testé	b3c3647530	chore(ci): add workflow to update documentation benchmark tables This new workflow can trigger all the required benchmarks needed to populate benchmarks tables in documentation. It also can generate SVG tables and store them as artifacts. Optionally, it can open a pull-request to update the current tables in documentation.	2025-11-24 14:03:08 +01:00
David Testé	58378b7972	chore(bench): add dedicated targets for aes cuda benchmarks	2025-11-20 16:58:06 +01:00
Nicolas Sarlin	edb435bd46	chore: update msrv to 1.91.1	2025-11-20 09:29:37 +01:00
David Testé	522a612ad4	chore(ci): update zizmor and use zizmor-action in workflow	2025-11-14 15:24:10 +01:00
David Testé	c3017341bd	chore(ci): refactor cpu benchmarks workflows Following the same pattern as GPU benchmarks, CPU benchmarks rely on a common workflow. Weekly benchmarks are all gathered in one place. Also, all the manual launches via workflow_dispatch event are now done in one place. That way, one doesn't have to browse the workflow tree to find the right CPU benchmark to trigger. Signed-off-by: David Testé <david.teste@zama.ai>	2025-11-03 16:14:02 +01:00
Arthur Meyre	00ce0deec9	chore: make typos version fixed - add a script to properly install the correct version - correct new typos	2025-11-03 14:58:23 +01:00
Nicolas Sarlin	83b82091bd	chore: use common msrv for the workspace Since cargo commands create a lock using the smallest msrv in the workspace, it can prevent getting up-to-date dependencies	2025-10-31 09:31:43 +01:00
David Testé	2a8885aa9f	chore(ci): run erc20 and dex throughput bench only on demand Following the same pattern as other benchmarks.	2025-10-30 09:52:30 +01:00
Agnes Leroy	231d0c5e50	chore(gpu): disable lto in gpu bench compilation	2025-10-28 09:37:14 +01:00
Pedro Alves	70773e442c	fix(gpu): fix 128-bit compression benchmark	2025-10-27 17:06:45 +01:00
Thomas Montaigu	20b7b06ffb	chore: add check_fmt_js to pcc_batch	2025-10-20 14:37:36 +02:00
Arthur Meyre	205b767fc1	chore: fix various target issues for benchmarks following renames - renames were done to uniformize and make it easier to setup perf regression measurements, some names were not updated this PR fixes that	2025-10-20 13:45:27 +02:00
Thomas Montaigu	eed5a6c5ba	chore(bench): add grep check for trivial in benches	2025-10-20 12:26:44 +02:00
David Testé	70b0c0ff19	chore(ci): echo post-commit checks sub-recipe names This is done to improve readability in case of recipe failure.	2025-10-17 15:30:19 +02:00
David Testé	206553e9ee	chore(ci): check for performance regression and create report After running performances regression benchmarks, a performance changes checking is executed. It will fetch results data with an external tool then it will look for anomaly in changes. Finally it will produce a report as an issue comment with any anomaly display in a Markdown array. A folded section of the report message contains all the results from the benchmark. Note that a fully custom benchmark triggered from an issue comment would not generate a report. In addition HPU performance regression benchmark is not supported yet.	2025-10-17 15:05:24 +02:00
Nicolas Sarlin	2cdc804670	chore(backward): backward compat data targeted generation	2025-10-17 12:43:13 +02:00
Thomas Montaigu	41a41278e6	chore(docs): fix docs for docs.rs doc_auto_cfg is no longer available in nightly >= 1.92 This prevents the docs to be build on docs.rs, as docs.rs uses the latest nightly This commit also make the `make doc` target use the lastest nightly so that we can catch these errors	2025-10-10 13:07:30 +02:00
pgardratzama	b61dd21ef7	fix(hpu): HPU HLAPI ERC20 bench was missing pbs-stats feature	2025-10-07 10:14:43 +02:00
Andrei Stoian	0604d237eb	chore(gpu): multi-gpu debug target	2025-10-03 16:48:42 +02:00
Thomas Montaigu	e523fd2cb6	feat: add KVStore to the high level api * Added Value type name to crate::integer::KVStore impl of Named trait as well as a bool to check we deserialize the correct value type (Radix vs SignedRadix) * Add KVStore to high_level_api * Add KVStore hlapi benches * Remove specialized `[add,mul,sub]_to_slot` as `map` is now the intended API. - mul_to_slot was way slower than using `map` - add/mul_to_slot were a bit faster (~5% latency-wise), but returned less information (no old_value, no new_value, no boolean to check) if the key matched - Some known improvement can be made to map, which should result in it being better than add/sub_to_slot * Add FheIntegerType trait to make the KVStore generic over FheUint/FheInt, and should make GPU integration "easy"	2025-10-03 15:01:23 +02:00
pgardratzama	f3cddb5635	chore(hpu): force benches to run on specific board	2025-10-02 13:20:36 +02:00
pgardratzama	39b81a8ded	feat(hpu): move to new bitstream at 400Mhz with GRAM_NB 3 - update SIMD_N and min_batch_size to 12 which seems to give better latency and ERC20 throughput - support IOp on several lines in ami /proc file - reduce amount of ERC_20_SIMD per batch in HLAPI bench	2025-10-02 13:20:36 +02:00
David Testé	6494a82fb3	chore(ci): split cargo builds into several jobs Post-commit checks was the bottleneck regarding running duration. It's now split into 7 batches to improve parallelism. Builds that are specific to Ubuntu are run in their own jobs, so that only build_tfhe_full recipe call remains in the os matrix. A final check is performed to ensure all the checks have passed, this very job is used as branch protection rule.	2025-09-29 11:22:56 +02:00
Andrei Stoian	87c0d646a4	fix(gpu): coprocessor bench	2025-09-18 13:56:55 +02:00
Andrei Stoian	1dcc3c8c89	chore(gpu): structure to encapsulate streams	2025-09-18 09:43:17 +02:00
Nicolas Sarlin	1a2643d1da	fix(ci): use precise wasm-bindgen version for the cli	2025-09-17 13:17:57 +02:00
David Testé	f8684d1f67	chore(ci): add regression benchmark workflow Regression benchmarks are meant to be run in pull-request. They can be launched in two flavors: * issue comment: using command like "/bench --backend cpu" * adding a label: `bench-perfs-cpu` or `bench-perfs-gpu` Benchmark definitions are written in TOML and located at ci/regression.toml. While not exhaustive, it can be easily modified by reading the embbeded documentation. "/bench" commands are parsed by a Python script located at ci/perf_regression.py. This script produces output files that contains cargo commands and a shell script generating custom environment variables. The Python script and generated files are meant to be used only by the workflow benchmark_perf_regression.yml.	2025-09-16 13:33:49 +02:00
Nicolas Sarlin	b4066df77f	chore(ci): run cargo audit	2025-09-16 12:03:32 +02:00
Agnes Leroy	daee3f1850	chore(gpu): fix out of memory error in 4090 doc tests	2025-09-10 10:46:04 +02:00
pgardratzama	bd7df4a03b	chore(hpu): enable hpu hlapi workflow and throughput bench in integer workflow	2025-09-05 10:42:36 +02:00
pgardratzama	c6aa1adbe7	chore(hpu): update benches to run new operations	2025-09-05 10:42:36 +02:00
Agnes Leroy	f62e5b3e3b	chore(gpu): fix oom in 4090 tests	2025-08-28 16:12:52 +02:00
Andrei Stoian	c06b513182	chore(gpu): add valgrind and fix leaks	2025-08-28 14:21:57 +02:00
Nicolas Sarlin	44ac59099b	chore(csprng): run clippy without the software-prng feature	2025-08-26 19:32:40 +02:00
David Testé	b3f1a85e1d	chore(bench): write parameters to disk for hlapi operations	2025-08-13 18:34:26 +02:00
Arthur Meyre	1169096058	chore: fix whitespace in Makefile	2025-08-13 09:16:49 +02:00
Arthur Meyre	a63207af9e	chore(ci): add MSRV build to check we are compliant with what we announce - have to downgrade param_dedup edition as 1.84 cannot handle 2024 in a workspace	2025-08-08 18:06:29 +02:00
Arthur Meyre	04d4ccc16c	chore(ci): remove TFHE_SPEC from Makefile - this is a leftover from a complicated attempt at backward compatibility no need to keep this	2025-08-08 18:06:29 +02:00
Andrei Stoian	7bf2ec6ff2	chore(gpu): fix warnings detection	2025-07-31 18:47:08 +02:00
Arthur Meyre	82a5cc7f2d	chore(ci): increase timeout for noise checks	2025-07-31 12:00:15 +02:00
Andrei Stoian	36eceaf05e	feat(gpu): utility debug workflows in ci	2025-07-30 12:55:40 +01:00
Arthur Meyre	e8986cbd7c	chore: setup CI for noise checks	2025-07-29 15:29:24 +02:00
Andrei Stoian	494e0e0601	chore(gpu): add short op sequence test for GPU on PRs	2025-07-15 16:03:45 +02:00
Agnes Leroy	068cbc0f41	chore(gpu): add hl api noise squash latency and throughput bench	2025-07-11 14:04:32 +01:00
Arthur Meyre	bd739c2d48	chore(docs): uniformize paths in docs to use "-" instead of "_" - this is to avoid conflicts with gitbook	2025-07-09 14:36:04 +02:00
Arthur Meyre	17d3a492b6	chore: only run backward compat clippy on x86 machines - older versions of the crates are only compilable with x86, disable on arm for now - revisit when the crates are split ?	2025-07-09 08:29:12 +02:00
Nicolas Sarlin	bb1ff363d3	chore(ci): use Cargo.lock for installed tools	2025-07-07 13:10:55 +02:00
Nicolas Sarlin	7bcd6b94da	chore: use script to pull hpu files	2025-07-07 13:10:55 +02:00

1 2 3 4 5 ...

310 Commits