tfhe-rs

mirror of https://github.com/zama-ai/tfhe-rs.git synced 2026-01-10 07:08:03 -05:00

Author	SHA1	Message	Date
David Testé	c3017341bd	chore(ci): refactor cpu benchmarks workflows Following the same pattern as GPU benchmarks, CPU benchmarks rely on a common workflow. Weekly benchmarks are all gathered in one place. Also, all the manual launches via workflow_dispatch event are now done in one place. That way, one doesn't have to browse the workflow tree to find the right CPU benchmark to trigger. Signed-off-by: David Testé <david.teste@zama.ai>	2025-11-03 16:14:02 +01:00
Arthur Meyre	00ce0deec9	chore: make typos version fixed - add a script to properly install the correct version - correct new typos	2025-11-03 14:58:23 +01:00
Nicolas Sarlin	83b82091bd	chore: use common msrv for the workspace Since cargo commands create a lock using the smallest msrv in the workspace, it can prevent getting up-to-date dependencies	2025-10-31 09:31:43 +01:00
David Testé	2a8885aa9f	chore(ci): run erc20 and dex throughput bench only on demand Following the same pattern as other benchmarks.	2025-10-30 09:52:30 +01:00
Agnes Leroy	231d0c5e50	chore(gpu): disable lto in gpu bench compilation	2025-10-28 09:37:14 +01:00
Pedro Alves	70773e442c	fix(gpu): fix 128-bit compression benchmark	2025-10-27 17:06:45 +01:00
Thomas Montaigu	20b7b06ffb	chore: add check_fmt_js to pcc_batch	2025-10-20 14:37:36 +02:00
Arthur Meyre	205b767fc1	chore: fix various target issues for benchmarks following renames - renames were done to uniformize and make it easier to setup perf regression measurements, some names were not updated this PR fixes that	2025-10-20 13:45:27 +02:00
Thomas Montaigu	eed5a6c5ba	chore(bench): add grep check for trivial in benches	2025-10-20 12:26:44 +02:00
David Testé	70b0c0ff19	chore(ci): echo post-commit checks sub-recipe names This is done to improve readability in case of recipe failure.	2025-10-17 15:30:19 +02:00
David Testé	206553e9ee	chore(ci): check for performance regression and create report After running performances regression benchmarks, a performance changes checking is executed. It will fetch results data with an external tool then it will look for anomaly in changes. Finally it will produce a report as an issue comment with any anomaly display in a Markdown array. A folded section of the report message contains all the results from the benchmark. Note that a fully custom benchmark triggered from an issue comment would not generate a report. In addition HPU performance regression benchmark is not supported yet.	2025-10-17 15:05:24 +02:00
Nicolas Sarlin	2cdc804670	chore(backward): backward compat data targeted generation	2025-10-17 12:43:13 +02:00
Thomas Montaigu	41a41278e6	chore(docs): fix docs for docs.rs doc_auto_cfg is no longer available in nightly >= 1.92 This prevents the docs to be build on docs.rs, as docs.rs uses the latest nightly This commit also make the `make doc` target use the lastest nightly so that we can catch these errors	2025-10-10 13:07:30 +02:00
pgardratzama	b61dd21ef7	fix(hpu): HPU HLAPI ERC20 bench was missing pbs-stats feature	2025-10-07 10:14:43 +02:00
Andrei Stoian	0604d237eb	chore(gpu): multi-gpu debug target	2025-10-03 16:48:42 +02:00
Thomas Montaigu	e523fd2cb6	feat: add KVStore to the high level api * Added Value type name to crate::integer::KVStore impl of Named trait as well as a bool to check we deserialize the correct value type (Radix vs SignedRadix) * Add KVStore to high_level_api * Add KVStore hlapi benches * Remove specialized `[add,mul,sub]_to_slot` as `map` is now the intended API. - mul_to_slot was way slower than using `map` - add/mul_to_slot were a bit faster (~5% latency-wise), but returned less information (no old_value, no new_value, no boolean to check) if the key matched - Some known improvement can be made to map, which should result in it being better than add/sub_to_slot * Add FheIntegerType trait to make the KVStore generic over FheUint/FheInt, and should make GPU integration "easy"	2025-10-03 15:01:23 +02:00
pgardratzama	f3cddb5635	chore(hpu): force benches to run on specific board	2025-10-02 13:20:36 +02:00
pgardratzama	39b81a8ded	feat(hpu): move to new bitstream at 400Mhz with GRAM_NB 3 - update SIMD_N and min_batch_size to 12 which seems to give better latency and ERC20 throughput - support IOp on several lines in ami /proc file - reduce amount of ERC_20_SIMD per batch in HLAPI bench	2025-10-02 13:20:36 +02:00
David Testé	6494a82fb3	chore(ci): split cargo builds into several jobs Post-commit checks was the bottleneck regarding running duration. It's now split into 7 batches to improve parallelism. Builds that are specific to Ubuntu are run in their own jobs, so that only build_tfhe_full recipe call remains in the os matrix. A final check is performed to ensure all the checks have passed, this very job is used as branch protection rule.	2025-09-29 11:22:56 +02:00
Andrei Stoian	87c0d646a4	fix(gpu): coprocessor bench	2025-09-18 13:56:55 +02:00
Andrei Stoian	1dcc3c8c89	chore(gpu): structure to encapsulate streams	2025-09-18 09:43:17 +02:00
Nicolas Sarlin	1a2643d1da	fix(ci): use precise wasm-bindgen version for the cli	2025-09-17 13:17:57 +02:00
David Testé	f8684d1f67	chore(ci): add regression benchmark workflow Regression benchmarks are meant to be run in pull-request. They can be launched in two flavors: * issue comment: using command like "/bench --backend cpu" * adding a label: `bench-perfs-cpu` or `bench-perfs-gpu` Benchmark definitions are written in TOML and located at ci/regression.toml. While not exhaustive, it can be easily modified by reading the embbeded documentation. "/bench" commands are parsed by a Python script located at ci/perf_regression.py. This script produces output files that contains cargo commands and a shell script generating custom environment variables. The Python script and generated files are meant to be used only by the workflow benchmark_perf_regression.yml.	2025-09-16 13:33:49 +02:00
Nicolas Sarlin	b4066df77f	chore(ci): run cargo audit	2025-09-16 12:03:32 +02:00
Agnes Leroy	daee3f1850	chore(gpu): fix out of memory error in 4090 doc tests	2025-09-10 10:46:04 +02:00
pgardratzama	bd7df4a03b	chore(hpu): enable hpu hlapi workflow and throughput bench in integer workflow	2025-09-05 10:42:36 +02:00
pgardratzama	c6aa1adbe7	chore(hpu): update benches to run new operations	2025-09-05 10:42:36 +02:00
Agnes Leroy	f62e5b3e3b	chore(gpu): fix oom in 4090 tests	2025-08-28 16:12:52 +02:00
Andrei Stoian	c06b513182	chore(gpu): add valgrind and fix leaks	2025-08-28 14:21:57 +02:00
Nicolas Sarlin	44ac59099b	chore(csprng): run clippy without the software-prng feature	2025-08-26 19:32:40 +02:00
David Testé	b3f1a85e1d	chore(bench): write parameters to disk for hlapi operations	2025-08-13 18:34:26 +02:00
Arthur Meyre	1169096058	chore: fix whitespace in Makefile	2025-08-13 09:16:49 +02:00
Arthur Meyre	a63207af9e	chore(ci): add MSRV build to check we are compliant with what we announce - have to downgrade param_dedup edition as 1.84 cannot handle 2024 in a workspace	2025-08-08 18:06:29 +02:00
Arthur Meyre	04d4ccc16c	chore(ci): remove TFHE_SPEC from Makefile - this is a leftover from a complicated attempt at backward compatibility no need to keep this	2025-08-08 18:06:29 +02:00
Andrei Stoian	7bf2ec6ff2	chore(gpu): fix warnings detection	2025-07-31 18:47:08 +02:00
Arthur Meyre	82a5cc7f2d	chore(ci): increase timeout for noise checks	2025-07-31 12:00:15 +02:00
Andrei Stoian	36eceaf05e	feat(gpu): utility debug workflows in ci	2025-07-30 12:55:40 +01:00
Arthur Meyre	e8986cbd7c	chore: setup CI for noise checks	2025-07-29 15:29:24 +02:00
Andrei Stoian	494e0e0601	chore(gpu): add short op sequence test for GPU on PRs	2025-07-15 16:03:45 +02:00
Agnes Leroy	068cbc0f41	chore(gpu): add hl api noise squash latency and throughput bench	2025-07-11 14:04:32 +01:00
Arthur Meyre	bd739c2d48	chore(docs): uniformize paths in docs to use "-" instead of "_" - this is to avoid conflicts with gitbook	2025-07-09 14:36:04 +02:00
Arthur Meyre	17d3a492b6	chore: only run backward compat clippy on x86 machines - older versions of the crates are only compilable with x86, disable on arm for now - revisit when the crates are split ?	2025-07-09 08:29:12 +02:00
Nicolas Sarlin	bb1ff363d3	chore(ci): use Cargo.lock for installed tools	2025-07-07 13:10:55 +02:00
Nicolas Sarlin	7bcd6b94da	chore: use script to pull hpu files	2025-07-07 13:10:55 +02:00
Nicolas Sarlin	57cbab9fe1	chore(backward): integrate backward compat data Code is taken from `59a6179831` Adapted to make ci work	2025-07-07 13:10:55 +02:00
Baptiste Roux	eb0b9643bb	fix(hpu): Fix clippy_hpu_mockup makefile entry	2025-07-03 10:28:52 +02:00
Baptiste Roux	6432b98591	chore(mockup): Add clippy target for tfhe_hpu_mockup Also fix all clippy lint	2025-07-02 14:41:41 +02:00
Nicolas Sarlin	950915a108	chore(ci): use the correct data branch in clippy_ws_tests	2025-07-01 14:18:10 +02:00
Andrei Stoian	5e6562878a	chore(gpu): add cuda debug target for integer tests	2025-07-01 10:37:17 +02:00
Nicolas Sarlin	940a9ba860	chore(zk): enable tfhe-lints on zk pok	2025-06-27 14:34:25 +02:00

1 2 3 4 5 ...

304 Commits