pgardratzama
b1657876fb
fix(hpu): fix double import when hpu feature is on
2026-01-06 18:01:14 +01:00
pgardratzama
d2a570bdd6
chore: uses if_then_zero only in HPU ERC20 whitepaper (to be updated when encrypt_trivial becomes available on HPU), adds test of if_then_zero for both CPU & HPU
2026-01-06 16:55:07 +01:00
pgardratzama
ed84387bba
chore: trying to insure GPU ERC20 bench are not impacted while CPU & HPU uses if_then_zero
2026-01-06 16:55:07 +01:00
Baptiste Roux
e645ee3397
feat: Add IfThenZero impl for Cpu
2026-01-06 16:55:07 +01:00
pgardratzama
569abd9a3b
fix(hpu): fix whitepaper erc20 for HPU using if_then_zero
2026-01-06 16:55:07 +01:00
David Testé
e20aea90df
chore(bench): set ks32 parameters as default for integer benchmarks
2026-01-06 14:59:34 +01:00
Nicolas Sarlin
70a0021cbf
chore: fix bincode dependency version
2026-01-02 18:57:32 +01:00
Nicolas Sarlin
312ce494bf
chore(zk): add 1 * 64 benches with production CRS
2025-12-17 15:06:37 +01:00
Thomas Montaigu
d394af7f4d
chore: bump dyn-stack to 0.13
...
Notable changes:
- StackReq methods no longer returns Result<StackReq, SizeOverflow>
instead, StackReq contains the invalid state.
Now, its when we create a PodBuffer that we can check/catch if the
size req is invalid by catching errors when calling
`PodBuffer::try_new`. Its also possible to manually check that
`stack_req != StackReq::OVERFLOW`
- GlobalaPodBuffer is now PodBuffer
2025-12-15 10:02:17 +01:00
Andrei Stoian
78d1ce18c1
feat(gpu): support keyswitch 64/32
2025-12-12 22:01:49 +01:00
Agnes Leroy
b7a706a3db
chore(bench): remove constraint in pcc to not use trivial name in bench
2025-12-11 14:41:07 +01:00
Agnes Leroy
8e4bec0b2a
chore(bench): modify whitepaper erc20 to match newest litepaper version
2025-12-11 14:41:07 +01:00
Enzo Di Maria
cf969ff930
refactor(gpu): creating benchmarks for match_value
2025-12-11 12:01:43 +01:00
David Testé
5eb4cc5a22
chore(bench): add fast benchmark capability for hlapi
...
Run only a small subset of the current benchmarks to speed up developers feedback
2025-12-09 11:34:53 +01:00
Agnes Leroy
100b4200c2
chore(gpu): update number of streams in erc20 throughput bench
2025-12-08 09:21:55 +01:00
David Testé
e85fd936d0
chore(bench): suffix hlapi ops bench with measured type name
2025-12-04 17:59:27 +01:00
Agnes Leroy
e6625521ad
chore(gpu): add the possibility to run classical bench for erc20 and dex
2025-12-02 15:59:40 +01:00
Andrei Stoian
e2063c8ef4
chore(gpu): bench KS latency batches
2025-11-27 17:32:44 +01:00
Nicolas Sarlin
f8a958663b
chore(tfhe): rename nightly feature flag to avx512
2025-11-26 11:28:21 +01:00
Nicolas Sarlin
01367368ed
chore(zk): do not bench zkv1 at the integer level
2025-11-25 17:20:06 +01:00
Nicolas Sarlin
33f77458e9
chore(zk): fix elements count for zk throughput benches
2025-11-25 17:20:06 +01:00
Arthur Meyre
caf5e9d879
chore: fix scalar benchmarks generating fixed values
...
- this would not give an average runtime for scalar benchmarks and for
small precisions could give super good timings (for lucky values)
- the timings for other precisions could still be favorable or unfavorable
depending on the value that was drawn
2025-11-25 14:23:55 +01:00
David Testé
6141ad2eee
chore(bench): fix bench prefix pattern for hlapi ops
...
To follow the standard used by other HLAPI benchmarks and ease parsing for data_extractor.
2025-11-24 17:56:10 +01:00
David Testé
b0393c0acb
chore(bench): run scalar ops in integer deduplicated cpu bench
2025-11-24 14:03:08 +01:00
David Testé
b3c3647530
chore(ci): add workflow to update documentation benchmark tables
...
This new workflow can trigger all the required benchmarks needed
to populate benchmarks tables in documentation.
It also can generate SVG tables and store them as artifacts.
Optionally, it can open a pull-request to update the current
tables in documentation.
2025-11-24 14:03:08 +01:00
David Testé
58378b7972
chore(bench): add dedicated targets for aes cuda benchmarks
2025-11-20 16:58:06 +01:00
David Testé
071e70c037
chore(bench): fix benchmark id pattern for aes and aes256
2025-11-19 17:23:05 +01:00
Mayeul@Zama
f9268b889f
chore(bench): revert print bench id
...
This reverts commit ef07963767 .
2025-11-17 11:23:50 +01:00
Enzo Di Maria
54c8c5e020
chore(gpu): no crash with aes benches if oom error
2025-11-14 17:02:33 +01:00
David Testé
ef07963767
chore(bench): print bench id before running the benchmark
...
Done to circumvent criterion limitation regarding automatic
truncation of long benchmark ID.
Using a println() call we ensure the complete name is displayed
before benchmark execution to ease manual parsing and debugging.
2025-11-14 13:45:04 +01:00
David Testé
d53bf79592
chore(bench): fix naming order for erc20 hpu benchmarks
2025-11-10 11:46:41 +01:00
Enzo Di Maria
4ff95e3a42
feat(gpu): AES 256
2025-11-05 13:37:08 +01:00
David Testé
0c977a3996
chore(bench): insert params name in bench id for hlapi
...
To ease parsing and filtering by third parties.
2025-11-04 10:53:25 +01:00
Arthur Meyre
00ce0deec9
chore: make typos version fixed
...
- add a script to properly install the correct version
- correct new typos
2025-11-03 14:58:23 +01:00
Nicolas Sarlin
83b82091bd
chore: use common msrv for the workspace
...
Since cargo commands create a lock using the smallest msrv in the workspace, it
can prevent getting up-to-date dependencies
2025-10-31 09:31:43 +01:00
David Testé
2a8885aa9f
chore(ci): run erc20 and dex throughput bench only on demand
...
Following the same pattern as other benchmarks.
2025-10-30 09:52:30 +01:00
Pedro Alves
867f8fb579
feat(gpu): implement re-randomization
...
- exposed to integer and HL API
- test on the HL API
- benchmarks for GPU and CPU implementation
2025-10-29 17:55:45 -03:00
Guillermo Oyarzun
0f0438c8cf
feat(gpu): add 1_1 classical pbs params for specialized version
2025-10-29 09:18:18 +01:00
David Testé
b0b49ae533
chore(bench): new parameters set to run core_crypto bench for docs
...
This creates extended parameters set to reflect what's displayed
in the documentation.
2025-10-27 17:25:41 +01:00
Pedro Alves
70773e442c
fix(gpu): fix 128-bit compression benchmark
2025-10-27 17:06:45 +01:00
Mayeul@Zama
777bbe437a
feat(shortint): add multi bit decompression
2025-10-24 09:28:17 +02:00
Arthur Meyre
23246f63f7
chore: update fast_dedup opset to match the latency benchmarks in the docs
...
- signed bench update
2025-10-23 10:42:19 +02:00
Arthur Meyre
11c79b5237
chore: update fast_dedup opset to match the latency benchmarks in the docs
2025-10-23 10:42:19 +02:00
Guillermo Oyarzun
e12638dabe
feat(gpu): extend specialized version to classical pbs
2025-10-22 09:20:40 +02:00
pgardratzama
f9c89212ea
fix(hpu): display name on shift looked wrong
2025-10-21 13:29:59 +02:00
Agnes Leroy
b4b6275ca5
chore(gpu): remove device synchronize in drop for cudavec
2025-10-21 11:33:46 +02:00
Arthur Meyre
205b767fc1
chore: fix various target issues for benchmarks following renames
...
- renames were done to uniformize and make it easier to setup perf
regression measurements, some names were not updated this PR fixes that
2025-10-20 13:45:27 +02:00
Thomas Montaigu
0dd0ead4e2
chore(bench): remove trivial encryptions
...
It makes benches not accurate
2025-10-20 12:26:44 +02:00
Agnes Leroy
c30835fc30
chore(gpu): remove async entry points for abs, add, sub, aes
2025-10-17 15:42:06 +02:00
David Testé
206553e9ee
chore(ci): check for performance regression and create report
...
After running performances regression benchmarks, a performance
changes checking is executed. It will fetch results data with an
external tool then it will look for anomaly in changes.
Finally it will produce a report as an issue comment with any
anomaly display in a Markdown array. A folded section of the
report message contains all the results from the benchmark.
Note that a fully custom benchmark triggered from an issue comment
would not generate a report. In addition HPU performance
regression benchmark is not supported yet.
2025-10-17 15:05:24 +02:00