tfhe-rs

mirror of https://github.com/zama-ai/tfhe-rs.git synced 2026-01-07 22:04:10 -05:00

Author	SHA1	Message	Date
Mayeul@Zama	e1620d4087	feat(shortint): add support for centered modulus switch in parameters	2025-07-01 14:18:10 +02:00
pgardratzama	702989f796	fix(hpu): it seems transfer_safe is not totally safe with HPU	2025-06-20 10:04:16 +02:00
Nicolas Sarlin	343cad641c	chore: TFHE-rs 1.3.0	2025-06-18 10:20:49 +02:00
David Testé	39d77299ed	chore(bench): harmonize dex benchmark function names	2025-06-18 09:47:57 +02:00
Andrei Stoian	7986e0bf1d	chore(gpu): skip packing ks test if it needs more ram than available	2025-06-12 17:47:10 +02:00
David Testé	11c0340eca	chore(bench): plug server-side proof in zk benchmarks	2025-06-10 18:00:39 +02:00
Baptiste Roux	443e02215f	feat(hpu): Add recent IOp in integer benchmarks	2025-06-10 17:43:35 +02:00
Baptiste Roux	96c8c44c71	feat(hpu): Enable some erc20 impl With the support of overflowing ops, those impl are now available to Hpu	2025-06-10 17:43:35 +02:00
Guillermo Oyarzun	0d81623a23	feat(gpu): add squash noise in the hlapi	2025-06-10 13:14:29 +02:00
Agnes Leroy	3bfacc1e9d	chore(bench): add swap throughput benchmark	2025-05-27 12:08:31 +02:00
Agnes Leroy	a47a418d41	chore(gpu): rework dex bench to prepare throughput benchmark	2025-05-27 12:08:31 +02:00
Nicolas Sarlin	f51c70d536	feat(shortint): adds generic client key for atomic pattern support	2025-05-26 16:53:35 +02:00
Pedro Alves	408e81c45a	feat(gpu): add support for GPU-accelerated expand on the HL Api - includes documentation about GPU's accelerated expand on the HL API - rework CudaKeySwitchingKey - Cloning the key is no longer necessary on the HL API	2025-05-23 11:54:29 +02:00
Nicolas Sarlin	25d008bae8	fix(bench): add missing internal keycache feature	2025-05-22 16:14:30 +02:00
Pedro Alves	259d125434	fix(gpu): fix pbs and ks benchmarks	2025-05-20 17:37:48 +02:00
David Testé	e29d615b9d	chore(bench): add suitable heuristic for zk throughput Heuristic based on PBS count was flawed since a ZK verification operation will eat up to 32 threads on the machine. The previous heuristic could generate an input data vector way bigger than the total of threads divided by 32. This in turn lead to long execution time for benchmark and generate bad results.	2025-05-20 15:02:59 +02:00
Nicolas Sarlin	a01949e630	fix(bench): compilation error without the internal-keycache feature	2025-05-19 09:50:29 +02:00
Baptiste Roux	9ee8259002	feat(hpu): Add Hpu backend implementation This backend abstract communication with Hpu Fpga hardware. It define it's proper entities to prevent circular dependencies with tfhe-rs. Object lifetime is handle through Arc<Mutex<T>> wrapper, and enforce that all objects currently alive in Hpu Hw are also kept valid on the host side. It contains the second version of HPU instruction set (HIS_V2.0): * DOp have following properties: + Template as first class citizen + Support of Immediate template + Direct parser and conversion between Asm/Hex + Replace deku (and it's associated endianess limitation) by + bitfield_struct and manual parsing * IOp have following properties: + Support various number of Destination + Support various number of Sources + Support various number of Immediat values + Support of multiple bitwidth (Not implemented yet in the Fpga firmware) Details could be view in `backends/tfhe-hpu-backend/Readme.md`	2025-05-16 16:30:23 +02:00
David Testé	97b5973e4c	chore(bench): store object measurements results in tfhe-benchmark	2025-05-13 16:05:16 +02:00
Agnes Leroy	fd79c4f972	chore(bench): parallelize transfer bench	2025-05-13 10:45:48 +02:00
David Testé	a96970e8c3	chore: update clap dependency version to 4.5.30	2025-05-13 10:35:51 +02:00
Agnes Leroy	67f11a44df	chore(gpu): parallelize dex bench	2025-05-12 18:14:24 +02:00
David Testé	67ec4a28c1	chore(bench): move benchmarks to their own crate This is done to speed-up compilation duration by avoiding recompiling tfhe each time a modification is made in a benchmark file.	2025-05-09 13:46:27 +02:00

1 2 3

123 Commits