icicle

mirror of https://github.com/pseXperiments/icicle.git synced 2026-01-08 20:48:06 -05:00

Author	SHA1	Message	Date
Jeremy Felder	999167afe1	[PATCH]: Update module with v2 versioning (#495 ) ## Describe the changes This PR fixes the issue of v2 ICICLE not being discovered by Go's packaging service by adding the required "v2" to the module path: https://go.dev/doc/modules/release-workflow#breaking	2024-04-24 10:09:45 +03:00
release-bot	ff374fcac7	Bump rust crates' version icicle-babybear@2.0.0 icicle-bls12-377@2.0.0 icicle-bls12-381@2.0.0 icicle-bn254@2.0.0 icicle-bw6-761@2.0.0 icicle-core@2.0.0 icicle-cuda-runtime@2.0.0 icicle-grumpkin@2.0.0 icicle-hash@2.0.0 Generated by cargo-workspaces v2.0.0	2024-04-23 02:30:18 +00:00
ChickenLover	7265d18d48	ICICLE V2 Release (#492 ) This PR introduces major updates for ICICLE Core, Rust and Golang bindings --------- Co-authored-by: Yuval Shekel <yshekel@gmail.com> Co-authored-by: DmytroTym <dmytrotym1@gmail.com> Co-authored-by: Otsar <122266060+Otsar-Raikou@users.noreply.github.com> Co-authored-by: VitaliiH <vhnatyk@gmail.com> Co-authored-by: release-bot <release-bot@ingonyama.com> Co-authored-by: Stas <spolonsky@icloud.com> Co-authored-by: Jeremy Felder <jeremy.felder1@gmail.com> Co-authored-by: ImmanuelSegol <3ditds@gmail.com> Co-authored-by: JimmyHongjichuan <45908291+JimmyHongjichuan@users.noreply.github.com> Co-authored-by: pierre <pierreuu@gmail.com> Co-authored-by: Leon Hibnik <107353745+LeonHibnik@users.noreply.github.com> Co-authored-by: nonam3e <timur@ingonyama.com> Co-authored-by: Vlad <88586482+vladfdp@users.noreply.github.com> Co-authored-by: LeonHibnik <leon@ingonyama.com> Co-authored-by: nonam3e <71525212+nonam3e@users.noreply.github.com> Co-authored-by: vladfdp <vlad.heintz@gmail.com>	2024-04-23 05:26:40 +03:00
release-bot	a1dc0539ce	Bump rust crates' version icicle-bls12-377@1.10.1 icicle-bls12-381@1.10.1 icicle-bn254@1.10.1 icicle-bw6-761@1.10.1 icicle-core@1.10.1 icicle-cuda-runtime@1.10.1 icicle-grumpkin@1.10.1 Generated by cargo-workspaces v1.10.1	2024-04-11 07:56:32 +00:00
Jeremy Felder	cda806ff0c	[PATCH]: Fix underflow and div by 0 (#471 ) ## Describe the changes This PR fixes an underflow in `get_optimal_c` when bitsize is less than 16 and adds `max(1,NUM_THREADS)` to prevent div by 0 when calculating NUM_THREADS	2024-04-11 10:45:34 +03:00
release-bot	8498a962f9	Bump rust crates' version icicle-bls12-377@1.10.0 icicle-bls12-381@1.10.0 icicle-bn254@1.10.0 icicle-bw6-761@1.10.0 icicle-core@1.10.0 icicle-cuda-runtime@1.10.0 icicle-grumpkin@1.10.0 Generated by cargo-workspaces v1.10.0	2024-04-09 10:02:34 +00:00
Leon Hibnik	a7b0dc40c1	[FEAT] ReleaseDomain API (#465 ) ## Describe the changes This PR adds a NTT ReleaseDomain API in Golang and Rust ## Linked Issues Resolves # --------- Co-authored-by: Yuval Shekel <yshekel@gmail.com>	2024-04-09 12:58:19 +03:00
Vlad	4a35eece51	transpose kernel in vec_ops and rust binding (#462 ) ## Describe the changes This PR adds an extern C link to the transpose kernel, now in vec_ops.cu. Also Rust binding, and I updated the test check_ntt_batch to use the new transpose function. The test passes. ## Linked Issues Resolves # --------- Co-authored-by: LeonHibnik <leon@ingonyama.com>	2024-04-09 08:47:33 +03:00
VitaliiH	4c9b3c00a5	Devmode to Reduce compilation time (including G2 and ECNTT) (#395 ) devmode to reduce compilation time	2024-04-09 06:09:04 +02:00
Jeremy Felder	c6719167ce	[FEAT]: golang device slice ranges (#463 ) ## Describe the changes This PR adds the capability to slice a DeviceSlice, allowing portions of data that are already on the device to be reused. Additionally, this PR removes the need for a HostSlice underlying type to implement a Size function and uses unsafe.Sizeof instead. This together with #407 will allow direct usage of gnark-crypto types with HostSlice without the need for converting to ICICLE types --------- Co-authored-by: nonam3e <timur@ingonyama.com>	2024-04-08 19:42:03 +03:00
Leon Hibnik	cd3769d6b7	Fix Golang TestNttDeviceAsync (#461 ) ## Describe the changes This PR fixes TestNttDeviceAsync by adding a missing call to initDomain ## Linked Issues Resolves #	2024-04-08 17:47:10 +03:00
DmytroTym	b93b1d0aaf	NTT inplace in Rust (#453 ) ## Describe the changes Due to Rust's ownership rules, we can't run NTT inplace using the [`ntt`](https://github.com/ingonyama-zk/icicle/blob/v1.9.1/wrappers/rust/icicle-core/src/ntt/mod.rs#L139) function. Which is why we saw a need to add a separate function a couple of times. Incidentally an issue with radix-2 NTT was found when ran inplace, `__syncthreads()` was used in reverse order kernel as if it was a global barrier for all blocks and not block-local one. Thus data race happened that is fixed by this PR.	2024-04-08 10:04:04 +03:00
Leon Hibnik	6a96eef8dc	add golang multigpu to sidebar (#449 ) This PR adds multi GPU golang documentation to dev sidebar	2024-04-08 09:20:29 +03:00
JimmyHongjichuan	95ab6de059	fix: use the log2 in lib std explicitly to prevent makefile from link… (#459 ) …ing other log2 func ## Describe the changes This PR adds "std" as prefix on log2 function of icicle/appUtils/msm/msm.cu to explicitly use std::log2 for MSM calculatation. ## Linked Issues https://github.com/ingonyama-zk/icicle/issues/458 Resolves # Co-authored-by: pierre <pierreuu@gmail.com>	2024-04-07 07:58:53 +03:00
Yuval Shekel	9c9311bee0	golang multi-device MSM test temporarily disabled due to issues related to golang tests env	2024-04-04 23:23:18 +03:00
Yuval Shekel	406020bda6	fix: NTT release domain linkage	2024-04-04 23:23:18 +03:00
release-bot	25ac705c3b	Bump rust crates' version icicle-bls12-377@1.9.1 icicle-bls12-381@1.9.1 icicle-bn254@1.9.1 icicle-bw6-761@1.9.1 icicle-core@1.9.1 icicle-cuda-runtime@1.9.1 icicle-grumpkin@1.9.1 Generated by cargo-workspaces v1.9.1	2024-03-27 19:00:07 +00:00
VitaliiH	ef757e8210	hotfix for large ecntt (#448 ) hotfix for large ECNTTs	2024-03-27 18:31:50 +01:00
Otsar	2c1431d904	Update Hall of fame in 'README.md' (#445 ) Added v1.8's contributors to hall of fame	2024-03-27 16:57:41 +02:00
ImmanuelSegol	77ebc4848e	Docs 1.8 (#436 )	2024-03-25 08:54:17 -04:00
Yuval Shekel	919ff42f49	fix: NTT input is const	2024-03-24 16:26:10 +02:00
release-bot	a1ff989740	Bump rust crates' version icicle-bls12-377@1.9.0 icicle-bls12-381@1.9.0 icicle-bn254@1.9.0 icicle-bw6-761@1.9.0 icicle-core@1.9.0 icicle-cuda-runtime@1.9.0 icicle-grumpkin@1.9.0 Generated by cargo-workspaces v1.9.0	2024-03-21 07:11:47 +00:00
Otsar	1f2144a57c	Removed "machines using ICICLE" static badge (#442 )	2024-03-21 09:04:19 +02:00
Jeremy Felder	db4c07dcaf	Golang bindings for ECNTT (#433 )	2024-03-21 09:04:00 +02:00
ChickenLover	d4f39efea3	Add Keccak hash function (#435 ) This PR adds support for Keccak-256 and Keccak-512. It only adds them in c++. There is no way of adding rust or golang wrappers rn as it requires having an `icicle-common` create / mod	2024-03-20 22:30:19 +02:00
Yuval Shekel	7293058246	fix: (golang) MSM multi device test reset to original device after test is done	2024-03-20 16:27:11 +02:00
Yuval Shekel	03136f1074	fix: (golang) add missing NttAlgorithm field in NTTConfig	2024-03-20 16:27:11 +02:00
Yuval Shekel	3ef0d0c66e	MSM scalars and points params are const - This is required to be able to compute MSM on polynomial coefficients that are accessible by const only.	2024-03-20 16:27:11 +02:00
Stas	0dff1f9302	Use multi-threaded CUDA compilation to spped up compilation (#439 ) ## Describe the changes Speed up CUDA c++ compile time using multi-threaded compilation (--split-compile flag). The tests on 8 core machine show ~2x acceleration. ## Linked Issues Compiling c++ takes long time	2024-03-18 16:40:30 -04:00
ChickenLover	0d806d96ca	tidy (#437 )	2024-03-19 00:59:10 +07:00
release-bot	b6b5011a47	Bump rust crates' version icicle-bls12-377@1.8.0 icicle-bls12-381@1.8.0 icicle-bn254@1.8.0 icicle-bw6-761@1.8.0 icicle-core@1.8.0 icicle-cuda-runtime@1.8.0 icicle-grumpkin@1.8.0 Generated by cargo-workspaces v1.8.0	2024-03-13 21:38:17 +00:00
DmytroTym	7ac463c3d9	MSM pre-computation (#427 ) ## Brief description This PR adds pre-computation to the MSM, for some theory see [this](https://youtu.be/KAWlySN7Hm8?si=XeR-htjbnK_ySbUo&t=1734) timecode of Niall Emmart's talk. In terms of public APIs, one method is added. It does the pre-computation on-device leaving resulting data on-device as well. No extra structures are added, only `precompute_factor` from `MSMConfig` is now activated. ## Performance While performance gains are for now often limited by our inflexibility in choice of `c` (for example, very large MSMs get basically no speedup from pre-compute because currently `c` cannot be larger than 16), there's still a number of MSM sizes which get noticeable improvement: \| Pre-computation factor \| bn254 size `2^20` MSM, ms. \| bn254 size `2^12` MSM, size `2^10` batch, ms. \| bls12-381 size `2^20` MSM, ms. \| bls12-381 size `2^12` MSM, size `2^10` batch, ms. \| \| ------------- \| ------------- \| ------------- \| ------------- \| ------------- \| \| 1 \| 14.1 \| 82.8 \| 25.5 \| 136.7 \| \| 2 \| 11.8 \| 76.6 \| 20.3 \| 123.8 \| \| 4 \| 10.9 \| 73.8 \| 18.1 \| 117.8 \| \| 8 \| 10.6 \| 73.7 \| 17.2 \| 116.0 \| Here for example pre-computation factor = 4 means that alongside each original base point, we pre-compute and pass into the MSM 3 of its "shifted" versions. Pre-computation factor = 1 means no pre-computation. GPU used for benchmarks is a 3090Ti. ## TODOs and open questions - Golang APIs are missing; - I mentioned that to utilise pre-compute to its full potential we need arbitrary choice of `c`. One issue with this is that pre-compute will become dependent on `c`. For now this is not the case as `c` can only be a power of 2 and powers of 2 can always share the same pre-computation. So apparently we need to make `c` a parameter of the precompute function to future-proof it from a breaking change. This is pretty unnatural and counterintuitive as `c` is typically chosen in runtime after pre-compute is done but I don't really see another way, pls let me know if you do. UPD: `c` is added into pre-compute function, for now it's unused and it's documented how it will change in the future. Resolves https://github.com/ingonyama-zk/icicle/issues/147 Co-authored with @ChickenLover --------- Co-authored-by: ChickenLover <romangg81@gmail.com> Co-authored-by: nonam3e <timur@ingonyama.com> Co-authored-by: nonam3e <71525212+nonam3e@users.noreply.github.com> Co-authored-by: LeonHibnik <leon@ingonyama.com>	2024-03-13 23:25:16 +02:00
HadarIngonyama	287f53ff16	NTT columns batch (#424 ) This PR adds the columns batch feature - enabling batch NTT computation to be performed directly on the columns of a matrix without having to transpose it beforehand, as requested in issue #264. Also some small fixes to the reordering kernels were added and some unnecessary parameters were removes from functions interfaces. --------- Co-authored-by: DmytroTym <dmytrotym1@gmail.com>	2024-03-13 18:46:47 +02:00
Jeremy Felder	89082fb561	FEAT: MultiGPU for golang bindings (#417 ) ## Describe the changes This PR adds multi gpu support in the golang bindings. Tha main changes are to DeviceSlice which now includes a `deviceId` attribute specifying which device the underlying data resides on and checks for correct deviceId and current device when using DeviceSlices in any operation. In Go, most concurrency can be done via Goroutines (described as lightweight threads - in reality, more of a threadpool manager), however, there is no guarantee that a goroutine stays on a specific host thread. Therefore, a function `RunOnDevice` was added to the cuda_runtime package which locks a goroutine into a specific host thread, sets a current GPU device, runs a provided function, and unlocks the goroutine from the host thread after the provided function finishes. While the goroutine is locked to the hsot thread, the Go runtime will not assign other goroutines to that host thread	2024-03-13 16:19:45 +02:00
hhh_QC	08ec0b1ff6	update go install source in Dockerfile (#428 )	2024-03-10 10:47:08 +02:00
Jeremy Felder	fa219d9c95	Fix release flow with deploy key and caching (#425 ) ## Describe the changes This PR fixes the release flow action	2024-03-10 08:57:35 +02:00
DmytroTym	0e84fb4b76	feat: add warmup for CudaStream (#422 ) ## Describe the changes Add a non-blocking `warmup` function to `CudaStream` > when you run the benchmark (e.g. the msm example you have) the first instance is always slow, with a constant overhead of 200~300ms cuda stream warmup. and I want to get rid of that in my application by warming it up in parallel while my host do something else.	2024-03-07 19:11:34 +02:00
Alex Xiong	d8059a2a4e	Merge pull request #1 from ingonyama-zk/feat/warmup Warmup function added	2024-03-07 18:18:18 +08:00
DmytroTym	4a65758408	Merge branch 'main' into feat/warmup	2024-03-06 22:08:45 +02:00
Jeremy Felder	1abd2ef9c9	Bump rust crates' version icicle-bls12-377@1.7.0 icicle-bls12-381@1.7.0 icicle-bn254@1.7.0 icicle-bw6-761@1.7.0 icicle-core@1.7.0 icicle-cuda-runtime@1.7.0 icicle-grumpkin@1.7.0 Generated by cargo-workspaces v1.7.0	2024-03-06 22:05:10 +02:00
Jeremy Felder	9d402df0cf	Release flow CI (#423 ) ## Describe the changes This PR: - Moves common crate attributes to the workspace Cargo.toml. - Adds a manual release flow for bumping, tagging, and draft release	2024-03-06 21:41:48 +02:00
DmytroTym	7185657ff7	Warmup function	2024-03-06 18:13:23 +02:00
Alex Xiong	b22aa02e91	fix: cargo fmt	2024-03-06 13:10:12 +00:00
ImmanuelSegol	af6bfc9ab0	golang docs (#413 ) ## Describe the changes This PR... ## Linked Issues Resolves # --------- Co-authored-by: Jeremy Felder <jeremy.felder1@gmail.com> Co-authored-by: DmytroTym <dmytrotym1@gmail.com>	2024-03-06 08:59:32 -04:00
Alex Xiong	b108c71bdd	feat: add rust api for cudaFreeAsync	2024-03-06 12:44:43 +00:00
ChickenLover	9fc083916d	Small features (#415 ) This PR is a compilation of small improvements - Lock bindgen version for `icicle-cuda-runtime` - Add an error message when trying to build on Mac (or any non windows/linux machine) - Add documentation and template files for adding new curve - Add documentation on _params.cuh contents - Add the script to bump all the rust crates versions to the same version Resolves #313	2024-03-06 13:48:34 +02:00
Jeremy Felder	87ccd62976	Fix go setup in CI (#420 ) ## Describe the changes This PR adds the use of setup-go in the CI to ensure that `go` is installed properly and caches dependencies and build outputs by default	2024-03-06 12:31:24 +02:00
Jeremy Felder	d8f2313a01	Fix the rust changed files glob pattern (#419 ) ## Describe the changes This PR fixes the glob pattern of changed files for rust	2024-03-05 12:27:08 +02:00
ChickenLover	a2ae7a9e2f	minor changes to cuda (#414 ) ## Describe the changes - Fix include statements to use absolute path - Remove stale comments and code parts - Fix test_kernels.cu bug	2024-03-04 16:52:03 +07:00
ImmanuelSegol	d98b851d62	fix-primitives-links (#411 )	2024-03-03 13:42:28 +07:00

1 2 3 4 5 ...

328 Commits