* refactor ops_gpu ctypes - remove redundant byref as ctypes automatically handles passing `type` as `POINTER(type)` - use walrus operator instead of init_c_var when possible * clSetKernelArg argtype is POINTER(None)
_reshape_mask