You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The default stack size on CUDA is 1024, which is blown by simple operations like addition. Maybe we have some space for optimization, or maybe we simply need to be more aggressive with forced inline on CUDA.
The default stack size on CUDA is 1024, which is blown by simple operations like addition. Maybe we have some space for optimization, or maybe we simply need to be more aggressive with forced inline on CUDA.