Warp shuffles, or why OpenCL should expose low-level interfaces
Posted by Oblomov