What's New 2021.17:
- New API: Technical preview of NCCL* like API alignment with an addition of onecclcommDestroy, onecclGetErrorstring, and onecclGetLastError APIs
- Support for single process and multiple threads: Currently supporting Allgather, Allreduce, Alltoall, ReduceScatter, Broadcast, pt2pt and Group API for scale up
- Added Operations: Added support for user defined reduction operations for scale up and extended group API to also support pt2pt operations.
- Improved Performance: Allgather optimizations for large messages for scale out up to 8 nodes
- Support for BMG: Added BMG support, for now only available on the opensource
- Bug fixes and performance optimizations