New in CUDA 8
Pascal Architecture Support
- Enhance performance out-of-the-box on Pascal GPUs
- Simplify programming using Unified Memory including support for large datasets, concurrent data access and atomics
- Optimize Unified Memory performance using new data migration APIs
- Increase throughput at ultra-fast speeds using NVIDIA® NVLINK™, new high-speed interconnect
Developer Tools
- Identify latent system-level bottlenecks using critical path analysis
- Improve productivity by up to 2x with faster NVCC compile times
- Tune OpenACC applications and overall host code using new profiling extensions
Libraries
- Accelerate graph analytics algorithms with nvGRAPH
- Speed-up Deep Learning applications using native support for FP16 and INT8, support for batch operation in cuBLAS
Last modified
8 years ago
Last modified on Oct 13, 2016, 1:07:26 PM
Note:
See TracWiki
for help on using the wiki.