'''New in CUDA 8''' '''Pascal Architecture Support''' \\ * Enhance performance out-of-the-box on Pascal GPUs * Simplify programming using Unified Memory including support for large datasets, concurrent data access and atomics * Optimize Unified Memory performance using new data migration APIs * Increase throughput at ultra-fast speeds using NVIDIA® NVLINK™, new high-speed interconnect '''Developer Tools''' \\ * Identify latent system-level bottlenecks using critical path analysis * Improve productivity by up to 2x with faster NVCC compile times * Tune OpenACC applications and overall host code using new profiling extensions '''Libraries''' * Accelerate graph analytics algorithms with nvGRAPH * Speed-up Deep Learning applications using native support for FP16 and INT8, support for batch operation in cuBLAS