git-svn-id: svn://svn.icms.temple.edu/lammps-ro/trunk@15078 f3b2605a-c512-4ea7-a41b-209d697bcdaa
This commit is contained in:
@ -124,9 +124,8 @@ multiple threads can be launched by each MPI task running on a CPU.
|
||||
|
||||
To build with Kokkos support for NVIDIA GPUs, NVIDIA Cuda software
|
||||
version 6.5 or later must be installed on your system. See the
|
||||
discussion for the :doc:`USER-CUDA <accelerate_cuda>` and
|
||||
:doc:`GPU <accelerate_gpu>` packages for details of how to check and do
|
||||
this.
|
||||
discussion for the :doc:`GPU <accelerate_gpu>` package for details of
|
||||
how to check and do this.
|
||||
|
||||
.. note::
|
||||
|
||||
@ -393,8 +392,8 @@ Generally speaking, the following rules of thumb apply:
|
||||
* When running on CPUs only, with multiple threads per MPI task,
|
||||
performance of a KOKKOS style is a bit slower than the USER-OMP
|
||||
package.
|
||||
* When running on GPUs, KOKKOS is typically faster than the USER-CUDA
|
||||
and GPU packages.
|
||||
* When running large number of atoms per GPU, KOKKOS is typically faster
|
||||
than the GPU package.
|
||||
* When running on Intel Xeon Phi, KOKKOS is not as fast as
|
||||
the USER-INTEL package, which is optimized for that hardware.
|
||||
See the `Benchmark page <http://lammps.sandia.gov/bench.html>`_ of the
|
||||
|
||||
Reference in New Issue
Block a user