Commit Graph

24 Commits

Author SHA1 Message Date
baf3e614fb Add comments for tip4p GPU kernels 2022-08-07 22:26:11 +03:00
59dc63d003 Add typecasting for consts in tip4p GPU kernels 2022-06-01 01:29:18 +03:00
e9051620a5 Cleanup 2022-05-28 00:39:07 +03:00
d8f8a3a36a Handle inconsistent J molecules in tip4p/gpu 2022-05-28 00:24:24 +03:00
bbb3f35aa3 "int64_t" is defined in stdint.h; must use "long" in OpenCL kernels instead 2021-08-26 22:46:08 -04:00
94ce07b2de Merge branch 'lammps:master' into tip4p_fix_types 2021-07-21 14:09:39 +03:00
d982d153f8 Fix conditions for correct results on other types 2021-06-18 18:26:53 +03:00
8fef6a10dd Fix atom types handling in the tip4p/gpu kernels 2021-06-18 00:52:23 +03:00
0e1fa44b9d fix compilation error on latest intel OpenCL compiler due to missing tagint definition 2021-05-22 13:16:31 -04:00
e7e2d2323b Feb2021 GPU Package Update - GPU Package Files 2021-02-15 08:20:50 -08:00
496bd55dc9 Fix lj/cut/tip4p/long/gpu virial 2020-06-25 17:30:51 +03:00
4f6dd381c3 Merge branch 'master' into gpu_hip 2020-03-30 17:15:55 -04:00
d024b3d340 Add shared memory optimization 2020-02-12 01:35:44 +03:00
66c5fa2abd Merge 'gpu_hip_port' into master 2020-01-28 20:35:08 +03:00
43a9936241 Fix tagint usage 2020-01-20 03:38:58 +03:00
f69a17dd3a Reduce unnecessary data exchanges 2020-01-19 02:05:16 +03:00
514271c984 recover compilation with -DLAMMPS_BIGBIG and fix whitespace and formatting issues 2020-01-08 17:57:54 -05:00
a3fca53e97 Separate the computation of newsite into another kernel
This eliminates the need for thread fence and makes the calculation stable on GTX1070 (CUDA and OpenCL) and TitanV
2019-12-07 15:54:20 +03:00
4a51e1660f Add thread fence in kernel to fix Volta indeterminacy 2019-12-07 01:16:39 +03:00
a2f9fa8e78 Separate the computation of hneigh into another kernel
Simplify the main GPU kernel and add another kernel 'k_pair_reneigh'. It works good on GTX1070 (Pascal), but still there is a problem with non-deterministic results on Volta.

I reimplement BaseCharge::compute methods in the child class LJ_TIP4PLong to correctly embed a new kernel in the code.

Also commit includes some codestyle fixes.
2019-12-06 21:41:02 +03:00
66a076b819 Delete "float.h" from TIP4P GPU kernel for better OpenCL compatibility
According to Khronos OpenCL docs, "The C99 standard headers <...>, float.h, <...> are not available and cannot be included by a program"
2019-11-27 19:49:12 +03:00
4febc7f794 Add copyright and fix style 2019-11-18 02:01:32 +03:00
f803ba5655 Add shfl_xor sum to kernel for ARCH>=300 2019-11-12 21:35:36 +03:00
64bdc59623 Implement GPU pair style lj/cut/tip4p/long/gpu
Source code, Makefiles and Install for GPU-accelerated TIP4P pair style.
It is implemented as a part of the standard GPU package.
The style is compatible with the standard  lj/cut/tip4p/long.
Also, this commit modifies "atom.h" just to
add a getter for variable 'max_same'.
2019-11-10 02:38:58 +03:00