lammps

Author	SHA1	Message	Date
Vsevak	baf3e614fb	Add comments for tip4p GPU kernels	2022-08-07 22:26:11 +03:00
Vsevak	59dc63d003	Add typecasting for consts in tip4p GPU kernels	2022-06-01 01:29:18 +03:00
Vsevak	e9051620a5	Cleanup	2022-05-28 00:39:07 +03:00
Vsevak	d8f8a3a36a	Handle inconsistent J molecules in tip4p/gpu	2022-05-28 00:24:24 +03:00
Axel Kohlmeyer	bbb3f35aa3	"int64_t" is defined in stdint.h; must use "long" in OpenCL kernels instead	2021-08-26 22:46:08 -04:00
Vsevolod Nikolskiy	94ce07b2de	Merge branch 'lammps:master' into tip4p_fix_types	2021-07-21 14:09:39 +03:00
Vsevak	d982d153f8	Fix conditions for correct results on other types	2021-06-18 18:26:53 +03:00
Vsevak	8fef6a10dd	Fix atom types handling in the tip4p/gpu kernels	2021-06-18 00:52:23 +03:00
Axel Kohlmeyer	0e1fa44b9d	fix compilation error on latest intel OpenCL compiler due to missing tagint definition	2021-05-22 13:16:31 -04:00
Michael Brown	e7e2d2323b	Feb2021 GPU Package Update - GPU Package Files	2021-02-15 08:20:50 -08:00
Vsevak	496bd55dc9	Fix lj/cut/tip4p/long/gpu virial	2020-06-25 17:30:51 +03:00
Richard Berger	4f6dd381c3	Merge branch 'master' into gpu_hip	2020-03-30 17:15:55 -04:00
Vsevak	d024b3d340	Add shared memory optimization	2020-02-12 01:35:44 +03:00
Vsevak	66c5fa2abd	Merge 'gpu_hip_port' into master	2020-01-28 20:35:08 +03:00
Vsevak	43a9936241	Fix tagint usage	2020-01-20 03:38:58 +03:00
Vsevak	f69a17dd3a	Reduce unnecessary data exchanges	2020-01-19 02:05:16 +03:00
Axel Kohlmeyer	514271c984	recover compilation with -DLAMMPS_BIGBIG and fix whitespace and formatting issues	2020-01-08 17:57:54 -05:00
Vsevak	a3fca53e97	Separate the computation of newsite into another kernel This eliminates the need for thread fence and makes the calculation stable on GTX1070 (CUDA and OpenCL) and TitanV	2019-12-07 15:54:20 +03:00
Vsevak	4a51e1660f	Add thread fence in kernel to fix Volta indeterminacy	2019-12-07 01:16:39 +03:00
Vsevak	a2f9fa8e78	Separate the computation of hneigh into another kernel Simplify the main GPU kernel and add another kernel 'k_pair_reneigh'. It works good on GTX1070 (Pascal), but still there is a problem with non-deterministic results on Volta. I reimplement BaseCharge::compute methods in the child class LJ_TIP4PLong to correctly embed a new kernel in the code. Also commit includes some codestyle fixes.	2019-12-06 21:41:02 +03:00
Vsevak	66a076b819	Delete "float.h" from TIP4P GPU kernel for better OpenCL compatibility According to Khronos OpenCL docs, "The C99 standard headers <...>, float.h, <...> are not available and cannot be included by a program"	2019-11-27 19:49:12 +03:00
Vsevak	4febc7f794	Add copyright and fix style	2019-11-18 02:01:32 +03:00
Vsevak	f803ba5655	Add shfl_xor sum to kernel for ARCH>=300	2019-11-12 21:35:36 +03:00
Vsevak	64bdc59623	Implement GPU pair style lj/cut/tip4p/long/gpu Source code, Makefiles and Install for GPU-accelerated TIP4P pair style. It is implemented as a part of the standard GPU package. The style is compatible with the standard lj/cut/tip4p/long. Also, this commit modifies "atom.h" just to add a getter for variable 'max_same'.	2019-11-10 02:38:58 +03:00

24 Commits