lammps

Author	SHA1	Message	Date
Trung Nguyen	cd3a00c2c4	Added timing breakdown for fphi_uind	2022-09-14 15:28:44 -05:00
Trung Nguyen	9c4d3db558	Cleaned up and converted arrays to ucl_vector of numtyp4	2022-09-13 16:48:39 -05:00
Trung Nguyen	31047b4a31	Removed mem alloc in precompute_induce, used buffer for packing, and switched to using ucl_vector	2022-09-13 12:53:48 -05:00
Trung Nguyen	7f4efa380a	Re-arranged memory allocation for cgrid_brick, some issues need to be fixed	2022-09-11 18:58:34 -05:00
Trung Nguyen	5e59c95be4	Moved temp variables inside loops	2022-09-10 02:45:06 -05:00
Trung Nguyen	363b6c51d0	Used local arrays and re-arranged for coalesced global memory writes	2022-09-10 02:31:39 -05:00
Trung Nguyen	c58343b2e2	Cleaned up debugging stuffs, need more refactoring and add to hippo	2022-09-09 13:50:41 -05:00
Trung Nguyen	b72b71837e	Moved first_induce_iteration in induce() to the right place	2022-09-09 13:34:57 -05:00
Trung Nguyen	4b8caac727	Made some progress with fphi_uind in the gpu pair style	2022-09-09 12:14:36 -05:00
Trung Nguyen	a0af9627e5	Fixed memory bugs with device array allocations	2022-09-06 16:19:17 -05:00
Trung Nguyen	21b7fb2fcf	Exposing fphi_uind to the gpu pair style, still keeping the part not ready though	2022-09-02 14:55:20 -05:00
Trung Nguyen	cad7e1b364	Moved fphi_uind up to BaseAmoeba	2022-09-02 10:18:59 -05:00
Trung Nguyen	aac264f2e2	Working on the fphi_uind kernel and array allocations	2022-08-30 23:40:04 -05:00
Trung Nguyen	c5c3c697df	Adding fphi_uind kernel, working on the arrays allocation	2022-08-29 00:13:30 -05:00
Trung Nguyen	9e7bbad4d4	Working on fphi_uind in the GPU lib	2022-08-27 13:19:52 -05:00
Trung Nguyen	b160460dcc	Added preprocessors to comment out cufft entirely for now	2022-08-26 12:55:46 -05:00
Trung Nguyen	b2d6df5bfb	Re-arranged some for loops in umutual1 to improve cache-friendly memory access; made placeholder for grid_uind on the GPU lib, maybe FFT is not that heavy to be put on the device.	2022-08-25 23:18:13 -05:00
Trung Nguyen	f4a90c62c0	First attempt to port the forward FFT in the k-space induce term to the GPU, not working yet	2022-08-23 15:42:05 -05:00
Trung Nguyen	921796a15f	Cleaned up unused variables in the hippo kernels	2022-08-16 16:29:38 -05:00
Trung Nguyen	28dabb9687	Cleaned up unused variables in the amoeba kernels, made room for convolution gpu	2022-08-16 15:37:49 -05:00
Trung Nguyen	46b8b00a4f	Working on fft on the device	2022-08-15 15:51:43 -05:00
Trung Nguyen	538aa13693	Only transfer data that is needed for umutual2b; allowed convolution and kspace term umutual1 to be overridden by the gpu counterparts	2022-08-10 16:21:30 -05:00
Trung Nguyen	aad4e417f9	Moved temp variables inside neighbor loops	2022-08-03 12:33:48 -05:00
Trung Nguyen	a54f0b684d	Moved temp variables inside the loop over neighbors	2022-08-03 10:56:52 -05:00
Trung Nguyen	93784f35e3	Added ucl_erfc to the opencl, cuda and hip backends; reverted to using erfc instead of approximation to ensure double-precision matches	2022-07-25 15:34:44 -05:00
Trung Nguyen	66ee2bf989	Cleaned up	2022-07-14 11:01:30 -05:00
Trung Nguyen	0c44bd1086	Rearranged the order of real-space and kspace part of ufield0c(), delayed device-host transfer from umutual2b() to overlap with kspace part	2022-07-08 14:45:31 -05:00
Trung Nguyen	675c2d38a3	Flipped sign of forces and virial terms in the hippo kernels	2022-07-05 14:37:26 -05:00
Trung Nguyen	5dab809522	Flipped force sign in polar_real, made sure that multipole_real is true for precompute() to be invoked, ubdirect2b() is segfault and needs work	2022-07-04 01:38:22 -05:00
Trung Nguyen	f4900d131a	Working on the multipole term on the gpu side, incorrect virials	2022-07-01 16:26:25 -05:00
Trung Nguyen	a14f0cfd6c	Merge branch 'amoeba' into amoeba-gpu, update the gpu pair styles with the base class	2022-06-28 12:54:27 -05:00
Axel Kohlmeyer	d4ea5ca49e	more clang-tidy fixes after re-running it with added settings	2022-05-14 07:18:05 -04:00
Axel Kohlmeyer	f09556018b	fix bugs reported by @jibril-b-coulibaly	2022-04-28 14:47:53 -04:00
Trung Nguyen	531e553162	Merge branch 'amoeba' into amoeba-gpu	2022-04-22 16:10:24 -05:00
Axel Kohlmeyer	bd6d7b9136	clarify CUDA versus OpenCL build and runtime restrictions	2022-04-13 14:24:43 -04:00
Axel Kohlmeyer	f3363070e7	remove support for CUDA toolkits before version 8 and GPUs older than Kepler	2022-04-12 15:48:16 -04:00
Axel Kohlmeyer	b36262cd11	cosmetic	2022-04-10 20:47:42 -04:00
Axel Kohlmeyer	d6f7570d57	avoid redundant use of boolean literals	2022-04-10 20:47:31 -04:00
Axel Kohlmeyer	39b316729b	use auto type when assigning from cast or using new	2022-04-10 18:16:36 -04:00
Nick Curtis	77565add6e	Add C++14 to Makefile build system Change-Id: I24f72b4aaca93a49877775c3d181507c83cd7f82	2022-04-07 17:54:20 -04:00
Axel Kohlmeyer	537e3f2539	silence compiler warnings	2022-02-17 23:13:13 -05:00
Axel Kohlmeyer	5f47ff770c	Merge pull request #3127 from akohlmey/gpu-opencl-updates Update compiling OpenCL loader lib	2022-02-14 16:10:53 -05:00
Axel Kohlmeyer	83beffbb9f	whitespace	2022-02-13 17:08:18 -05:00
Trung Nguyen	193dea6327	Updated the comment to be precise on the case	2022-02-13 14:51:00 -06:00
Trung Nguyen	159f107abd	Corrected the added comment	2022-02-13 11:16:01 -06:00
Trung Nguyen	bae4e45978	Added a comment to the added check while looping over the neighbors of ghost atoms	2022-02-13 09:17:05 -06:00
Trung Nguyen	55a500cf8a	Fixed bugs with in the kernel sw_three_end kernel	2022-02-13 00:42:26 -06:00
Axel Kohlmeyer	a17bdf5652	silence compiler warnings and avoid infinite recursion in aspherical pair styles	2022-02-11 21:06:16 -05:00
Axel Kohlmeyer	7e92809288	Merge pull request #3069 from Vsevak/fix-hip-ffast-math Fix HIP Makefile under lib/gpu	2022-01-03 11:37:55 -05:00
Axel Kohlmeyer	14e5474174	restore obsolete compilation settings similar to parallel makefile	2021-12-27 20:31:42 -05:00

1 2 3 4 5 ...

479 Commits