lammps

Author	SHA1	Message	Date
Axel Kohlmeyer	fbded8d8f4	include fix from @ndtrung81	2023-04-13 20:06:35 -04:00
Axel Kohlmeyer	92302d0f22	Merge branch 'develop' into mar2023_gpu_misc	2023-03-13 19:02:20 -04:00
W. Michael Brown	c96ac858bf	GPU Package: Adding JIT test for OpenCL prefetch support.	2023-03-07 21:43:19 -08:00
Axel Kohlmeyer	68ed9ec85e	Merge branch 'develop' into collected-small-changes	2023-03-07 19:39:26 -05:00
W. Michael Brown	d7c783560a	GPU Package: Fixing out of bounds memory access issues with special kernel unroll optimizations.	2023-03-06 22:11:10 -08:00
W. Michael Brown	37f22c8627	Misc Improvements to GPU Package - Optimizations for molecular systems - Improved kernel performance and greater CPU overlap - Reduced GPU to CPU communications for discrete devices - Switch classic Intel makefiles to use LLVM-based compilers - Prefetch optimizations supported for OpenCL - Optimized data repack for quaternions	2023-03-05 21:03:12 -08:00
Richard Berger	f658a947c6	gpu: fix broken HIP code path	2023-03-05 19:35:32 -07:00
Axel Kohlmeyer	646ef15d83	more consistent way to disable fast math for Intel OpenCL with double precision	2023-03-05 07:18:12 -05:00
Axel Kohlmeyer	2aff321187	Revert "use sin(x+pi/2) instead of cos(x) on Intel OpenCL with double precision" This reverts commit `fa38047749` It is no longer needed since we disable fast math with Intel OpenCL and double precision setting.	2023-03-05 07:04:20 -05:00
Axel Kohlmeyer	9cdb9283ca	another workaround for Intel GPU with OpenCL and double precision	2023-03-03 03:08:03 -05:00
Axel Kohlmeyer	fa38047749	use sin(x+pi/2) instead of cos(x) on Intel OpenCL with double precision	2023-03-02 18:47:34 -05:00
Trung Nguyen	2ccfe635ce	Removed the outdated CUDA_PROXY flag, using CUDA_MPS_SUPPORT consistently in CMake and traditional builds	2023-03-01 16:38:50 -06:00
Trung Nguyen	2511872823	Updated GPU.cmake and Makefile.cuda_mps to not error out with exclusive process mode with the CUDA MPS server in use	2023-02-28 00:41:47 -06:00
Trung Nguyen	88ccd546d8	Fixed bugs with gauss/gpu in bonded systems, including factor_lj in forces and energies	2023-02-09 23:55:14 -06:00
Axel Kohlmeyer	04bed1a6e0	roll back changes for vec3_scale() and vec3_scaleadd() and use temporary vector	2023-02-08 20:32:47 -05:00
Axel Kohlmeyer	912f046cd7	recover compilation of tersoff kernels with CUDA	2023-02-08 11:16:46 -05:00
Axel Kohlmeyer	30abe68c82	recover kernel failure for tersoff with mixed and single precision	2023-02-08 09:13:04 -05:00
Axel Kohlmeyer	3b4c873beb	another OpenCL bugfix attempt from Trung	2023-02-07 17:31:43 -05:00
Axel Kohlmeyer	acf7f9184d	fix failing unit tests with OpenCL	2023-02-06 18:40:59 -05:00
Axel Kohlmeyer	0885edc154	re-enable new neighbor lists with CUDA 12.0 and later	2023-02-05 03:02:19 -05:00
Axel Kohlmeyer	34e8a74989	apply changes suggested by clang-tidy	2023-01-27 11:49:43 -05:00
Axel Kohlmeyer	5bd7b95e60	Merge pull request #3613 from lammps/gpu-neigh-macos Attempted to allow GPU acceleration on MacOS with neighbor builds on …	2023-01-25 13:34:52 -05:00
Trung Nguyen	6fefd8821a	Attempted to allow GPU acceleration on MacOS with neighbor builds on the device by enforcing the old neighbor list code path (will revisit)	2023-01-25 10:42:55 -06:00
Axel Kohlmeyer	722e583b59	use available introspection API to get accumulator data type. update name of flag.	2023-01-25 05:22:49 -05:00
Trung Nguyen	adf43d7fee	Fixed the issues with some OpenCL implementation to avoid errors casting changing the pointer address spaces	2023-01-25 00:02:25 -06:00
Axel Kohlmeyer	64b5ad8966	Merge branch 'develop' into amoeba-gpu	2023-01-24 20:22:59 -05:00
Trung Nguyen	aaa918cbe7	Fixed bugs with access mode on the host side of thetai[1-3]	2023-01-24 17:05:48 -06:00
Trung Nguyen	5014e04341	Removed commented out code, ensured that ic_kspace is not nullptr when call precompute_kspace for hippo/gpu	2023-01-24 08:40:08 -06:00
Axel Kohlmeyer	27da716852	getting out of the insurance business :-)	2023-01-23 16:45:41 -05:00
Trung Nguyen	8e79e2efa5	More cleanup, fixed bugs with hippo fphi kernels for mixed precision	2023-01-23 00:18:42 -06:00
Trung Nguyen	658328dd9d	Added a note in the amoeba doc page on the not-yet resolved issue with integrated GPUs, removed commented out and debugging stuffs in the AM/HP kernels	2023-01-22 17:24:15 -06:00
Trung Nguyen	8eb722a32a	Enforced synchronous host-device transfers for cgrid_brick and fdip arrays	2023-01-19 13:22:27 -06:00
Axel Kohlmeyer	03ab42fd52	correct calling sequence for matching argument types	2023-01-19 08:57:24 -05:00
Axel Kohlmeyer	4244d2e6cd	silence compiler warnings about unused parameters and variables	2023-01-19 08:56:54 -05:00
Trung Nguyen	eddd3d6f25	Fixed a bug with extra being nullptr when _host_view is true: always allocate extra (Note that BaseAmoeba has its own cast_extra_data() that doesn't know if extra is allocated properly, it is the case when _host_view is false for dedicated GPUs for example)	2023-01-18 20:04:45 -06:00
Trung Nguyen	f86375c992	Attempted to ensure that extra gets allocated in the exactly same way as other added fields (charge, quat and vel)	2023-01-17 09:47:09 -06:00
Trung Nguyen	71931d1d44	Cleaned up, and added missing zero timers for extra fields transfers	2023-01-17 09:39:03 -06:00
Trung Nguyen	973b46a907	Attempted to resolve the memory access runtime errors when acquiring single and mixed precision arrays from the GPU lib	2023-01-16 10:12:42 -06:00
Trung Nguyen	9dc0369cee	Attempted to resolve the address space change issue when casting for OpenCL 2.0 (ref: https://www.intel.com/content/www/us/en/developer/articles/technical/the-generic-address-space-in-opencl-20.html#06_address_space_casting )	2023-01-15 23:28:48 -06:00
Trung Nguyen	c9ae41246d	Ran the four make commands in the src folder: make fix-whitespace; make fix-homepage; make fix-errordocs; make fix-permissions	2023-01-15 16:05:36 -06:00
Trung Nguyen	212da7f109	Merge branch 'develop' into amoeba-gpu	2023-01-14 18:36:26 -06:00
Axel Kohlmeyer	5cbe303af4	Merge branch 'develop' into collected-small-changes	2023-01-04 07:28:03 -05:00
Axel Kohlmeyer	d9abc3fcc0	update CUDA Toolkit / GPU compatibility lists and GPU package compilation settings	2023-01-03 11:56:44 -05:00
Axel Kohlmeyer	396d577f40	port DPD exclusions corrections to GPU package	2023-01-02 12:04:10 -05:00
Trung Nguyen	8af77c690c	Merge branch 'develop' into amoeba-gpu	2022-12-14 13:16:41 -06:00
Trung Nguyen	959b9c220f	Cleaned up unused member functions and hd_balancer calls	2022-11-07 15:49:37 -06:00
Trung Nguyen	a3cc0e8432	Reverted the block size tuning, which caused bugs for low atom counts (will revisit later)	2022-11-04 13:45:59 -05:00
Trung Nguyen	2f1f7ee0fa	Cleaned up code	2022-11-03 23:45:40 -05:00
Axel Kohlmeyer	e5a808fb8d	apply correct platform selection for OpenCL context	2022-11-01 04:05:57 -04:00
Axel Kohlmeyer	80a141d9c8	silence compiler warnings	2022-11-01 03:38:08 -04:00

1 2 3 4 5 ...

570 Commits