Commit Graph

607 Commits

Author SHA1 Message Date
80ef8656e8 remove some more dead code to silence compiler warnings 2024-01-20 23:11:20 -05:00
35a6bf0cac silence compiler warning 2024-01-20 22:59:23 -05:00
48e83b2298 remove unused parameter and silence compiler warnings 2024-01-18 22:55:02 -05:00
991663feff Avoid (harmless) errors when shutting down the GPU. 2024-01-18 22:42:32 -05:00
9c4597b5e9 more GPU package cleanup from Trung 2024-01-17 16:12:36 -05:00
6f0fab8b08 bugfix for pair style coul/slater/long/gpu from Trung 2024-01-15 12:29:05 -05:00
569c23a1e6 Fixed typo with the kernel names in lj/cut/coul/cut/soft 2023-12-12 22:54:50 -06:00
669782cd5f Saved some exp operations 2023-12-11 21:26:12 -06:00
93f8ada8dc Fixed ecoul issue with the coul/slater/long kernels 2023-12-11 21:21:31 -06:00
6e0e1dc44f Fixed atom force issues with the coul/long/slater kernels 2023-12-11 17:10:43 -06:00
76bf35fa4d Added -allow-unsupported-compiler to nvcc for both CMake and traditional CUDA builds 2023-12-10 14:42:27 -06:00
e727ec1eac Fixed compiling bugs revealed by CUDA builds, removed unused variables 2023-12-09 16:46:10 -06:00
267e360bac Fixed bugs with acquiring depsh from lib/gpu, updated the doc page 2023-12-09 15:39:54 -06:00
3830711dec Added the GPU version of sph/heatconduction 2023-12-09 15:15:13 -06:00
54a6143e9e Updated the doc pages for sph/lj/gpu and sph/taitwater/gpu 2023-12-08 14:27:00 -06:00
46a670889d Added BaseSPH for sph pair styles, fixed bugs with per-type arrays 2023-12-08 14:09:39 -06:00
0f78afc66f Added atom tag for dpd to work with tag and v 2023-12-08 00:20:53 -06:00
7fad795141 Added pair sph/taitwater/gpu in src/GPU 2023-12-08 00:10:01 -06:00
cad4c25750 Adding sph/taitwater in lib/gpu 2023-12-07 17:22:45 -06:00
6fe16c7606 Added pair style sph/lj/gpu in src/GPU, removed commented codes 2023-12-07 14:33:47 -06:00
fef28c9daa Updated sph_lj kernels 2023-12-07 11:41:37 -06:00
379d3c8e20 Updated host_esph to extra data and cut to coeff 2023-12-07 11:06:17 -06:00
26c7358a84 Working on sph_lj kernels 2023-12-07 10:24:25 -06:00
1849df15e0 Removed unused member functions in edpd 2023-12-07 10:16:40 -06:00
0940793537 Updated the coul_soft and coul_long_soft kernels with forces as acctyp3 2023-12-07 09:23:49 -06:00
fe96d9f836 Added the GPU version of pair edpd and mdpd 2023-12-05 13:34:37 -06:00
086cf49a8c Added the GPU version of coul/slater/long 2023-12-05 13:32:33 -06:00
9588ddabf6 Added the GPU versions of lj/cut/coul/cut/soft and lj/cut/coul/long/soft 2023-12-05 13:30:26 -06:00
2f415c8b62 bugfix for pair style dpd/gpu from Trung 2023-08-09 09:42:01 -04:00
021eeae860 add workaround for failing overhead estimation test on (Intel) GPUs that only support single precision 2023-07-18 13:24:11 -04:00
74c5ca0996 move definition of struct containing double to avoid failure on Intel Xe GPU 2023-07-17 18:34:03 -04:00
fa4a3a0622 using NBOR_PREFETCH=2 gives errors with Intel Xe GPUs set to 0 2023-07-17 18:33:21 -04:00
d27ab47ed9 for clarity 2023-07-17 18:32:18 -04:00
be2e437cec use approximation for erfc() on OpenCL for Intel since the OpenCL version seems broken 2023-07-14 10:08:59 -04:00
620c60122a should not use pow() for computing cube, use multiplication. 2023-07-14 10:08:09 -04:00
d6412dc97b Attempted to resolve issues with switching from acctyp4 to acctyp3 in tep, fieldp since the changes in PR #3675, noting some changes with Intel OCL PR #3663 2023-07-08 00:50:19 -05:00
f128de7dd0 Bugfix from @ndtrung81 for indexing bug when tallying per-atom Coulomb energy 2023-06-13 15:13:16 -04:00
fbded8d8f4 include fix from @ndtrung81 2023-04-13 20:06:35 -04:00
92302d0f22 Merge branch 'develop' into mar2023_gpu_misc 2023-03-13 19:02:20 -04:00
c96ac858bf GPU Package: Adding JIT test for OpenCL prefetch support. 2023-03-07 21:43:19 -08:00
68ed9ec85e Merge branch 'develop' into collected-small-changes 2023-03-07 19:39:26 -05:00
d7c783560a GPU Package: Fixing out of bounds memory access issues with special kernel unroll optimizations. 2023-03-06 22:11:10 -08:00
37f22c8627 Misc Improvements to GPU Package
- Optimizations for molecular systems
-   Improved kernel performance and greater CPU overlap
- Reduced GPU to CPU communications for discrete devices
- Switch classic Intel makefiles to use LLVM-based compilers
- Prefetch optimizations supported for OpenCL
- Optimized data repack for quaternions
2023-03-05 21:03:12 -08:00
f658a947c6 gpu: fix broken HIP code path 2023-03-05 19:35:32 -07:00
646ef15d83 more consistent way to disable fast math for Intel OpenCL with double precision 2023-03-05 07:18:12 -05:00
2aff321187 Revert "use sin(x+pi/2) instead of cos(x) on Intel OpenCL with double precision"
This reverts commit fa38047749
It is no longer needed since we disable fast math with
Intel OpenCL and double precision setting.
2023-03-05 07:04:20 -05:00
9cdb9283ca another workaround for Intel GPU with OpenCL and double precision 2023-03-03 03:08:03 -05:00
fa38047749 use sin(x+pi/2) instead of cos(x) on Intel OpenCL with double precision 2023-03-02 18:47:34 -05:00
2ccfe635ce Removed the outdated CUDA_PROXY flag, using CUDA_MPS_SUPPORT consistently in CMake and traditional builds 2023-03-01 16:38:50 -06:00
2511872823 Updated GPU.cmake and Makefile.cuda_mps to not error out with exclusive process mode with the CUDA MPS server in use 2023-02-28 00:41:47 -06:00