|
|
a981843686
|
forgot updated Makefile
|
2022-10-17 11:51:50 -04:00 |
|
|
|
9f20347375
|
update PACE library for conventional build
|
2022-10-15 15:10:01 -04:00 |
|
|
|
c113253e2d
|
Merge branch 'develop' of https://github.com/lammps/lammps into kk_update_3.7
|
2022-10-10 13:44:02 -07:00 |
|
|
|
1fb07387b9
|
Merge pull request #3479 from yury-lysogorskiy/feature/pace-al
BUGFIX: address issue with compilation KOKKOS + pace/extrapolation
|
2022-10-10 14:57:13 -04:00 |
|
|
|
341bb57555
|
Update Install.py
|
2022-10-10 08:45:33 -07:00 |
|
|
|
7c9666798e
|
whitespace
|
2022-10-08 09:34:20 -04:00 |
|
|
|
7551c0a3ca
|
GPU Package: Documenting some additional preprocessor flags, updating oneapi Makefile.
|
2022-10-07 22:44:21 -07:00 |
|
|
|
00f46120c7
|
Removed max_cus() from Device, used device->gpu->cus() instead
|
2022-10-07 15:50:30 -05:00 |
|
|
|
5a98a38e24
|
GPU Package: Switching to parallel GPU initialization / JIT compilation.
|
2022-10-07 13:25:14 -07:00 |
|
|
|
f715f174bb
|
GPU Package: Print OCL platform name to screen when multiple platforms
|
2022-10-06 21:40:42 -07:00 |
|
|
|
a6a39d47e1
|
Fixing potential issues with automatic splitting of accelerators for NUMA.
|
2022-10-06 20:48:02 -07:00 |
|
|
|
e9f39f85d2
|
Fixing issue where shared main memory property only set for NVIDIA devices.
|
2022-10-06 20:05:33 -07:00 |
|
|
|
6b9e83fe20
|
Added timing for the induced dipole spreading part, computed the block size to ensure all the CUs are occupied by the fphi_uind and fphi_mpole kernels
|
2022-10-06 15:03:58 -05:00 |
|
|
|
7157643fdd
|
Merge pull request #3315 from yury-lysogorskiy/feature/pace-al
ML_PACE with extrapolation grade / active learning
|
2022-10-05 20:16:13 -04:00 |
|
|
|
e51be5d6e0
|
Need desul library
|
2022-10-04 15:00:14 -06:00 |
|
|
|
f9f9e44f2d
|
Update Kokkos library in LAMMPS to v3.7.0
|
2022-10-04 14:04:40 -06:00 |
|
|
|
2ef6a59c0a
|
Merge branch 'develop' into amoeba-gpu
|
2022-10-01 00:38:24 -05:00 |
|
|
|
9a1f23a079
|
Cosmetic changes and cleanup
|
2022-09-30 17:32:25 -05:00 |
|
|
|
1d75ca3b20
|
Moved precompute() out of the terms in amoeba and hippo, to be involed in the first term in a time step: multipole for amoeba and repulsion for hippo
|
2022-09-30 16:31:13 -05:00 |
|
|
|
fb675028b9
|
whitespace
|
2022-09-29 02:42:11 -04:00 |
|
|
|
71464d8314
|
GPU Package: Fixing logic in OpenCL backend that could result in unnecessary device allocations.
|
2022-09-28 22:30:09 -07:00 |
|
|
|
6e34d21b24
|
GPU Package: Switching back to timer disabling with multiple MPI tasks per GPU. Logic added to prevent mem leak.
|
2022-09-28 21:02:16 -07:00 |
|
|
|
e6d2582642
|
Updated fphi_mpole, renamed precompute_induce to precompute_kspace
|
2022-09-28 15:08:18 -05:00 |
|
|
|
de28c9b19c
|
propagate new pace lib version tage and hash to lib/pace/Install.py
|
2022-09-27 15:27:43 -04:00 |
|
|
|
2e3fc4c054
|
Merge branch 'develop' into feature/pace-al
|
2022-09-25 16:01:48 -04:00 |
|
|
|
166701f13a
|
Fixed missing commas in the argument list of the macros in amoeba and hippo cu files, added amoeba_convolution_gpu.cpp and .h to the source file list in GPU.cmake
|
2022-09-23 11:53:09 -05:00 |
|
|
|
785131932c
|
Added fphi_mpole in amoeba/gpu, fixed a bug in the kernel when indexing grid
|
2022-09-20 13:58:17 -05:00 |
|
|
|
356c46c913
|
Replaced mem allocation/deallocation inside moduli() with using member variables and mem resize if needed
|
2022-09-18 16:28:30 -05:00 |
|
|
|
caa66d904e
|
Cleaned up GPU lib functions
|
2022-09-18 15:54:12 -05:00 |
|
|
|
f9f777b099
|
Refactored precompute_induce to overlap data transfers with kernel launches
|
2022-09-18 15:09:26 -05:00 |
|
|
|
8d6629cb80
|
update MDI library to version 1.4.12 which plugs memory leaks on initialization
|
2022-09-18 11:04:57 -04:00 |
|
|
|
62ecf98cda
|
Enabled fphi_uind in hippo/gpu, really need to refactor hippo and amoeba in the GPU lib to remove kernel duplicates
|
2022-09-16 14:47:16 -05:00 |
|
|
|
880f20c285
|
Cleaned up kernels
|
2022-09-15 15:29:14 -05:00 |
|
|
|
797a45232c
|
Merge branch 'fix-pair-dump-skip' into feature/pace-al
# Conflicts:
# src/fix_pair.cpp
|
2022-09-15 11:07:24 +02:00 |
|
|
|
cd3a00c2c4
|
Added timing breakdown for fphi_uind
|
2022-09-14 15:28:44 -05:00 |
|
|
|
9c4d3db558
|
Cleaned up and converted arrays to ucl_vector of numtyp4
|
2022-09-13 16:48:39 -05:00 |
|
|
|
31047b4a31
|
Removed mem alloc in precompute_induce, used buffer for packing, and switched to using ucl_vector
|
2022-09-13 12:53:48 -05:00 |
|
|
|
1abfec066c
|
update MDI library version to 1.4.11
|
2022-09-12 12:30:34 -04:00 |
|
|
|
7f4efa380a
|
Re-arranged memory allocation for cgrid_brick, some issues need to be fixed
|
2022-09-11 18:58:34 -05:00 |
|
|
|
5e59c95be4
|
Moved temp variables inside loops
|
2022-09-10 02:45:06 -05:00 |
|
|
|
363b6c51d0
|
Used local arrays and re-arranged for coalesced global memory writes
|
2022-09-10 02:31:39 -05:00 |
|
|
|
1364033055
|
Merge pull request #3432 from benmenadue/develop
Use primary context in CUDA GPU code.
|
2022-09-09 16:24:46 -04:00 |
|
|
|
d1fb2244e2
|
make downloaded version consistent
|
2022-09-09 15:21:42 -04:00 |
|
|
|
c58343b2e2
|
Cleaned up debugging stuffs, need more refactoring and add to hippo
|
2022-09-09 13:50:41 -05:00 |
|
|
|
b72b71837e
|
Moved first_induce_iteration in induce() to the right place
|
2022-09-09 13:34:57 -05:00 |
|
|
|
4b8caac727
|
Made some progress with fphi_uind in the gpu pair style
|
2022-09-09 12:14:36 -05:00 |
|
|
|
167abe9ce0
|
add preprocessor flags to select between the changed and the old code variant
|
2022-09-09 12:41:24 -04:00 |
|
|
|
ffb8b8ba97
|
Merge branch 'develop' into mdi-tweak
|
2022-09-09 00:03:39 -04:00 |
|
|
|
1cd47b762b
|
Update MDI plugin code
|
2022-09-09 02:28:06 +00:00 |
|
|
|
5c73befc66
|
upgrade to MDI 1.4.9
|
2022-09-07 13:57:20 -06:00 |
|