Commit 867e44a
[Fix] "too many resources requested for launch" error in the ASSET joint probability matrix computation when using CUDA (#667)
* Implemented parameter to override number of threads in PMatNeighbors
* Determine the number of threads in PMatNeighbors automatically or use the override
* Added logging functionality for debugging
* Added documentation on the new behavior for the number of threads in PMatNeighbors
* Updated logging statements to avoid errors
* Updated logging statements for simplified access to kernel parameters
* Added unit test for computing PMatNeighbors with varying thread numbers
* Added restriction to override number to avoid exceeding the maximum number of threads
* Removed one parameter iteration as this is currently expected to fail
* Added unit test for the high level cuda_threads parameter in the ASSET class
* Added missing reasons for skipping
* Added missing rate variable
* Added cleanup if variables were not originally set
* Added validation for cuda_thread parameter
* Test cases not needed as tuple is enforced to be (int, int)
* Added invalid combinations for the revised validation
* Not allowing None in the tuple
* Setting upper bound for number of threads per block
* Complementary unit tests for upper bound
---------
Co-authored-by: Cristiano Köhler <c.koehler@fz-juelich.de>
Co-authored-by: Harris Jos <104043391+CozySocksAlways@users.noreply.github.com>1 parent 867a0fc commit 867e44a
2 files changed
Lines changed: 210 additions & 6 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1113 | 1113 | | |
1114 | 1114 | | |
1115 | 1115 | | |
1116 | | - | |
| 1116 | + | |
| 1117 | + | |
1117 | 1118 | | |
1118 | 1119 | | |
1119 | 1120 | | |
| 1121 | + | |
1120 | 1122 | | |
1121 | 1123 | | |
1122 | 1124 | | |
| |||
1249 | 1251 | | |
1250 | 1252 | | |
1251 | 1253 | | |
1252 | | - | |
1253 | 1254 | | |
1254 | 1255 | | |
1255 | 1256 | | |
| |||
1307 | 1308 | | |
1308 | 1309 | | |
1309 | 1310 | | |
| 1311 | + | |
| 1312 | + | |
| 1313 | + | |
| 1314 | + | |
| 1315 | + | |
| 1316 | + | |
| 1317 | + | |
| 1318 | + | |
| 1319 | + | |
| 1320 | + | |
| 1321 | + | |
| 1322 | + | |
| 1323 | + | |
| 1324 | + | |
| 1325 | + | |
| 1326 | + | |
| 1327 | + | |
| 1328 | + | |
| 1329 | + | |
| 1330 | + | |
| 1331 | + | |
| 1332 | + | |
| 1333 | + | |
| 1334 | + | |
| 1335 | + | |
| 1336 | + | |
| 1337 | + | |
| 1338 | + | |
| 1339 | + | |
1310 | 1340 | | |
| 1341 | + | |
| 1342 | + | |
| 1343 | + | |
| 1344 | + | |
| 1345 | + | |
| 1346 | + | |
| 1347 | + | |
| 1348 | + | |
| 1349 | + | |
| 1350 | + | |
| 1351 | + | |
| 1352 | + | |
| 1353 | + | |
| 1354 | + | |
| 1355 | + | |
| 1356 | + | |
| 1357 | + | |
| 1358 | + | |
| 1359 | + | |
| 1360 | + | |
| 1361 | + | |
| 1362 | + | |
1311 | 1363 | | |
1312 | 1364 | | |
1313 | 1365 | | |
1314 | 1366 | | |
1315 | 1367 | | |
1316 | | - | |
1317 | 1368 | | |
1318 | 1369 | | |
1319 | 1370 | | |
| |||
2446 | 2497 | | |
2447 | 2498 | | |
2448 | 2499 | | |
2449 | | - | |
| 2500 | + | |
2450 | 2501 | | |
2451 | 2502 | | |
2452 | 2503 | | |
| |||
2455 | 2506 | | |
2456 | 2507 | | |
2457 | 2508 | | |
| 2509 | + | |
| 2510 | + | |
| 2511 | + | |
| 2512 | + | |
| 2513 | + | |
| 2514 | + | |
| 2515 | + | |
| 2516 | + | |
| 2517 | + | |
| 2518 | + | |
| 2519 | + | |
| 2520 | + | |
2458 | 2521 | | |
2459 | 2522 | | |
2460 | 2523 | | |
| |||
2502 | 2565 | | |
2503 | 2566 | | |
2504 | 2567 | | |
| 2568 | + | |
| 2569 | + | |
| 2570 | + | |
| 2571 | + | |
| 2572 | + | |
| 2573 | + | |
| 2574 | + | |
| 2575 | + | |
| 2576 | + | |
| 2577 | + | |
| 2578 | + | |
| 2579 | + | |
| 2580 | + | |
| 2581 | + | |
| 2582 | + | |
2505 | 2583 | | |
2506 | 2584 | | |
2507 | 2585 | | |
2508 | 2586 | | |
2509 | | - | |
| 2587 | + | |
| 2588 | + | |
2510 | 2589 | | |
2511 | 2590 | | |
2512 | 2591 | | |
| |||
2527 | 2606 | | |
2528 | 2607 | | |
2529 | 2608 | | |
2530 | | - | |
| 2609 | + | |
2531 | 2610 | | |
2532 | 2611 | | |
2533 | 2612 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
320 | 320 | | |
321 | 321 | | |
322 | 322 | | |
| 323 | + | |
| 324 | + | |
| 325 | + | |
| 326 | + | |
| 327 | + | |
| 328 | + | |
| 329 | + | |
| 330 | + | |
| 331 | + | |
| 332 | + | |
| 333 | + | |
| 334 | + | |
| 335 | + | |
| 336 | + | |
| 337 | + | |
| 338 | + | |
| 339 | + | |
| 340 | + | |
| 341 | + | |
| 342 | + | |
| 343 | + | |
| 344 | + | |
| 345 | + | |
| 346 | + | |
| 347 | + | |
| 348 | + | |
| 349 | + | |
| 350 | + | |
| 351 | + | |
| 352 | + | |
323 | 353 | | |
324 | 354 | | |
325 | 355 | | |
| |||
700 | 730 | | |
701 | 731 | | |
702 | 732 | | |
| 733 | + | |
| 734 | + | |
| 735 | + | |
| 736 | + | |
| 737 | + | |
| 738 | + | |
| 739 | + | |
| 740 | + | |
| 741 | + | |
| 742 | + | |
| 743 | + | |
| 744 | + | |
| 745 | + | |
| 746 | + | |
| 747 | + | |
| 748 | + | |
| 749 | + | |
| 750 | + | |
| 751 | + | |
| 752 | + | |
| 753 | + | |
| 754 | + | |
| 755 | + | |
| 756 | + | |
| 757 | + | |
| 758 | + | |
| 759 | + | |
| 760 | + | |
| 761 | + | |
| 762 | + | |
| 763 | + | |
| 764 | + | |
| 765 | + | |
| 766 | + | |
| 767 | + | |
| 768 | + | |
| 769 | + | |
| 770 | + | |
| 771 | + | |
| 772 | + | |
| 773 | + | |
| 774 | + | |
| 775 | + | |
| 776 | + | |
| 777 | + | |
| 778 | + | |
| 779 | + | |
| 780 | + | |
| 781 | + | |
| 782 | + | |
| 783 | + | |
| 784 | + | |
| 785 | + | |
| 786 | + | |
| 787 | + | |
| 788 | + | |
| 789 | + | |
| 790 | + | |
| 791 | + | |
| 792 | + | |
| 793 | + | |
| 794 | + | |
| 795 | + | |
| 796 | + | |
| 797 | + | |
| 798 | + | |
| 799 | + | |
| 800 | + | |
| 801 | + | |
| 802 | + | |
| 803 | + | |
| 804 | + | |
| 805 | + | |
| 806 | + | |
| 807 | + | |
| 808 | + | |
| 809 | + | |
| 810 | + | |
| 811 | + | |
| 812 | + | |
| 813 | + | |
| 814 | + | |
| 815 | + | |
| 816 | + | |
| 817 | + | |
| 818 | + | |
| 819 | + | |
| 820 | + | |
| 821 | + | |
| 822 | + | |
| 823 | + | |
| 824 | + | |
| 825 | + | |
| 826 | + | |
| 827 | + | |
703 | 828 | | |
704 | 829 | | |
705 | 830 | | |
| |||
0 commit comments