madgraph5/madgraph4gpu
GPU development for the Madgraph5_aMC@NLO event generator software package
Fortran
Pinned issues
Issues
- 0
Port clang-format config to clang-format-18 and upgrade the github CI to ubuntu-22.04
#1023 opened by valassi - 1
- 0
- 2
cudacpp_backend card should be removed from output.py (it is only needed in launch_plugin.py)
#1015 opened by valassi - 2
- 0
"leading zeros in decimal integer literals are not permitted; use an 0o prefix for octal integers" from '1.00.01' cudacpp version
#1013 opened by valassi - 2
- 2
- 9
cxtype_ref problem for some gcc versions
#1004 opened by roiser - 3
FPEs in check.exe with clang16 and clang17
#1005 opened by valassi - 3
Crash (FPE) in check_hip.exe on LUMI
#1003 opened by valassi - 6
- 4
Question for Olivier: does run.sh support more than one CPU core? Can it? Should it?
#1001 opened by valassi - 2
- 0
Time profiles for DY+4j (and DY+3j) have high 'python/bash' component - especially for cuda
#1000 opened by valassi - 0
Cuda time profiles for DY+4j have very high 'HEL' component for helicity filtering?
#999 opened by valassi - 7
- 4
Permille cross section discrepancy fortran vs cudacpp (gg_tt.mad) after merging june24 and goodhel together
#991 opened by valassi - 1
GPU CI failing on itscrd-v100 "Failed to initialize NVML: Driver/library version mismatch"
#996 opened by valassi - 1
LHE file mismatch fortran vs cudacpp in tlau (for multi backend builds, possibly for normal builds too) - also: "fail to reach target"
#993 opened by valassi - 4
(june24) check if anything needs to be changed in dsample.f in the use of vecsize_used or nb_warp_used
#983 opened by valassi - 0
Support for multi-GPU nodes (3: orchestrate different madevent executables to target different GPUs)
#990 opened by valassi - 0
Support for multi-GPU nodes (2: send MEs to more than one GPU from the same madevent executable?)
#989 opened by valassi - 2
- 1
Allow LIMHEL>0 in cudacpp
#988 opened by valassi - 0
(low priority) improve passcuts interface?
#987 opened by valassi - 2
- 3
Port (or disable) rdtsc timers on ARM/MAC
#977 opened by valassi - 1
change name of variable warp/wrap ...
#961 opened by oliviermattelaer - 9
Reduce madevent 'Fortran overhead' by restricting cudacpp helicity calculation (which scales with SIMD) to only 16 events
#958 opened by valassi - 0
- 6
Compilation Error in cudacpp_backend=cpp for CMS DY+nj and TT+nj (nvcc exists but not NVTX or curand)
#965 opened by choij1589 - 5
Fortran missing again on Mac CI nodes?
#971 opened by valassi - 4
Faster timers based on rdtsc instead of chrono
#972 opened by valassi - 0
- 4
Remove htuple.f from generated code
#967 opened by valassi - 9
DY+3 jets cross section decreases by a factor 10 when changing vector size from 16384 to 32?
#959 opened by valassi - 7
Understand why CMS sees cross section discrepancy between fortran and cuda/cpp for DY+4 jets
#944 opened by valassi - 3
Trivial improvements for xbin_min and xbin_max may lead to speedups in sample_get_x
#969 opened by valassi - 7
Vectorise phase space sampling (port x_to_f_arg to cudacpp with SIMD and GPU support - starting with sample_get_x?)
#963 opened by valassi - 0
- 21
- 0
Vectorise running coupling scale updates (port update_scale_coupling_vec to cudacpp with SIMD/GPU)?? Maybe not
#964 opened by valassi - 3
Add support for nprocesses>2 (i.e. beyond mirror processes) in cudacpp to speed up directory handling?
#951 opened by valassi - 0
- 0
instrument python code in gridpacks to provide timing profiles for event generation
#957 opened by valassi - 0
Add events.lhe comparison in tlau tests
#956 opened by valassi - 0
Improve usability of cacche by experiment users?
#954 opened by valassi - 7
generate_events tries to remove GpuAbstraction.h
#947 opened by valassi - 0