View Issue Details

IDProjectCategoryView StatusLast Update
0004179OpenFOAMBugpublic2024-11-20 14:57
Reporterjherb Assigned Tohenry  
PrioritynormalSeverityminorReproducibilityalways
Status closedResolutionunable to reproduce 
PlatformGNU/LinuxOSOtherOS Version(please specify)
Product Version12 
Summary0004179: foamMultiRun crashes if config file is modified during run and fileHandler collated is used
DescriptionfoamMultiRun crashes (reproducible) if the following settings are chosen/actions taken:

Run in parallel
Run with fileHandler collated
Modify a config file, e.g. controlDict during run

Tested on a cluster with CentOS, Rocky Linux and on AWS with Ubuntu 24.04 LTS (ubuntu-noble-24.04-amd64-server-20240927)

Steps To Reproducecp $FOAM_TUTORIALS/multiRegion/CHT/heatedDuct/ $FOAM_RUN/ -r

Make the following changes:

diff -uBbwr $FOAM_TUTORIALS/multiRegion/CHT/heatedDuct/ $FOAM_RUN/heatedDuct/
diff -uBbwr /opt/openfoam12/tutorials/multiRegion/CHT/heatedDuct/system/controlDict /home/ubuntu/OpenFOAM/ubuntu-12/run/heatedDuct/system/controlDict
--- /opt/openfoam12/tutorials/multiRegion/CHT/heatedDuct/system/controlDict 2024-09-02 08:54:54.000000000 +0000
+++ /home/ubuntu/OpenFOAM/ubuntu-12/run/heatedDuct/system/controlDict 2024-11-15 10:14:21.021813745 +0000
@@ -57,4 +57,10 @@

 adjustTimeStep yes;

+OptimisationSwitches
+{
+ fileHandler collated;
+}
+
+
 // ************************************************************************* //

./Allrun

during the run of foamMultiRun

touch system/controlDict
Additional InformationStack trace on Ubutu:

/*---------------------------------------------------------------------------*\
  ========= |
  \\ / F ield | OpenFOAM: The Open Source CFD Toolbox
   \\ / O peration | Website: https://openfoam.org
    \\ / A nd | Version: 12
     \\/ M anipulation |
\*---------------------------------------------------------------------------*/
Build : 12-6aa359dae696

...

Starting time loop

fluid Courant Number mean: 0.000494161 max: 0.00103803
metal Diffusion Number mean: 0.346574 max: 0.529572
heater Diffusion Number mean: 0.0885803 max: 0.142947
       deltaT = 0.0012
       Time = 0.0012s

fluid diagonal: Solving for rho, Initial residual = 0, Final residual = 0, No Iterations 0
fluid DILUPBiCGStab: Solving for Ux, Initial residual = 1, Final residual = 1.86059e-08, No Iterations 1
fluid DILUPBiCGStab: Solving for Uy, Initial residual = 1, Final residual = 1.8871e-08, No Iterations 1
fluid DILUPBiCGStab: Solving for Uz, Initial residual = 1, Final residual = 6.92348e-08, No Iterations 1
fluid DILUPBiCGStab: Solving for h, Initial residual = 0.135164, Final residual = 3.39636e-10, No Iterations 1
metal GAMG: Solving for e, Initial residual = 0.00729615, Final residual = 0.00431141, No Iterations 10
heater GAMG: Solving for e, Initial residual = 1, Final residual = 4.83092e-10, No Iterations 1
fluid GAMG: Solving for p_rgh, Initial residual = 1, Final residual = 6.69575e-08, No Iterations 29
fluid diagonal: Solving for rho, Initial residual = 0, Final residual = 0, No Iterations 0
fluid time step continuity errors : sum local = 1.33981e-16, global = 1.86931e-17, cumulative = 1.86931e-17
       ExecutionTime = 6.39611 s ClockTime = 26 s

       regIOobject::readIfModified() :
           Re-reading object controlDict from file "/home/ubuntu/OpenFOAM/ubuntu-12/run/heatedDuct/system/controlDict"
[0] #0 Foam::error::printStack(Foam::Ostream&) at ??:?
[0] #1 Foam::sigSegv::sigHandler(int) at ??:?
[0] #2 ? in "/usr/lib/x86_64-linux-gnu/libc.so.6"
[0] #3 Foam::objectRegistry::readIfModified() at ??:?
[0] #4 Foam::objectRegistry::readModifiedObjects() at ??:?
[0] #5 Foam::Time::run() const at ??:?
[0] #6 ? in "/opt/openfoam12/platforms/linux64GccDPInt32Opt/bin/foamMultiRun"
[0] #7 ? in "/usr/lib/x86_64-linux-gnu/libc.so.6"
[0] #8 __libc_start_main in "/usr/lib/x86_64-linux-gnu/libc.so.6"
[0] #9 ? in "/opt/openfoam12/platforms/linux64GccDPInt32Opt/bin/foamMultiRun"
[ip-172-31-8-119:08412] *** Process received signal ***
[ip-172-31-8-119:08412] Signal: Segmentation fault (11)
[ip-172-31-8-119:08412] Signal code: (-6)
[ip-172-31-8-119:08412] Failing at address: 0x3e8000020dc
[ip-172-31-8-119:08412] [ 0] /usr/lib/x86_64-linux-gnu/libc.so.6(+0x45320)[0x70de86e45320]
[ip-172-31-8-119:08412] [ 1] /usr/lib/x86_64-linux-gnu/libc.so.6(pthread_kill+0x11c)[0x70de86e9eb1c]
[ip-172-31-8-119:08412] [ 2] /usr/lib/x86_64-linux-gnu/libc.so.6(gsignal+0x1e)[0x70de86e4526e]
[ip-172-31-8-119:08412] [ 3] /usr/lib/x86_64-linux-gnu/libc.so.6(+0x45320)[0x70de86e45320]
[ip-172-31-8-119:08412] [ 4] /opt/openfoam12/platforms/linux64GccDPInt32Opt/lib/libOpenFOAM.so(_ZN4Foam14objectRegistry14readIfModifiedEv+0x8b)[0x70de87b28e2b]
[ip-172-31-8-119:08412] [ 5] /opt/openfoam12/platforms/linux64GccDPInt32Opt/lib/libOpenFOAM.so(_ZN4Foam14objectRegistry19readModifiedObjectsEv+0x17)[0x70de87b2bbd7]
[ip-172-31-8-119:08412] [ 6] /opt/openfoam12/platforms/linux64GccDPInt32Opt/lib/libOpenFOAM.so(_ZNK4Foam4Time3runEv+0xc8)[0x70de87b5f818]
[ip-172-31-8-119:08412] [ 7] foamMultiRun(+0x5502)[0x5fc5312ea502]
[ip-172-31-8-119:08412] [ 8] /usr/lib/x86_64-linux-gnu/libc.so.6(+0x2a1ca)[0x70de86e2a1ca]
[ip-172-31-8-119:08412] [ 9] /usr/lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0x8b)[0x70de86e2a28b]
[ip-172-31-8-119:08412] [10] foamMultiRun(+0x5955)[0x5fc5312ea955]
[ip-172-31-8-119:08412] *** End of error message ***
--------------------------------------------------------------------------
Primary job terminated normally, but 1 process returned
a non-zero exit code. Per user-direction, the job has been aborted.
--------------------------------------------------------------------------
--------------------------------------------------------------------------
mpirun noticed that process rank 0 with PID 0 on node ip-172-31-8-119 exited on signal 11 (Segmentation fault).

TagsNo tags attached.

Activities

henry

2024-11-15 13:05

manager   ~0013464

Maintaining the collated file format is challenging, it is very complex code which has introduced maintenance overheads all over OpenFOAM. Will you contribute to maintenance funding to cover working on this?

henry

2024-11-20 14:57

manager   ~0013467

Unable to reproduce in either OpenFOAM-12 or OpenFOAM-dev. Requires a support contract to investigate further including access to the reporters computer to analyse the problem.

Issue History

Date Modified Username Field Change
2024-11-15 10:36 jherb New Issue
2024-11-15 13:05 henry Note Added: 0013464
2024-11-20 14:57 henry Assigned To => henry
2024-11-20 14:57 henry Status new => closed
2024-11-20 14:57 henry Resolution open => unable to reproduce
2024-11-20 14:57 henry Note Added: 0013467