Error in Cray compiler when I am writing output file

263 Views Asked by At

I am using exactly the same code in two different clusters. One cluster runs with mpi intel fortran and the other runs with Cray Fortran. The former is an old cluster and the latter is the newest we have at school. The implementation woks perfectly on the old cluster (MPI INTEL FORTRAN) but the implementation does not work in the Cray Fortran cluster. The portion of the output subroutine that is giving error is this:

    Subroutine Output
use Variables
implicit none

! Formating section

398   format(6(e22.15,2x))
399   format(7(e22.15,2x))
39   format(5(e22.15,2x))

!!! Computing Cp for postprocessing purposes
Cp = gamma*R_gas/(gamma-1)

! Creating the Global mesh
If(MyRank ==0) Call GridGlobal
    If(MyRank==0) then
    open(330, file = 'Primitive_Variables.dat')
    write(330,*) 'TITLE = "Primitive Variables Contours"'
    write(330,*) 'VARIABLES = "X"'
    write(330,*) '"Y"'
    write(330,*) '"U Velocity"'
    write(330,*) '"V Velocity"'
    write(330,*) '"Density"'
    write(330,*) '"Temperature"'

    write(330,*)'  zone T = "zone1", I = ',ImaxGlobal,' J= ',JmaxGlobal,' F = point'

    do j = 1,JmaxGlobal
      do i = 1,ImaxGlobal
        write(330,398) xGlobal(i,j),yGlobal(i,j),u_oldGlobal(i,j),v_oldGlobal(i,j),r_oldGlobal(i,j),T_oldGlobal(i,j)
      enddo
    enddo
    close(330)
    End If

When I run my implementation the error that I get is the following:

Application 135822 exit codes: 134
Application 135822 exit signals: Killed
Application 135822 resources: utime ~185s, stime ~1s, Rss ~1444696, inblocks ~1023410, outblocks ~5529676

pwd

setenv OMP_NUM_THREADS 1

if ( -e IDS ) then
aprun -j 1 -n 32 ./IDS

sys-38 : UNRECOVERABLE error on system request
  Function not implemented

Encountered during a CLOSE of unit 330
Fortran unit 330 is not connected
_pmiu_daemon(SIGCHLD): [NID 00018] [c0-0c0s4n2] [Wed Aug  2 16:57:32 2017] PE RANK 0 exit signal Aborted
[NID 00018] 2017-08-02 17:53:44 Apid 135820: initiated application termination
else

exit

With this, the output subroutine stops printing the results and my computation is useless.

Thanks in advance

For the Record: This only occurs for big arrays. By big I mean greater than 2001x2001. I know that this is not big at all, but for smaller arrays the error does not pop up. The Subrotuine allocates the arrays required for the printing, it starts printing the file but it does not finish printing the whole file. After few elements the process stops and the error pops up. The program creates the file and it starts writing the solution in the file and then it stops printing the solution. I have tried running it with different number of PE and the problem always pops up.

The variables are declared in the following way:

integer, parameter :: dp = 8
real(kind=dp),dimension(:,:),allocatable::r_old,u_old,v_old,T_old,a_old

The code looks like:

        DO kk=1, 2001

 ! This section calls different subroutine
 ! They are not relevant for the discussion

Call MPI_BARRIER(MPI_COMM_WORLD,ierr) ! Barrier in MPI
Enddo

! Postprocessing tasks and restart file
Call MPI_BARRIER(MPI_COMM_WORLD,ierr) ! Barrier in MPI
Call KillArrays   ! Deallocating the arrays not needed for writing output
Call Write_SolutionRestart
Call Output
Call MPI_BARRIER(MPI_COMM_WORLD,ierr) 
Call MPI_FINALIZE(ierr)

The subroutine that is giving me problem is "Output".

The problem is always Rank 0 according to the error file.

0

There are 0 best solutions below