1 [node18:02644] [[37701,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 142
2 [node18:02644] [[37701,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 82
3 [node18:02644] [[37701,0],0] ORTE_ERROR_LOG: File open fail[node18:02645] [[37700,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 142
4 [node18:02645] [[37700,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 82
5 [node18:02645] [[37700,0],0] ORTE_ERROR_LOG: File open fail[node18:02633] [[37720,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 142
6 [node18:02633] [[37720,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 82
7 [node18:02633] [[37720,0],0] ORTE_ERROR_LOG: File open fail[node18:02635] [[37722,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 142
8 [node18:02635] [[37722,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 82
9 [node18:02635] [[37722,0],0] ORTE_ERROR_LOG: File open fail[node18:02646] [[37703,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 142
10 [node18:02646] [[37703,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 82
11 [node18:02646] [[37703,0],0] ORTE_ERROR_LOG: File open fail[node18:02643] [[37698,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 142
12 [node18:02643] [[37698,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 82
13 [node18:02643] [[37698,0],0] ORTE_ERROR_LOG: File open fail[node18:02647] [[37702,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 142
14 [node18:02647] [[37702,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 82
15 [node18:02647] [[37702,0],0] ORTE_ERROR_LOG: File open fail[node18:02637] [[37724,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 142
16 [node18:02637] [[37724,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 82
17 [node18:02637] [[37724,0],0] ORTE_ERROR_LOG: File open fail[node18:02641] [[37696,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 142
18 [node18:02641] [[37696,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 82
19 [node18:02641] [[37696,0],0] ORTE_ERROR_LOG: File open fail[node18:02636] [[37725,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 142
20 [node18:02636] [[37725,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 82
21 [node18:02636] [[37725,0],0] ORTE_ERROR_LOG: File open fail[node18:02634] [[37723,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 142
22 [node18:02634] [[37723,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 82
23 [node18:02634] [[37723,0],0] ORTE_ERROR_LOG: File open fail[node18:02640] [[37697,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 142
24 [node18:02640] [[37697,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 82
25 [node18:02640] [[37697,0],0] ORTE_ERROR_LOG: File open fail[node18:02638] [[37727,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 142
26 [node18:02638] [[37727,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 82
27 [node18:02638] [[37727,0],0] ORTE_ERROR_LOG: File open fail[node18:02648] [[37705,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 142
28 [node18:02648] [[37705,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 82
29 [node18:02648] [[37705,0],0] ORTE_ERROR_LOG: File open fail[node18:02632] [[37721,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 142
30 [node18:02632] [[37721,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 82
31 [node18:02632] [[37721,0],0] ORTE_ERROR_LOG: File open fail[node18:02642] [[37699,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 142
32 [node18:02642] [[37699,0],0] ORTE_ERROR_LOG: File open failure in file ras_tm_module.c at line 82
33 [node18:02642] [[37699,0],0] ORTE_ERROR_LOG: File open failure in file base/ras_base_allocate.c at line 149
34 [node18:02644] [[37701,0],0] ORTE_ERROR_LOG: File open failure in file orted/orted_main.c at line 574
35 ure in file base/ras_base_allocate.c at line 149
36 [node18:02645] [[37700,0],0] ORTE_ERROR_LOG: File open failure in file orted/orted_main.c at line 574
37 ure in file base/ras_base_allocate.c at line 149
38 [node18:02633] [[37720,0],0] ORTE_ERROR_LOG: File open failure in file orted/orted_main.c at line 574
39 ure in file base/ras_base_allocate.c at line 149
40 [node18:02635] [[37722,0],0] ORTE_ERROR_LOG: File open failure in file orted/orted_main.c at line 574
41 ure in file base/ras_base_allocate.c at line 149
42 [node18:02646] [[37703,0],0] ORTE_ERROR_LOG: File open failure in file orted/orted_main.c at line 574
43 ure in file base/ras_base_allocate.c at line 149
44 [node18:02643] [[37698,0],0] ORTE_ERROR_LOG: File open failure in file orted/orted_main.c at line 574
45 ure in file base/ras_base_allocate.c at line 149
46 [node18:02647] [[37702,0],0] ORTE_ERROR_LOG: File open failure in file orted/orted_main.c at line 574
47 ure in file base/ras_base_allocate.c at line 149
48 [node18:02637] [[37724,0],0] ORTE_ERROR_LOG: File open failure in file orted/orted_main.c at line 574
49 ure in file base/ras_base_allocate.c at line 149
50 [node18:02641] [[37696,0],0] ORTE_ERROR_LOG: File open failure in file orted/orted_main.c at line 574
51 ure in file base/ras_base_allocate.c at line 149
52 [node18:02636] [[37725,0],0] ORTE_ERROR_LOG: File open failure in file orted/orted_main.c at line 574
53 ure in file base/ras_base_allocate.c at line 149
54 [node18:02634] [[37723,0],0] ORTE_ERROR_LOG: File open failure in file orted/orted_main.c at line 574
55 ure in file base/ras_base_allocate.c at line 149
56 [node18:02640] [[37697,0],0] ORTE_ERROR_LOG: File open failure in file orted/orted_main.c at line 574
57 ure in file base/ras_base_allocate.c at line 149
58 [node18:02638] [[37727,0],0] ORTE_ERROR_LOG: File open failure in file orted/orted_main.c at line 574
59 ure in file base/ras_base_allocate.c at line 149
60 [node18:02648] [[37705,0],0] ORTE_ERROR_LOG: File open failure in file orted/orted_main.c at line 574
61 ure in file base/ras_base_allocate.c at line 149
62 [node18:02632] [[37721,0],0] ORTE_ERROR_LOG: File open failure in file orted/orted_main.c at line 574
63 ure in file base/ras_base_allocate.c at line 149
64 [node19:02642] [[37699,0],0] ORTE_ERROR_LOG: File open failure in file orted/orted_main.c at line 574
~
작업을 제출하고 종료한 후 오류 파일에서 이러한 메시지 중 64개를 받았습니다. 작업이 완료되면 생성된 파일 중 하나에서 일부 숫자를 가져와야 합니다. 대신 NaN과 숫자를 얻습니다.
참고로 내 클러스터의 노드 18이 제대로 작동하고 있습니다. 매번 메시지의 노드 번호가 다릅니다. (며칠 전엔 3이었는데, 계산을 다 다시 해보았어요)
구글링해서 오류를 봤습니다. ORTE_ERROR가 MPI 패키지와 관련된 것 같습니다. 내 클러스터 계정에 설치된 MPI 버전은 1.6.5입니다.
내가 입력한 값 중 일부에 문제가 있거나 패키지가 누락되었거나 오래되었다고 생각하시나요?