가상 머신에서 centos6.9를 실행 중인데 오늘 아침에 이상해졌습니다. 갑자기 다시 시작되었습니다. 처음에는 재시작 간격이 정확히 10분이었다가 5분, 3분으로 줄어들었고 지금은 가끔씩 다릅니다. 다음은 /var/log/messages의 메시지입니다.
May 10 18:40:01 hwmaster01 init: tty (/dev/tty1) main process (2126) killed by TERM signal
May 10 18:40:01 hwmaster01 init: tty (/dev/tty2) main process (2128) killed by TERM signal
May 10 18:40:01 hwmaster01 init: tty (/dev/tty3) main process (2130) killed by TERM signal
May 10 18:40:01 hwmaster01 init: tty (/dev/tty4) main process (2132) killed by TERM signal
May 10 18:40:01 hwmaster01 init: tty (/dev/tty5) main process (2134) killed by TERM signal
May 10 18:40:01 hwmaster01 init: tty (/dev/tty6) main process (2136) killed by TERM signal
May 10 18:40:07 hwmaster01 ntpd[1767]: ntpd exiting on signal 15
May 10 18:40:08 hwmaster01 rpcbind: rpcbind terminating on signal. Restart with "rpcbind -w"
*잠시 후
May 10 18:45:02 hwmaster01 init: tty (/dev/tty1) main process (2137) killed by TERM signal
May 10 18:45:02 hwmaster01 init: tty (/dev/tty2) main process (2139) killed by TERM signal
May 10 18:45:02 hwmaster01 init: tty (/dev/tty3) main process (2141) killed by TERM signal
May 10 18:45:02 hwmaster01 init: tty (/dev/tty4) main process (2143) killed by TERM signal
May 10 18:45:02 hwmaster01 init: tty (/dev/tty5) main process (2146) killed by TERM signal
May 10 18:45:02 hwmaster01 init: tty (/dev/tty6) main process (2148) killed by TERM signal
May 10 18:45:08 hwmaster01 ntpd[1772]: ntpd exiting on signal 15
May 10 18:45:08 hwmaster01 rpcbind: rpcbind terminating on signal. Restart with "rpcbind -w"
*잠시 후
May 10 18:52:01 hwmaster01 init: tty (/dev/tty1) main process (2124) killed by TERM signal
May 10 18:52:01 hwmaster01 init: tty (/dev/tty2) main process (2126) killed by TERM signal
May 10 18:52:01 hwmaster01 init: tty (/dev/tty3) main process (2128) killed by TERM signal
May 10 18:52:01 hwmaster01 init: tty (/dev/tty4) main process (2131) killed by TERM signal
May 10 18:52:01 hwmaster01 init: tty (/dev/tty5) main process (2133) killed by TERM signal
May 10 18:52:01 hwmaster01 init: tty (/dev/tty6) main process (2135) killed by TERM signal
May 10 18:52:09 hwmaster01 ntpd[1767]: ntpd exiting on signal 15
May 10 18:52:10 hwmaster01 rpcbind: rpcbind terminating on signal. Restart with "rpcbind -w"
실행 중인 새로운 압력 도구가 없습니다. 4개의 노드가 서로 다른 가상 머신에 있지만 동일한 하드웨어에 있는 hadoop 클러스터 환경의 마스터 노드입니다. 모든 가상 머신이 하드웨어 수준에서 제대로 작동하는 것처럼 보였지만 이 마스터 노드가 충돌하여 모든 서비스를 중지했습니다. 이 문제에 대해 잘 아는 사람이 있나요?
답변1
strace
이 기본 프로세스에 연결할 수 있습니다 . 어떤 프로세스에 의해 종료되었는지 알려줍니다.