Centos가 갑자기 다시 시작됩니다. 매번 TERM 신호에 의해 동일한 기본 프로세스가 종료됩니다.

Centos가 갑자기 다시 시작됩니다. 매번 TERM 신호에 의해 동일한 기본 프로세스가 종료됩니다.

가상 머신에서 centos6.9를 실행 중인데 오늘 아침에 이상해졌습니다. 갑자기 다시 시작되었습니다. 처음에는 재시작 간격이 정확히 10분이었다가 5분, 3분으로 줄어들었고 지금은 가끔씩 다릅니다. 다음은 /var/log/messages의 메시지입니다.

May 10 18:40:01 hwmaster01 init: tty (/dev/tty1) main process (2126) killed by TERM signal
May 10 18:40:01 hwmaster01 init: tty (/dev/tty2) main process (2128) killed by TERM signal
May 10 18:40:01 hwmaster01 init: tty (/dev/tty3) main process (2130) killed by TERM signal
May 10 18:40:01 hwmaster01 init: tty (/dev/tty4) main process (2132) killed by TERM signal
May 10 18:40:01 hwmaster01 init: tty (/dev/tty5) main process (2134) killed by TERM signal
May 10 18:40:01 hwmaster01 init: tty (/dev/tty6) main process (2136) killed by TERM signal
May 10 18:40:07 hwmaster01 ntpd[1767]: ntpd exiting on signal 15
May 10 18:40:08 hwmaster01 rpcbind: rpcbind terminating on signal. Restart with "rpcbind -w"

*잠시 후

May 10 18:45:02 hwmaster01 init: tty (/dev/tty1) main process (2137) killed by TERM signal
May 10 18:45:02 hwmaster01 init: tty (/dev/tty2) main process (2139) killed by TERM signal
May 10 18:45:02 hwmaster01 init: tty (/dev/tty3) main process (2141) killed by TERM signal
May 10 18:45:02 hwmaster01 init: tty (/dev/tty4) main process (2143) killed by TERM signal
May 10 18:45:02 hwmaster01 init: tty (/dev/tty5) main process (2146) killed by TERM signal
May 10 18:45:02 hwmaster01 init: tty (/dev/tty6) main process (2148) killed by TERM signal
May 10 18:45:08 hwmaster01 ntpd[1772]: ntpd exiting on signal 15
May 10 18:45:08 hwmaster01 rpcbind: rpcbind terminating on signal. Restart with "rpcbind -w"

*잠시 후

May 10 18:52:01 hwmaster01 init: tty (/dev/tty1) main process (2124) killed by TERM signal
May 10 18:52:01 hwmaster01 init: tty (/dev/tty2) main process (2126) killed by TERM signal
May 10 18:52:01 hwmaster01 init: tty (/dev/tty3) main process (2128) killed by TERM signal
May 10 18:52:01 hwmaster01 init: tty (/dev/tty4) main process (2131) killed by TERM signal
May 10 18:52:01 hwmaster01 init: tty (/dev/tty5) main process (2133) killed by TERM signal
May 10 18:52:01 hwmaster01 init: tty (/dev/tty6) main process (2135) killed by TERM signal
May 10 18:52:09 hwmaster01 ntpd[1767]: ntpd exiting on signal 15
May 10 18:52:10 hwmaster01 rpcbind: rpcbind terminating on signal. Restart with "rpcbind -w"

실행 중인 새로운 압력 도구가 없습니다. 4개의 노드가 서로 다른 가상 머신에 있지만 동일한 하드웨어에 있는 hadoop 클러스터 환경의 마스터 노드입니다. 모든 가상 머신이 하드웨어 수준에서 제대로 작동하는 것처럼 보였지만 이 마스터 노드가 충돌하여 모든 서비스를 중지했습니다. 이 문제에 대해 잘 아는 사람이 있나요?

답변1

strace이 기본 프로세스에 연결할 수 있습니다 . 어떤 프로세스에 의해 종료되었는지 알려줍니다.

관련 정보