AWS Lightsail 서버에서 DHCP 임대 갱신이 실패했습니다.

Question 1

글쎄, 문제는 IPv6과 관련이 있는 것으로 밝혀졌습니다.

문제는 네트워크 서비스가 제대로 작동하지 않는다는 것입니다. 네트워크 서비스는 시작 후 네트워크를 구성해야 하는 일회성 스크립트입니다.

systemctl status networking다음을 표시합니다.

Oct 16 11:01:32 ip-172-26-9-21 dhclient[573]: DHCPACK of 172.26.9.21 from 172.26.0.1
Oct 16 11:01:32 ip-172-26-9-21 ifup[366]: RTNETLINK answers: File exists
Oct 16 11:01:32 ip-172-26-9-21 dhclient[573]: bound to 172.26.9.21 -- renewal in 1374 seconds.
Oct 16 11:01:32 ip-172-26-9-21 ifup[366]: bound to 172.26.9.21 -- renewal in 1374 seconds.
Oct 16 11:01:38 ip-172-26-9-21 ifup[366]: Could not get a link-local address
Oct 16 11:01:38 ip-172-26-9-21 ifup[366]: run-parts: /etc/network/if-pre-up.d/cloud_inet6 exited with return code 1
Oct 16 11:01:38 ip-172-26-9-21 ifup[366]: ifup: failed to bring up eth0
Oct 16 11:01:38 ip-172-26-9-21 systemd[1]: networking.service: Main process exited, code=exited, status=1/FAILURE
Oct 16 11:01:38 ip-172-26-9-21 systemd[1]: networking.service: Failed with result 'exit-code'.
Oct 16 11:01:38 ip-172-26-9-21 systemd[1]: Failed to start Raise network interfaces.

출력에 표시된 것처럼 /etc/network/if-pre-up.d/cloud_inet6dhcp를 통해 ipv6을 설정해야 하는 스크립트가 제대로 실행되지 않습니다.

AWS Lightsail 콘솔 및 Debian에서 IPv6 지원을 비활성화했습니다.

echo 'net.ipv6.conf.all.disable_ipv6 = 1' > /etc/sysctl.d/70-disable-ipv6.conf
sysctl -p -f /etc/sysctl.d/70-disable-ipv6.conf

IPv6 지원이 필요하지 않기 때문에 다음에서 ipv6 can init 스크립트를 제거하여 비활성화했습니다 /etc/network/if-pre-up.d/.

mv /etc/network/if-pre-up.d/cloud_inet6 ~/

서버를 다시 시작한 후 이제 네트워크 서비스가 제대로 실행되고 서버가 더 이상 충돌하지 않습니다.

이게 왜 갑자기 문제가 되었는지 아직도 헷갈립니다. 내가 아는 한, cloud init 스크립트는 4월부터 사용되었습니다. IPv6 지원이 꽤 좋기 때문에 추측만 할 수 있습니다.새로운AWS Lightsail에서 이 문제는 AWS 인프라의 특정 변경으로 인해 발생합니다.

Launchpad의 cloud-init에 대한 이 질문은 동일한 문제를 문서화한 것으로 보입니다.https://bugs.launchpad.net/cloud-init/+bug/1863773.

Answer

글쎄, 문제는 IPv6과 관련이 있는 것으로 밝혀졌습니다.

문제는 네트워크 서비스가 제대로 작동하지 않는다는 것입니다. 네트워크 서비스는 시작 후 네트워크를 구성해야 하는 일회성 스크립트입니다.

systemctl status networking다음을 표시합니다.

Oct 16 11:01:32 ip-172-26-9-21 dhclient[573]: DHCPACK of 172.26.9.21 from 172.26.0.1
Oct 16 11:01:32 ip-172-26-9-21 ifup[366]: RTNETLINK answers: File exists
Oct 16 11:01:32 ip-172-26-9-21 dhclient[573]: bound to 172.26.9.21 -- renewal in 1374 seconds.
Oct 16 11:01:32 ip-172-26-9-21 ifup[366]: bound to 172.26.9.21 -- renewal in 1374 seconds.
Oct 16 11:01:38 ip-172-26-9-21 ifup[366]: Could not get a link-local address
Oct 16 11:01:38 ip-172-26-9-21 ifup[366]: run-parts: /etc/network/if-pre-up.d/cloud_inet6 exited with return code 1
Oct 16 11:01:38 ip-172-26-9-21 ifup[366]: ifup: failed to bring up eth0
Oct 16 11:01:38 ip-172-26-9-21 systemd[1]: networking.service: Main process exited, code=exited, status=1/FAILURE
Oct 16 11:01:38 ip-172-26-9-21 systemd[1]: networking.service: Failed with result 'exit-code'.
Oct 16 11:01:38 ip-172-26-9-21 systemd[1]: Failed to start Raise network interfaces.

출력에 표시된 것처럼 /etc/network/if-pre-up.d/cloud_inet6dhcp를 통해 ipv6을 설정해야 하는 스크립트가 제대로 실행되지 않습니다.

AWS Lightsail 콘솔 및 Debian에서 IPv6 지원을 비활성화했습니다.

echo 'net.ipv6.conf.all.disable_ipv6 = 1' > /etc/sysctl.d/70-disable-ipv6.conf
sysctl -p -f /etc/sysctl.d/70-disable-ipv6.conf

IPv6 지원이 필요하지 않기 때문에 다음에서 ipv6 can init 스크립트를 제거하여 비활성화했습니다 /etc/network/if-pre-up.d/.

mv /etc/network/if-pre-up.d/cloud_inet6 ~/

서버를 다시 시작한 후 이제 네트워크 서비스가 제대로 실행되고 서버가 더 이상 충돌하지 않습니다.

이게 왜 갑자기 문제가 되었는지 아직도 헷갈립니다. 내가 아는 한, cloud init 스크립트는 4월부터 사용되었습니다. IPv6 지원이 꽤 좋기 때문에 추측만 할 수 있습니다.새로운AWS Lightsail에서 이 문제는 AWS 인프라의 특정 변경으로 인해 발생합니다.

Launchpad의 cloud-init에 대한 이 질문은 동일한 문제를 문서화한 것으로 보입니다.https://bugs.launchpad.net/cloud-init/+bug/1863773.

Question 2

최근에 다른 서버에서 매우 비슷한 문제가 발생했습니다. 이번에는 또 다른 힘든 호스트입니다. dhcp 버전이 소진되면 AWS 서버와 마찬가지로 서버 연결이 끊어집니다. 그리고 네트워크 서비스도 실패한 상태입니다. 그 이유는 ipv6과도 관련이 있습니다.

로그 파일에 "RTNETLINK 응답: 파일이 존재합니다"라는 오류 메시지가 나타납니다. 이는 ipv6 주소 설정을 시도한 후 네트워크에 장애가 발생한 것으로 보입니다.

이 문제를 해결하려면 /etc/network/interfaces 파일을 편집해야 했습니다. 내가 교체한 곳:

post-up ip -6 route add fe80::1 dev eth0
post-up ip -6 route add default via fe80::1 dev eth0
post-down ip -6 route del default via fe80::1 dev eth0
post-down ip -6 route del fe80::1 dev eth0
iface eth0 inet6 static
        address XXXX:XXXX:XXXX:XXXX:XXXX:XXXX:XXXX
        netmask 64

그리고:

iface eth0 inet6 static
        address XXXX:XXXX:XXXX:XXXX:XXXX:XXXX:XXXX
        netmask 64
        post-up sleep 5; ip -6 route add fe80::1 dev eth0
        post-up sleep 5; ip -6 route add default via fe80::1 dev eth0
        post-down sleep 5; ip -6 route del default via fe80::1 dev eth0
        post-down sleep 5; ip -6 route del fe80::1 dev eth0

수면 5를 추가하면 효과가 있는 것 같습니다.

Answer