Network Connectivity Troubleshooting

If none of the more specific forums is the right place to ask

Network Connectivity Troubleshooting

Postby kwriley87 » 2019-02-12 02:56

Hello-

I am a relatively novice Debian user, seeking some help on troubleshooting a network connectivity problem with a recently deployed server.

I am running FusionPBX on a Debian 9 OS using a Dell PowerEdge R610 server. I have another server just like this one in my datacenter that is working just fine but this server in particular is dropping offline multiple times a day for just a few seconds at a time. I first discovered this due to monitoring this server using Constellix Sonar monitoring and was getting several down and up alerts a day. To rule out Constellix false positive alerts, I ran a consistent ping to this server from my PC and as soon as the Constellix Sonar down alerts arrived into my email, I saw 5-7 dropped packets from my persistent ping and then it came right back up -- so the alerts appear to be valid.

This server has an integrated 4 port Broadcom NIC. I've done the obvious things such as swap out the patch cable from the server to the switch, move the server to a different port on the switch, switch to a different interface on the NIC, etc. I even added a PCIe Intel NIC to the server to rule out a faulty network card and the problem still persists. I am very certain the switch isn't the issue as we have multiple servers plugged into it that do not drop offline what so ever. I have done a full apt update & upgrade as well and still, this server appears to bounce up and down throughout the day.

Can anyone give me any pointers to lead me in the right direction? My gut tells me it's hardware related but since I've added an additional PCIe NIC and that didn't solve the problem, I'm a bit lost at this point.

Thank you for any help!
kwriley87
 
Posts: 4
Joined: 2019-02-12 02:45

Re: Network Connectivity Troubleshooting

Postby dilberts_left_nut » 2019-02-12 04:46

Any clues in the logfiles and dmesg?
AdrianTM wrote:There's no hacker in my grandma...
User avatar
dilberts_left_nut
 
Posts: 4992
Joined: 2009-10-05 07:54
Location: enzed

Re: Network Connectivity Troubleshooting

Postby kwriley87 » 2019-02-14 02:10

Nothing that I can see that sticks out.. the syslog just contains a bunch of clean PHP session file messages.

Here's the errors in red from a dmesg output:

[ 2.381020] scsi 0:0:32:0: Wrong diagnostic page; asked for 7 got 0
[ 2.431610] scsi 0:0:32:0: Wrong diagnostic page; asked for 7 got 0

[ 3.412460] ACPI Error: No handler for Region [IPMI] (ffff8943af827c18) [IPMI] (20160831/evregion-166)
[ 3.412613] ACPI Error: Region IPMI (ID=7) has no handler (20160831/exfldio-299)
[ 3.412758] ACPI Error: Method parse/execution failed [\_SB.PMI0._GHL] (Node ffff8947af1ac438), AE_NOT_EXIST (20160831/psparse-543)
[ 3.413002] ACPI Error: Method parse/execution failed [\_SB.PMI0._PMC] (Node ffff8947af1ac280), AE_NOT_EXIST (20160831/psparse-543)
[ 3.413246] ACPI Exception: AE_NOT_EXIST, Evaluating _PMC (20160831/power_meter-755)


[ 3.820151] kvm: disabled by bios


Not sure that any of this really explains the network connectivity issues?
kwriley87
 
Posts: 4
Joined: 2019-02-12 02:45

Re: Network Connectivity Troubleshooting

Postby Head_on_a_Stick » 2019-02-14 07:09

kwriley87 wrote:Here's the errors in red from a dmesg output

How about the systemd journal?
User avatar
Head_on_a_Stick
 
Posts: 10334
Joined: 2014-06-01 17:46
Location: /dev/chair

Re: Network Connectivity Troubleshooting

Postby kwriley87 » 2019-02-15 01:29

Nothing that stands out.. I got a drop notification about 8:10PM CST so I did a sudo journalctl --since "2019-02-14 20:00:00" and the only entries were related to clean php session files, etc..

Seems to be that nothing in any of the logs is explaining why this may be happening.. So strange. If anyone has any further suggestions, I would apprecaite it -- otherwise, we may just have to replace the server all together.
kwriley87
 
Posts: 4
Joined: 2019-02-12 02:45

Re: Network Connectivity Troubleshooting

Postby kwriley87 » 2019-03-10 22:35

Back to the drawing board here.. I have swapped the server out entirely (kept the hard drives and RAID controller) and the issue still persists.

At this point it has to be OS related.. Is there anything else I can try here rather than reinstalling Debian entirely?
kwriley87
 
Posts: 4
Joined: 2019-02-12 02:45


Return to General Questions

Who is online

Users browsing this forum: Head_on_a_Stick and 14 guests

fashionable