26. Troubleshooting Common Issues on Ubuntu Server: A Practical Guide

Introduction

Ubuntu Server is a dependable and widely-adopted Linux distribution, but even the most reliable systems can face challenges that impact performance or disrupt operations. Understanding how to troubleshoot common issues is crucial for maintaining uptime and ensuring seamless functionality. In this guide, “Troubleshooting Common Issues on Ubuntu Server,” we’ll explore practical techniques to diagnose and resolve frequent problems, such as boot failures, SSH connectivity errors, network issues, and more. You’ll also learn how to utilize system logs effectively to pinpoint the root cause of problems and restore your server to optimal performance.

Previous articles:

01. Introduction to Ubuntu Server – SysOSX: AI & Cloud

02. How to Setup Your First Ubuntu Server: A Beginner’s Guide – SysOSX: AI & Cloud

03.Mastering the Linux Command Line for Ubuntu Server – SysOSX: AI & Cloud

04. Managing Users and Permissions on Ubuntu Server: A Comprehensive Guide – SysOSX: AI & Cloud

05. Networking Basics for Ubuntu Server: A Comprehensive Guide – SysOSX: AI & Cloud

06. Installing and Managing Software on Ubuntu Server: A Complete Guide – SysOSX: AI & Cloud

07.Patching and Updating Ubuntu Server: A Comprehensive Guide – SysOSX: AI & Cloud

08. Securing Your Ubuntu Server: Practical Steps for Hardening and Protection – SysOSX: AI & Cloud

09. Ubuntu server auditing and logging

10. System Monitoring Tools for Ubuntu Server: A Comprehensive Guide – SysOSX: AI & Cloud

11. Centralized Logging for ubuntu server: a must read guide – SysOSX: AI & Cloud

12. Audit and Compliance for Ubuntu Server: Best Practices – SysOSX: AI & Cloud

13.Setting Up Your Ubuntu Server for Hosting and Web Applications: Ultimate Guide – SysOSX: AI & Cloud

14. Host Multiple Websites on a single Ubuntu Server: Useful tips – SysOSX: AI & Cloud

15. Managing Storage and Disks on Ubuntu Server: Simple Guide – SysOSX: AI & Cloud

16. Setting Up File Sharing Services on Ubuntu Server: a complete guide – SysOSX: AI & Cloud

17. Setting Up a Secure Database Server on Ubuntu: MySQL vs. MariaDB – SysOSX: AI & Cloud

18. How to Configure and Use LVM on Ubuntu Server: A Comprehensive Guide – SysOSX: AI & Cloud

19. Introduction to Virtualization with KVM: A Beginner’s Guide for Ubuntu Server – SysOSX: AI & Cloud

20. How to Install Docker on Ubuntu Server: A Step-by-Step Guide to Containerization – SysOSX: AI & Cloud

21. Getting Started with LXD: Install Containers with Ease + Cheat Sheet – SysOSX: AI & Cloud

22. Getting Started with Podman on Ubuntu Server: A complete guide – SysOSX: AI & Cloud

23. Deploying Ubuntu Server in the Cloud: A Step-by-Step Guide – SysOSX: AI & Cloud

24. Automating Ubuntu Server Setup with Ansible: A Practical Guide – SysOSX: AI & Cloud

25.Upgrading Ubuntu Server from 22.04 to 24.04: A Step-by-Step Guide – SysOSX: AI & Cloud

Common Ubuntu Server Issues and How to Troubleshoot

1. Boot Problems

Boot issues can occur due to corrupted files, hardware failures, or configuration errors.

Symptoms:

Server fails to boot and displays a blank screen or error messages.
The GRUB menu is inaccessible or missing.
Kernel panic errors appear during startup.

Troubleshooting Steps:

Step 1: Access GRUB Menu

Restart the server and hold down the Shift key (or Esc key for UEFI systems) to access the GRUB menu.

Step 2: Boot into Recovery Mode

Select Advanced Options > Recovery Mode to boot into a minimal environment for troubleshooting.

Step 3: Check System Logs

Inspect the system logs for errors related to the boot process:

sudo journalctl -b

Look for errors or warnings related to kernel modules, disk mounting, or services.

Step 4: Repair Broken Packages

Run the following commands to fix broken packages:

sudo apt update  
sudo apt install --fix-broken

Step 5: Reinstall GRUB

If GRUB is corrupted, reinstall it:

sudo grub-install /dev/sda  
sudo update-grub

Step 6: Check Disk Health

Use fsck to check and repair disk errors:

sudo fsck /dev/sda

2. SSH Errors

SSH errors can prevent remote access to the server, disrupting management and automation tasks.

Symptoms:

Unable to connect to the server via SSH.
“Connection refused” or “Permission denied” errors.
SSH hangs or times out.

Troubleshooting Steps:

Step 1: Verify SSH Service

Ensure the SSH service is running:

sudo systemctl status ssh

If the service is inactive, start it:

sudo systemctl start ssh

Step 2: Check SSH Logs

Inspect the SSH logs for detailed error messages:

sudo tail -n 50 /var/log/auth.log

Common issues include:

Authentication failures.
IP address blocks due to fail2ban or firewall rules.

Step 3: Check Firewall Rules

Ensure the firewall allows SSH traffic:

sudo ufw allow OpenSSH  
sudo ufw status

Step 4: Inspect SSH Configuration

Check for errors in the SSH configuration file:

sudo nano /etc/ssh/sshd_config

Restart SSH after making changes:

sudo systemctl restart ssh

Step 5: Verify Network Connectivity

Ensure the server is reachable:

ping <server-ip>

3. Network Connectivity Issues

Network issues can prevent the server from accessing the internet or communicating with other devices.

Symptoms:

Unable to ping external websites or other servers.
DNS resolution errors.
Network interfaces are down or misconfigured.

Troubleshooting Steps:

Step 1: Check Network Interfaces

List active network interfaces:

ip a

If an interface is down, bring it up:

sudo ip link set <interface-name> up

Step 2: Check Logs for Network Errors

Inspect network-related logs for potential issues:

sudo journalctl -u systemd-networkd

Step 3: Verify IP Configuration

Check the server’s IP address, gateway, and DNS settings:

cat /etc/netplan/*.yaml

If the configuration is incorrect, edit the file and apply changes:

sudo nano /etc/netplan/*.yaml  
sudo netplan apply

Example Netplan Configuration:

network:  
  version: 2  
  ethernets:  
    eth0:  
      dhcp4: true

Step 4: Test DNS Resolution

Verify DNS resolution:

nslookup google.com

If DNS resolution fails, update /etc/resolv.conf with valid DNS servers:

nameserver 8.8.8.8  
nameserver 8.8.4.4

4. Disk Space Issues

Low disk space can cause applications to crash or prevent updates.

Symptoms:

“No space left on device” errors.
Server becomes unresponsive.

Troubleshooting Steps:

Step 1: Check Disk Usage

Use df to check disk usage:

df -h

Step 2: Identify Large Files

Find large files consuming disk space:

sudo du -ah / | sort -rh | head -n 10

Step 3: Check Log File Sizes

Log files can grow excessively large. Check /var/log for oversized logs:

sudo du -sh /var/log/*

Remove or archive old logs:

sudo rm -rf /var/log/*.gz  
sudo rm -rf /var/log/*.log.old

Step 4: Extend Disk Space

If the disk is full, extend the volume (cloud platforms like AWS, Azure, and GCP allow resizing disks).

5. Package Installation Errors

Package installation errors can occur due to broken dependencies or repository issues.

Symptoms:

“Unable to locate package” or “Dependency errors” during installation.

Troubleshooting Steps:

Step 1: Check Logs for Package Errors

Inspect apt logs for detailed errors:

sudo tail -n 50 /var/log/apt/history.log

Step 2: Update Package Index

Ensure the package index is up-to-date:

sudo apt update

Step 3: Fix Broken Dependencies

Run the following command to resolve dependency issues:

sudo apt install --fix-broken

Step 4: Add Missing Repositories

If a package is unavailable, add the required repository:

sudo add-apt-repository universe  
sudo apt update

6. High CPU or Memory Usage

Excessive resource usage can cause performance degradation and slow response times.

Symptoms:

Server becomes unresponsive.
Applications crash or timeout.

Troubleshooting Steps:

Step 1: Monitor Resource Usage

Use top or htop to monitor CPU and memory usage:

top

Step 2: Check Logs for Application Errors

Inspect application-specific logs in /var/log for errors causing high resource usage.

Step 3: Identify Resource-Hungry Processes

Find processes consuming excessive resources:

ps aux --sort=-%cpu | head -n 10  
ps aux --sort=-%mem | head -n 10

Step 4: Kill Unnecessary Processes

Terminate resource-hungry processes:

sudo kill <pid>

Using Logs for Troubleshooting

Logs are one of the most valuable tools when troubleshooting server issues. Here’s how to check logs for common problems:

System Logs: sudo journalctl -xe Provides detailed logs about system-level errors and warnings.
Authentication Logs: sudo tail -n 50 /var/log/auth.log Useful for diagnosing SSH and login-related issues.
Application-Specific Logs:
Check /var/log for application-specific logs (e.g., Apache, MySQL).
Kernel Logs: sudo dmesg Displays kernel-related messages, useful for hardware and boot issues.

Conclusion

Troubleshooting Ubuntu Server issues requires a systematic approach to diagnosing and resolving problems. In this guide, we covered:

Resolving boot problems, SSH errors, and network connectivity issues.
Fixing disk space and package installation errors.
Using logs effectively to diagnose issues.

By following these steps and leveraging logs, you can identify and fix server problems efficiently, ensuring a stable and reliable Ubuntu Server environment.

Introduction

Table of Contents

Common Ubuntu Server Issues and How to Troubleshoot

1. Boot Problems

Symptoms:

Troubleshooting Steps:

Step 1: Access GRUB Menu

Step 2: Boot into Recovery Mode

Step 3: Check System Logs

Step 4: Repair Broken Packages

Step 5: Reinstall GRUB

Step 6: Check Disk Health

2. SSH Errors

Symptoms:

Troubleshooting Steps:

Step 1: Verify SSH Service

Step 2: Check SSH Logs

Step 3: Check Firewall Rules

Step 4: Inspect SSH Configuration

Step 5: Verify Network Connectivity

3. Network Connectivity Issues

Symptoms:

Troubleshooting Steps:

Step 1: Check Network Interfaces

Step 2: Check Logs for Network Errors

Step 3: Verify IP Configuration

Example Netplan Configuration:

Step 4: Test DNS Resolution

4. Disk Space Issues

Symptoms:

Troubleshooting Steps:

Step 1: Check Disk Usage

Step 2: Identify Large Files

Step 3: Check Log File Sizes

Step 4: Extend Disk Space

5. Package Installation Errors

Symptoms:

Troubleshooting Steps:

Step 1: Check Logs for Package Errors

Step 2: Update Package Index

Step 3: Fix Broken Dependencies

Step 4: Add Missing Repositories

6. High CPU or Memory Usage

Symptoms:

Troubleshooting Steps:

Step 1: Monitor Resource Usage

Step 2: Check Logs for Application Errors

Step 3: Identify Resource-Hungry Processes

Step 4: Kill Unnecessary Processes

Using Logs for Troubleshooting

Conclusion

Leave a Comment Cancel reply