Find Large Files in Linux

Updated on

4 min read

Find Largest Files in Linux

Over time, your disk drive may get cluttered with a lot of unnecessary files taking up large amounts of disk space. Usually, Linux systems run out of disk space due to large log or backup files.

This tutorial explains how to find the largest files and directories in Linux systems using the find and du commands.

Find Large Files Using the find Command

The find command is one of the most powerful tools in the Linux system administrators’ arsenal. It allows you to search for files and directories based on different criteria, including the file size.

For example, to search for files with size greater than 100 MB, in the current working directory , you would run the following command:

sudo find . -xdev -type f -size +100M
Replace . with the path to the directory where you want to search for the largest files.

The output will show a list of files without any additional information.

/var/lib/libvirt/images/centos-7-desktop_default.img
/var/lib/libvirt/images/bionic64_default.img
/var/lib/libvirt/images/win10.qcow2
/var/lib/libvirt/images/debian-9_default.img
/var/lib/libvirt/images/ubuntu-18-04-desktop_default.img
/var/lib/libvirt/images/centos-7_default.img

The find command also can be used in combination with other tools such as ls or sort to perform operations on those files.

In the example below, we are passing the output of the find command to ls which will print the size of each found file and then pipe that output to the sort command to sort it based on the 5th column which is the file size.

find . -xdev -type f -size +100M -print | xargs ls -lh | sort -k5,5 -h -r

The output will look something like this:

-rw-------  1 root   root 40967M Jan  5 14:12 /var/lib/libvirt/images/win10.qcow2
-rw-------  1 root   root  3725M Jan  7 22:12 /var/lib/libvirt/images/debian-9_default.img
-rw-------  1 root   root  1524M Dec 30 07:46 /var/lib/libvirt/images/centos-7-desktop_default.img
-rw-------  1 root   root   999M Jan  5 14:43 /var/lib/libvirt/images/ubuntu-18-04-desktop_default.img
-rw-------  1 root   root   562M Dec 31 07:38 /var/lib/libvirt/images/centos-7_default.img
-rw-------  1 root   root   378M Jan  7 22:26 /var/lib/libvirt/images/bionic64_default.img

If the output contains a lot of lines of information you can use the head command to print only the first 10 lines:

find . -xdev -type f -size +100M -print | xargs ls -lh | sort -k5,5 -h -r | head

Let’s break down the command:

  • find . -xdev -type f -size +100M -print - search only for files (-type f) in the current working directory (.), larger than than 100MB (-size +100M), don’t descend directories on other filesystems (-xdev) and print the full file name on the standard output, followed by a new line (-print).
  • xargs ls -lh - the output of the find command is piped to xargs which executes the ls -lh command that will print the output in long listing human-readable format.
  • sort -k5,5 -h -r - sort lines based on the 5th column (-k5,5), compare the values in human-readable format (-h) and reverse the result (-r).
  • head : prints only the first 10 lines of the piped output.

The find command comes with a lot of powerful options. For example, you can search for large files that are older than x days, large files with a specific extension, or large files that belong to a particular user.

Find Large Files and Directories Using the du Command

The du command is used to estimate file space usage, and it is particularly useful for finding directories and files that consume large amounts of disk space.

The following command will print the largest files and directories:

du -ahx . | sort -rh | head -5

The first column includes the size of the file and the second one the file name:

55G	.
24G	./.vagrant.d/boxes
24G	./.vagrant.d
13G	./Projects
5.2G	./.minikube

Explanation of the command:

  • du -ahx . : estimate disk space usage in the current working directory (.), count both files and directories (a), print sizes in a human-readable format (h), and skip directories on different file systems (x).
  • sort -rh : sort lines by comparing values in human-readable format (-h) and reverse the result (-r).
  • head -5 : prints only the first five lines of the piped output.

The du command has many other options that can be used to refine the output of the disk space usage.

Conclusion

We’ve shown you how to find the largest files and directories using the find and du commands.

Now that you learn how to found the largest files on your system, you may want to read our guide about How to Remove Files and Directories Using Linux Command Line .

If you have any questions or remarks, please leave a comment below.