Working with Files and Folders in Python
In today’s technologically driven society, we often need to work with files and folders on our computers, whether we’re managing documents, media, or code. However, finding the right files and folders can be time-consuming, especially when you have a large number of them.
Fortunately, there are tools available to streamline this process and make it easier to locate the files and folders you need. In this article, we’ll explore two such tools: the glob module and the os module.
We’ll show you how to use these modules to search for files with a specific extension, find files and folders based on a specific pattern, and more.
1) Listing Files with a Specific Extension
One of the most common tasks when managing files is finding all the files with a specific extension. This is where the glob and os modules come in handy.
Both modules provide a simple and effective way to list files in a directory that match a specific pattern.
Using the glob module
The glob module provides a simple way to search for files based on a pattern. When using the glob module, you specify a pattern with a wildcard character that matches one or more characters in the file name.
Here’s an example of how to use the glob module to list all files with a .txt extension:
import glob
txt_files = glob.glob('*.txt')
for file in txt_files:
print(file)
This code will print out a list of all the files in the current directory that end with .txt. You can modify the search pattern to match any other file extension, such as .pdf or .docx.
Using the os module
The os module provides another way to list files with a specific extension. Instead of using a wildcard character, you can use the os module to iterate through all the files in a directory and check if they have the desired extension.
Here’s an example of how to use the os module to list all files with a .txt extension:
import os
dir_path = '/path/to/directory'
for file in os.listdir(dir_path):
if file.endswith('.txt'):
print(file)
This code will print out a list of all the files in the specified directory that end with .txt. You can modify the search pattern to match any other file extension.
Listing files in directory and subdirectories with extension txt
If you need to search for files in subdirectories as well, you can use either the glob module or the os.walk() function.
Using the glob module
To search for files in subdirectories with the glob module, use the ** wildcard character to indicate that you want to search recursively through all subdirectories. Here’s an example of how to list all files with a .txt extension in the current directory and its subdirectories:
import glob
txt_files = glob.glob('**/*.txt', recursive=True)
for file in txt_files:
print(file)
This code will print out all files with a .txt extension in the current directory and its subdirectories.
Using the os.walk() function
The os.walk() function allows you to traverse a directory tree, starting at a specified directory, and return the path, directories, and files at each level.
Here’s an example of how to use the os.walk() function to list all files with a .txt extension in the current directory and its subdirectories:
import os
dir_path = '/path/to/directory'
for root, dirs, files in os.walk(dir_path):
for file in files:
if file.endswith('.txt'):
print(os.path.join(root, file))
This code will print out all files with a .txt extension in the current directory and its subdirectories.
2) Using the Glob Module to Find Files and Folders
The glob module can also be used to search for files and folders based on a specific pattern. This can be useful when you know part of the file or folder name but aren’t sure of the full name.
The glob module provides a way to search for files and folders based on pattern matching.
When using the glob module, you specify a pattern with a wildcard character that matches one or more characters in the file or folder name. Here’s an example of how to use the glob module to find all files and folders that start with “test”:
import glob
test_files = glob.glob('test*')
for file in test_files:
print(file)
This code will print out a list of all the files and folders in the current directory that start with “test”.
Finding files in a directory with a specific pattern
If you need to search for files in a directory that match a specific pattern, you can use the glob module to do so. Here’s an example of how to find all files in a directory with “test” in their name:
import glob
dir_path = '/path/to/directory'
test_files = glob.glob(dir_path + '/test*')
for file in test_files:
print(file)
This code will print out a list of all the files in the specified directory that have “test” in their name.
Searching for files in subdirectories with a specific pattern
Sometimes, you may need to search for files in subdirectories that match a specific pattern. The glob module can handle this as well.
Here’s an example of how to search for all files in the current directory and its subdirectories that have “test” in their name:
import glob
test_files = glob.glob('**/test*', recursive=True)
for file in test_files:
print(file)
This code will print out all files in the current directory and its subdirectories that have “test” in their name.
3) Using the OS Module to Work with Files and Directories
As you work with files and folders on your computer, you may need to perform various operations such as listing files, creating new directories, moving files, and checking file properties. The os module provides a platform-independent way to interact with the file system and perform these operations.
The os module is a part of the Python standard library that provides a range of functions for working with the operating system.
These functions allow you to perform tasks such as navigating directories, creating and deleting files and folders, and getting information about files and folders. The os module provides an abstraction layer over the file system, which allows your code to be portable across different operating systems.
Listing files and folders in a directory
If you need to list all the files and directories in a given directory, you can use the os.listdir() function. Here’s an example of how to use the os module to list all the files and directories in a given directory:
import os
dir_path = '/path/to/directory'
# List all files and directories in dir_path
for filename in os.listdir(dir_path):
# Print the filename
print(filename)
This code will print out all the files and directories in the specified directory.
Working with file paths
When working with files and directories, it’s important to be able to create and manipulate file paths. The os module provides a range of functions to work with file paths, including os.path.join(), os.path.basename(), os.path.exists(), os.path.isfile(), and os.path.isdir().
Here’s an example of how to use these functions to manipulate file paths:
import os
# Create a file path by joining directory and filename
dir_path = '/path/to/directory'
filename = 'myfile.txt'
file_path = os.path.join(dir_path, filename)
# Print the basename of the file path
print(os.path.basename(file_path))
# Check if the file exists
if os.path.exists(file_path):
# Check if it's a file
if os.path.isfile(file_path):
print('File found')
# Check if it's a directory
elif os.path.isdir(file_path):
print('Directory found')
else:
print('File not found')
In this example, we create a file path by joining a directory path and a file name using os.path.join(). We then use os.path.basename() to extract the file name from the file path.
Finally, we use os.path.exists(), os.path.isfile(), and os.path.isdir() to check if the file exists and whether it’s a file or a directory.
Conclusion
In this article, we’ve explored two powerful modules for working with files and folders in Python: the glob and os modules. We’ve shown you how to use these modules to list files with a specific extension, find files and folders based on a specific pattern, and more.
With these tools at your disposal, you’ll be able to work more efficiently with your files and folders, saving you time and effort. Happy coding!