Filter Commands in Linux System

weixin_0010034

44人浏览 · 2022-08-04 19:28:36

weixin_0010034 · 2022-08-04 19:28:36 发布

In the last post, we looked at viewing and sorting of files.

There are many times that we want to look at only a specific part of the file and even more times when just want to analyze the file content.

In this post, we are going to have a look at basic commands to filter the contents of a file.

1. grep command

The grep command is used to search text or search the given file for lines containing a match to the given strings or words.

The syntax for grep command

grep syntax

To filter a file using grep we use grep along with the options like "case", patterns we want it to match and the source files.

Let's take an example

We have the following log file

log-img

filter out which user is mentioned in the logs

grep-user

In the above example, we used the command cat logs.txt | grep user@ where

cat represents cat command to read the file content
logs.txt represents the file to read content from
grep represents the grep command
User@ represents the text we used to filter the logs.txt file.

Note: The grep command by default is case-sensitive

get the same output as above using User@ as the pattern

grep-i

In the above example, we first used the command to check whether we can directly use the User@ but it did not work sincegrep is case-sensitive so to fix this we used the command

cat logs.txt | grep -i User@ where
cat represents cat command to read the file content
logs.txt represents the file to read content from
grep represents the grep command
-i represents the option to filter case-insensitive
user@ represents the text we used to filter the logs.txt file.

We can also use the grep command to get the count of a specific word

find number of occurrences of the word "exceed"

grep-count

In the above example, we used the command cat logs.txt | grep -ic exceed where
cat represents cat command to read the file content
logs.txt represents the file to read content from
grep represents the grep command
-ic represents the option to count the number of **occurrences of the word irrespective of the case
exceed represents the text we used to filter the logs.txt file.

What if we need to revert the above case?

find the count of words that do not match the word "exceed"

grep-invert

In the above example, we used the command cat logs.txt | grep -cv exceed where
cat represents cat command to read the file content
logs.txt represents the file to read content from
grep represents the grep command
-cv represents the option to count the number of occurrences of the words not matching
exceed represents the text we used to filter the logs.txt file.

Let's move on to the next command

2.wc command

The wc stands for word count is used for printing the number of lines, words, and bytes contained in files

The syntax for wc command

wc-syntax

To use the wc command we simply use wc along with options and files

Let's take an example

wc-basic

In the above example, we used the command wc logs.txt to get the number of lines, words, and bytes.

get only the number of lines in a file

wc-no-of-lines

In the above example, we used the option -l and created a command wc -l logs.txt to get the number of lines.

Similarly, we can use -w for words, -c for bytes and -m for characters

3.uniq command

The uniq command is used to report or omit repeated lines.

By default, uniq prints its input lines, except that it filters all but the first of adjacent repeated lines so that no output lines are repeated.

uniq is mostly used with sort command. the sort command provides with the sorted input and uniq command removes the repeated lines.

Let's take an example of the following file

random-file

use unique to filter file contents

uniq-count

In the above example, we first used the command sort random-words.txt | we to find the current count and then to get the unique values only we used sort random-words.txt | uniq | wc where

sort represents sort command
random-words.txt represents the input file
uniq represents uniq command
wc represents wordcount command

4.tr command

The tr command is used to translate, squeeze, and/or delete characters.

The syntax for tr command

tr-syntax

By default, the tr command will replace each character in SET1 with each character in the same position in SET2.

Let's take examples for better understanding

convert lowercase to uppercase

tr-lower-upper

In the above example, we used the command tr "[:lower:]" "[:upper:]" where
tr represents tr command
[:lower:] represents SET1 all the lower-case letters[abcd...]
[:upper:] represents SET2 all the upper-case letters[ABCD...]

delete a specified character

tr-delete

In the above example, we used the command cat hello.txt | tr -d 'o' where
cat represents the concatenation command
hello.txt represents the input file
tr represents tr command
-d represents the delete option
o represents character to be deleted

join all the lines into a single line

tr-single-line

In the above example, we used the command tr -s '\n' ' ' < logs.txt where
tr represents tr command
-s represents squeeze repeats
\n represents a new line
' ' represents single space
logs.txt represents the input file

5.head and tail command

In the recent past, we already had a look at this command. Let's take a recap.

head command is used to print the lines from the start of the file whereas the tail command is used to print the files from the bottom of the file.

Let's take an example for better understanding