Section 4.3. File Types

team bbl


4.3. File Types

We've talked about two different types of files so far: regular files and directories. Most files on a UNIX system are either regular files or directories, but there are additional types of files. The types are:

  1. Regular file. The most common type of file, which contains data of some form. There is no distinction to the UNIX kernel whether this data is text or binary. Any interpretation of the contents of a regular file is left to the application processing the file.

    One notable exception to this is with binary executable files. To execute a program, the kernel must understand its format. All binary executable files conform to a format that allows the kernel to identify where to load a program's text and data.

  2. Directory file. A file that contains the names of other files and pointers to information on these files. Any process that has read permission for a directory file can read the contents of the directory, but only the kernel can write directly to a directory file. Processes must use the functions described in this chapter to make changes to a directory.

  3. Block special file. A type of file providing buffered I/O access in fixed-size units to devices such as disk drives.

  4. Character special file. A type of file providing unbuffered I/O access in variable-sized units to devices. All devices on a system are either block special files or character special files.

  5. FIFO. A type of file used for communication between processes. It's sometimes called a named pipe. We describe FIFOs in Section 15.5.

  6. Socket. A type of file used for network communication between processes. A socket can also be used for non-network communication between processes on a single host. We use sockets for interprocess communication in Chapter 16.

  7. Symbolic link. A type of file that points to another file. We talk more about symbolic links in Section 4.16.

The type of a file is encoded in the st_mode member of the stat structure. We can determine the file type with the macros shown in Figure 4.1. The argument to each of these macros is the st_mode member from the stat structure.

Figure 4.1. File type macros in <sys/stat.h>

Macro

Type of file

S_ISREG()

regular file

S_ISDIR()

directory file

S_ISCHR()

character special file

S_ISBLK()

block special file

S_ISFIFO()

pipe or FIFO

S_ISLNK()

symbolic link

S_ISSOCK()

socket


POSIX.1 allows implementations to represent interprocess communication (IPC) objects, such as message queues and semaphores, as files. The macros shown in Figure 4.2 allow us to determine the type of IPC object from the stat structure. Instead of taking the st_mode member as an argument, these macros differ from those in Figure 4.1 in that their argument is a pointer to the stat structure.

Figure 4.2. IPC type macros in <sys/stat.h>

Macro

Type of object

S_TYPEISMQ()

message queue

S_TYPEISSEM()

semaphore

S_TYPEISSHM()

shared memory object


Message queues, semaphores, and shared memory objects are discussed in Chapter 15. However, none of the various implementations of the UNIX System discussed in this book represent these objects as files.

Example

The program in Figure 4.3 prints the type of file for each command-line argument.

Sample output from Figure 4.3 is

     $ ./a.out /etc/passwd /etc /dev/initctl /dev/log /dev/tty \     > /dev/scsi/host0/bus0/target0/lun0/cd /dev/cdrom     /etc/passwd: regular     /etc: directory     /dev/initctl: fifo     /dev/log: socket     /dev/tty: character special     /dev/scsi/host0/bus0/target0/lun0/cd: block special     /dev/cdrom: symbolic link 

(Here, we have explicitly entered a backslash at the end of the first command line, telling the shell that we want to continue entering the command on another line. The shell then prompts us with its secondary prompt, >, on the next line.) We have specifically used the lstat function instead of the stat function to detect symbolic links. If we used the stat function, we would never see symbolic links.

To compile this program on a Linux system, we must define _GNU_SOURCE to include the definition of the S_ISSOCK macro.

Figure 4.3. Print type of file for each command-line argument
 #include "apue.h" int main(int argc, char *argv[]) {     int         i;     struct stat buf;     char        *ptr;     for (i = 1; i < argc; i++) {         printf("%s: ", argv[i]);         if (lstat(argv[i], &buf) < 0) {             err_ret("lstat error");             continue;          }          if (S_ISREG(buf.st_mode))             ptr = "regular";          else if (S_ISDIR(buf.st_mode))             ptr = "directory";          else if (S_ISCHR(buf.st_mode))             ptr = "character special";          else if (S_ISBLK(buf.st_mode))             ptr = "block special";          else if (S_ISFIFO(buf.st_mode))             ptr = "fifo";          else if (S_ISLNK(buf.st_mode))             ptr = "symbolic link";          else if (S_ISSOCK(buf.st_mode))             ptr = "socket";          else             ptr = "** unknown mode **";          printf("%s\n", ptr);   }    exit(0); } 

Historically, early versions of the UNIX System didn't provide the S_ISxxx macros. Instead, we had to logically AND the st_mode value with the mask S_IFMT and then compare the result with the constants whose names are S_IFxxx. Most systems define this mask and the related constants in the file <sys/stat.h>. If we examine this file, we'll find the S_ISDIR macro defined something like

     #define S_ISDIR(mode) (((mode) & S_IFMT) == S_IFDIR) 

We've said that regular files are predominant, but it is interesting to see what percentage of the files on a given system are of each file type. Figure 4.4 shows the counts and percentages for a Linux system that is used as a single-user workstation. This data was obtained from the program that we show in Section 4.21.

Figure 4.4. Counts and percentages of different file types

File type

Count

Percentage

regular file

226,856

88.22 %

directory

23,017

8.95

symbolic link

6,442

2.51

character special

447

0.17

block special

312

0.12

socket

69

0.03

FIFO

1

0.00


    team bbl



    Advanced Programming in the UNIX Environment
    Advanced Programming in the UNIX Environment, Second Edition (Addison-Wesley Professional Computing Series)
    ISBN: 0321525949
    EAN: 2147483647
    Year: 2005
    Pages: 370

    flylib.com © 2008-2017.
    If you may any questions please contact us: flylib@qtcs.net