Structured Query Language (SQL)

Structured Query Language (SQL) is a standardized language used to work with database objects and the data they contain. Using SQL, you can define, alter, and remove database objects, as well as add, update, delete, and retrieve data values. One of the strengths of SQL is that it can be used in a variety of ways: SQL statements can be executed interactively using tools such as the Command Center and the Command Line Processor, they can be placed directly in UNIX shell scripts or Windows batch files, and they can be embedded in high-level programming language source code files that are precompiled/compiled to create a database application. (Because SQL is nonprocedural by design, it is not an actual programming language; therefore, most embedded SQL applications are built by combining the decision and sequence control of a high-level programming language with the data storage, manipulation, and retrieval capabilities of SQL).

Like most other languages, SQL has a defined syntax and a set of language elements. Most SQL statements can be categorized according to the function they have been designed to perform; SQL statements typically fall under one of the following categories:

Embedded SQL Application Construct Statements. SQL statements used for the sole purpose of constructing embedded SQL applications.

Data Control Language (DCL) Statements. SQL statements used to grant and revoke authorities and privileges.

Data Definition Language (DDL) Statements. SQL statements used to create, alter, and delete database objects.

Data Manipulation Language (DML) Statements. SQL statements used to store data in and retrieve or remove data from database objects.

Transaction Management Statements. SQL statements used to establish and terminate database connections and active transactions.

You do not have to be familiar with the Embedded SQL Application Construct Statements in order to pass the DB2 UDB V8.1 Family Fundamentals certification exam (Exam 700). However, you must know how the more common DCL, DDL, and DML statements are used, and you must be familiar with the Transaction Management Statements available. With that in mind, this chapter will focus on introducing you to the most common SQL statements used to create database objects and manipulate data.

Although basic syntax is presented for most of the SQL statements covered in this chapter, the actual syntax supported may be much more complex. To view the complete syntax for a specific SQL statement or to obtain more information about a particular statement, refer to the IBM DB2 Universal Database, Version 8 SQL Reference Volume 2 product documentation.

Data Control Language (DCL) Statements

You may recall that in Chapter 3, "Security," we saw that authorities and privileges are used to control access to the DB2 Database Manager for an instance, to one or more databases running under that instance's control, and to a particular database's objects. Users are only allowed to work with those objects for which they have been given the appropriate authority or privilege; the authorities and privileges they possess also determine what they can and cannot do with the objects they have access to. Authorities and privileges are given using the GRANT SQL statement. Likewise, authorities and privileges are revoked using the REVOKE SQL statement. The GRANT statement and the REVOKE statement, along with the CONNECT SQL statement, make up the bulk of the Data Control Language (DCL) statements available with DB2 UDB.

The `CONNECT` statement

Before a user can access data stored in a DB2 UDB database (or do anything else with a database for that matter), they must first establish a connection to that database. In some cases, a database connection can be established using the Control Center; however, in most cases, a database connection is established by executing the CONNECT SQL statement. The basic syntax for this statement is:

 CONNECT TO [  DatabaseName  ] <USER [  UserID  ] USING [  Password  ]>

where:

DatabaseName	Identifies the name assigned to the database a connection is to be established with.
UserID	Identifies the authentication ID (or user ID) assigned to the user attempting to establish the database connection.
Password	Identifies the password assigned to the user trying to establish the database connection.

Thus, in order for a user whose authentication ID is "db2user" and whose password is "ibmdb2" to establish a connection to a database named SAMPLE, a CONNECT statement that looks something like this would need to be executed:

 CONNECT TO SAMPLE USER db2user USING ibmdb2

And as soon as the CONNECT statement is successfully executed, you might see a message that looks something like this:

 Database Connection Information Database server        = DB2/NT 8 SQL authorization ID   = DB2USER Local database alias   = SAMPLE

The CONNECT statement can be executed without specifying a user ID and password (since these are optional parameters). When the CONNECT statement is executed without this information, the DB2 Database Manager will either attempt to use the user ID and password you provided to gain access to the system or prompt you for this information. (Such a connect operation is called an implicit connect, as it is implied that the credentials of the current user are to be used; on the other hand, when a user ID and password are specified, the connect operation is called an explicit connect, because the required user credentials have been explicitly provided.

Once a database connection has been established, it will remain in effect until it is explicitly terminated or until the application that established the connection ends. Database connections can be explicitly terminated at any time by executing a special form of the CONNECT statement. The syntax for this form of the CONNECT statement is:

 CONNECT RESET

The `GRANT` statement (revisited)

In Chapter 3, "Security," we saw that authorities and privileges can be explicitly granted to an individual user or a group of users by executing the GRANT SQL statement. Several flavors of the GRANT statement are available, and the appropriate form to use is determined by the database object for which authorities and privileges are to be granted. (Objects for which authorities and privileges can be granted include databases, schemas, tablespaces, tables, indexes, views, packages, routines, sequences, servers, and nicknames.) In general, the basic syntax for the GRANT statement looks something like this:

 GRANT [Privilege] ON [ObjectType] [ObjectName] TO [  Recipient  , ...] <WITH GRANT OPTION>

where:

Privilege	Identifies one or more authorities or privileges to be given to one or more users and/or groups.
ObjectType	Identifies the type of object one or more authorities or privileges are to be granted for.
ObjectName	Identifies the name assigned to the object one or more authorities or privileges are to be granted for.
Recipient	Identifies the name of the user(s) and/or group(s) that are to receive the object privileges specified. The value specified for the this parameter can be any combination of the following:
`USER` [ UserName ]	Identifies a specific user that the privileges specified are to be given to.
`GROUP` [ GroupName ]	Identifies a specific group that the privileges specified are to be given to.
`PUBLIC`	Indicates that the specified privilege(s) are to be given to the special group PUBLIC. (All users are a member of the group PUBLIC).

If the WITH GRANT OPTION clause is specified with the GRANT statement, the user and/or group receiving the specified privileges is given the ability to grant the newly received privileges (except for the CONTROL privilege) to other users.

For more information about available authorities and privileges, and for examples of how the GRANT SQL statement can be used, refer to Chapter 3, "Security."

The `REVOKE` statement (revisited)

Authorities and privileges can be explicitly revoked from an individual user or a group of users by executing the REVOKE SQL statement. As with the GRANT SQL statement, several flavors of the REVOKE statement are available, and the appropriate form to use is determined by the database object for which authorities and privileges are to be revoked. In general, the basic syntax for the REVOKE statement looks something like this:

 REVOKE [Privilege] ON [ObjectType] [ObjectName] FROM [  Forfeiter  , ...] <BY ALL>

where:

Privilege	Identifies one or more authorities or privileges to be taken from one or more users and/or groups.
ObjectType	Identifies the type of object one or more authorities or privileges are to be revoked for.
ObjectName	Identifies the name assigned to the object one or more authorities or privileges are to be revoked for.
Forfeiter	Identifies the name of the user(s) and/or group(s) that are to lose DBADM authority or the privileges specified. As with the `GRANT` statement, the value specified for this parameter can be any combination of the following:
`USER` [ UserName ]	Identifies a specific user that the privileges specified are to be taken from.
`GROUP` [ GroupName ]	Identifies a specific group that the privileges specified are to be taken from.
`PUBLIC`	Indicates that the specified privilege(s) are to be taken from the special group PUBLIC. (All users are a member of the group PUBLIC.)

The BY ALL clause is optional and is provided as a courtesy for administrators who are familiar with the syntax of the DB2 for OS/390 REVOKE SQL statement. Whether it is included or not, the results will always be the same ”the privilege(s) specified will be revoked from all users and/or groups specified, regardless of who granted it originally.

Again, for more information about the authorities and privileges available, and for examples of how the REVOKE statement can be used, refer to Chapter 3, "Security."

Data Definition Language (DDL) Statements

When a database is first created, it cannot be used to store data because, aside from the read-only system catalog tables and views that get created by default, no other data objects exist. That's where the Data Definition Language (DDL) statements come into play. The Data Definition Language statements are a set of SQL statements that are used to define and create objects in a database that will be used both to store user data and to improve data access performance. Various forms of CREATE statements and ALTER statements, along with the DROP statement, make up the set of Data Definition Language (DDL) statements available with DB2 UDB.

The `CREATE BUFFERPOOL` statement

Earlier, we saw that a buffer pool is an area of main memory that has been allocated to the DB2 Database Manager for the purpose of caching table and index data pages as they are read from disk. By default, one buffer pool (named IBMDEFAULTBP) is created for a particular database when that database is first created. On Linux and UNIX platforms, this buffer pool is 1,000 4K (kilobyte) pages in size; on Windows platforms, this buffer pool is 250 4K pages in size. Additional buffer pools can be created by executing the CREATE BUFFERPOOL SQL statement. The basic syntax for this statement is:

 CREATE BUFFERPOOL [  BufferPoolName  ] <IMMEDIATE  DEFERRED> SIZE [  Size  ] <PAGESIZE [  PageSize  ] <K>> <NOT EXTENDED STORAGE  EXTENDED STORAGE>

where:

BufferPoolName	Identifies the name to be assigned to the buffer pool to be created.
Size	Identifies the number of pages (of PageSize size) to be allocated for the buffer pool to be created.
PageSize	Specifies the size that each page used by the buffer pool being created is to be. (Valid values include 4096, 8192, 16384, or 32768 bytes ”if the suffix K (for kilobytes) is provided, this parameter must be set to 4, 8, 16, or 32.) Unless otherwise specified, pages used by buffer pools are 4K in size. (It is important to note that the page size used by a buffer pool determines what tablespaces can be used with it; a buffer pool can only be used by a tablespace that has a corresponding page size.)

If the IMMEDIATE clause is specified with the CREATE BUFFERPOOL statement, the buffer pool will be created immediately unless the amount of memory required is not available, in which case a warning message will be generated and the buffer pool creation process will behave as if the DEFERRED clause were specified. If the DEFERRED clause is specified, the buffer pool will not be created until all connections to the database the buffer pool is to be created for have been terminated. If neither clause is specified, the buffer pool will not be created until all connections to the database the buffer pool is to be created for have been terminated.

So, if you wanted to create a buffer pool that has the name TEMP_BP, consists of 100 pages that are 8 kilobytes in size, and is to be created immediately if enough free memory is available, you could do so by executing a CREATE BUFFERPOOL SQL statement that looks something like this:

 CREATE BUFFERPOOL TEMP_BP IMMEDIATE SIZE 100 PAGESIZE 8 K

Buffer pools can also be created using the Create Buffer Pool dialog, which can be activated by selecting the appropriate action from the Buffer Pools menu found in the Control Center. Figure 5-1 shows the Control Center menu items that must be selected to activate the Create Buffer Pool dialog; Figure 5-2 shows how the Create Buffer Pool dialog might look after its input fields have been populated .

Figure 5-1. Invoking the Create Buffer Pool dialog from the Control Center.

graphics/05fig01.gif

Figure 5-2. The Create Buffer Pool dialog.

graphics/05fig02.gif

The `CREATE TABLESPACE` statement

As mentioned earlier, tablespaces are used to control where data is physically stored and to provide a layer of indirection between database objects and the directories, files, or raw devices (referred to as containers) in which the data physically resides. Depending upon how it is defined, a tablespace can be either a System Managed Space (SMS) tablespace or a Database Managed Space (DMS) tablespace. With SMS tablespaces, each container used must be a directory that resides within the file space of the operating system, and the operating system's file manager is responsible for managing data storage. With DMS tablespaces, each container used must be either a fixed-size preallocated file or a raw device, and the DB2 Database Manager is responsible for managing data storage. By default, three tablespaces (named SYSCATSPACE, USERSPACE1, and TEMPSPACE1) are created for a database as part of the database creation process. Additional tablespaces can be created by executing the CREATE TABLESPACE SQL statement. The basic syntax for this statement is:

[View full width]

  [View full width] 
 CREATE TABLESPACE [  TablespaceName  ] <PAGESIZE [  PageSize  ] <K>> MANAGED BY SYSTEM USING    ( '[  Container  ]' ,... ) <EXTENTSIZE [  ExtentPages   ExtentSize  <K  M  G>]> <PREFETCHSIZE [  PrefetchPages    PrefetchSize  <K  M  G>]> <BUFFERPOOL [  BufferPoolName  ]>

[View full width]

  [View full width] 
 CREATE TABLESPACE [  TablespaceName  ] <PAGESIZE [  PageSize  ] <K>> MANAGED BY DATABASE USING    ( [FILE  DEVICE] '[  Container  ]' [  ContainerSize  ] ,... ) <EXTENTSIZE [  ExtentPages   ExtentSize  <K  M  G>]> <PREFETCHSIZE [  PrefetchPages    PrefetchSize  <K  M  G>]> <BUFFERPOOL [  BufferPoolName  ]>

where:

TablespaceName	Identifies the name to be assigned to the tablespace to be created.
PageSize	Specifies the size that each page used by the tablespace being created is to be. (Valid values include 4096, 8192, 16384, or 32768 bytes ”if the suffix K (for kilobytes) is provided, this parameter must be set to 4, 8, 16, or 32.) Unless otherwise specified, pages used by tablespaces are 4K in size.
Container	Identifies one or more containers that are to be used to store the data associated with the tablespace to be created. (If multiple containers are used, data is written to each in a round- robin fashion, one extent at a time.)
ContainerSize	Identifies the number of pages (of PageSize size) to be stored in the tablespace container specified.
ExtentPages	Identifies the number of pages of data to be written to a single tablespace container before another container will be used.
ExtentSize	Identifies the amount of data to be written to a single tablespace container before another container will be used. The value specified for this parameter is treated as the total number of bytes, unless the letter K (for kilobytes), M (for megabytes), or G (for gigabytes) is also specified. (If an ExtentSize value is specified, it is converted to an ExtentPages value using the PageSize value provided.)
PrefetchPages	Identifies the number of pages of data to be read from the tablespace when data prefetching is performed ( prefetching allows data needed by a query to be read before it is referenced so that the query spends less time waiting for I/O).
PrefetchSize	Identifies the amount of data to be read from the tablespace when data prefetching is performed. The value specified for this parameter is treated as the total number of bytes, unless the letter K (for kilobytes), M (for megabytes), or G (for gigabytes) is also specified. (If a PrefetchSize value is specified, it is converted to a PrefetchPages value using the PageSize value provided.)
BufferPoolName	Identifies the name of the buffer pool to be used by the tablespace to be created. (The page size of the buffer pool specified must match the page size of the tablespace to be created, or the `CREATE TABLESPACE` statement will fail.)

If the MANAGED BY SYSTEM version of this statement is executed, the resulting tablespace will be an SMS tablespace. On the other hand, if the MANAGED BY DATABASE version is executed, the resulting tablespace will be a DMS tablespace. Furthermore, if an SMS tablespace is to be created, only existing directories can be used as that tablespace's storage container(s), and if a DMS tablespace is to be created, only fixed-size preallocated files or physical raw devices can be used as that tablespace's storage container(s).

The size of each container assigned to a single tablespace can differ ; however, optimal performance is achieved when all containers used are the same size.

Thus, if you wanted to create a DMS tablespace that has the name PAYROLL_TS, consists of pages that are 8 kilobytes in size, uses the file DMS_TBS, which is 1000 megabytes in size and resides in the directory C:\TABLESPACES as its storage container, and uses the buffer pool PAYROLL_BP, you could do so by executing a CREATE TABLESPACE SQL statement that looks something like this:

 CREATE TABLESPACE PAYROLL_TS PAGESIZE 8K MANAGED BY DATABASE USING    (FILE 'C:\TABLESPACES\DMS_TBSP.TSF', 100 M) BUFFERPOOL PAYROLL_BP

Tablespaces can also be created using the Create Table Space Wizard, which can be activated by selecting the appropriate action from the Table Spaces menu found in the Control Center. Figure 5-3 shows the Control Center menu items that must be selected to activate the Create Table Space Wizard; Figure 5-4 shows how the first page of the Create Table Space Wizard might look after its input fields have been populated.

Figure 5-3. Invoking the Create Table Space Wizard from the Control Center.

graphics/05fig03.gif

Figure 5-4. The first page of the Create Table Space Wizard.

graphics/05fig04.gif

The `ALTER TABLESPACE` statement

Because SMS tablespaces rely on the operating system for physical storage space management, they rarely need to be modified after they have been successfully created. DMS tablespaces, on the other hand, have to be monitored closely to ensure that the fixed-size preallocated file(s) or physical raw device(s) that they use for storage always have enough free space available to meet the database's needs. When the amount of free storage space available to a DMS tablespace becomes dangerously low (typically less than 10 percent), more free space can be added either by increasing the size of one or more of its containers or by making one or more new containers available to it. Existing tablespace containers can be resized, new containers can be made available to an existing tablespace, and an existing tablespace's properties can be altered by executing the ALTER TABLESPACE SQL statement. The basic syntax for this statement is:

 ALTER TABLESPACE [  TablespaceName  ] ADD ( [FILE  DEVICE] '[  Container  ]' [  ContainerSize  ]    ,... )

 ALTER TABLESPACE [  TablespaceName  ] DROP ( [FILE  DEVICE] '[  Container  ]' ,... )

 ALTER TABLESPACE [  TablespaceName  ] [EXTEND  REDUCE  RESIZE]    ( [FILE  DEVICE] '[  Container  ]' ,... )

 ALTER TABLESPACE [  TablespaceName  ] <PREFETCHSIZE [  PrefetchPages   PrefetchSize  <K  M  G>]> <BUFFERPOOL [  BufferPoolName  ]> <DROPPED TABLE RECOVERY [ON  OFF]>

where:

TablespaceName	Identifies the name assigned to the tablespace to be altered.
Container	Identifies one or more containers that are to be used to store the data associated with the tablespace to be altered.
ContainerSize	Identifies the number of pages to be stored in the tablespace container specified.
PrefetchPages	Identifies the number of pages of data to be read from the tablespace when data prefetching is performed.
PrefetchSize	Identifies the amount of data to be read from the tablespace when data prefetching is performed. The value specified for this parameter is treated as the total number of bytes, unless the letter K (for kilobytes), M (for megabytes), or G (for gigabytes) is also specified. (If a PrefetchSize value is specified, it is converted to a PrefetchPages value using the page size of the tablespace being altered.)
BufferPoolName	Identifies the name of the buffer pool to be used by the tablespace to be altered. (The page size of the buffer pool specified must match the page size used by the tablespace to be altered.)

Thus, if you wanted a fixed-size preallocated file named NEWFILE.TSF, which is 1000 megabytes in size and resides in the directory C:\TABLESPACES, to be used as a new storage container for an existing DMS tablespace named PAYROLL_TS, you could make the PAYROLL_TS tablespace use this file as a new storage container by executing an ALTER TABLESPACE SQL statement that looks like this:

 ALTER TABLESPACE PAYROLL_TS ADD (FILE 'C:\TABLESPACES\NEWFILE.TSF', 100 M)

Tablespaces can also be altered using the Alter Table Space dialog, which can be activated by selecting the appropriate action from the Table Spaces menu found in the Control Center. Figure 5-5 shows the Control Center menu items that must be selected in order to activate the Alter Table Space dialog; Figure 5-6 shows how the first page of the Alter Table Space dialog might look after its input fields have been populated.

Figure 5-5. Invoking the Alter Table Space dialog from the Control Center.

graphics/05fig05.gif

Figure 5-6. The first page of the Alter Table Space dialog.

graphics/05fig06.gif

The `CREATE TABLE` statement

Earlier, we saw that a table is a logical structure used to present data as a collection of unordered rows with a fixed number of columns . Each column contains a set of values of the same data type, and each row contains the actual table data. Because tables are the basic data objects used to store information, many are often created for a single database. Tables are created by executing the CREATE TABLE SQL statement. In its simplest form, the syntax for this statement is:

 CREATE TABLE [  TableName  ] ([ColumnName] [DataType] ,...)

where:

TableName	Identifies the name to be assigned to the table to be created. (A table name must be unique within the schema the table is to be defined in.)
ColumnName	Identifies the unique name (within the table definition) to be assigned to the column that is to be created.
DataType	Identifies the data type (built-in or user defined) to be assigned to the column to be created; the data type specified determines the kind of data values that can be stored in the column. (Table 5-1 contains a list of valid data type definitions.)

Table 5-1. Data Type Definitions That Can Be Used with the `CREATE TABLE` Statement

Definition(s)	Data Type
SMALLINT	Numeric
INTEGER INT	Numeric
BIGINT	Numeric
DECIMAL( Precision, Scale ) DEC( Precision, Scale ) NUMERIC( Precision, Scale ) NUM( Precision, Scale ) where Precision is any number between 1 and 31; Scale is any number between 0 and Precision	Numeric
REAL FLOAT( Precision ) where Precision is any number between 1 and 24	Numeric
DOUBLE FLOAT( Precision ) where Precision is any number between 25 and 53	Numeric
CHARACTER( Length ) <FOR BIT DATA> ^[] CHAR( Length ) <FOR BIT DATA> ^[] where Length is any number between 1 and 254	Character/Character string
CHARACTER VARYING( MaxLength ) <FOR BIT DATA> ^[] CHAR VARYING( MaxLength ) <FOR BIT DATA> ^[] VARCHAR( MaxLength ) <FOR BIT DATA> ^[*] where MaxLength is any number between 1 and 32,672	Character string
LONG VARCHAR	Character string
GRAPHIC( Length ) where Length is any number between 1 and 127	Double-byte character string
VARGRAPHIC( MaxLength ) where MaxLength is any number between 1 and 16,336	Double-byte character string
LONG VARGRAPHIC	Double-byte character string
DATE	Date
TIME	Time
TIMESTAMP	Date and time
BINARY LARGE OBJECT( Size <K M G>) BLOB( Size <K M G>) where Length is any number between 1 and 2,147,483,647; if K (for kilobyte) is specified, Length is any number between 1 and 2,097,152; if M (for megabyte) is specified, Length is any number between 1 and 2,048; if G (for gigabyte) is specified, Length is any number between 1 and 2.	Binary
CHARACTER LARGE OBJECT( Size <K M G>) CHAR LARGE OBJECT( Size <K M G>) CLOB( Size <K M G>) where Length is any number between 1 and 2,147,483,647; if K (for kilobyte) is specified, Length is any number between 1 and 2,097,152; if M (for megabyte) is specified, Length is any number between 1 and 2,048; if G (for gigabyte) is specified, Length is any number between 1 and 2.	Character string
DBCLOB( Size <K M G>) where Length is any number between 1 and 1,073,741,823; if K (for kilobyte) is specified, Length is any number between 1 and 1,048,576; if M (for megabyte) is specified, Length is any number between 1 and 1,024; if G (for gigabyte) is specified, Length is must be 1.	Double-byte character string

^[*] If the FOR BIT DATA option is used with any character string data type definition, the contents of the column the data type is assigned to are treated as binary data.

Thus, if you wanted to create a table that had three columns in it, two of which are used to store numeric values and one that is used to store character string values, you could do so by executing a CREATE TABLE SQL statement that looks something like this:

 CREATE TABLE EMPLOYEES     (EMPID  INTEGER,      NAME   CHAR(50),      DEPT   INTEGER)

It is important to note that this is an example of a relatively simple table. Table definitions can be quite complex, and as a result, the CREATE TABLE statement has several different permutations . Because the definition for a table object can be so complex, and because the syntax for the CREATE TABLE SQL statement can be complex as well, both are covered in much more detail in Chapter 6, "Working With DB2 UDB Objects." (A detailed description of available data types is presented in Chapter 6 as well.)

Like many of the other database objects available, tables can be created using a GUI tool that is accessible from the Control Center. In this case, the tool is the Create Table Wizard, and it can be activated by selecting the appropriate action from the Tables menu found in the Control Center. Figure 5-7 shows the Control Center menu items that must be selected to activate the Create Table Wizard; Figure 5-8 shows how the first page of the Create Table Wizard might look after its input fields have been populated.

Figure 5-7. Invoking the Create Table Wizard from the Control Center.

graphics/05fig07.jpg

Figure 5-8. The first page of the Create Table Wizard.

graphics/05fig08.gif

The `ALTER TABLE` statement

Over time, a table may be required to hold additional data values that did not exist or were not considered at the time the table was created. Or character data that was originally thought to be one size may have turned out to be larger than was anticipated. These are just a couple of reasons why it can become necessary to modify an existing table's definition. When the amount of data stored in a table is relatively small, and when the table has few or no dependencies, it is relatively easy to save the associated data, drop the existing table, create a new table incorporating the appropriate modifications, load the previously saved data into the new table, and redefine all appropriate dependencies. But how can you make such modifications to a table that holds a large volume of data or has numerous dependency relationships?

Certain properties of an existing table can be changed, additional columns and constraints can be added (constraints are covered in detail in Chapter 6, "Working with DB2 UDB Objects"), existing constraints can be removed, and the length of varying-length character data type values allowed for a particular column can be increased by executing the ALTER TABLE SQL statement. Like the CREATE TABLE statement, the ALTER TABLE statement can be quite complex. However, in its simplest form, the syntax for the ALTER TABLE statement is:

 ALTER TABLE [  TableName  ] ADD [  Element  ,...]

 ALTER TABLE [  TableName  ] ALTER COLUMN [  ColumnName  ] SET DATA TYPE    [VARCHAR  CHARACTER VARYING  CHAR VARYING] ([  Length  ])

 ALTER TABLE [  TableName  ] DROP [PRIMARY KEY        FOREIGN KEY [  ConstraintName  ]        UNIQUE [  ConstraintName  ]        CHECK [  ConstraintName  ]        CONSTRAINT [  ConstraintName  ]]

where:

TableName	Identifies the name assigned to the table whose definition is to be modified.
Element	Identifies one or more columns, unique/primary key constraints, referential constraints, and/or check constraints that are to be added to the existing table's definition. The syntax used for defining each of these elements varies according to the element being defined.
ColumnName	Identifies an existing varying-length character column in the table whose size is to be altered.
Length	Identifies the new maximum length that data values can be for the varying-length character column specified. (The new length value provided must be larger than the current length of the column.)
ConstraintName	Identifies an existing unique, foreign key, or check constraint that is to be removed from the table definition.

The basic syntax used to add a new column is:

 <COLUMN> [ColumnName] [DataType] <NOT NULL WITH DEFAULT <[  DefaultValue  ]  CURRENT DATE      CURRENT TIME  CURRENT TIMESTAMP>> <  UniqueConstraint  > <  CheckConstraint  > <  ReferentialConstraint  >

where:

ColumnName	Identifies the unique name to be assigned to the column to be created.
DataType	Identifies the data type (built-in or user-defined) to be assigned to the column to be created; the data type specified determines the kind of data values that can be stored in the column. (Table 5-1 contains a list of valid data type definitions).
DefaultValue	Identifies the value to be provided for the column in the event no value is supplied for the column when an insert or update operation is performed against the table.
UniqueConstraint	Identifies a unique or primary key constraint that is to be associated with the column.
CheckConstraint	Identifies a check constraint that is to be associated with the column.
ReferentialConstraint	Identifies a referential constraint that is to be associated with the column.

The ALTER TABLE statement syntax used to add a unique or primary key constraint, a check constraint, and/or a referential constraint as part of a column definition is the same as the syntax used with the more advanced form of the CREATE TABLE statement. This syntax, along with details on how each of these constraints is used, is covered in Chapter 6, "Working with DB2 UDB Objects."

So if you wanted to add two new columns that use a date data type to an existing table named EMPLOYEES, you could do so by executing an ALTER TABLE SQL statement that looks something like this:

 ALTER TABLE EMPLOYEES     ADD COLUMN BIRTHDAY  DATE     ADD COLUMN HIREDATE  DATE

On the other hand, if you wanted to change the maximum length of the data value allowed for a varying-length character data type column named DEPTNAME found in a table named DEPARTMENT from 50 to 100, you could do so by executing an ALTER TABLE statement that looks something like this:

 ALTER TABLE DEPARTMENT     ALTER COLUMN DEPTNAME SET DATA TYPE VARCHAR(100)

As you might imagine, existing tables can also be modified using the Alter Table dialog, which can be activated by selecting the appropriate action from the Tables menu found in the Control Center. Figure 5-9 shows the Control Center menu items that must be selected in order to activate the Alter Table dialog; Figure 5-10 shows how the Alter Table dialog might look after its input fields have been populated.

Figure 5-9. Invoking the Alter Table dialog from the Control Center.

graphics/05fig09.jpg

Figure 5-10. The Alter Table dialog.

graphics/05fig10.gif

The `CREATE INDEX` statement

It was mentioned earlier that an index is an object that contains an ordered set of pointers that refer to records stored in a base table. Indexes are important because they:

Provide a fast, efficient method for locating specific rows of data in very large tables. (In some cases, all the information needed to resolve a query may be found in the index itself, in which case the actual table data does not have to be accessed.)
Provide a logical ordering of the rows of a table. (When indexes are used, the values of one or more columns can be sorted in ascending or descending order; this property is very beneficial when processing queries that contain ORDER BY and GROUP BY clauses.)
Can enforce the uniqueness of records stored in a table.
Can require a table to use clustering storage, which causes the rows of a table to be physically arranged according to the ordering of their index column values. (Although all indexes provide a logical ordering of data, only a clustering index provides a physical ordering of data.)

A clustering index usually increases performance by decreasing the amount of I/O required to access data: When a logical set of rows are physically stored close together, a read operation on the set of rows will require less I/O, because adjacent rows are more likely to be found within the same extent (remember, data pages are written in batches called extents) instead of being widely distributed across multiple extents.

While some indexes are created implicitly to provide support for a table's definition (for example to provide support for a primary key), indexes are typically created explicitly, using the tools provided with DB2 UDB. Indexes can be created by executing the CREATE INDEX SQL statement. The basic syntax for this statement is:

 CREATE <UNIQUE> INDEX [  IndexName  ] ON [  TableName  ] ( [  PriColumnName  ] <ASC  DESC> ,... ) <INCLUDE ( [  SecColumnName  ] ,... )> <CLUSTER> <DISALLOW REVERSE SCANS  ALLOW REVERSE SCANS>

where:

IndexName	Identifies the name to be assigned to the index to be created.
TableName	Identifies the name assigned to the base table that the index to be created is to be associated with.
PriColumnName	Identifies one or more primary columns that are to be part of the index's key. (The combined values of each primary column specified will be used to enforce data uniqueness in the associated base table.)
SecColumnName	Identifies one or more secondary columns whose values are to be stored with the values of the primary columns specified, but are not to be used to enforce data uniqueness.

If the UNIQUE clause is specified when the CREATE INDEX statement is executed, rows in the table associated with the index to be created must not have two or more occurrences of the same values in the set of columns that make up the index key. If the base table the index is to be created for contains data, this uniqueness is checked when the DB2 Database Manager attempts to create the index specified; once the index has been created, this uniqueness is enforced each time an insert or update operation is performed against the table. In both cases, if the uniqueness of the index key is compromised, the index creation, insert, or update operation will fail, and an error will be generated. It is important to keep in mind that when the UNIQUE clause is used, it is possible to have an index key that contains one (and only one) NULL value.

So, if you wanted to create a unique index for a base table named EMPLOYEES that has the following characteristics:

Column Name	Data Type
EMPNO	INTEGER
FNAME	CHAR(20)
LNAME	CHAR(30)
TITLE	CHAR(10)
DEPARTMENT	CHAR(20)
SALARY	DECIMAL(6,2)

such that the index key consists of the column named EMPNO, you could do so by executing a CREATE INDEX statement that looks something like this:

 CREATE INDEX EMPNO_INDX ON EMPLOYEES (EMPNO)

Indexes can also be created using the Create Index dialog, which can be activated by selecting the appropriate action from the Indexes menu found in the Control Center. Figure 5-11 shows the Control Center menu items that must be selected to activate the Create Indexes dialog; Figure 5-12 shows how the Create Indexes dialog might look after its input fields have been populated.

Figure 5-11. Invoking the Create Index dialog from the Control Center.

graphics/05fig11.jpg

Figure 5-12. The Create Index dialog.

graphics/05fig12.gif

If an index is created for an empty table, that index will not have any entries stored in it until the table the index is associated with is populated. On the other hand, if an index is created for a table that already contains data, index entries will be generated for the existing data and added to the index as soon as it is created. Any number of indexes can be created for a table, using a wide variety of combinations of columns. However each index comes at a price in both storage requirements and performance: Since each index replicates its key values, and this replication requires additional storage space, and each modification to a table results in a similar modification to all indexes defined on the table, performance can decrease when insert, update, and delete operations are performed. In fact, if a large number of indexes are created for a table that is modified frequently, overall performance will decrease, rather than increase for all operations except data retrieval. Tables that are used for data mining, business intelligence, business warehousing, and other applications that execute many (and often complex) queries while rarely modifying data are prime targets for multiple indexes. On the other hand, tables that are used in OLTP (On-Line Transactional Processing) environments or other environments where data throughput is high should use indexes sparingly.

The `CREATE VIEW` statement

We saw earlier that a view is an object that acts as a named specification of a result data set that is produced each time the view is referenced. A view can be thought of as having columns and rows, just like base tables, and in most cases, data can be retrieved directly from a view in the same way that it can be retrieved from a base table. However, whether a view can be used to insert, update, or delete data depends on how it is defined. (In general, a view can be used in an insert, update, or delete operation if each row in the view can be uniquely mapped onto a single row of a base table.)

Although views look (and often behave) like base tables, they do not have their own physical storage (contrary to indexes, which are also based on base tables); therefore, they do not contain data. Instead, views refer to data that is physically stored in other base tables. And because a view can reference the data stored in any number of columns found in the base table it refers to, views can be used, together with view privileges, to control what data a user can and cannot see. For example, suppose a company has a database that contains a table that has been populated with information about each employee of that company. Managers might work with a view that only allow them to see information about employees they manage, while users in Corporate Communications might work with a view that only allows them to see contact information about each employee, and users in Payroll might work with a view that allows them to see both contact information and salary information. By creating views and coupling them with the view privileges available, a database administrator can have greater control over how individual users access specific pieces of data.

Views can be created by executing the CREATE VIEW SQL statement. The basic syntax for this statement is:

 CREATE VIEW [  ViewName  ] <( [  ColumnName  ] ,... )> AS [  SELECTStatement  ] <WITH CHECK OPTION>

where:

ViewName	Identifies the name to be assigned to the view to be created.
ColumnName	Identifies the names of one or more columns that are to be included in the view to be created. If a list of column names is specified, the number of column names provided must match the number of columns that will be returned by the `SELECT` statement used to create the view. (If a list of column names is not provided, the columns of the view will inherit the names that are assigned to the columns returned by the `SELECT` statement used to create the view.)
SELECTStatement	Identifies a `SELECT` SQL statement that, when executed, will produce data that can be seen using the view to be created.

Thus, if you wanted to create a view that references all data stored in a table named DEPARTMENT and assign it the name DEPT_VIEW, you could do so by executing a CREATE VIEW SQL statement that looks something like this:

 CREATE VIEW DEPT_VIEW AS SELECT * FROM DEPARTMENT

On the other hand, if you wanted to create a view that references specific data values stored in the table named DEPARTMENT and assign it the name ADV_DEPT_VIEW, you could do so by executing a CREATE VIEW SQL statement that looks something like this:

 CREATE VIEW ADV_DEPT_VIEW AS SELECT (DEPT_NO, DEPT_NAME, DEPT_SIZE) FROM DEPARTMENT WHERE DEPT_SIZE > 25

The view created by this statement would only contain department number, department name, and department size information for each department that has more than 25 people in it.

Views can also be created using the Create View dialog, which can be activated by selecting the appropriate action from the Views menu found in the Control Center. Figure 5-13 shows the Control Center menu items that must be selected to activate the Create View dialog; Figure 5-14 shows how the Create View dialog might look after its input fields have been populated.

Figure 5-13. Invoking the Create View dialog from the Control Center.

graphics/05fig13.gif

Figure 5-14. The Create View dialog.

graphics/05fig14.gif

If the WITH CHECK OPTION clause of with the CREATE VIEW SQL statement is specified, insert and update operations performed against the view that is created are validated to ensure that all rows being inserted into or updated in the base table the view refers to conform to the view's definition (otherwise, the insert/update operation will fail). So what exactly does this mean? Suppose a view was created using the following CREATE VIEW statement:

 CREATE VIEW PRIORITY_ORDERS AS SELECT * FROM ORDERS WHERE RESPONSE_TIME < 4 WITH CHECK OPTION

Now, suppose a user tries to insert a record into this view that has a RESPONSE_TIME value of 6. The insert operation will fail because the record violates the view's definition. Had the view not been created with the WITH CHECK OPTION clause, the insert operation would have been successful, even though the new record would not be visible to the view that was used to add it. Figure 5-15 illustrates how the WITH CHECK OPTION works.

Figure 5-15. How the `WITH CHECK OPTION` clause is used to ensure insert and update operations conform to a view's definition.

graphics/05fig15.gif

Views created with the WITH CHECK OPTION clause specified are referred to as symmetric views, because every record that can be inserted into them can also be retrieved from them.

The `CREATE ALIAS` statement

We saw earlier that an alias is simply an alternate name for a table or view and that once created, aliases can be referenced the same way tables or views can be referenced. By using aliases, SQL statements can be constructed such that they are independent of the qualified names that identify the base tables or views they reference. Whenever an alias is used in an SQL statement, the behavior is the same as when the target (source table or view) of the alias is used instead. Therefore, any application that uses an alias to access data can easily be made to work with many different targets. That's because when the target of an alias is changed, no changes to applications that use the alias are necessary.

Aliases can be created by executing the CREATE ALIAS SQL statement. The basic syntax for this statement is:

 CREATE [ALIAS  SYNONYM] [  AliasName  ] FOR [  TableName   ViewName   ExistingAlias  ]

where:

AliasName	Identifies the name to be assigned to the alias to be created.
TableName	Identifies the name assigned to the table the alias to be created is to reference.
ViewName	Identifies the name assigned to the view the alias to be created is to reference.
ExistingAlias	Identifies the name assigned to the alias the alias to be created is to reference.

Thus, if you wanted to create an alias that references a table named EMPLOYEES and you wanted to assign it the name EMPINFO, you could do so by executing a CREATE ALIAS SQL statement that looks something like this:

 CREATE ALIAS EMPINFO FOR EMPLOYEES

Aliases can also be created using the Create Alias dialog, which can be activated by selecting the appropriate action from the Alias menu found in the Control Center. Figure 5-16 shows the Control Center menu items that must be selected to activate the Create Alias dialog; Figure 5-17 shows how the Create Alias dialog might look after its input fields have been populated.

Figure 5-16. Invoking the Create Alias dialog from the Control Center.

graphics/05fig16.gif

Figure 5-17. The Create Alias dialog.

graphics/05fig17.gif

The `CREATE SCHEMA` statement

Earlier, we saw that a schema is an identifier that helps provide a means of classifying or grouping objects stored in a database. And because a schema is an object itself, it can be owned by an individual, who can control access to both the data and the objects that reside within it. Whenever a data object, such as a table, view, or index, is created, it is always assigned, either implicitly or explicitly, to a schema.

Schemas are implicitly created whenever a data object that has been assigned a qualifier name that is different from existing schema names found in the database is created ”provided the user creating the object holds IMPLICIT_SCHEMA authority. (If a qualifier is not included as part of the name assigned to an object during the creation process, the authorization ID of the user creating the object is used as the schema, by default.) Schemas can be explicitly created by executing the CREATE SCHEMA SQL statement. The basic syntax for this statement is:

 CREATE SCHEMA [  SchemaName  ] <  SQLStatement  ,...>

 CREATE SCHEMA AUTHORIZATION [  AuthorizationName  ] <  SQLStatement  ,...>

 CREATE SCHEMA [  SchemaName  ] AUTHORIZATION [  AuthorizationName  ] <  SQLStatement  ,...>

where:

SchemaName	Identifies the name to be assigned to the schema to be created.
AuthorizationName	Identifies the user to be given ownership of the schema to be created.
SQLStatement	Specifies one or more SQL statements that are to be executed together with the `CREATE SCHEMA` statement. (Only the following SQL statements are valid: `CREATE TABLE` , `CREATE VIEW` , `CREATE INDEX` , `COMMENT ON` , and `GRANT` ).

If a schema name is specified, but no authorization name is provided, the authorization ID of the user that issued the CREATE SCHEMA statement is given ownership of the new schema when it is created; if an authorization name is specified but no schema name is provided, the new schema is assigned the same name as the authorization name used.

So, if you wanted to explicitly create a schema named INVENTORY, along with a table named PARTS that is associated with the schema named INVENTORY, you could do so by executing a CREATE SCHEMA SQL statement that looks something like this:

 CREATE SCHEMA INVENTORY CREATE TABLE PARTS (PARTNO       INTEGER NOT NULL,                      DESCRIPTION  VARCHAR(50),                      QUANTITY     SMALLINT)

Schemas can also be created using the Create Schema dialog, which can be activated by selecting the appropriate action from the Schemas menu found in the Control Center. Figure 5-18 shows the Control Center menu items that must be selected to activate the Create Schema dialog; Figure 5-19 shows how the Create Schema dialog might look after its input fields have been populated.

Figure 5-18. Invoking the Create Schema dialog from the Control Center.

graphics/05fig18.gif

Figure 5-19. The Create Schema dialog.

graphics/05fig19.gif

Since schemas can be implicitly created by creating an object with a new schema name, you may be wondering why anyone would want to explicitly create a schema using the CREATE SCHEMA statement or the Create Schema dialog. The primary reason for explicitly creating a schema has to do with access control. An explicitly created schema has an owner, identified either by the authorization ID of the user who executed the CREATE SCHEMA statement or by the authorization ID provided to identify the owner when the schema was created. A schema owner has the authority to create, alter, and drop any object stored in the schema; to drop the schema itself; and to grant these privileges to other users. On the other hand, implicitly created schemas are considered to be owned by the user "SYSIBM." Any user can create an object in an implicitly created schema, and each object in the schema is controlled by the user who created it. Furthermore, only users with System Administrator (SYSADM) or Database Administrator (DBADM) authority are allowed to drop implicitly created schemas. Thus, in order for a user other than a system administrator or database administrator to have complete control over a schema, as well as all data objects stored in it, the schema must be created explicitly.

The `CREATE TRIGGER` statement

We saw earlier that a trigger is a group of actions that are automatically executed (or "triggered") whenever an insert, update, or delete operation is performed against a particular table. Triggers are often used in conjunction with constraints to enforce data integrity rules. However, triggers can also be used to automatically perform operations on other tables, to automatically generate and/or transform values for new or existing rows, to generate audit trails, and to detect exception conditions. By using triggers, the logic needed to enforce business rules can be placed directly in the database instead of in one or more applications; and by requiring the database to enforce business rules, the need to recode and recompile database applications each time business rules change is eliminated.

Before a trigger can be created, the following components must be identified:

Subject table. The table the trigger is to interact with.

Trigger event. An SQL operation that causes the trigger to be activated whenever it is performed against the subject table. This operation can be an insert operation, an update operation, or a delete operation.

Trigger activation time. Indicates whether the trigger should be activated before or after the trigger event occurs. A before trigger will be activated before the trigger event occurs; therefore, it will be able to see new data values before they are inserted into the subject table. An after trigger will be activated after the trigger event occurs; therefore, it can only see data values that have already been inserted into the subject table. (A before trigger might be used to trap and process unwanted values, while an after trigger could be used to copy data values entered to other tables or views.)

Set of affected rows. The rows of the subject table that are being inserted, updated, or deleted.

Trigger granularity. Specifies whether the actions the trigger will perform are to be performed once for the entire insert, update, or delete operation or once for every row affected by the insert, update, or delete operation.

Triggered action. An optional search condition and a set of SQL statements that are to be executed whenever the trigger is activated. (If a search condition is specified, the SQL statements will only be executed if the search condition evaluates to true.) If the trigger is a before trigger, the triggered action can include statements that retrieve data, set transition variables , or signal SQL states. If the trigger is an after trigger, the triggered action can include statements that retrieve data, insert records, update records, delete records, or signal SQL states.

Triggered actions can refer to the values in the set of affected rows using what are known as transition variables. Transition variables use the names of the columns in the subject table, qualified by a specified name that indicates whether the reference is to the original value (before the insert, update, or delete operation is performed) or the new value (after the insert, update, or delete operation is performed). Another means of referring to values in the set of affected rows is through the use of transition tables. Transition tables also use the names of the columns in the subject table, but they allow the complete set of affected rows to be treated as a table. Unfortunately, transition tables can only be used in after triggers.

Once the appropriate trigger components have been identified, a trigger can be created by executing the CREATE TRIGGER SQL statement. The basic syntax for this statement is:

 CREATE TRIGGER [  TriggerName  ] [NO CASCADE BEFORE  AFTER] [INSERT  DELETE  UPDATE <OF [  ColumnName  ], ... >] ON [  TableName  ] <REFERENCING [  Reference  ]> [FOR EACH ROW  FOR EACH STATEMENT] MODE DB2SQL <WHEN ( [  SearchCondition  ] )> [  TriggeredAction  ]

where:

TriggerName	Identifies the name to be assigned to the trigger to be created.
ColumnName	Identifies one or more columns in the subject table of the trigger whose values must be updated before the trigger's triggered action ( TriggeredAction ) will be executed.
Reference	Identifies one or more transition variables and/or transition tables that are to be used by the trigger's triggered action ( TriggeredAction ). The syntax used to create transition variables and/or transition tables that are to be used by the trigger's triggered action is:

 <OLD <AS> [  CorrelationName  ]> <NEW <AS> [  CorrelationName  ]> <OLD TABLE <AS> [  Identifier  ]> <NEW TABLE <AS> [  Identifier  ]>

where:

CorrelationName	Identifies a name to be used to identify a specific row in the subject table of the trigger, either before it was modified by the trigger's triggered action ( `OLD <AS>` ) or after it has been modified by the trigger's triggered action ( `NEW <AS>` ).
Identifier	Identifies a name that is to be used to identify a temporary table that contains a set of rows found in the subject table of the trigger, either before they were modified by the trigger's triggered action ( `OLD TABLE <AS>` ) or after they have been modified by the trigger's triggered action ( `NEW TABLE <AS>` ). Each column affected by an activation event (insert, update, or delete operation) can be made available to the trigger's triggered action by qualifying the column's name with the appropriate correlation name or table identifier.
SearchCondition	Specifies a search condition that, when evaluated, will return either TRUE, FALSE, or Unknown. This condition is used to determine whether or not the trigger's triggered action ( TriggeredAction ) is to be performed.
TriggeredAction	Identifies the action to be performed when the trigger is activated. The triggered action must consist of one or more SQL statements; when multiple statements are specified, the first statement must be preceded by the keywords `BEGIN ATOMIC` , the last statement must be followed by the keyword `END` , and every statement between these keywords must be terminated with a semicolon (;).

Thus, if you wanted to create a trigger for a base table named EMPLOYEES that has the following characteristics:

Column Name	Data Type
EMPNO	INTEGER
FNAME	CHAR(20)
LNAME	CHAR(30)
TITLE	CHAR(10)
DEPARTMENT	CHAR(20)
SALARY	DECIMAL(6,2)

that will cause the value for the column named EMPNO to be incremented each time a row is added to the table, you could do so by executing a CREATE TRIGGER statement that looks something like this:

 CREATE TRIGGER EMPNO_INC AFTER INSERT ON EMPLOYEES FOR EACH ROW MODE DB2SQL UPDATE EMPNO SET EMPNO = EMPNO + 1

Triggers can also be created using the Create Trigger dialog, which can be activated by selecting the appropriate action from the Triggers menu found in the Control Center. Figure 5-20 shows the Control Center menu items that must be selected to activate the Create Trigger dialog; Figure 5-21 shows how the Create Trigger dialog might look after its input fields have been populated.

Figure 5-20. Invoking the Create Trigger dialog from the Control Center.

graphics/05fig20.gif

Figure 5-21. The Create Trigger dialog.

graphics/05fig21.gif

The `DROP` statement

Just as it is important to be able to create and modify objects, it is important to be able to delete an existing object when it is no longer needed. Existing objects can be removed from a database by executing the DROP SQL statement. The basic syntax for this statement is:

 DROP [  ObjectType  ] [  ObjectName  ]

where:

ObjectType	Identifies the type of object to be deleted (dropped). (Valid values include: BUFFERPOOL, TABLESPACE, TABLE, INDEX, VIEW, ALIAS, SCHEMA, and TRIGGER.)
ObjectName	Identifies the name assigned to the object to be deleted.

So, if you wanted to delete a table that has been assigned the name SALES, you could do so by executing a DROP SQL statement that looks something like this:

 DROP TABLE SALES

Database objects can also be dropped from the Control Center by highlighting the appropriate object and selecting the appropriate action from any object menu found. Figure 5-22 shows the Control Center menu items that must be selected in order to drop a particular object (in this case, a tablespace object).

Figure 5-22. Dropping a tablespace object from the Control Center.

graphics/05fig22.gif

It is important to keep in mind that when an object is dropped, its removal may affect other objects that depend upon its existence. In some cases, when an object is dropped, all objects dependent upon that object are dropped as well (for example, if a tablespace containing one or more tables is dropped, all tables that resided in that tablespace, along with their corresponding data, are also dropped). In other cases, an object cannot be dropped if other objects are dependent upon its existence (for example, a schema can only be dropped after all objects that were in that schema have been dropped). And it goes without saying that built-in objects, such as the system catalog tables and views, cannot be dropped.

The rules that govern how objects can be dropped as well as the rules that determine how dependent objects are affected when an object is dropped can be found in the IBM DB2 Universal Database, Version 8 SQL Reference Volume 2 product documentation.

Data Manipulation Language (DML) Statements

Once the appropriate data objects have been created for a particular database, they can be used to store data values. And just as there is a set of SQL statements that is used to define and create objects, there is a set of SQL statements that is used exclusively to store, modify, remove, and retrieve data values. This set of statements is referred to as Data Manipulation Language (DML) statements. There are four Data Manipulation Language statements available: the INSERT statement, the UPDATE statement, the DELETE statement, and the SELECT statement.

The `INSERT` statement

As you might expect, when a table is first created, it is empty. But once a table is created, it can be populated in a variety of ways: It can be bulk-loaded using the LOAD utility, it can be bulk-loaded using the IMPORT utility, or one or more rows can be added to it by executing the INSERT SQL statement. Of the three, the INSERT statement is the method most commonly used. And it can work directly with the table to be populated or it can work with an updatable view that references the table to be populated. The basic syntax for the INSERT statement is:

 INSERT INTO [  TableName   ViewName  ] < ( [  ColumnName  ] ,... ) > VALUES ( [  Value  ] ,...)

 INSERT INTO [  TableName   ViewName  ] < ( [  ColumnName  ] ,... ) > [  SELECTStatement  ]

where:

TableName	Identifies the name assigned to the table data is to be added to.
ViewName	Identifies the name assigned to the updatable view data is to be added to.
ColumnName	Identifies the name of one or more columns that data values being added to the table/view are to be assigned to. Each name provided must identify an existing column in the table or updatable view specified.
Value	Identifies one or more data values that are to be added to the column(s), table, or updatable view specified.
SELECTStatement	Identifies a `SELECT` SQL statement that, when executed, will produce the data values to be added to the column(s), table, or updatable view specified (by retrieving data from other tables and/or views).

So, if you wanted to add a record to a base table named DEPARTMENT that has the following characteristics:

Column Name	Data Type
DEPTNO	INTEGER
DEPTNAME	CHAR(20)
MGRID	INTEGER

you could do so by executing an INSERT statement that looks something like this:

 INSERT INTO DEPARTMENT (DEPTNO, DEPTNAME, MGRID) VALUES (001, 'SALES', 1001)

It is important to note that the number of values provided in the VALUES clause must be equal to the number of column names provided in the column name list. Furthermore, the values provided will be assigned to the columns specified based upon the order in which they appear ”in other words, the first value provided will be assigned to the first column identified in the column name list, the second value provided will be assigned to the second column identified, and so on. Each value provided must also be compatible with the data type of the column the value is to be assigned to.

If values are provided for every column found in the table (in the VALUES clause), the column name list can be omitted. In this case, the first value provided will be assigned to the first column found in the table, the second value provided will be assigned to the second column found, and so on. Thus, the row of data that was added to the DEPARTMENT table in the previous example could just as well have been added by executing the following INSERT statement:

 INSERT INTO DEPARTMENT VALUES (001, 'SALES', 1001)

Along with literal values, two special tokens can be used to designate values that are to be assigned to base table columns. The first of these is the DEFAULT token, which is used to assign a system or user-supplied default value to a column defined with the WITH DEFAULT constraint. The second is the NULL token, which is used to assign a NULL value to any column that was not defined with the NOT NULL constraint. (Both of these constraints are covered in detail in Chapter 6, "Working With DB2 UDB Objects.") Thus, you could add a record that contains a NULL value for the MGRID column to the DEPARTMENT table we looked at earlier by executing an INSERT statement that looks something like this:

 INSERT INTO DEPARTMENT VALUES (001, 'SALES', NULL)

By using a special form of the INSERT SQL statement, the results of a query can also be used to provide values for one or more columns in a base table. With this form of the INSERT statement, a SELECT statement (known as a subselect ) is provided in place of the VALUES clause (we'll look at the SELECT statement shortly), and the results of the SELECT statement are assigned to the appropriate columns. (This form of the INSERT statement creates a type of "cut and paste" action where values are retrieved from one base table or view and inserted into another.) As you might imagine, the number of values returned by the subselect must match the number of columns provided in the column name list (or the number of columns found in the table if no column name list is provided), and the order of assignment is the same as that used when literal values are provided in a VALUES clause. Therefore, you could add a record to the DEPARTMENT table we looked at earlier, using the results of a query, by executing an INSERT statement that looks something like this:

 INSERT INTO DEPARTMENT (DEPTNO, DEPTNAME) SELECT DEPTNO, DEPTNAME FROM OLD_DEPARTMENT

You may have noticed that the INSERT statement used in the last example did not provide values for every column found in the DEPARTMENT table. Just as there are times you may want to insert complete records into a table, there may be times when you wish to insert partial records into a table. Such operations can be performed by listing just the columns you have data values for in the column names list and providing the corresponding values using either the VALUES clause or a subselect. However, in order for such an INSERT statement to execute correctly, all columns in the table the record is being inserted into that do not appear in the column name list provided must either accept null values or have a default value constraint defined. Otherwise the INSERT statement will fail.

The `UPDATE` statement

Data stored in a database is rarely static; over time, the need to modify (or even remove) one or more values residing in a database can arise. In such situations, specific data values can be changed by executing the UPDATE SQL statement. The basic syntax for this statement is:

 UPDATE [  TableName   ViewName  ] SET [[  ColumnName  ] = [  Value  ]  NULL  DEFAULT ,... ] <WHERE [  Condition  ]>

 UPDATE [  TableName   ViewName  ] SET ([  ColumnName  ] ,... ) =    ([  Value  ]  NULL  DEFAULT ,... ) <WHERE [  Condition  ]>

 UPDATE [  TableName   ViewName  ] SET ([  ColumnName  ] ,... ) = ( [  SELECTStatement  ] ) <WHERE [  Condition  ]>

where:

TableName	Identifies the name assigned to the table that contains the data to be modified.
ViewName	Identifies the name assigned to the updatable view that contains the data to be modified.
ColumnName	Identifies the name of one or more columns that contain data values to be modified. Each name provided must identify an existing column in the table or updatable view specified.
Value	Identifies one or more data values that are to be used to replace existing values found in the column(s) specified.
SELECTStatement	Identifies a `SELECT` SQL statement that, when executed, will produce the data values to be used to replace existing values found in the column(s) specified (by retrieving data from other tables and/or views).
Condition	Identifies the search criterion that is to be used to locate one or more specific rows whose data values are to be modified. (This condition is coded like the `WHERE` clause that can be used with a `SELECT` SQL statement; we will look at the `WHERE` clause and its predicates later.) If no condition is specified, the update operation will be performed on every row found in the table or updatable view specified.

So, if you wanted to modify the records stored in a base table named EMPLOYEES that has the following characteristics:

Column Name	Data Type
EMPNO	INTEGER
FNAME	CHAR(20)
LNAME	CHAR(30)
TITLE	CHAR(10)
DEPARTMENT	CHAR(20)
SALARY	DECIMAL(6,2)

such that the salary of every employee that has the title of DBA is increased by 10%, you could do so by executing an UPDATE statement that looks something like this:

 UPDATE EMPLOYEES SET SALARY = SALARY * 1.10 WHERE TITLE = 'DBA'

The UPDATE statement can also be used to remove values from nullable columns. This is done by changing the column's current value to NULL. Thus, the value assigned to the DEPARTMENT column of the EMPLOYEES table shown in the previous example could be removed by executing the following UPDATE statement:

 UPDATE EMPLOYEES SET SALARY = NULL

Like the INSERT statement, the UPDATE statement can either work directly with the table that contains the values to be modified or it can work with an updatable view that references the table containing the values to be modified. Similarly, the results of a query, or subselect, can be used to provide values for one or more columns identified in the column name list provided. (This form of the UPDATE statement creates a type of "cut and paste" action where values retrieved from one base table or view are used to modify values stored in another.) As you might imagine, the number of values returned by the subselect must match the number of columns provided in the column name list specified. Thus, you could change the value assigned to the DEPARTMENT columns of each record found in the EMPLOYEES table we looked at earlier, using the results of a query, by executing an UPDATE statement that looks something like this:

 UPDATE EMPLOYEES SET (DEPARTMENT) =   (SELECT DEPTNAME FROM DEPARTMENT WHERE DEPTNO = 1)

It is important to note that update operations can be conducted in one of two ways: by performing a searched update operation or by performing a positioned update operation. So far, all of the examples we have looked at have been searched update operations. To perform a positioned update, a cursor must first be created, opened, and positioned on the row that is to be updated. Then, the UPDATE statement that is to be used to modify one or more data values must contain a WHERE CURRENT OF [ CursorName ] clause ( CursorName identifies the cursor being used ”we'll look at cursors shortly). Because of their added complexity, positioned update operations are typically performed by embedded SQL applications.

It is very important that you provide the proper WHERE clause whenever the UPDATE statement is used. Failure to do so will cause an update operation to be performed on every row found in the table or updatable view specified.

The `DELETE` statement

Although the UPDATE statement can be used to delete individual values from a base table (by setting those values to NULL), it cannot be used to remove entire rows. When one or more rows of data need to be removed from a base table, the DELETE SQL statement must be used instead. As with the INSERT statement and the UPDATE statement, the DELETE statement can either work directly with the table that rows are to be removed from or it can work with an updatable view that references the table that rows are to be removed from. The basic syntax for the DELETE statement is:

 DELETE FROM [  TableName   ViewName  ] <WHERE [  Condition  ]>

where:

TableName	Identifies the name assigned to the table data is to be removed from.
ViewName	Identifies the name assigned to the updatable view data is to be removed from.
Condition	Identifies the search criterion to be used to locate one or more specific rows that are to be removed. (This condition is coded like the `WHERE` clause used with a `SELECT` SQL statement; we will look at the `WHERE` clause and its predicates later.) If no condition is specified, the delete operation will be performed on every row found in the table or updatable view specified.

Therefore, if you wanted to remove every record for company XYZ from a base table named SALES that has the following characteristics:

Column Name	Data Type
PONUMBER	CHAR(10)
COMPANY	CHAR(20)
PURCHASEDATE	DATE
SALESPERSON	INTEGER

you could do so by executing a DELETE statement that looks something like this:

 DELETE FROM SALES WHERE COMPANY = 'XYZ'

Like update operations, delete operations can be conducted in one of two ways: as searched delete operations or as a positioned delete operations. To perform a positioned delete, a cursor must first be created, opened, and positioned on the row to be deleted. Then, the DELETE statement used to remove the row must contain a WHERE CURRENT OF [ CursorName ] clause ( CursorName identifies the cursor being used). Because of their added complexity, positioned delete operations are typically performed by embedded SQL applications.

Because omitting the WHERE clause in a DELETE SQL statement causes the delete operation to be applied to all rows in the table or view specified, it is important to always provide a WHERE clause with a DELETE statement unless you explicitly want to erase all data stored in a table.

The `SELECT` statement

Although the primary function of a database is to act as a data repository, a database serves another, equally important purpose. Sooner or later, almost all database users and/or applications have the need to retrieve specific pieces of information (data) from the database they are interacting with. The operation used to retrieve data from a database is called a query (because it searches the database to find the answer to some question), and the results returned by a query are typically expressed in one of two forms: either as a single row of data values or as a set of rows of data values, otherwise known as a result data set (or result set). (If no data values that correspond to the query specification provided can be found in the database, an empty result data set will be returned.)

All queries begin with the SELECT SQL statement, which is an extremely powerful statement used to construct a wide variety of queries containing an infinite number of variations (using a finite set of rules). And because the SELECT statement is recursive, a single SELECT statement can derive its output from a successive number of nested SELECT statements (which are known as subqueries ). (We have already seen how SELECT statements can be used to provide input to INSERT and UPDATE statements; SELECT statements can be used to provide input to other SELECT statements in a similar manner.)

In its simplest form, the syntax for the SELECT statement is:

 SELECT * FROM [ [  TableName  ]  [  ViewName  ] ]

where:

TableName	Identifies the name assigned to the table data is to be retrieved from.
ViewName	Identifies the name assigned to the view data is to be retrieved from.

Consequently, if you wanted to retrieve all values stored in a base table named DEPARTMENT, you could do so by executing a SELECT statement that looks something like this:

 SELECT * FROM DEPARTMENT

Data Control Language (DCL) Statements

The CONNECT statement

The GRANT statement (revisited)

The REVOKE statement (revisited)

Data Definition Language (DDL) Statements

The CREATE BUFFERPOOL statement

Figure 5-1. Invoking the Create Buffer Pool dialog from the Control Center.

Figure 5-2. The Create Buffer Pool dialog.

The CREATE TABLESPACE statement

Figure 5-3. Invoking the Create Table Space Wizard from the Control Center.

Figure 5-4. The first page of the Create Table Space Wizard.

The ALTER TABLESPACE statement

Figure 5-5. Invoking the Alter Table Space dialog from the Control Center.

Figure 5-6. The first page of the Alter Table Space dialog.

The CREATE TABLE statement

Table 5-1. Data Type Definitions That Can Be Used with the CREATE TABLE Statement

Figure 5-7. Invoking the Create Table Wizard from the Control Center.

Figure 5-8. The first page of the Create Table Wizard.

The ALTER TABLE statement

Figure 5-9. Invoking the Alter Table dialog from the Control Center.

Figure 5-10. The Alter Table dialog.

The CREATE INDEX statement

Figure 5-11. Invoking the Create Index dialog from the Control Center.

Figure 5-12. The Create Index dialog.

The CREATE VIEW statement

Figure 5-13. Invoking the Create View dialog from the Control Center.

Figure 5-14. The Create View dialog.

Figure 5-15. How the WITH CHECK OPTION clause is used to ensure insert and update operations conform to a view's definition.

The CREATE ALIAS statement

Figure 5-16. Invoking the Create Alias dialog from the Control Center.

Figure 5-17. The Create Alias dialog.

The CREATE SCHEMA statement

Figure 5-18. Invoking the Create Schema dialog from the Control Center.

Figure 5-19. The Create Schema dialog.

The CREATE TRIGGER statement

Figure 5-20. Invoking the Create Trigger dialog from the Control Center.

Figure 5-21. The Create Trigger dialog.

The DROP statement

Figure 5-22. Dropping a tablespace object from the Control Center.

Data Manipulation Language (DML) Statements

The INSERT statement

The UPDATE statement

The DELETE statement

The SELECT statement

The `CONNECT` statement

The `GRANT` statement (revisited)

The `REVOKE` statement (revisited)

The `CREATE BUFFERPOOL` statement

The `CREATE TABLESPACE` statement

The `ALTER TABLESPACE` statement

The `CREATE TABLE` statement

Table 5-1. Data Type Definitions That Can Be Used with the `CREATE TABLE` Statement

The `ALTER TABLE` statement

The `CREATE INDEX` statement

The `CREATE VIEW` statement

Figure 5-15. How the `WITH CHECK OPTION` clause is used to ensure insert and update operations conform to a view's definition.

The `CREATE ALIAS` statement

The `CREATE SCHEMA` statement

The `CREATE TRIGGER` statement

The `DROP` statement

The `INSERT` statement

The `UPDATE` statement

The `DELETE` statement

The `SELECT` statement