Modeling the Application

At this point, you and your customer should have a pretty good idea of the problem you're trying to solve. More important, you and your customer should agree what the problem is!

Once the problem is defined, you can turn your attention to the conceptual design of the application. The four key tasks to accomplish during conceptual design are as follows:

Modeling persistent data
Modeling the data objects
Modeling the business objects
Modeling the presentation layer

We will discuss each of these tasks in detail in the sections that follow.

Modeling Persistent Data

You should begin the conceptual design process by examining the requirements documents for information about persistent data. These documents provide information that will help define the databases the application will use. These could be relational databases or object-oriented databases, but here we'll focus on issues related to relational databases since these are far more prevalent. In some cases, the databases you need to use will already be defined. This certainly simplifies the task, but it doesn't leave us much to talk about, so for the sake of discussion let's assume that you're starting from scratch.

Even if the exact database tables you need don't currently exist, you do need to consider how your data fits in with other data maintained by the organization. You need to consider where the data will reside and who can access it. Will the data be stored on centrally located servers or on individual workstations? Will data need to be replicated to multiple locations in order to meet performance requirements? If so, will the data be partitioned, or do you need to worry about conflicting updates from multiple sites? For the Island Hopper classified ads application, the database is maintained on a single database server. Your application might have more complex requirements.

Selecting entities

One very common way to model persistent data is a technique known as entity-relationship modeling. To begin data modeling, look for real-world things, or "entities," in the problem definition. An easy way to start is to look for nouns in the problem definition—these often correspond to entities in the model. An entity has a set of attributes that describe it. A particular entity has specific values for each attribute. All the entities that share a set of attributes define an entity type.

NOTE
In object-oriented terms, "entity" = "object," "entity type" = "class," and "attribute" = "property."

When you are using a relational database, entity types are modeled as database tables. Attributes correspond to columns in the database table. A complete definition of the entity type, including the names and meanings of all attributes and any constraints is called a schema. One of the main constraints to consider is which attribute(s) can be used to identify each entity. These attributes are used to define database keys. A database key is one or more columns in a table used to identify a record; database keys are commonly used as an index for the table.

NOTE
This chapter merely scratches the surface of data modeling. For more information, see the sources listed in the bibliography.

After examining the problem definition and functional specification for the Island Hopper classified ads application, you should be able to identify the following three main entity types:

Customer type: Includes name, mailing address, password, and customer ID attributes. The customer ID attribute uniquely identifies a customer. The name and mailing address attributes might need to be divided into separate elements—for example, first name, last name, street, city, state, and zip code.

Advertisement type: Includes category, title, body text, start date, end date, customer, and advertisement ID attributes. The advertisement ID attribute uniquely identifies an advertisement.

Invoice type: Includes customer, list of ads, date, total amount, and invoice ID attributes. The invoice ID uniquely identifies an invoice. The list of ads is a multivalued attribute. We'll see how to deal with multivalued attributes in the section "Data object methods" later in this chapter.

Defining relationships

Some implicit relationships exist between these entities. Advertisements and invoices both refer to customers. Invoices refer to a list of advertisements.

Relationship types define associations between entity types. Most relationships are between two entity types. Each entity type plays a particular role in the relationship type. Normally, you don't need to explicitly call out role names when the entity types are distinct, but the names can be very helpful when you are defining relationships between different entities of the same type. (These are called recursive relationships.) For example, two employee entities might be related by a supervises relationship, one entity in the role of supervisor and the other in the role of supervisee.

To locate the relationship types in your data model, you should look for attributes in your entities that seem to correspond to other entities. For example, both the advertisement and invoice entities mentioned above contain a customer attribute. You should also refer back to the problem definition. Verbs in the problem definition can point out relationships that you haven't recognized yet.

Relationship types usually have a structural constraint associated with them. This constraint defines how many relationship instances an entity can participate in. Usually, the constraint is specified as one-to-one (1:1), one- to-many (1:N), or many-to-many (M:N). For example, if one customer can submit many advertisements, each advertisement is submitted by exactly one customer, and a customer is deleted if no advertisements exist, the relationship between customers and advertisements is 1:N. You can also specify the minimum and maximum number of instances for each entity.

Relationship types can also have attributes associated with them. For 1:1 and 1:N relationships, these attributes can be stored in the entity representing the "1" side of the relationship. However, for M:N relationships, the attributes are a characteristic of the relationship itself.

In the relational database, relationship types can be represented as attributes or tables. For 1:N relationships, the table representing the "N" side of the relationship includes a column corresponding to the key of the other table. (1:1 is just a special case of 1:N.) Attributes for the relationship itself would be stored as columns in the table on the "1" side of the relationship. For example, the Island Hopper advertisement entity should include a customer ID column to represent the 1:N relationship between customers and advertisements. For M:N relationships, a separate table is usually used to represent the relationship. This table includes columns corresponding to keys for each of the tables in the relationship, as well as columns for each attribute of the relationship itself.

Figure 7-4 shows a data model for the Island Hopper classified ads application that includes both entity and relationship types. Notice that some of the attributes have been refined to reflect the relationships between the entities.

Normalizing the database

In a relational database, data is usually factored according to the following data normalization rules. These rules help prevent common data problems related to synchronization and consistency of changes.

OK, those aren't the formal rules…but you get the idea. (If you really want to know the rules, check out any book on database design.) Your first cut at a data model might not be normalized (unless you've been doing data modeling for a while and these rules are burned into your subconscious). After you've created the basic tables and relationships, normalize the database. But keep in mind that performance requirements might require denormalizing the database.

Figure 7-5 shows the final data model for the Island Hopper classified ads application. A new entity type, category, has been defined. This entity type will remove the name of the category from every advertisement, making it easier to maintain a category list. Customer passwords have been split into a separate entity to help secure the passwords. The product and payment entities have been introduced to help with invoicing.

In addition, the multivalued list of ads attribute in the invoice entity type has been converted to a 1:N relationship between invoices and advertisements, with an invoice ID attribute stored in each advertisement. This is how multivalued attributes are normalized in relational databases: the attribute is converted to a separate entity that has a 1:N relationship with the original entity.

You might also have noticed that the customer entity includes a balance attribute. The customer balance could be computed by querying invoices and payments, but this process would be very slow. So the database is slightly denormalized to include the outstanding balance in the customer entity.

Documenting the data model

Many tools are available for documenting a data model. In this book, we use the Microsoft Visual Database Tools to create our data models. With the Visual Database Tools, you can connect to and explore any ODBC-compliant database. The Database Designer tool provides a graphical environment for the following tasks:

Other data tools provide a graphical user interface that lets you add, update, and delete data in the database; design and execute complex queries; and so on.

*Data Object*	*Description*
db_AdC	Stores information about an advertisement
db_CategoryC	Stores the main descriptive category for advertised items
db_CustomerC	Stores information about a specific customer for billing and contact purposes
db_CustomerPasswordC	Stores customer password
db_InvoiceC	Stores information about a particular invoice
db_PaymentC	Stores information about a payment from a customer
db_ProductC	Stores product/billing codes for invoicing

*Business Object*	*Description*
bus_AdC	Places, updates, and deletes advertisements
bus_CustomerC	Validates customer information
bus_InvoiceC	Generates customer invoices for advertisements
bus_PaymentC	Processes payments from customers

Modeling Persistent Data

Selecting entities

Defining relationships

Normalizing the database

Documenting the data model

Designing Classes

Documenting the logical classes

Modeling the Data Objects

Managing data object state

Data object methods

Other factors

Island Hopper data objects

Modeling the Business Objects

Choosing business objects

Business object methods

Managing business object state

Composition, resource management, and transactions

Security considerations

Island Hopper business objects

Modeling the Presentation Layer