| Understanding and Deploying LDAP Directory Services > 10. Replication Design > Designing Your Directory Replication System | 
| 
 Designing Your Directory Replication SystemTo maximize the reliability and performance of your directory service through replication, you should spend some time planning for both your current requirements and future expansion. To begin your planning, gather the following information: 
 As we work through the design process, realize that it's very easy to over-engineer a replication solution. Think about airplanes: Very few have ten engines ”most commercial jets these days have two because that's been found to be an optimal number given fuel costs, plane sizes, and engine reliability. Now think about your directory: Although it's certainly possible to put three redundant directory servers on every Ethernet segment in an office building ”which would definitely increase your directory's reliability ”don't forget that someone has to set up and manage all those replicas! In an office in which all workstations and servers have 10Mbps connectivity, connectivity is probably so reliable that it's unnecessary to worry about the network being partitioned. Even if that's not a valid assumption, it's probably still cheaper to fix the network problem than it is to set up and manage a lot of directory replicas! On the other hand, if your organization is spread over a wide geographical area and your sites are linked by slower, less-reliable network links, you should definitely pay very close attention to how you allocate replicas across your network. Finally, view this design process as an iterative one. Make a pass through it, thinking about maximizing reliability and performance. As your solution evolves, you may find that you want to revisit a design decision. Also, as you think about replication, you may find that you want to revisit some of your design decisions about directory topology and even your namespace design. Don't be afraid to do this; it will be time well spent. Designing for Maximum ReliabilityWhen you design for maximum reliability, you make your directory impervious to the failure of a single directory server. Then, if one of your servers fails, directory clients can use another replica to obtain their directory services. How exactly do directory clients deal with the failure of a particular server? LDAP client applications are responsible for detecting the failure of their primary server and reconnecting to an alternate server. At present, there is no standard method for locating an alternate server that can provide service, so it's useful to ask the supplier of an LDAP-based application how this is handled. For example, client applications that use the Netscape C language Software Development Kit (SDK) can provide multiple server names when establishing a connection; if a given server is unavailable, the SDK tries another. Another option is to use a hardware failover device such as Cisco Systems' LocalDirector, which can balance client load across a number of servers. It can also detect when a server has failed and avoids directing clients to it until it is returned to service. To maximize reliability, locate within each major zone of your network at least one replica that is connected via a network link of less than 10Mbps. (If you have a single well-connected network, you have only one zone to worry about.) For example, if your network comprises a single set of buildings connected by high-speed fiber- optic links, you might choose to deploy two replicated servers. If either server fails, client requests would be handled by the remaining server (see Figure 10.19). Figure 10.19 Multiple replicas located at a site.  Suppose now that your organization consists of a central office with a series of remote offices connected by slower, 56Kbps network connections. In this case, you might choose to place a replica at each remote site (see Figure 10.20). This allows you to avoid wasting your scarce network bandwidth on LDAP client traffic. Figure 10.20 Replicas placed to limit client traffic on WAN links.  Under normal circumstances, the directory clients shown in Figure 10.20 in the remote office would contact the onsite replica for directory operations. In the event that the server fails, the remote clients could contact one of the servers in the central office. You could even place more than one replica in each remote office so that even if one of the onsite servers fail, the clients can obtain directory service without sending requests across the slower interoffice network link. If you use such a configuration, you may want to schedule replication updates to occur only during off-peak hours, if your directory server software supports it. Designing for Maximum PerformanceWhen you design replication to enhance performance, you strive to design a system that can handle your existing client load today but can be expanded easily to handle a larger load in the future. You should strive to provide a sufficient number of replica servers to handle your client load. Estimating your client load involves a bit of guesswork, but you should be able to get a pretty good idea by understanding how often a given client makes a request and how a typical request from that client would impact the server. Then multiply this by the number of clients you expect to make use of the directory. For example, suppose 1,000 users will use a Web-based address book application to look up other employees in the directory. Begin by making some assumptions about how often people will use the service. (These assumptions can be verified , and adjusted if necessary, during the pilot phase of your deployment; see Chapter 13, "Piloting Your Directory Service" ). Let's assume that each person will perform 10 lookups during the 8 a.m. “5 p.m. workday ; that means you should expect to see 10,000 queries in 8 hours. This translates to 1,250 queries per hour ”approximately 1 query every 3 seconds. If you have some additional knowledge that leads you to believe that usage will not be uniform, but instead it will have spikes during the day, take that into account as well. Next, understand the impact the load will place on the directory server. Are the queries typically made on indexed attributes ”which perform well and place a lighter load on the server ”or might some of the queries be on unindexed attributes? Will the searches typically return a single entry or many entries? Will the retrieved entries contain large attributes, or will the amount of transmitted data be small (perhaps up to a kilobyte or two)? Finally, determine how many replicated servers you will need to handle the load. You may be fortunate enough to have some data from the software vendor that tells you how many typical queries the software can service while running on some standard hardware in a given time period. For example, the Netscape Directory Server can perform several hundred searches per second on a typical 200Mhz Pentium II computer, so a single server would be entirely capable of handling the client load in the previous example. If, however, you don't really know how many operations the server can perform per second, you should measure it when evaluating server software or piloting your directory. Tip When doing capacity planning, always take vendor-supplied performance figures with a grain of salt. Performance figures that appear in vendor data sheets reflect the performance measured by the vendor on one system with one particular set of assumptions. Performance may be affected by many other factors, including the operating system platform, amount of available memory, speed of disk drives , namespace design, complexity of access control rules, characteristics of LDAP clients, and much more. If possible, measure your systems under the types of loads you expect them to handle to get a better idea of individual server performance capabilities before planning your total number of replicas. You should consider write performance of your directory separately from search performance. In a replicated environment, each change made by a client needs to be made at each replica as well. For this reason, write performance in a replicated environment does not scale as well as search performance. If you find that your application modifies the directory frequently, you may find it advantageous to partition your directory so that each server needs to handle fewer modifications. Remember that directories are typically optimized for searching, so don't be surprised to discover that your software can perform only a few write operations per second ”even if it can perform hundreds or even thousands of indexed searches per second. Another way to improve your directory's performance is to provide and tune dedicated servers for particular applications. For example, Netscape Messaging Server 3.5 imposes a heavy load on a server as it goes about its business of delivering mail. However, it uses only LDAP equality search filters (as opposed to substring or approximate filters). You could create a Netscape Directory Server replica that is dedicated to servicing the Messaging Server's queries. Because you know that substring and approximate indexes are not required on the server, you could remove them from the server's configuration, therefore improving its write performance and reducing the memory requirements. One other consideration is how your directory clients will know that multiple servers are able to handle their requests. Some directory services handle this automatically via their proprietary protocols, but LDAP does not currently provide a way for a server to inform clients about other replicas of itself. One way to achieve load-balancing in the absence of this capability is to use the Domain Name System (DNS) round-robin capability to map a given hostname to all the IP addresses of hosts containing a copy of the replicated data. With DNS round- robin , the DNS server reorders the list of IP addresses each time it responds to a query for the hostname. This way, client load can be divided among any number of servers. As previously mentioned, it is also possible to use a hardware load-balancing and failover device such as Cisco Systems' LocalDirector to distribute load across a number of servers. Other ConsiderationsIn the discussion of reliability and performance, we've focused primarily on where to put replicas. But we haven't considered some other factors that may be important when designing your replication system. First, you need to consider the maximum number of replicas your software can gracefully handle. This is highly dependent on the software in use and the number of updates that your system receives. In general, it's a good idea to try to limit the number of replicas supplied by a single server to somewhere between 5 and 10. If you try to manage a larger number of replicas, you may find that the servers spend so much time propagating updates that they are unable to answer client requests in a timely fashion. If your directory sees very few modifications, you can probably use more replicas; if your directory handles many modifications, you may need to use fewer replicas. What if you find that your directory service is bogged down with synchronization traffic? One option is to partition your directory tree among a larger number of servers. When you do this, each server needs to handle fewer update requests and therefore needs to send fewer updates to replicas. Of course, the total amount of network traffic would still be the same (in fact, it may actually increase somewhat because of the partition management overhead of some directory systems), so if you find that your network is the bottleneck, a network upgrade may be in order. But in practice, the network is rarely the bottleneck. More often the limiting factors are server CPU, disk I/O, and memory usage. Another option for reducing the replication burden on a master server is to use a cascading replication configuration. In a cascaded configuration, a change propagates from a supplier to a small number of consumers, and then from each of those consumers to a larger number of consumers, and so on until all replicas have been updated (see Figure 10.21). This approach lengthens the time it takes for a given change to propagate to all replicas, but it does make it possible to feed a larger number of replicas. Your directory server software may or may not allow this type of configuration, so consult your documentation. Figure 10.21 Cascaded replication.  The second factor to consider is the overhead associated with managing a very complex replication system. What if a replica goes down? How hard is it to bring it back up? How do you monitor the system to be sure it's working properly? How hard is it to find out whether a user 's complaint results from a replication problem? In general, use of the KISS principle (keep it simple, stupid) is a wonderful idea: The simpler you can make the replication configuration, the better off you'll be. It'll be easier to troubleshoot, simpler to fix, and probably more reliable overall. If your boss is unimpressed because the system looks too simple, you can describe it as "elegant." That usually works. Choosing Replication SolutionsIf you need to provide a reliable, high-performance directory service, you should evaluate the software you are considering in terms of its replication capabilities. The following are some general questions to ask when performing this evaluation: 
 More information about selecting appropriate directory server software can be found in Chapter 12, "Choosing Directory Products." 
 
 | 
| Index terms contained in this sectioncascaded replicationconnectivity replication design design replication systems 2nd choosing software 2nd connectivity maximizing performance 2nd 3rd 4th 5th 6th 7th 8th 9th 10th 11th 12th maximizing reliability 2nd 3rd 4th 5th network maps 2nd directories replication choosing software 2nd system design 2nd 3rd 4th 5th 6th 7th 8th 9th 10th 11th 12th 13th 14th 15th 16th 17th 18th 19th 20th 21st 22nd maps network replication design 2nd multiple replicas maximizing server performance networks maps replication design 2nd performance maximizing replication design 2nd 3rd 4th 5th 6th 7th 8th 9th 10th 11th 12th reliability maximizing replication design 2nd 3rd 4th 5th replication system design 2nd 3rd 4th choosing software 2nd connectivity maximizing performance 2nd 3rd 4th 5th 6th 7th 8th 9th 10th 11th 12th maximizing reliability 2nd 3rd 4th 5th servers maximizing performance multiple replicas software replication design choosing 2nd | 
| 2002, O'Reilly & Associates, Inc. | 
