Benchmarking
Benchmarking is a tool that, when used correctly, can help you plan for scalability, test throughput, and measure response time. When used incorrectly, benchmarking can give you a very wrong
In a good benchmark, you want to simulate your production environment as closely as possible. This includes things such as hardware available, software being used, and usage patterns. The following sections describe some common problems with benchmarks. Using the Wrong Data SizeWhen doing a benchmark, you need to have the same amount of data you plan on having in the system. Doing a benchmark with 50MB of data when you plan on having 10GB in production is not a useful benchmark. Using a Data Set That Is Not RepresentativeIf you are generating data, you need to make sure it isn't too random. In real life, most data has patterns that cause more repeats of certain data than of other data. For example, imagine that you have a set of categories for something. In real life, certain categories are likely to be much more common than other categories. If you have exactly the same distribution of these in your generated data, this can influence the benchmark. Using Inaccurate Data Access Patterns
Using inaccurate data access patterns is similar to using a data set that is not representative, but it
Failing to Consider Cache Effects
There are two ways failing to consider cache effects can affect your benchmarking. First, you can run a benchmark that makes overly heavy use of caches. For example, if you run just the same query over and over, caching plays a very big role in it. Second, you can do completely different things over and over, which
Using Too Little Load or Too Few Users
In order for a benchmark to be accurate, it needs to reflect the number of users who will be accessing your system. A very common problem with MySQL Cluster benchmarks is attempting to benchmark with only a single
Benchmarking Solutions
Now that you know some of the problems with benchmarking, how can you work around them? One of the
The other solution is to use a benchmarking tool that can more closely
|