UPDATE accounts
SET Abalance = Abalance + :delta
WHERE Aid = :Aid;
SELECT Abalance
INTO :Abalance FROM accounts
WHERE Aid = :Aid;
UPDATE tellers
SET Tbalance = Tbalance + :delta
WHERE Tid = :Tid;
UPDATE branches
SET Bbalance = Bbalance + :delta
WHERE Bid = :Bid;
INSERT INTO history(Tid, Bid, Aid, delta, time)
VALUES (:Tid, :Bid, :Aid, :delta, CURRENT);
The runs conducted varied the number of costumers: 1,8,16,32,64. These represented the number of simultaneous clients that were actually emulated.
I configured PostgreSQL (8.3.1), using postgresql.conf, according to the following options:
- max_connections = 100
- shared_buffers = 200MB
- effective_cache_size = 2GB
As for MySQL (5.1.24-rc) I used the transactional storage engine InnoDb. Then I configured the engine according to the following options, in my.cnf:
- innodb_buffer_pool_size = 500M
- innodb_flush_log_at_trx_commit=1
- innodb_thread_concurrency=0
- thread_cache_size=25
With respect to the hardware, I used two nodes, one for executing the emulated clients (node A) and another to holding the databases (node B). The following table details the environment.
Node A | Node B | |
CPU | Dual AMD Opteron(tm) at 1.5GHz | |
Memory | 4 Gb | 3 Gb |
Storage | One 55 Gb dedicated volume for each node (RAID5). | |
Network | Ethernet: 1 Gbps | |
Operating System | Ubuntu 7.10 |
Both of the machines were dedicated, meaning that no other job was running concurrently with the benchmark. As for the network I am unable to claim the same, as the physical network was shared among people here in the lab accessing other servers. The benchmark implementation was developed within the context of the GORDA project and integrated into Bristlecone benchmarking framework, from Continuent.
So, taking into account the aforementioned settings, I performed several runs in which the difference among them was the number of concurrent connections opened to the database. From the runs, I have measured latency (ms - milliseconds) and throughput (TPM - transactions per minute). The samples from each of the runs were taken by droping the initial two minutes (avoiding the ramp up effects). Additionally, the outliers samples were removed by taking only the 98th percentile of set of samples. The results are depicted in the following two figures.
From these two figures, one may immediately conclude that the system is overloaded even when there are only eight clients in the system. This is predictable, since TPC-b does not add think time on transaction submission, hence the system is in overload. As a consequence the latency increases, due to queuing and the throughput remains stable. One interesting conclusion is that MySQL outperforms completely PostgreSQL in throughput. Since TPC-b is an update, stress test it seems reasonable to consider that MySQL is more suitable for workloads that characterized by update peaks.
What I would like to do next is to compare both of these DBMS using workload with more read operations (TPC-w?), and also compare them using some TPC-c based benchmark for modeling a more complex OLTP setting, and not a stressing it as TPC-b does. When I have the time, I will do this and let you know about the results.
2 comments:
Very good information, thanks. I'm at ScaleDB and we're working on a storage engine with beta release in August. I'd like to get your feedback/benchmarks on it as well.
Thanks Mike. Right now, I cannot commit to anything within August time-frame. Nevertheless, if I have the time, I won't mind to give you some feedback on your beta release. I will probably need a "friendly reminder" then. So, please keep in touch, and drop me an email (luis.soares /at gmail.com ) when the beta release is out.
Post a Comment