| Weblog: |
|
The Python comunity has too many deceptive XML benchmarks
|
| Subject: |
|
deceptive benchmarks |
| Date: |
|
2005-01-24 08:34:35 |
| From: |
|
faassen
|
|
|
|
The benchmark in this article is indeed rather deceptive, as the effbot said.
You definitely want to use time.time() or something like that *inside* the program to avoid measuring the Python startup and shutdown time, which is hardly relevant, unless you build web applications using CGI scripts. :)
That said, it is good idea to communicate our benchmarking strategy. What I did when I was curious what Fredrik used was just mail him and ask him. I've been using the same strategy as a result. I get numbers slightly different from his, though not drastically. It's likely due to platform/compiler differences (I'm on Linux, he's on Windows). See my weblog for some numbers:
http://faassen.n--tree.net/blog
|
Showing messages 1 through 3 of 3.
-
deceptive benchmarks
2005-01-24 12:09:20
Fredrik Lundh |
[View]
-
deceptive benchmarks
2005-01-24 14:51:55
faassen
[View]
|
Showing messages 1 through 3 of 3.
|
(fwiw, the virtual debunking team currently suspects that Uche has done most or all of the following mistakes: included Python startup and shutdown times in his figures, included module load times in his figures (cET 0.9 can parse OT.XML nine times in the time it takes Python to load Amara's bindtools component), sent output to a terminal instead of a file or /dev/null, used non-idiomatic solutions for the SAX and cET samples (for cET, Uche's code is 40% slower than the most obvious solution), and, quite possibly, used an unreleased version of the underlying cDomlette library, which is reportedly 3-4 times faster than the current release. And yes, the pystone figures don't seem to match his hardware description, either. This article should be archived in the "the whole bloody breakfast on my face" category, and replaced with an apology.)